BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy3847
(664 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|242008779|ref|XP_002425177.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212508879|gb|EEB12439.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 685
Score = 704 bits (1817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/523 (65%), Positives = 394/523 (75%), Gaps = 60/523 (11%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + FY A++VV DVSWIPNKHYSGVYGL+KLTLPK+LP L K +VLDTDV FATDI
Sbjct: 122 PQVDVRFYPAENVVPDVSWIPNKHYSGVYGLMKLTLPKILPRNLTKVVVLDTDVTFATDI 181
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW L +++ +Q GLVENQSDWYLGKLWKNHKPWPALGRG+NTGVILL + +LR +
Sbjct: 182 AELWKLMTKMSDKQAFGLVENQSDWYLGKLWKNHKPWPALGRGYNTGVILLQMKRLRRLK 241
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
WA WR IAEK L+T TSLADQDI NAII ++P LVY LPCQWNVQLSDNTRS ELCY
Sbjct: 242 WAHLWRFIAEKDLVTMYSTSLADQDILNAIIKQYPELVYKLPCQWNVQLSDNTRS-ELCY 300
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
TE+TDLK+IHWNSPKKLKV+NKH+EFFRNLYLTFLEYDGNLLRRELFGCN+++ ++ Q E
Sbjct: 301 TEVTDLKVIHWNSPKKLKVQNKHVEFFRNLYLTFLEYDGNLLRRELFGCNISRPSVPQQE 360
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
L LNEED CYD RR++++ LRTHLYFL+Y+YE S+DG DVTLVAQLSMDRL
Sbjct: 361 LRKLNEEDQCYDFRRARISRLRTHLYFLDYDYEPSTDGADVTLVAQLSMDRL-------- 412
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
QMVE L KHWEGPISLTLYMSDAEAQQFLSYA +S
Sbjct: 413 -------------------------QMVENLLKHWEGPISLTLYMSDAEAQQFLSYALSS 447
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E L SR+NIGYH+VYKEGNFYP+N LRNVAL QV TP+VFL DIDFLPMFGLY YLK+ +
Sbjct: 448 EILSSRKNIGYHIVYKEGNFYPVNLLRNVALQQVKTPFVFLTDIDFLPMFGLYEYLKTCV 507
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+ +++ K LVVPAFETQRYR FP S HAPTN
Sbjct: 508 QLLNL--ENSNKALVVPAFETQRYRITFPRSKADLLRMLDMGSLFTFRYDVWPKGHAPTN 565
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+++W AT+PY++ W PD+EPYIV +D+P YDTRFVGFGWNK
Sbjct: 566 YTKWRTATSPYEVSWEPDYEPYIVVKKDIPLYDTRFVGFGWNK 608
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 121/168 (72%), Positives = 142/168 (84%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
+E I CE+I VAIVCAGYN++RS+VTLIKSILFY +NPL H I D+VA +IL+TLF
Sbjct: 58 KEAEISKCEIIHVAIVCAGYNASRSVVTLIKSILFYTRNPLQLHFIADSVAWHILETLFK 117
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TWSVPQV+V FY A++VV DVSWIPNKHYSGVYGL+KLTLPK+LP L K +VLDTDV F
Sbjct: 118 TWSVPQVDVRFYPAENVVPDVSWIPNKHYSGVYGLMKLTLPKILPRNLTKVVVLDTDVTF 177
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LW L +++ +Q GLVENQSDWYLGKLWKNHKPWPALGRG+
Sbjct: 178 ATDIAELWKLMTKMSDKQAFGLVENQSDWYLGKLWKNHKPWPALGRGY 225
>gi|193669092|ref|XP_001944635.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 1
[Acyrthosiphon pisum]
gi|328699413|ref|XP_003240926.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 2
[Acyrthosiphon pisum]
Length = 709
Score = 661 bits (1705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/523 (62%), Positives = 380/523 (72%), Gaps = 63/523 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P L FY A+ ++ DVSWI NKHYSGVYGL+KL L +V P+TL + +VLDTD+ FATDI
Sbjct: 156 PQLEIEFYPAEKIIADVSWINNKHYSGVYGLMKLCLTQVFPDTLKQVLVLDTDITFATDI 215
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
AQLW++ ++ Q++GLVENQSDWYLGKLWK HKPWPALGRG+NTGVIL+DL KLR I
Sbjct: 216 AQLWSMLKSFKKGQSLGLVENQSDWYLGKLWKKHKPWPALGRGYNTGVILMDLRKLRLIG 275
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W G WR +A K L+T WTSLADQDIFN II E P L+Y +PCQWNVQLSDNTRS ELCY
Sbjct: 276 WNGLWRSVAVKTLVTHYWTSLADQDIFNTIIKERPELLYPMPCQWNVQLSDNTRS-ELCY 334
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
E+ +LKIIHWNSPKKLKVKNKH+EFFRNLYLTFLEYDGNLL+RELFGCN + T E
Sbjct: 335 AEVIELKIIHWNSPKKLKVKNKHVEFFRNLYLTFLEYDGNLLQRELFGCNYSSTIQDVPE 394
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+S E+D CY+ RR+ + SLRTHLYFL++EY++S D D+TLVAQLSMDRLQ
Sbjct: 395 IS---EDDLCYEFRRAGMASLRTHLYFLDFEYQSSPDECDITLVAQLSMDRLQ------- 444
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
MVEML ++W+GPISL+LYMSDAEAQQFLSYA NS
Sbjct: 445 --------------------------MVEMLCEYWDGPISLSLYMSDAEAQQFLSYAQNS 478
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E LR+R+NIGYHVVYKEG+FYPIN LRNVAL V TPYVFL DIDFLPM GLY YLK I
Sbjct: 479 EVLRNRKNIGYHVVYKEGDFYPINLLRNVALENVVTPYVFLSDIDFLPMSGLYSYLKKYI 538
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
M++ K LVVPAFETQRYRTAFP + HA TN
Sbjct: 539 FIMNI--KSSDKALVVPAFETQRYRTAFPQTKADILRMLDDGALFSFRYHVWPKGHAATN 596
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+++W +T PY++ W PD+EPY+V D+PRYDTRFVGFGWNK
Sbjct: 597 YAKWKGSTVPYKVSWEPDYEPYVVVRSDVPRYDTRFVGFGWNK 639
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 111/164 (67%), Positives = 136/164 (82%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I CEVI+V IVCAGYNSTRS+VTLIKSILFYRKNPL FH + D ++ IL+TLF TW +
Sbjct: 96 IDKCEVIEVGIVCAGYNSTRSVVTLIKSILFYRKNPLRFHFVVDVISRKILKTLFDTWKI 155
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
PQ+E+ FY A+ ++ DVSWI NKHYSGVYGL+KL L +V P+TL + +VLDTD+ FATDI
Sbjct: 156 PQLEIEFYPAEKIIADVSWINNKHYSGVYGLMKLCLTQVFPDTLKQVLVLDTDITFATDI 215
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
AQLW++ ++ Q++GLVENQSDWYLGKLWK HKPWPALGRG+
Sbjct: 216 AQLWSMLKSFKKGQSLGLVENQSDWYLGKLWKKHKPWPALGRGY 259
>gi|94733298|emb|CAK04442.1| novel protein similar to vertebrate discs, large homolog 3
(neuroendocrine-dlg, Drosophila) (DLG3) [Danio rerio]
Length = 750
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/527 (60%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 245
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR +
Sbjct: 246 AELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRMG 305
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + T L
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSQASSESTVL 424
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L L+E+D CYD RR ++ R HLYFL+YEY + DG D+TLVAQLSMDR
Sbjct: 425 QQA-LEELDEDDQCYDFRRERIMLHRVHLYFLQYEYSPTDDGTDITLVAQLSMDR----- 478
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 479 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 510
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L++R+N+GYH+VYKEG FYP+N +RNVAL V+TPYVFL D+DFLPM+GLY YL
Sbjct: 511 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALRNVNTPYVFLTDVDFLPMYGLYDYL 570
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ SI +DM KK LVVPAFET RYR +FP S H
Sbjct: 571 RKSIVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWTKGH 628
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTN+++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 629 APTNYAKWRTATTPYKVEWEADFEPYVVVRRDCPEYDQRFVGFGWNK 675
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 115/166 (69%), Positives = 137/166 (82%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P CE++ VA VCAG+N++R +VTL+KSILF+R+NPLHFH ITDTVA IL TLF +W
Sbjct: 124 PTAEKCELLHVACVCAGHNASRDVVTLVKSILFHRRNPLHFHFITDTVANQILSTLFQSW 183
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
VP V+VSFY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFAT 243
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
DIA+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289
>gi|52218928|ref|NP_001004538.1| glycosyltransferase-like protein LARGE2 [Danio rerio]
gi|82083909|sp|Q66PG1.1|LARG2_DANRE RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
Full=Glycosyltransferase-like 1B
gi|51872297|gb|AAU12252.1| glycosyltransferase-like 1B [Danio rerio]
gi|190336631|gb|AAI62103.1| Glycosyltransferase-like 1b [Danio rerio]
Length = 750
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/527 (60%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 245
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR +
Sbjct: 246 AELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRMG 305
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + T L
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSQASSESTVL 424
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L L+E+D CYD RR ++ R HLYFL+YEY + DG D+TLVAQLSMDR
Sbjct: 425 QQA-LEELDEDDQCYDFRRERIMLHRVHLYFLQYEYSPTDDGTDITLVAQLSMDR----- 478
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 479 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 510
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L++R+N+GYH+VYKEG FYP+N +RNVAL V+TPYVFL D+DFLPM+GLY YL
Sbjct: 511 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALRNVNTPYVFLTDVDFLPMYGLYDYL 570
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ SI +DM KK LVVPAFET RYR +FP S H
Sbjct: 571 RKSIVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWTKGH 628
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTN+++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 629 APTNYAKWRTATTPYKVEWEADFEPYVVVRRDCPEYDQRFVGFGWNK 675
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 115/166 (69%), Positives = 137/166 (82%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P CE++ VA VCAG+N++R +VTL+KSILF+R+NPLHFH ITDTVA IL TLF +W
Sbjct: 124 PTAEKCELLHVACVCAGHNASRDVVTLVKSILFHRRNPLHFHFITDTVANQILSTLFQSW 183
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
VP V+VSFY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFAT 243
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
DIA+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289
>gi|410912556|ref|XP_003969755.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Takifugu
rubripes]
Length = 748
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/527 (60%), Positives = 379/527 (71%), Gaps = 67/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +VSWIPNKHYSG+YGL+KLTL K LP L K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLRKVIVLDTDITFATDI 245
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW +F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR I
Sbjct: 246 AELWGIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRIG 305
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AEK L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAEKELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC +Q++ +
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCP-SQSSPENIQ 423
Query: 403 -QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QA L L+E+D CYD RR +LT+ R HLYFL+YEY + D D+TLVAQLSMDR
Sbjct: 424 LQAALEELDEDDQCYDFRRERLTAHRVHLYFLQYEYTPTED--DITLVAQLSMDR----- 476
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 477 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 508
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L++R+N+GYH+VYKEG FYP+N +RNVAL TPYVFL D+DFLPM+GLY YL
Sbjct: 509 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALKNARTPYVFLTDVDFLPMYGLYEYL 568
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ MDM H KK LVVPAFET RYR +FP S H
Sbjct: 569 RKSVLQMDM-AH-TKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWPKGH 626
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTN+++W ATTPY++EW PDFEPY+V RD P YD RFVGFGWNK
Sbjct: 627 APTNYAKWRTATTPYKVEWEPDFEPYVVVRRDCPEYDQRFVGFGWNK 673
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 113/166 (68%), Positives = 136/166 (81%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
PV+ CE+I VA VCAG+N++R +VTL+KS+LF+R+NPLHFH ITD VA IL +LF +W
Sbjct: 124 PVVQKCELIHVACVCAGHNASRDVVTLVKSVLFHRRNPLHFHFITDAVANQILSSLFQSW 183
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
VP V+VSFY AD + +VSWIPNKHYSG+YGL+KLTL K LP L K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLRKVIVLDTDITFAT 243
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
DIA+LW +F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWGIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289
>gi|298684035|gb|ADI96197.1| LARGE-1 [Mastomys erythroleucus]
Length = 671
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVXXKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (650), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|298684009|gb|ADI96184.1| LARGE-1 [Mastomys kollmannspergeri]
Length = 671
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++ +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPDVQKCETIHVAIVCAGYNASXXVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|298684023|gb|ADI96191.1| LARGE-1 [Mastomys natalensis]
Length = 671
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|298684019|gb|ADI96189.1| LARGE-1 [Mastomys natalensis]
gi|298684021|gb|ADI96190.1| LARGE-1 [Mastomys natalensis]
Length = 671
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|298684033|gb|ADI96196.1| LARGE-1 [Mastomys erythroleucus]
Length = 671
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 252 bits (644), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VA VCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAXVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|348530274|ref|XP_003452636.1| PREDICTED: glycosyltransferase-like protein LARGE2-like
[Oreochromis niloticus]
Length = 750
Score = 650 bits (1678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/527 (60%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLSKVIVLDTDITFATDI 245
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW +F + +Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR I
Sbjct: 246 AELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRIG 305
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC TQ + +
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCP-TQASPESIK 423
Query: 403 -QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q L L+E+D CYD RR +LT R HLYFL+Y+Y + D D+TLVAQLSMDR
Sbjct: 424 LQKALEELDEDDQCYDFRRERLTVHRVHLYFLQYDYTPTEDNTDITLVAQLSMDR----- 478
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 479 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 510
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L++R+N+GYH+VYKEG FYP+N +RNVAL +TPYVFL D+DFLPM+GLY YL
Sbjct: 511 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALKNANTPYVFLTDVDFLPMYGLYDYL 570
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ MDM H KK LVVPAFET RYR +FP S H
Sbjct: 571 RKSVVQMDM-AH-TKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWPKGH 628
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTN+++W ATTPY++EW PDFEPY+V R+ P YD RFVGFGWNK
Sbjct: 629 APTNYAKWRTATTPYKVEWEPDFEPYVVVRRECPEYDQRFVGFGWNK 675
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 113/166 (68%), Positives = 136/166 (81%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P + CE+I VA VCAG+N++R +VTL+KS+LF+R+NPLHFH ITDTVA IL +LF +W
Sbjct: 124 PTVQKCELIHVACVCAGHNASRDVVTLVKSVLFHRRNPLHFHFITDTVANRILSSLFQSW 183
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
VP V+VSFY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLSKVIVLDTDITFAT 243
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
DIA+LW +F + +Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289
>gi|298684031|gb|ADI96195.1| LARGE-1 [Mastomys erythroleucus]
Length = 671
Score = 650 bits (1678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQXWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|260805248|ref|XP_002597499.1| hypothetical protein BRAFLDRAFT_281013 [Branchiostoma floridae]
gi|229282764|gb|EEN53511.1| hypothetical protein BRAFLDRAFT_281013 [Branchiostoma floridae]
Length = 677
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/525 (61%), Positives = 382/525 (72%), Gaps = 63/525 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + FY A+++ E+VSWIPNKHYSGVYGL+KL L + LP L K IVLDTDV FATDI
Sbjct: 117 PGVTANFYSAENLKEEVSWIPNKHYSGVYGLMKLVLTRALPTDLEKVIVLDTDVTFATDI 176
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW LF + + +++IGLVENQSDWYLGK+WKNH+PWPALGRGFNTGVILL L KLR +
Sbjct: 177 AELWKLFKKFKGKKSIGLVENQSDWYLGKIWKNHRPWPALGRGFNTGVILLHLEKLRGMK 236
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I +HPYLV+ LPC WNVQLSDNT+S++ CY
Sbjct: 237 WMQMWRLTAERELMSMLSTSLADQDIFNAVIKQHPYLVHKLPCAWNVQLSDNTKSEQ-CY 295
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ-QA 404
TE+TDLK+IHWNSPKKL+VKNKH+EFF+N+YLTFLEYDGNLLRRELFGC Q +
Sbjct: 296 TEVTDLKVIHWNSPKKLRVKNKHVEFFKNMYLTFLEYDGNLLRRELFGCEKVQHLDEVDQ 355
Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
+L+ L+E+D CYD RR ++ RTHLYFL+Y YE S D NDVTLVAQLSMDR
Sbjct: 356 KLNALDEDDQCYDFRRERVLMHRTHLYFLDYMYE-SVDDNDVTLVAQLSMDR-------- 406
Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
LQM+E L +HWEGPISL LYMSDAEAQQFL YA
Sbjct: 407 -------------------------LQMLEALCRHWEGPISLALYMSDAEAQQFLRYALG 441
Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
SE L R+N+GYH+VYKEG FYP+N LRN+AL QV+TPYVFL+DIDFLPM+ LYPYLK +
Sbjct: 442 SETLMQRKNVGYHIVYKEGQFYPVNLLRNIALQQVNTPYVFLMDIDFLPMYNLYPYLKKT 501
Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
+ MDM KK LVVPAFET RYR +FP S HAPT
Sbjct: 502 VGMMDMRRQ--KKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWQKGHAPT 559
Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
N+++W ATTPY++ W PDFEPYIV +D LP YDTRFVGFGWNK
Sbjct: 560 NYAKWRTATTPYRVNWEPDFEPYIVVRKDQLPEYDTRFVGFGWNK 604
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 115/167 (68%), Positives = 138/167 (82%)
Query: 6 EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
E +P CE I+VAIVCAGYN+TR +VTL+KSILFYRKNPLHFH ++D A ILQTL T
Sbjct: 54 EDQVPKCETIEVAIVCAGYNATRGVVTLVKSILFYRKNPLHFHFMSDRAATLILQTLLRT 113
Query: 66 WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
W+VP V +FY A+++ E+VSWIPNKHYSGVYGL+KL L + LP L K IVLDTDV FA
Sbjct: 114 WNVPGVTANFYSAENLKEEVSWIPNKHYSGVYGLMKLVLTRALPTDLEKVIVLDTDVTFA 173
Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
TDIA+LW LF + + +++IGLVENQSDWYLGK+WKNH+PWPALGRGF
Sbjct: 174 TDIAELWKLFKKFKGKKSIGLVENQSDWYLGKIWKNHRPWPALGRGF 220
>gi|38494377|gb|AAH61506.1| Large protein, partial [Mus musculus]
Length = 732
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 168 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 227
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 228 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 287
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 288 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 346
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 347 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 406
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 407 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 460
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 461 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 492
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 493 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 552
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 553 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 610
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 611 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 657
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 104 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 163
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 164 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 223
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 224 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 271
>gi|326912113|ref|XP_003202398.1| PREDICTED: glycosyltransferase-like protein LARGE1-like, partial
[Meleagris gallopavo]
Length = 720
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 156 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 215
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 216 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 275
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 276 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 334
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 335 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 394
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 395 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 448
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 449 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 480
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 481 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 540
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L++PAFET RYR +FP S H
Sbjct: 541 RKSVTQLDL--ANTKKALIIPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 598
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V +D P YD RFVGFGWNK
Sbjct: 599 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 645
Score = 255 bits (651), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 92 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 151
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V + FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 152 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 211
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 212 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 259
>gi|298684015|gb|ADI96187.1| LARGE-1 [Mastomys kollmannspergeri]
Length = 671
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+ + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLXAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|449276407|gb|EMC84939.1| Glycosyltransferase-like protein LARGE1, partial [Columba livia]
Length = 722
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 158 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 217
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 218 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 277
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 278 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 336
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 337 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 396
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 397 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 450
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 451 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 482
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 483 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 542
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L+VPAFET RYR +FP S H
Sbjct: 543 RKSVTQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 600
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V +D P YD RFVGFGWNK
Sbjct: 601 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 647
Score = 255 bits (651), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 94 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 153
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V + FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 154 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 213
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 214 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 261
>gi|6754506|ref|NP_034817.1| glycosyltransferase-like protein LARGE1 [Mus musculus]
gi|22001690|sp|Q9Z1M7.1|LARGE_MOUSE RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
Full=Acetylglucosaminyltransferase-like 1A
gi|3954978|emb|CAA06945.1| acetylglucosaminyltransferase-like protein [Mus musculus]
gi|116138324|gb|AAI25490.1| Like-glycosyltransferase [Mus musculus]
gi|116138711|gb|AAI25488.1| Like-glycosyltransferase [Mus musculus]
gi|148678872|gb|EDL10819.1| like-glycosyltransferase [Mus musculus]
Length = 756
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|224095339|ref|XP_002199503.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Taeniopygia
guttata]
Length = 756
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L+VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V +D P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 681
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V + FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|28972313|dbj|BAC65610.1| mKIAA0609 protein [Mus musculus]
Length = 740
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 176 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 235
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 236 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 295
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 296 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 354
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 355 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 414
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDRLQM+E
Sbjct: 415 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDRLQMLE 473
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
+ KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 474 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 500
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 501 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 560
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 561 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 618
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 619 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 665
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 112 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 171
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 172 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 231
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 232 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 279
>gi|298684045|gb|ADI96202.1| LARGE-1 [Mus musculus]
Length = 671
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 255 bits (651), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|355563615|gb|EHH20177.1| hypothetical protein EGK_02976, partial [Macaca mulatta]
Length = 721
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 217 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 276
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 277 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 335
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 336 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 395
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 396 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 449
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 450 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 481
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 482 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 541
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 542 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 599
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 600 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 646
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 93 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 213 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 260
>gi|298684013|gb|ADI96186.1| LARGE-1 [Mastomys kollmannspergeri]
Length = 671
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNV + +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVXMKHISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ + +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKXVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|297708696|ref|XP_002831094.1| PREDICTED: like-glycosyltransferase isoform 1 [Pongo abelii]
gi|297708698|ref|XP_002831095.1| PREDICTED: like-glycosyltransferase isoform 2 [Pongo abelii]
Length = 756
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|332859455|ref|XP_003317212.1| PREDICTED: like-glycosyltransferase [Pan troglodytes]
gi|397501767|ref|XP_003821547.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 4 [Pan
paniscus]
Length = 688
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 60 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227
>gi|402884050|ref|XP_003905506.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 4 [Papio
anubis]
Length = 688
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 60 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227
>gi|297708700|ref|XP_002831096.1| PREDICTED: like-glycosyltransferase isoform 3 [Pongo abelii]
Length = 688
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 60 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227
>gi|296191746|ref|XP_002743761.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3
[Callithrix jacchus]
Length = 688
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 613
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 60 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227
>gi|403283126|ref|XP_003932978.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 4
[Saimiri boliviensis boliviensis]
Length = 688
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 613
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 60 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227
>gi|4758664|ref|NP_004728.1| glycosyltransferase-like protein LARGE1 [Homo sapiens]
gi|19924141|ref|NP_598397.1| glycosyltransferase-like protein LARGE1 [Homo sapiens]
gi|114686090|ref|XP_001154608.1| PREDICTED: like-glycosyltransferase isoform 2 [Pan troglodytes]
gi|114686092|ref|XP_001154669.1| PREDICTED: like-glycosyltransferase isoform 3 [Pan troglodytes]
gi|397501761|ref|XP_003821544.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1 [Pan
paniscus]
gi|397501763|ref|XP_003821545.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2 [Pan
paniscus]
gi|397501765|ref|XP_003821546.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3 [Pan
paniscus]
gi|22001684|sp|O95461.1|LARGE_HUMAN RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
Full=Acetylglucosaminyltransferase-like 1A
gi|3954938|emb|CAA07571.1| acetylglucosaminyltransferase-like protein [Homo sapiens]
gi|47678551|emb|CAG30396.1| LARGE [Homo sapiens]
gi|55957087|emb|CAI17890.1| like-glycosyltransferase [Homo sapiens]
gi|109451358|emb|CAK54540.1| LARGE [synthetic construct]
gi|109451936|emb|CAK54839.1| LARGE [synthetic construct]
gi|109659262|gb|AAI17426.1| Like-glycosyltransferase [Homo sapiens]
gi|116496729|gb|AAI26405.1| Like-glycosyltransferase [Homo sapiens]
gi|119580445|gb|EAW60041.1| like-glycosyltransferase, isoform CRA_a [Homo sapiens]
gi|119580446|gb|EAW60042.1| like-glycosyltransferase, isoform CRA_a [Homo sapiens]
gi|313883560|gb|ADR83266.1| like-glycosyltransferase (LARGE), transcript variant 2 [synthetic
construct]
gi|410216402|gb|JAA05420.1| like-glycosyltransferase [Pan troglodytes]
gi|410216404|gb|JAA05421.1| like-glycosyltransferase [Pan troglodytes]
gi|410267726|gb|JAA21829.1| like-glycosyltransferase [Pan troglodytes]
gi|410267728|gb|JAA21830.1| like-glycosyltransferase [Pan troglodytes]
gi|410291548|gb|JAA24374.1| like-glycosyltransferase [Pan troglodytes]
gi|410291550|gb|JAA24375.1| like-glycosyltransferase [Pan troglodytes]
gi|410331887|gb|JAA34890.1| like-glycosyltransferase [Pan troglodytes]
gi|410331889|gb|JAA34891.1| like-glycosyltransferase [Pan troglodytes]
Length = 756
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|403283120|ref|XP_003932975.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1
[Saimiri boliviensis boliviensis]
gi|403283122|ref|XP_003932976.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2
[Saimiri boliviensis boliviensis]
gi|403283124|ref|XP_003932977.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3
[Saimiri boliviensis boliviensis]
Length = 756
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 681
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|386781137|ref|NP_001247834.1| glycosyltransferase-like protein LARGE1 [Macaca mulatta]
gi|402884044|ref|XP_003905503.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1 [Papio
anubis]
gi|402884046|ref|XP_003905504.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2 [Papio
anubis]
gi|402884048|ref|XP_003905505.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3 [Papio
anubis]
gi|355784935|gb|EHH65786.1| hypothetical protein EGM_02623 [Macaca fascicularis]
gi|380817940|gb|AFE80844.1| glycosyltransferase-like protein LARGE1 [Macaca mulatta]
Length = 756
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|296191742|ref|XP_002743759.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1
[Callithrix jacchus]
Length = 756
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 681
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|395819892|ref|XP_003783312.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Otolemur
garnettii]
Length = 756
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPVADNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMTHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L+VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|126339665|ref|XP_001366396.1| PREDICTED: glycosyltransferase-like protein LARGE1-like
[Monodelphis domestica]
Length = 756
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPTSDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GL YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLCEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L+VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 254 bits (650), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|427792509|gb|JAA61706.1| Putative glycosyltransferase-like 1b, partial [Rhipicephalus
pulchellus]
Length = 699
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/530 (60%), Positives = 387/530 (73%), Gaps = 67/530 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FYLAD++ +VSWIPNKHYSGVYGL+KL LPK LP+TL K IVLDTD+ FA+DI
Sbjct: 131 PAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPKALPDTLDKVIVLDTDITFASDI 190
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW +F + +Q IGLVENQSDWYLGKLWKNH+PWPALGRGFNTGVILL L +LR+ +
Sbjct: 191 AELWRIFHKYTSKQAIGLVENQSDWYLGKLWKNHRPWPALGRGFNTGVILLRLQRLRERN 250
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++IAEK L++ +TSLADQDIFNA++ +H L++ LPCQWNVQLSDNT S ELCY
Sbjct: 251 WMQMWKLIAEKELISMWFTSLADQDIFNAVLKQHTELLFRLPCQWNVQLSDNTLS-ELCY 309
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
+E+ +LKIIHWNSPKKLKVKNKH+EFFRNLYLTFLEYDGNLLRREL GCN T ++
Sbjct: 310 SEVQELKIIHWNSPKKLKVKNKHVEFFRNLYLTFLEYDGNLLRRELIGCNGTTSSSAGVH 369
Query: 403 ----QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
Q L+ L+E+DPCYD RR+++ RTHLY++EY+YE S +GNDVTLVAQLSMDRL
Sbjct: 370 ASHVQDALNLLDEDDPCYDFRRARVAQHRTHLYYIEYDYEPSPEGNDVTLVAQLSMDRL- 428
Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
QMVE + KHWEGPISL LYMSD+E QQF
Sbjct: 429 --------------------------------QMVEAVCKHWEGPISLALYMSDSEVQQF 456
Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
LSY +SE L+SR+N+GYH+VYK+G FYP+N LRNVAL QV+TP+VFL DIDFLPM GLY
Sbjct: 457 LSYTLSSEILQSRKNVGYHIVYKDGTFYPVNMLRNVALQQVNTPFVFLTDIDFLPMHGLY 516
Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
YLK S+ ++ + +K L+VPAFETQRYR +FP +
Sbjct: 517 EYLKRSVFALGL--ESSRKALIVPAFETQRYRLSFPKTKAELLSMLDMGTLFTFRYHVWQ 574
Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTN+++W ATTPY + W P+FEPY+V RD+P YD RFVGFGWNK
Sbjct: 575 KGHAPTNYAKWRTATTPYLVNWEPNFEPYVVVKRDVPEYDKRFVGFGWNK 624
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 122/171 (71%), Positives = 147/171 (85%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
+E P +P CEV+QVAIVCAGYN++RS++TL+KSILFYRKNPLHFH I+D VA +LQT
Sbjct: 64 GGNELPEVPKCEVLQVAIVCAGYNASRSVITLVKSILFYRKNPLHFHFISDAVAHLVLQT 123
Query: 62 LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
LF +W++P VEVSFYLAD++ +VSWIPNKHYSGVYGL+KL LPK LP+TL K IVLDTD
Sbjct: 124 LFRSWNMPAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPKALPDTLDKVIVLDTD 183
Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+ FA+DIA+LW +F + +Q IGLVENQSDWYLGKLWKNH+PWPALGRGF
Sbjct: 184 ITFASDIAELWRIFHKYTSKQAIGLVENQSDWYLGKLWKNHRPWPALGRGF 234
>gi|395538357|ref|XP_003771149.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Sarcophilus
harrisii]
Length = 756
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP + + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANMERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPTSDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GL YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLCEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L+VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP + + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANMERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|327272296|ref|XP_003220921.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Anolis
carolinensis]
Length = 756
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE + D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAPDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L+VPAFET RYR +FP S H
Sbjct: 577 RKSVVQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V +D P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRQDCPEYDRRFVGFGWNK 681
Score = 254 bits (650), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 128 QQPAVEKCEAIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V + FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|344266604|ref|XP_003405370.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Loxodonta
africana]
Length = 756
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPADLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A S+ L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSDVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 112/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI DT+A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADTIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPADLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|291389926|ref|XP_002711463.1| PREDICTED: like-glycosyltransferase [Oryctolagus cuniculus]
Length = 756
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQALGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ + TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHIGTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQALGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|432094353|gb|ELK25930.1| Glycosyltransferase-like protein LARGE1 [Myotis davidii]
Length = 730
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 166 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 225
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 226 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 285
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 286 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 344
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 345 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 404
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 405 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADNTDVTLVAQLSMDR----- 458
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 459 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 490
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 491 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 550
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 551 RKSVIQLDL--ANSKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 608
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 609 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 655
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL +LF
Sbjct: 102 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILASLFQ 161
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 162 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 221
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 222 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 269
>gi|221040868|dbj|BAH12135.1| unnamed protein product [Homo sapiens]
Length = 688
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYS +YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSEIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 60 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYS +YGL+KL L K LP L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSEIYGLMKLVLTKTLPANLERVIVLDTDITF 179
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227
>gi|410965461|ref|XP_003989266.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Felis catus]
Length = 756
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ASTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|417404394|gb|JAA48952.1| Putative glycosyltransferase-like protein large1 [Desmodus
rotundus]
Length = 756
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKTEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L RRN+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMGRRNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKTEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|345777120|ref|XP_531751.3| PREDICTED: like-glycosyltransferase [Canis lupus familiaris]
Length = 756
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYLFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|119580447|gb|EAW60043.1| like-glycosyltransferase, isoform CRA_b [Homo sapiens]
Length = 693
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 129 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 188
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 189 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 248
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 249 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 307
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 308 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 367
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDRLQM+E
Sbjct: 368 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLE 426
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
+ KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 427 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 453
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 454 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 513
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 514 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 571
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 572 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 618
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 93/151 (61%), Positives = 113/151 (74%)
Query: 22 AGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSV 81
A + +T V +S R+NPLHFHLI D++A IL TLF TW VP V V FY AD +
Sbjct: 82 ASWQATAPSVGSSRSWRNARRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADEL 141
Query: 82 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
+VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDIA+LWA+F + + +
Sbjct: 142 KSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQ 201
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 202 QVLGLVENQSDWYLGNLWKNHRPWPALGRGY 232
>gi|440903966|gb|ELR54547.1| Glycosyltransferase-like protein LARGE1, partial [Bos grunniens
mutus]
Length = 721
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/527 (59%), Positives = 374/527 (70%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 217 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 276
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 277 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 335
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 336 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 395
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 396 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 449
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGP+SL LY+SDAEAQQFL Y
Sbjct: 450 ----------------------------LQMLEAICKHWEGPVSLALYLSDAEAQQFLRY 481
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L R N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 482 AQGSEVLMGRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 541
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 542 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 599
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 600 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 646
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 93 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 213 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 260
>gi|351708238|gb|EHB11157.1| Glycosyltransferase-like protein LARGE1 [Heterocephalus glaber]
Length = 759
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/530 (58%), Positives = 377/530 (71%), Gaps = 68/530 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F R R +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHRFRGQQALGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC-------NLTQ 398
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + +
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEAGVADASN 430
Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
LQ+ +LS L+E+D CY+ RR + T RTHLYFL YE+E ++DG DVTLVAQLSMDR
Sbjct: 431 KDLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEHEPATDGTDVTLVAQLSMDR-- 487
Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
LQM+E + KHWEGPISL LY+SDAEAQQF
Sbjct: 488 -------------------------------LQMLEAICKHWEGPISLALYLSDAEAQQF 516
Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
L YA SE L R N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY
Sbjct: 517 LRYAQGSEVLMGRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLY 576
Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
YL+ S+ +D+ KK ++VPAFET RYR +FP S
Sbjct: 577 EYLRKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWT 634
Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF++W ATTPY+++W DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 KGHAPTNFAKWRTATTPYRVQWEADFEPYVVLRRDCPEYDRRFVGFGWNK 684
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL LF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILAMLFH 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY A+ + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F R R +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHRFRGQQALGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|52138661|ref|NP_001004383.1| glycosyltransferase-like protein LARGE1 [Gallus gallus]
gi|82083914|sp|Q66PG3.1|LARGE_CHICK RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
Full=Acetylglucosaminyltransferase-like 1A
gi|51872293|gb|AAU12250.1| glycosyltransferase-like 1A [Gallus gallus]
Length = 756
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS +E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSEPDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWE PISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEDPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK L++PAFET RYR +FP S H
Sbjct: 577 RKSVTQLDL--ANTKKALIIPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V +D P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 681
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V + FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|432850590|ref|XP_004066824.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Oryzias
latipes]
Length = 756
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/526 (60%), Positives = 373/526 (70%), Gaps = 65/526 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +VSWIPNKHYSG+YGL+KLTL K LP L K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPPDLNKVIVLDTDITFATDI 245
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW +F + +Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR IS
Sbjct: 246 AELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRIS 305
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQ- 402
TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + ++Q
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSQASPESIQL 424
Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
Q L L+E+D CYD RR +LT R HL FL+YEY + D DVTLVAQLSMDR
Sbjct: 425 QGALEELDEDDQCYDFRRERLTVHRVHLEFLQYEYTPTED--DVTLVAQLSMDR------ 476
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
LQM+E + KHWEGPISL LYMSDAEAQQFL YA
Sbjct: 477 ---------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRYA 509
Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
SE L++R+N+GYH+VYKEG FYP+N LRNVAL +TPY FL D+DFLPM+GLY YL+
Sbjct: 510 QASEVLKNRKNVGYHIVYKEGQFYPVNLLRNVALKNANTPYTFLTDVDFLPMYGLYDYLR 569
Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
S+ +DM KK LVVPAFET RYR +FP S HA
Sbjct: 570 RSVVHLDM--RNSKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWPKGHA 627
Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PTN+++W ATTPY++EW PDFEPY+V R P YD RFVGFGWNK
Sbjct: 628 PTNYAKWRTATTPYRVEWEPDFEPYVVVRRTCPEYDQRFVGFGWNK 673
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 114/166 (68%), Positives = 135/166 (81%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P I CE+I VA VCAG+N++R +VTL+KS+LF+R+NPLHFH ITDTVA IL +LF +W
Sbjct: 124 PTIQKCELIHVACVCAGHNASRDVVTLVKSVLFHRRNPLHFHFITDTVANRILSSLFRSW 183
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
VP V+VSFY AD + +VSWIPNKHYSG+YGL+KLTL K LP L K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPPDLNKVIVLDTDITFAT 243
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
DIA+LW +F + +Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289
>gi|329663446|ref|NP_001192517.1| glycosyltransferase-like protein LARGE1 [Bos taurus]
gi|296487413|tpg|DAA29526.1| TPA: like-glycosyltransferase [Bos taurus]
Length = 756
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/527 (59%), Positives = 374/527 (70%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGP+SL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPVSLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L R N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMGRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|194226762|ref|XP_001498804.2| PREDICTED: like-glycosyltransferase [Equus caballus]
Length = 756
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPPNLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVHKLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDHTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPPNLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|301772300|ref|XP_002921570.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Ailuropoda
melanoleuca]
Length = 756
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 374/527 (70%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY +EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYHVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 254 bits (650), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFR 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY A+ + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|71682499|gb|AAI00400.1| Like-glycosyltransferase [Mus musculus]
Length = 756
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +G VENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGSVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ + L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAEREHMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E++ CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDNLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +G VENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGSVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|348553098|ref|XP_003462364.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Cavia
porcellus]
Length = 759
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/530 (58%), Positives = 375/530 (70%), Gaps = 68/530 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC-------NLTQ 398
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + +
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEAGVADASS 430
Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
LQ+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 KDLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR-- 487
Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
LQM+E + KHWEGPISL LY+SDAEAQQF
Sbjct: 488 -------------------------------LQMLEAICKHWEGPISLALYLSDAEAQQF 516
Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
L YA SE L R N+GYH+VYKEG FYP+N LRNVAL + TPY+FL DIDFLPM+GLY
Sbjct: 517 LRYAQGSEVLMGRHNVGYHIVYKEGQFYPVNLLRNVALKHIRTPYMFLSDIDFLPMYGLY 576
Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
YL+ S+ +D+ KK ++VPAFET RYR +FP S
Sbjct: 577 EYLRKSVVQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWT 634
Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 KGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 684
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFH 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|354485819|ref|XP_003505079.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Cricetulus
griseus]
gi|344238050|gb|EGV94153.1| Glycosyltransferase-like protein LARGE1 [Cricetulus griseus]
Length = 756
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/527 (58%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPATDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W AT PY++EW DFEPY+V RD YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATMPYRVEWEADFEPYVVVRRDCSEYDRRFVGFGWNK 681
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|390349971|ref|XP_781563.3| PREDICTED: glycosyltransferase-like protein LARGE1-like
[Strongylocentrotus purpuratus]
Length = 575
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/530 (60%), Positives = 376/530 (70%), Gaps = 66/530 (12%)
Query: 163 KPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
K W P + FYLA++ V WIPNKHYSG+YGL+KL L ++LP + KTIVLDTD+
Sbjct: 11 KTWDVPGVKVSFYLAENYKSHVEWIPNKHYSGLYGLMKLLLIEILPSNMEKTIVLDTDIT 70
Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
FA+DIA+LW LF R+ +QT+ LVENQSDWYLG LWK HKPWPALGRGFNTGVILLDL K
Sbjct: 71 FASDIAELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGFNTGVILLDLQK 130
Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
LRD+ W W + AEK L+++ T+LADQDI NA+I +HPYLVYTLPC WNVQLSDNTRS
Sbjct: 131 LRDLKWMQMWLLTAEKELMSQFSTALADQDIINAVIKQHPYLVYTLPCAWNVQLSDNTRS 190
Query: 341 DELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQ 398
ELCY E+TDLK IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC
Sbjct: 191 -ELCYKEVTDLKAIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCTDGKAN 249
Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
TL + +LS+L+E+D CY+ RR +L RTHLY+L+Y+ D DVTLVAQLSMDRLQ
Sbjct: 250 NTLHE-KLSSLDEDDQCYEFRREQLLMHRTHLYYLDYQLPKFLD-TDVTLVAQLSMDRLQ 307
Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
M+E + KHWEGPISL LYMSDAEAQQF
Sbjct: 308 MLEA---------------------------------ICKHWEGPISLALYMSDAEAQQF 334
Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
+ YA +SE L +R+NIGYHVVYK+G FYP+N LRN A++QV TPY+FL DIDFLPM+GLY
Sbjct: 335 IRYALSSETLMARKNIGYHVVYKDGQFYPVNYLRNTAMSQVKTPYMFLSDIDFLPMYGLY 394
Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
YL+ +I +DM K L+VPAFET RYR FP S
Sbjct: 395 DYLRKAIGMLDM--ATSNKALIVPAFETLRYRLNFPKSKADLLHMLDMGTLFTFRYHVWQ 452
Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF+RW ATTPYQ++W PDFEPY+V +D P+YD RF+GFGWNK
Sbjct: 453 KGHAPTNFARWRTATTPYQVQWEPDFEPYVVVKKDCPKYDPRFLGFGWNK 502
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 79/115 (68%), Positives = 93/115 (80%)
Query: 58 ILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIV 117
IL TLF TW VP V+VSFYLA++ V WIPNKHYSG+YGL+KL L ++LP + KTIV
Sbjct: 5 ILSTLFKTWDVPGVKVSFYLAENYKSHVEWIPNKHYSGLYGLMKLLLIEILPSNMEKTIV 64
Query: 118 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
LDTD+ FA+DIA+LW LF R+ +QT+ LVENQSDWYLG LWK HKPWPALGRGF
Sbjct: 65 LDTDITFASDIAELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGF 119
>gi|298684047|gb|ADI96203.1| LARGE-1 [Rattus losea]
Length = 671
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 311/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LV LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVXQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YE+E S+D DVTLVAQLSMDR
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADHTDVTLVAQLSMDR----- 420
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLXY 452
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR+N+GYH+VYKEG Y +N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQXYXVNFLRNVAMKXISTPYMFLSDIDFLPMYGLYEYL 512
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|432941969|ref|XP_004082928.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 1
[Oryzias latipes]
Length = 756
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 319/527 (60%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L K IVLDTD+ FATDI
Sbjct: 192 PAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 371 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ L L+E+DPCY+ RR + T RTHLYFL YEYE SSDG DVTLVAQLSMDRLQM+E
Sbjct: 431 QKT-LMELDEDDPCYEFRRERFTVHRTHLYFLHYEYEPSSDGTDVTLVAQLSMDRLQMLE 489
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
+ KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 490 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ QV TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRQVHTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +DM KK LVVPAFET RYR +FP S H
Sbjct: 577 RKSVVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSQLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W AT PY+++W PDFEPY++ RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATMPYRVQWEPDFEPYVMVRRDSPEYDRRFVGFGWNK 681
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 112/168 (66%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+RKNPLHFH ITD++A IL LF
Sbjct: 128 QQPTVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRKNPLHFHFITDSIAQKILSALFH 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V+FY AD + +VSWIPNKHYSG+YGL+KL L K LP L K IVLDTD+ F
Sbjct: 188 TWMVPAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 295
>gi|443688665|gb|ELT91285.1| hypothetical protein CAPTEDRAFT_168890 [Capitella teleta]
Length = 710
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 312/523 (59%), Positives = 372/523 (71%), Gaps = 61/523 (11%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + FY AD + VSWIPNKHYSGVYGL+KLTLP LP+TL K IVLDTD+ FA+DI
Sbjct: 153 PTVTVTFYPADDLKSVVSWIPNKHYSGVYGLMKLTLPTTLPDTLNKVIVLDTDITFASDI 212
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A LW LF + +Q +GLVENQSDWYLGK+WKNH+PWPALGRGFNTGVILLDL KLR I+
Sbjct: 213 ADLWNLFRKFTGKQAMGLVENQSDWYLGKIWKNHRPWPALGRGFNTGVILLDLEKLRAIN 272
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+IAE+ L+T L T+LADQDIFNA+I +HP LVYTLPCQWNVQLSDNTRS++ CY
Sbjct: 273 WIQAWRLIAERELMTMLSTALADQDIFNAVIKQHPMLVYTLPCQWNVQLSDNTRSEQ-CY 331
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
T+++DLKIIHWNSP KL+VKNKH+E FR+LYLTFL+YDGNLLRRELFGC +
Sbjct: 332 TDVSDLKIIHWNSPHKLQVKNKHIEVFRSLYLTFLQYDGNLLRRELFGCGGESKPSLSNQ 391
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+ NE+D CYD RR + RTHLY+L++ + D NDVTLVAQLSMDRL
Sbjct: 392 ILEANEDDECYDFRRERNLVHRTHLYYLDFNFTPDPD-NDVTLVAQLSMDRL-------- 442
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
QM+EML +HWEGPISL LYMSD EAQQFL YA +S
Sbjct: 443 -------------------------QMLEMLCQHWEGPISLALYMSDTEAQQFLQYALSS 477
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E L SR+NIG+H+VYK+G FYP+N LRNVAL+Q ++PYV L DIDFLPMFGLY YL+ ++
Sbjct: 478 EILMSRKNIGFHIVYKDGQFYPVNYLRNVALSQATSPYVLLSDIDFLPMFGLYEYLRKAV 537
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+ KK LV+PAFETQRYR FP S H+PTN
Sbjct: 538 SMTSVMQE--KKALVIPAFETQRYRLTFPTSKAELLTMLDMGTLFTFRYHVWPQGHSPTN 595
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+++W ATTPY++EW DFEPY+V R++P YDTRFVGFGWNK
Sbjct: 596 YAKWRTATTPYEVEWDQDFEPYVVVPREIPMYDTRFVGFGWNK 638
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 116/172 (67%), Positives = 140/172 (81%)
Query: 1 MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+ ++ P +P CEVI VAIVCA YNS+R +VTL+KSILFY K+PLHFH I+DT+A IL+
Sbjct: 85 LKCEQVPEVPKCEVIHVAIVCAQYNSSRDVVTLVKSILFYGKHPLHFHFISDTIAQRILE 144
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
T+F TW+VP V V+FY AD + VSWIPNKHYSGVYGL+KLTLP LP+TL K IVLDT
Sbjct: 145 TVFRTWAVPTVTVTFYPADDLKSVVSWIPNKHYSGVYGLMKLTLPTTLPDTLNKVIVLDT 204
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
D+ FA+DIA LW LF + +Q +GLVENQSDWYLGK+WKNH+PWPALGRGF
Sbjct: 205 DITFASDIADLWNLFRKFTGKQAMGLVENQSDWYLGKIWKNHRPWPALGRGF 256
>gi|432941971|ref|XP_004082929.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 2
[Oryzias latipes]
Length = 755
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 319/527 (60%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L K IVLDTD+ FATDI
Sbjct: 191 PAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITFATDI 250
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 251 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 310
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 311 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 369
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 370 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 429
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ L L+E+DPCY+ RR + T RTHLYFL YEYE SSDG DVTLVAQLSMDRLQM+E
Sbjct: 430 QKT-LMELDEDDPCYEFRRERFTVHRTHLYFLHYEYEPSSDGTDVTLVAQLSMDRLQMLE 488
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
+ KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 489 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 515
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ QV TPY+FL DIDFLPM+GLY YL
Sbjct: 516 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRQVHTPYMFLSDIDFLPMYGLYEYL 575
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +DM KK LVVPAFET RYR +FP S H
Sbjct: 576 RKSVVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSQLDMGTLFTFRYHVWTKGH 633
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W AT PY+++W PDFEPY++ RD P YD RFVGFGWNK
Sbjct: 634 APTNFAKWRTATMPYRVQWEPDFEPYVMVRRDSPEYDRRFVGFGWNK 680
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 112/168 (66%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+RKNPLHFH ITD++A IL LF
Sbjct: 127 QQPTVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRKNPLHFHFITDSIAQKILSALFH 186
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V+FY AD + +VSWIPNKHYSG+YGL+KL L K LP L K IVLDTD+ F
Sbjct: 187 TWMVPAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITF 246
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 247 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 294
>gi|410918397|ref|XP_003972672.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Takifugu
rubripes]
Length = 755
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 316/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 191 PAVKVNFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITFATDI 250
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 251 AELWAVFHKFRGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 310
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 311 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 369
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 370 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDHNSENL 429
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ LS L+E+DPCY+ RR + T RTHLYFL YEYEA+SD DVTLVAQLSMDR
Sbjct: 430 QKT-LSELDEDDPCYEFRRERFTVHRTHLYFLHYEYEAASDNTDVTLVAQLSMDR----- 483
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 484 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 515
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A S+ L SR N+GYH+VYKEG FYP+N LRNVA+ QV+TPY+FL DIDFLPM+GLY YL
Sbjct: 516 AQGSDVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMQQVNTPYMFLSDIDFLPMYGLYEYL 575
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +DM KK LVVPAFET RYR ++P S H
Sbjct: 576 RKSVVQLDM--ANAKKALVVPAFETLRYRLSYPKSKAELLSQLDMGTLFTFRYHVWTKGH 633
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATT Y+++W DFEPY++ RD P YD RFVGFGWNK
Sbjct: 634 APTNFAKWRTATTAYRVQWEADFEPYVMVRRDSPEYDRRFVGFGWNK 680
Score = 258 bits (659), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 139/168 (82%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A +IL +LF
Sbjct: 127 QQPAVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQHILSSLFH 186
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V+V+FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 187 TWMVPAVKVNFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITF 246
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 247 ATDIAELWAVFHKFRGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 294
>gi|426225215|ref|XP_004006763.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE1 [Ovis aries]
Length = 758
Score = 636 bits (1640), Expect = e-179, Method: Compositional matrix adjust.
Identities = 312/532 (58%), Positives = 375/532 (70%), Gaps = 70/532 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGP+SL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPVSLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L R N+ YH+VYKEG FYP+N LRNVA+ VSTPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMGRHNVAYHIVYKEGQFYPVNLLRNVAMKHVSTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 K-----SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------------- 616
+ S+ +D+ KK ++VPAFET RYR +FP S
Sbjct: 577 RXGRAGKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHV 634
Query: 617 ----HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 635 WTKGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 686
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|221043764|dbj|BAH13559.1| unnamed protein product [Homo sapiens]
Length = 555
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 307/512 (59%), Positives = 369/512 (72%), Gaps = 65/512 (12%)
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
+VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDIA+LWA+F + + +Q
Sbjct: 6 EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65
Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
+GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W WR+ AE+ L+
Sbjct: 66 LGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERELMG 125
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPK 360
L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWNSPK
Sbjct: 126 MLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWNSPK 184
Query: 361 KLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEEDPCY 416
KL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+E+D CY
Sbjct: 185 KLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDDLCY 243
Query: 417 DLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDG 476
+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDRLQM+E
Sbjct: 244 EFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEA-------------- 289
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
+ KHWEGPISL LY+SDAEAQQFL YA SE L SR N+GY
Sbjct: 290 -------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHNVGY 330
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
H+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S+ +D+ K
Sbjct: 331 HIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--ANTK 388
Query: 597 KVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPY 632
K ++VPAFET RYR +FP S HAPTNF++W ATTPY
Sbjct: 389 KAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTATTPY 448
Query: 633 QIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 449 RVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 480
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 63/89 (70%), Positives = 76/89 (85%)
Query: 84 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
+VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDIA+LWA+F + + +Q
Sbjct: 6 EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65
Query: 144 IGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 66 LGLVENQSDWYLGNLWKNHRPWPALGRGY 94
>gi|348523141|ref|XP_003449082.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 2
[Oreochromis niloticus]
Length = 756
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 371 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ LS L+E+DPCY+ RR + T RTHLYFL YEY+ SSD DVTLVAQLSMDR
Sbjct: 431 QKT-LSELDEDDPCYEFRRERFTVHRTHLYFLHYEYKPSSDNTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ V+TPY+FL DIDFLPM+ LY YL
Sbjct: 517 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRHVNTPYMFLSDIDFLPMYSLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +DM KK LVVPAFET RYR ++P S H
Sbjct: 577 RKSVVQLDM--ANTKKALVVPAFETLRYRLSYPKSKAELLSQLDMGTLFTFRYHVWTKGH 634
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY+++W DFEPY++ RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVQWEADFEPYVMVRRDSPEYDRRFVGFGWNK 681
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 114/173 (65%), Positives = 138/173 (79%), Gaps = 2/173 (1%)
Query: 2 SSD--EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
SSD PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A IL
Sbjct: 123 SSDCVRHPVVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQQIL 182
Query: 60 QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
+LF TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLD
Sbjct: 183 SSLFHTWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLD 242
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
TD+ FATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 243 TDITFATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 295
>gi|348523139|ref|XP_003449081.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 1
[Oreochromis niloticus]
Length = 755
Score = 630 bits (1624), Expect = e-178, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 191 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITFATDI 250
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 251 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 310
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 311 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 369
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 370 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 429
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ LS L+E+DPCY+ RR + T RTHLYFL YEY+ SSD DVTLVAQLSMDR
Sbjct: 430 QKT-LSELDEDDPCYEFRRERFTVHRTHLYFLHYEYKPSSDNTDVTLVAQLSMDR----- 483
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 484 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 515
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ V+TPY+FL DIDFLPM+ LY YL
Sbjct: 516 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRHVNTPYMFLSDIDFLPMYSLYEYL 575
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +DM KK LVVPAFET RYR ++P S H
Sbjct: 576 RKSVVQLDM--ANTKKALVVPAFETLRYRLSYPKSKAELLSQLDMGTLFTFRYHVWTKGH 633
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY+++W DFEPY++ RD P YD RFVGFGWNK
Sbjct: 634 APTNFAKWRTATTPYRVQWEADFEPYVMVRRDSPEYDRRFVGFGWNK 680
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 114/173 (65%), Positives = 138/173 (79%), Gaps = 2/173 (1%)
Query: 2 SSD--EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
SSD PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A IL
Sbjct: 122 SSDCVRHPVVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQQIL 181
Query: 60 QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
+LF TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLD
Sbjct: 182 SSLFHTWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLD 241
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
TD+ FATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 242 TDITFATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 294
>gi|291238057|ref|XP_002738950.1| PREDICTED: like-glycosyltransferase-like [Saccoglossus kowalevskii]
Length = 749
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 303/512 (59%), Positives = 366/512 (71%), Gaps = 62/512 (12%)
Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
V+DVSWIPNKHYSGVYGL+KL L KVLP+ + K IVLDTDV FATDIA+LW +F +L+ +
Sbjct: 201 VDDVSWIPNKHYSGVYGLMKLLLTKVLPKDMDKVIVLDTDVTFATDIAELWKIFRKLKNK 260
Query: 239 QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFL 298
+ IGLVENQSDWYLG LW+ HKPWPALGRGFNTGVILLD+ KLR + W WR+ AE+ L
Sbjct: 261 KAIGLVENQSDWYLGTLWRKHKPWPALGRGFNTGVILLDVKKLRKMKWMQMWRLTAEREL 320
Query: 299 LTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNS 358
++ L TSLADQDIFNA+I +HP +V+ LPC WNVQLS+NTRS E CYTE++D+K+IHWNS
Sbjct: 321 MSMLSTSLADQDIFNAVIKQHPDIVHQLPCAWNVQLSENTRS-ETCYTEISDIKVIHWNS 379
Query: 359 PKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQA--ELSNLNEEDPCY 416
P+KL+VKN H+EFFRNLYLTFLEYDGNLL+RELFGC Q ++ E++ L+E+D CY
Sbjct: 380 PQKLRVKNNHIEFFRNLYLTFLEYDGNLLKRELFGCQAYQHKPKEVVKEITELDEDDQCY 439
Query: 417 DLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDG 476
+ RR KL + R HLY+L+Y+Y DG DVTLVAQLSMDRLQM+E N
Sbjct: 440 EFRREKLLTHRIHLYYLDYQYIQDLDGTDVTLVAQLSMDRLQMIESICN----------- 488
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
HW GPIS LYMSDAE QQFL YA +S+ L R+N+GY
Sbjct: 489 ----------------------HWLGPISFALYMSDAEVQQFLRYALSSDILMKRKNVGY 526
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
H+VYK+G FYP+N LRNVA+ QV+TPYV+L DIDFLPM+GLY YLK +I MDM
Sbjct: 527 HIVYKDGQFYPVNFLRNVAMKQVNTPYVYLSDIDFLPMYGLYEYLKKAIGMMDM--SKAN 584
Query: 597 KVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPY 632
K L+VPAFET RYR FP S H+PTNF++W ATTPY
Sbjct: 585 KALIVPAFETLRYRLTFPKSKAELLTMLDMGSLYTFRYHVWQKGHSPTNFAKWRTATTPY 644
Query: 633 QIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+++W DFEPY+V +D P YDTRFVGFGWNK
Sbjct: 645 KVQWESDFEPYVVVKKDSPLYDTRFVGFGWNK 676
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 110/183 (60%), Positives = 133/183 (72%), Gaps = 14/183 (7%)
Query: 1 MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+ ++ E +P CEVI VAIVCAGYN++R ++TLIKS+LFYRKNP+H H + D L
Sbjct: 112 ICTNVEENVPKCEVIHVAIVCAGYNASREVITLIKSVLFYRKNPIHLHFLADKPGQLTLS 171
Query: 61 TLFSTW-----------SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLP 109
TLF TW S+ Q SF L V+DVSWIPNKHYSGVYGL+KL L KVLP
Sbjct: 172 TLFDTWDITGGMLIGESSLAQPIKSFNLT---VDDVSWIPNKHYSGVYGLMKLLLTKVLP 228
Query: 110 ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALG 169
+ + K IVLDTDV FATDIA+LW +F +L+ ++ IGLVENQSDWYLG LW+ HKPWPALG
Sbjct: 229 KDMDKVIVLDTDVTFATDIAELWKIFRKLKNKKAIGLVENQSDWYLGTLWRKHKPWPALG 288
Query: 170 RGF 172
RGF
Sbjct: 289 RGF 291
>gi|52219122|ref|NP_001004537.1| glycosyltransferase-like protein LARGE1 [Danio rerio]
gi|82083911|sp|Q66PG2.1|LARGE_DANRE RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
Full=Acetylglucosaminyltransferase-like 1A
gi|51872295|gb|AAU12251.1| glycosyltransferase-like 1A [Danio rerio]
gi|190336617|gb|AAI62096.1| Like-glycosyltransferase [Danio rerio]
Length = 757
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG++GL+KL L K LP L K IVLDTD+ FATDI
Sbjct: 193 PAVHVNFYDADELKSEVSWIPNKHYSGIHGLMKLVLTKTLPSDLEKVIVLDTDITFATDI 252
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW +F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 253 AELWVVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLK 312
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 313 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCYWNVQLSDHTRS-EKCY 371
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 372 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDHNSENL 431
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ LS L+E+DPCY+ RR + T RTH+YFL YEYE + D DVTLVAQLSMDR
Sbjct: 432 QKT-LSELDEDDPCYEFRRERFTVHRTHVYFLHYEYEPTVDNTDVTLVAQLSMDR----- 485
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 486 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 517
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ QV+TPY+FL DIDFLPM+GLY YL
Sbjct: 518 AQGSEVLMSRSNVGYHIVYKEGQFYPVNLLRNVAMGQVNTPYMFLSDIDFLPMYGLYEYL 577
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +DM KK LVVPAFET RYR +FP S H
Sbjct: 578 RKSVVQLDM--GNTKKALVVPAFETLRYRLSFPKSKAELLSQLDMGTLFTFRYHVWTKGH 635
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+F++W ATTPY+++W DFEPY++ R+ P YD RFVGFGWNK
Sbjct: 636 APTDFAKWRTATTPYRVQWEADFEPYVMVRRESPEYDRRFVGFGWNK 682
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 109/167 (65%), Positives = 136/167 (81%)
Query: 6 EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
+PV+ CE I +AIVCAGYN++R +VTL+KS+LF+R+NPLHFH+ITD++A IL LF T
Sbjct: 130 QPVVKKCETIHIAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHIITDSIARKILADLFHT 189
Query: 66 WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
W VP V V+FY AD + +VSWIPNKHYSG++GL+KL L K LP L K IVLDTD+ FA
Sbjct: 190 WMVPAVHVNFYDADELKSEVSWIPNKHYSGIHGLMKLVLTKTLPSDLEKVIVLDTDITFA 249
Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
TDIA+LW +F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 250 TDIAELWVVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 296
>gi|52138697|ref|NP_001004404.1| glycosyltransferase-like protein LARGE2 [Gallus gallus]
gi|82083916|sp|Q66PG4.1|LARG2_CHICK RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
Full=Glycosyltransferase-like 1B
gi|51872291|gb|AAU12249.1| glycosyltransferase-like 1B [Gallus gallus]
Length = 739
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 317/527 (60%), Positives = 373/527 (70%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FATDI
Sbjct: 175 PSIHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 234
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR +
Sbjct: 235 AELWAVFGKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDRLRRLG 294
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P LVY LPC WNVQLSD+TRS ELCY
Sbjct: 295 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPALVYRLPCFWNVQLSDHTRS-ELCY 353
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 354 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCASLPSPPSDQL 413
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L L+E+DPCYD RR LT R HL+FL+YE+ A + DVTLVAQLSMDR
Sbjct: 414 QQA-LEELDEDDPCYDFRRQHLTQHRVHLFFLQYEFLALPNPTDVTLVAQLSMDR----- 467
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHW GPISL LYMSDAEAQQFL Y
Sbjct: 468 ----------------------------LQMLEAICKHWAGPISLALYMSDAEAQQFLRY 499
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L +RRN+ YH+VYKEG FYPIN LRNVAL TPYVFL DIDFLPM+GLY YL
Sbjct: 500 AQASEVLSARRNVAYHIVYKEGQFYPINLLRNVALANTQTPYVFLTDIDFLPMYGLYDYL 559
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI+ +++ K L+VPAFET YR FP S H
Sbjct: 560 RNSIQQLELPHR--KAALIVPAFETLHYRLTFPKSKAELLSMLDMGSLYTFRYHVWPKGH 617
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT++++W AT PY++ W PDFEPY+V RD P+YD RFVGFGWNK
Sbjct: 618 APTDYAKWRTATVPYRVAWQPDFEPYVVVRRDCPKYDQRFVGFGWNK 664
Score = 261 bits (667), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 116/171 (67%), Positives = 142/171 (83%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
+ E+ ++ CE++ VAIVCAG+N++R +VTL+KSILF+RKNPLHFH ITD+VA ILQT
Sbjct: 108 ACPEQRMVRKCELLHVAIVCAGHNASRDVVTLVKSILFHRKNPLHFHFITDSVAHQILQT 167
Query: 62 LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
LF +W VP + VSFY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD
Sbjct: 168 LFQSWMVPSIHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTD 227
Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+ FATDIA+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 228 ITFATDIAELWAVFGKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 278
>gi|327259715|ref|XP_003214681.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Anolis
carolinensis]
Length = 739
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/527 (59%), Positives = 370/527 (70%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + D+SWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FATDI
Sbjct: 175 PSVHISFYDADDLKPDISWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 234
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + +Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR I
Sbjct: 235 AELWAVFGKFSDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLERLRRIG 294
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I + P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 295 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQSPALVYQLPCFWNVQLSDHTRSEQ-CY 353
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC +LT L
Sbjct: 354 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCASLPSLTSGQL 413
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L L+E+DPCYD RR L R HL+FL+Y++ AS+D DVTL+AQLSMDR
Sbjct: 414 QQA-LEELDEDDPCYDFRRQSLIQHRVHLFFLQYDFPASTDAADVTLIAQLSMDR----- 467
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHW GPISL LY+SDAEAQQFL Y
Sbjct: 468 ----------------------------LQMLEAICKHWTGPISLALYLSDAEAQQFLRY 499
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L +RRN+ YH+VYKEG FYP+N LRNVAL T +VFL DIDFLPM+GLY YL
Sbjct: 500 AQASEVLSNRRNVAYHIVYKEGQFYPVNFLRNVALKNAQTSHVFLTDIDFLPMYGLYDYL 559
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++S+ +++ K +VPAFET YR FP S H
Sbjct: 560 RTSVVQLELPER--KAAFIVPAFETLHYRLTFPKSKAELVSMLDMGSLYTFRYHVWPQGH 617
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT++++W AT PYQ+EW P FEPY+V RD P YD RFVGFGWNK
Sbjct: 618 APTDYAKWRTATVPYQVEWQPHFEPYVVVRRDCPLYDQRFVGFGWNK 664
Score = 264 bits (675), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 117/168 (69%), Positives = 141/168 (83%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++ ++ CE++ VAIVCAGYN++R +VTL+KSILF+RKNPLHFHLITD+VA ILQTLF
Sbjct: 111 KQRMVQKCELLHVAIVCAGYNASRDVVTLVKSILFHRKNPLHFHLITDSVARQILQTLFQ 170
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
+W VP V +SFY AD + D+SWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ F
Sbjct: 171 SWMVPSVHISFYDADDLKPDISWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITF 230
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + +Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 231 ATDIAELWAVFGKFSDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 278
>gi|47209488|emb|CAF89604.1| unnamed protein product [Tetraodon nigroviridis]
Length = 571
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 313/532 (58%), Positives = 367/532 (68%), Gaps = 86/532 (16%)
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
+VSWIPNKHYSG+YGL+KLTLPK LP L++ +VLDTD+ FATDIA+LW +F + +Q
Sbjct: 4 EVSWIPNKHYSGIYGLMKLTLPKALPAHLSRVVVLDTDITFATDIAELWGVFRKFTDKQV 63
Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR + W WR+ AEK L++
Sbjct: 64 IGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLHLERLRSLGWEQMWRLTAEKELMS 123
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPK 360
L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CYTE++DLK+IHWNSPK
Sbjct: 124 MLSTSLADQDIFNAFIKQNPALVHQLPCFWNVQLSDHTRSEQ-CYTEVSDLKVIHWNSPK 182
Query: 361 KLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNL---TQTTLQQAELSNLNEEDPCYD 417
KL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC +T QA L L+E+D CYD
Sbjct: 183 KLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSPPGPETVQLQAALEELDEDDQCYD 242
Query: 418 LRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN 477
RR +LT+ R HL+FL+Y+Y + D D+TLVAQLSMDRL
Sbjct: 243 FRRERLTAHRLHLFFLQYKYTPTED--DITLVAQLSMDRL-------------------- 280
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
QM+E + KHW GPISL LYMSDAEAQQFL YA SE L++RRN+GYH
Sbjct: 281 -------------QMLEAICKHWAGPISLALYMSDAEAQQFLRYAQASEVLKNRRNVGYH 327
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
VVYKEG FYP+N LRNVAL TPYVFL D+DFLPM+GLY YL+ + MDM H KK
Sbjct: 328 VVYKEGQFYPVNLLRNVALRNARTPYVFLTDVDFLPMYGLYEYLRKLVLQMDM-AH-TKK 385
Query: 598 VLVVPAFETQRYRTAFPAS----------------------------------------- 616
LVVPAFET RYR +FP S
Sbjct: 386 ALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYRSFTLGPGGLQSDLPFCCCRYHV 445
Query: 617 ----HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTN+++W ATTPY++EW PDFEPY+V RD P YD RFVGFGWNK
Sbjct: 446 WPKGHAPTNYAKWRTATTPYRVEWEPDFEPYVVVRRDCPEYDQRFVGFGWNK 497
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 66/89 (74%), Positives = 77/89 (86%)
Query: 84 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
+VSWIPNKHYSG+YGL+KLTLPK LP L++ +VLDTD+ FATDIA+LW +F + +Q
Sbjct: 4 EVSWIPNKHYSGIYGLMKLTLPKALPAHLSRVVVLDTDITFATDIAELWGVFRKFTDKQV 63
Query: 144 IGLVENQSDWYLGKLWKNHKPWPALGRGF 172
IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 64 IGLVENQSDWYLGNLWKNHKPWPALGRGF 92
>gi|224051108|ref|XP_002200170.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Taeniopygia
guttata]
Length = 739
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 314/527 (59%), Positives = 372/527 (70%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD+ FATDI
Sbjct: 175 PSVHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 234
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR +
Sbjct: 235 AELWAVFKKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDRLRRLG 294
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I + P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 295 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQDPSLVYRLPCFWNVQLSDHTRSEQ-CY 353
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
TEL+DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC + + L
Sbjct: 354 TELSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCASLPSPPRDQL 413
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L L+E+DPCYD R+ L R HL+FL+YE+ A + DVTLVAQLSMDR
Sbjct: 414 QQA-LEELDEDDPCYDFRQQHLMQHRIHLFFLQYEFLALPNPTDVTLVAQLSMDR----- 467
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHW GPISL LYMSDAEAQQFL Y
Sbjct: 468 ----------------------------LQMLEAICKHWAGPISLALYMSDAEAQQFLRY 499
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L +RRN+ YH+VYKEG FYPIN LRNVAL TPYVFL DIDFLPM+GLY YL
Sbjct: 500 AQASEVLSARRNVAYHIVYKEGQFYPINLLRNVALANTQTPYVFLTDIDFLPMYGLYDYL 559
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI+ +++ K L+VPAFET YR FP S H
Sbjct: 560 RNSIQQLELPQR--KAALIVPAFETLHYRLTFPKSKAELLSMLDMGSLYTFRYHVWPKGH 617
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT++++W AT PY++ W PDFEPY+V RD P+YD RFVGFGWNK
Sbjct: 618 APTDYAKWRTATVPYRVAWQPDFEPYVVVRRDCPKYDQRFVGFGWNK 664
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 116/171 (67%), Positives = 141/171 (82%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
+ ++ + CE++ VAIVCAG+N++R +VTL+KSILF+RKNPLHFH ITD+VA ILQT
Sbjct: 108 ACPKQRTVHKCELLHVAIVCAGHNASRDVVTLVKSILFHRKNPLHFHFITDSVAHQILQT 167
Query: 62 LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
LF +W VP V VSFY AD + +VSWIPNKHYSG+YGL+KLTL K LP L+K IVLDTD
Sbjct: 168 LFQSWMVPSVHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTD 227
Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+ FATDIA+LWA+F + ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 228 ITFATDIAELWAVFKKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 278
>gi|297708702|ref|XP_002831097.1| PREDICTED: like-glycosyltransferase isoform 4 [Pongo abelii]
Length = 707
Score = 615 bits (1587), Expect = e-173, Method: Compositional matrix adjust.
Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
+ + + HAPTNF++W ATTPY++EW DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
PY+V RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|332859457|ref|XP_003317213.1| PREDICTED: like-glycosyltransferase [Pan troglodytes]
gi|397501769|ref|XP_003821548.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 5 [Pan
paniscus]
Length = 707
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
+ + + HAPTNF++W ATTPY++EW DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
PY+V RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|403283128|ref|XP_003932979.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 5
[Saimiri boliviensis boliviensis]
Length = 707
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
+ + + HAPTNF++W ATTPY++EW DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
PY+V RD P YD RFVGFGWNK
Sbjct: 610 PYVVVKRDCPEYDRRFVGFGWNK 632
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|402884052|ref|XP_003905507.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 5 [Papio
anubis]
Length = 707
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
+ + + HAPTNF++W ATTPY++EW DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
PY+V RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|296191744|ref|XP_002743760.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2
[Callithrix jacchus]
Length = 707
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
+ + + HAPTNF++W ATTPY++EW DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
PY+V RD P YD RFVGFGWNK
Sbjct: 610 PYVVVKRDCPEYDRRFVGFGWNK 632
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|221040396|dbj|BAH11875.1| unnamed protein product [Homo sapiens]
Length = 707
Score = 612 bits (1577), Expect = e-172, Method: Compositional matrix adjust.
Identities = 296/503 (58%), Positives = 356/503 (70%), Gaps = 66/503 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKME 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRREL GC ++ L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELSGCPSEADVNSENL 430
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
+ + + HAPTNF++W ATTPY++EW DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
PY+V RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|391344075|ref|XP_003746329.1| PREDICTED: glycosyltransferase-like protein LARGE1-like
[Metaseiulus occidentalis]
Length = 713
Score = 611 bits (1576), Expect = e-172, Method: Compositional matrix adjust.
Identities = 309/534 (57%), Positives = 361/534 (67%), Gaps = 68/534 (12%)
Query: 163 KPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
K W P + FYL D V +DV+WIPNKHYSGVYGL+KL L K LP +LAKTIVLDTDV
Sbjct: 139 KTWEIPQVEYSFYLTDKVAKDVNWIPNKHYSGVYGLMKLVLTKALPLSLAKTIVLDTDVT 198
Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
FA DIAQLW LF I LVENQSDWYLGKLWK+HKPWPALGRGFNTGVILL L K
Sbjct: 199 FACDIAQLWKLFDSFGPEHAIALVENQSDWYLGKLWKHHKPWPALGRGFNTGVILLHLGK 258
Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
LR W WR +AE+ L + +TSLADQDIFNA+I+ +P L+ LPCQWNVQL DNT S
Sbjct: 259 LRSKGWDELWRNVAERELSSMYFTSLADQDIFNAVINSYPALLLPLPCQWNVQLGDNTLS 318
Query: 341 DELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT 400
E+CY+E+ DLK+IHWNSPKKL+VKNKH+++FRNLYLTFLEYDGNLLRREL CN +
Sbjct: 319 -EVCYSEVEDLKVIHWNSPKKLRVKNKHVQYFRNLYLTFLEYDGNLLRRELISCNRSSPN 377
Query: 401 ----LQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDR 456
Q +L L+E+DPCY+ RR +T RTHLY++EYEY +S D VTLV QLSMDR
Sbjct: 378 ESGPSSQDQLQGLDEDDPCYEFRRGNITRYRTHLYYMEYEYPSSPDY--VTLVTQLSMDR 435
Query: 457 LQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ 516
L QMVE L KHW GPISL LY+SD+E Q
Sbjct: 436 L---------------------------------QMVEALCKHWSGPISLALYLSDSEVQ 462
Query: 517 QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
QFLS+ NSE L RRNIGYH+VY++G YP+N LRNVAL+Q TPYVFL DIDFLPMFG
Sbjct: 463 QFLSFTLNSEVLSKRRNIGYHIVYRDGPLYPVNHLRNVALDQADTPYVFLSDIDFLPMFG 522
Query: 577 LYPYLKSSIR--SMDMHGHGGKKVLVVPAFETQRYRTAFP-------------------- 614
L+ LK+ IR MD G + LVVPAFET RYR FP
Sbjct: 523 LFEALKNYIRFLRMDAPNMGLYRALVVPAFETHRYRVEFPTSKAKLLKMLDDGRVFTFRH 582
Query: 615 ----ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
A HA T+F+RW ATTPY + W P FEPY+V +++ R+D+RF+GFGWNK
Sbjct: 583 DVWCAGHAATDFARWKTATTPYDVNWQPHFEPYVVVRKEVVRFDSRFLGFGWNK 636
Score = 265 bits (677), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 123/161 (76%), Positives = 135/161 (83%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C+VI V+IVCAGYNSTRSLVTLIKS+LFYRKNPLH HL+TD+VA +IL TLF TW +PQV
Sbjct: 87 CQVIHVSIVCAGYNSTRSLVTLIKSVLFYRKNPLHLHLVTDSVAHSILSTLFKTWEIPQV 146
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
E SFYL D V +DV+WIPNKHYSGVYGL+KL L K LP +LAKTIVLDTDV FA DIAQL
Sbjct: 147 EYSFYLTDKVAKDVNWIPNKHYSGVYGLMKLVLTKALPLSLAKTIVLDTDVTFACDIAQL 206
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W LF I LVENQSDWYLGKLWK+HKPWPALGRGF
Sbjct: 207 WKLFDSFGPEHAIALVENQSDWYLGKLWKHHKPWPALGRGF 247
>gi|47217228|emb|CAF96751.1| unnamed protein product [Tetraodon nigroviridis]
Length = 826
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 310/548 (56%), Positives = 369/548 (67%), Gaps = 100/548 (18%)
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
+VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDIA+LWA+F + + +Q
Sbjct: 240 EVSWIPNKHYSGIYGLMKLVLTKTLPPDLHRVIVLDTDITFATDIAELWAVFHKFQGQQV 299
Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
+GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + W WR+ AE+ L++
Sbjct: 300 LGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLRWEQMWRLTAERELMS 359
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKI------- 353
L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY +++DLK+
Sbjct: 360 MLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCYKDVSDLKVSPKTLHP 418
Query: 354 -----------------------IHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRE 390
IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRE
Sbjct: 419 GSFLQGGPLNISPPSFSGSFAQVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRE 478
Query: 391 LFGC---------NLTQTTLQ-QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEAS 440
LFGC N+ Q Q LS L+E+DPCY+ RR + T RTHLYFL YEYE S
Sbjct: 479 LFGCPSETDHNSENVALFCPQLQKTLSELDEDDPCYEFRRERFTVHRTHLYFLHYEYEPS 538
Query: 441 SDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHW 500
+DG DVTLVAQLSMDRLQM+E + KHW
Sbjct: 539 ---------------------------------ADGTDVTLVAQLSMDRLQMLEAICKHW 565
Query: 501 EGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS 560
EGPISL LY+SDAEAQQFL YA S+ L SR N+GYH+VYKEG FYP+N LRNVA+ QV+
Sbjct: 566 EGPISLALYLSDAEAQQFLRYAQGSDVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMQQVN 625
Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---- 616
TPY+FL DIDFLPM+GLY YL+ S+ +DM KK LVVPAFET RYR ++P S
Sbjct: 626 TPYMFLSDIDFLPMYGLYEYLRKSVVQLDM--ANTKKALVVPAFETLRYRLSYPKSKAEL 683
Query: 617 --------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTR 656
HAPTNF++W ATT Y+++W DFEPY++ RD P YD R
Sbjct: 684 LSQLDMGTLFTFRYHVWTKGHAPTNFAKWRTATTAYRVQWEADFEPYVMVRRDSPEYDRR 743
Query: 657 FVGFGWNK 664
FVGFGWNK
Sbjct: 744 FVGFGWNK 751
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 113/207 (54%), Positives = 141/207 (68%), Gaps = 36/207 (17%)
Query: 2 SSD--EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
SSD ++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A IL
Sbjct: 122 SSDCVQQPAVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQQIL 181
Query: 60 QTLFSTWSVPQVEVSFYLADSVV----------------------------------EDV 85
+LF TW VP V+V+FY AD + +V
Sbjct: 182 SSLFHTWMVPAVKVNFYDADELKVRTNSSRFSQETAVLMWDKNYWCFFCLFVWFFSQSEV 241
Query: 86 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 145
SWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDIA+LWA+F + + +Q +G
Sbjct: 242 SWIPNKHYSGIYGLMKLVLTKTLPPDLHRVIVLDTDITFATDIAELWAVFHKFQGQQVLG 301
Query: 146 LVENQSDWYLGKLWKNHKPWPALGRGF 172
LVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 302 LVENQSDWYLGNLWKNHRPWPALGRGF 328
>gi|156717856|ref|NP_001096468.1| glycosyltransferase-like 1B precursor [Xenopus (Silurana)
tropicalis]
gi|134025835|gb|AAI36205.1| LOC100125087 protein [Xenopus (Silurana) tropicalis]
Length = 723
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 317/538 (58%), Positives = 365/538 (67%), Gaps = 68/538 (12%)
Query: 155 LGKLWKNHKPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKT 212
LG L+K W P+L FY A + DV+WIPNKHYSG+YGLLKLTL K LP L+K
Sbjct: 151 LGNLFKT---WMVPSLQISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKV 207
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
IVLDTD+ FATDIA+LWA+F + Q +GLVENQSDWYLG LWKNHKPWPALGRGFNTG
Sbjct: 208 IVLDTDITFATDIAELWAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGFNTG 267
Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
VILL L KLR I W WR+ AE+ L+ L TSLADQDIFNA+I P LVY LPC WNV
Sbjct: 268 VILLLLDKLRLIGWEEMWRLTAERELMNMLSTSLADQDIFNAVIKSSPTLVYQLPCYWNV 327
Query: 333 QLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELF 392
QLSD+TRS++ CY+EL DLK+IHWNSP KL+VKNKH+E FR LYLTFLEYDG+LLRREL
Sbjct: 328 QLSDHTRSEQ-CYSELADLKVIHWNSPHKLRVKNKHVELFRTLYLTFLEYDGSLLRRELI 386
Query: 393 GC--NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVA 450
GC Q QA LS L+EEDPCYD RR L S R HL FL + D DVTLVA
Sbjct: 387 GCPSEGEQQGGSQAALSQLDEEDPCYDFRRESLASHRVHLSFLPH-VTPPPDPYDVTLVA 445
Query: 451 QLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM 510
QLSMDRL QM+E++ +HW+GP+SL LY+
Sbjct: 446 QLSMDRL---------------------------------QMLELICRHWDGPMSLALYL 472
Query: 511 SDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
SDAEAQQFL YA SE L+SR N+ YHVVYKEG YP+N LRNVAL TPYVFL DID
Sbjct: 473 SDAEAQQFLRYAQASEVLQSRTNVAYHVVYKEGQLYPVNLLRNVALKNSQTPYVFLSDID 532
Query: 571 FLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------- 616
FLPM+GLY YL+ SI D+ G K L+VPAFET RYR +FP S
Sbjct: 533 FLPMYGLYEYLRKSISQQDL--TGPPKALIVPAFETLRYRLSFPKSKAELLSMLDTGALY 590
Query: 617 ----------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPT++++W ATTPY++EWAPDFEPY+V RD P YD RF+GFGWNK
Sbjct: 591 TFRYHVWEKGHAPTDYAKWRTATTPYRVEWAPDFEPYVVVRRDCPEYDQRFLGFGWNK 648
Score = 236 bits (602), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 113/161 (70%), Positives = 133/161 (82%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE+I VAIVCAG+NS+R +VTL+KSILF+R+NPLH HLITD VAL +L LF TW VP +
Sbjct: 104 CELIHVAIVCAGHNSSRDVVTLVKSILFHRRNPLHLHLITDAVALRVLGNLFKTWMVPSL 163
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
++SFY A + DV+WIPNKHYSG+YGLLKLTL K LP L+K IVLDTD+ FATDIA+L
Sbjct: 164 QISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKVIVLDTDITFATDIAEL 223
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WA+F + Q +GLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 224 WAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGF 264
>gi|148224937|ref|NP_001083457.1| glycosyltransferase-like protein LARGE2-A precursor [Xenopus
laevis]
gi|82186763|sp|Q6PA90.1|LRG2A_XENLA RecName: Full=Glycosyltransferase-like protein LARGE2-A; AltName:
Full=Glycosyltransferase-like 1B-A
gi|38014447|gb|AAH60410.1| MGC68636 protein [Xenopus laevis]
Length = 723
Score = 593 bits (1529), Expect = e-166, Method: Compositional matrix adjust.
Identities = 313/525 (59%), Positives = 359/525 (68%), Gaps = 63/525 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P+L FY A + DV+WIPNKHYSG+YGLLKLTL K LP L+K IVLDTD+ FATDI
Sbjct: 161 PSLQISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKVIVLDTDITFATDI 220
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + Q +GLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L KLR I
Sbjct: 221 AELWAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDKLRLIG 280
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 281 WEEMWRLTAERELMNMLSTSLADQDIFNAVIKSSPTLVYQLPCYWNVQLSDHTRSEQ-CY 339
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQQ 403
+EL DLK+IHWNSP KL+VKNKH+E FR LYLTFLEYDG+LLRREL GC Q Q
Sbjct: 340 SELADLKVIHWNSPHKLRVKNKHVELFRTLYLTFLEYDGSLLRRELIGCPSEGEQQGGSQ 399
Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
A LS L+EEDPCYD RR + S R HL FL + D DVTLVAQLSMDRL
Sbjct: 400 ATLSQLDEEDPCYDFRRESVASHRVHLSFLPH-LTPPPDPYDVTLVAQLSMDRL------ 452
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
QM+E++ +HWEGP+SL LY+SDAEAQQFL YA
Sbjct: 453 ---------------------------QMLELICRHWEGPMSLALYLSDAEAQQFLRYAQ 485
Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
SE L+SR NIGYHVVYKEG YP+N LRNVAL TPYVFL DIDFLPM+GLY YL+
Sbjct: 486 ASEVLQSRTNIGYHVVYKEGQLYPVNLLRNVALKNSHTPYVFLSDIDFLPMYGLYEYLRK 545
Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
SI D G + L+VPAFET RYR +FP S HAP
Sbjct: 546 SIAQQD--PAGSPRALIVPAFETLRYRLSFPKSKADLLSMLDTGALYTFRYHVWEKGHAP 603
Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++++W ATTPY++EWAPDFEPY+V RD P YD RF+GFGWNK
Sbjct: 604 TDYAKWRTATTPYRVEWAPDFEPYVVVRRDCPEYDQRFLGFGWNK 648
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 113/161 (70%), Positives = 134/161 (83%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE+I VAIVCAG+NS+R +VTL+KSILF+R+NPLH HLITD VAL +L LF+TW VP +
Sbjct: 104 CELIHVAIVCAGHNSSRDVVTLVKSILFHRRNPLHLHLITDDVALRVLGNLFNTWMVPSL 163
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
++SFY A + DV+WIPNKHYSG+YGLLKLTL K LP L+K IVLDTD+ FATDIA+L
Sbjct: 164 QISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKVIVLDTDITFATDIAEL 223
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WA+F + Q +GLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 224 WAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGF 264
>gi|147900424|ref|NP_001089877.1| glycosyltransferase-like protein LARGE2-B precursor [Xenopus
laevis]
gi|90185260|sp|Q32NJ7.1|LRG2B_XENLA RecName: Full=Glycosyltransferase-like protein LARGE2-B; AltName:
Full=Glycosyltransferase-like 1B-B
gi|80477206|gb|AAI08591.1| Gyltl1b protein [Xenopus laevis]
Length = 723
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 312/525 (59%), Positives = 360/525 (68%), Gaps = 63/525 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P+L FY A + DV+WIPNKHYSG++GLLKLTL K LP L+K IVLDTD+ FATDI
Sbjct: 161 PSLTISFYNASELKPDVAWIPNKHYSGIFGLLKLTLTKALPSYLSKVIVLDTDITFATDI 220
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + Q +GLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L KLR I
Sbjct: 221 AELWAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDKLRLIG 280
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 281 WEEMWRLTAERELMNMLSTSLADQDIFNAVIKSSPTLVYQLPCYWNVQLSDHTRSEQ-CY 339
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQQ 403
+EL DLK+IHWNSP KL+VKNKH+E FR LYLTFLEYDG+LLRREL GC Q Q
Sbjct: 340 SELADLKVIHWNSPHKLRVKNKHVELFRTLYLTFLEYDGSLLRRELIGCPSEGEQQGGSQ 399
Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
A LS L+EEDPCYD RR L S R HL FL + + D +DVTLVAQLSMDRL
Sbjct: 400 AALSQLDEEDPCYDFRRESLASHRVHLSFLPH-LTPTPDPSDVTLVAQLSMDRL------ 452
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
QM+E++ +HWEGP+SL LY+SDAEAQQFL YA
Sbjct: 453 ---------------------------QMLELICRHWEGPMSLALYLSDAEAQQFLRYAQ 485
Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
SE L+SR NIGYHV+YKEG YP+N LRNVAL TPYVFL DIDFLPM+GLY L+
Sbjct: 486 ASEVLQSRTNIGYHVIYKEGQLYPVNLLRNVALKNSHTPYVFLSDIDFLPMYGLYENLRK 545
Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
SI D G K L+VPAFET RYR +FP S HAP
Sbjct: 546 SIAQQD--PTGSPKALIVPAFETLRYRLSFPKSKADLLSMLDTGALYTFRYHVWEKGHAP 603
Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
TN+++W ATTPY++EWAPDFEPY+V +D P YD RF+GFGWNK
Sbjct: 604 TNYAKWRTATTPYRVEWAPDFEPYVVVRQDCPEYDQRFLGFGWNK 648
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 112/161 (69%), Positives = 134/161 (83%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE+I VAIVCAG+NS+R +VTL+KSILF+R+NPLH HLITD VAL +L+ LF+TW VP +
Sbjct: 104 CELIHVAIVCAGHNSSRDVVTLVKSILFHRRNPLHLHLITDDVALRVLRNLFNTWMVPSL 163
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+SFY A + DV+WIPNKHYSG++GLLKLTL K LP L+K IVLDTD+ FATDIA+L
Sbjct: 164 TISFYNASELKPDVAWIPNKHYSGIFGLLKLTLTKALPSYLSKVIVLDTDITFATDIAEL 223
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WA+F + Q +GLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 224 WAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGF 264
>gi|334331837|ref|XP_003341528.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE2-like [Monodelphis domestica]
Length = 751
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 297/527 (56%), Positives = 360/527 (68%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY AD + +V+WIPNKHYSG+YGLLKL L + LP TL + IVLDTD+ FA+DI
Sbjct: 155 PSVLVSFYNADELKPEVAWIPNKHYSGIYGLLKLALLRALPPTLPQVIVLDTDITFASDI 214
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A LWA+F+R +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 215 ADLWAVFARFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLQLERLRQVG 274
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ LLT L TSLADQDIFNA+I +HP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 275 WEQMWRLTAERELLTMLATSLADQDIFNAVIKQHPWLVHPLPCSWNVQLSDHTRA-ERCY 333
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+E +DLK+IHWNSPKKL VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ + +
Sbjct: 334 SEASDLKVIHWNSPKKLGVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCGGSEPQPEPGQ 393
Query: 406 ----LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
L +++ CY+ R+ +LT R HL FL +E A D +DVTLVAQLSMDRL
Sbjct: 394 TRPATGELEDDEACYEFRQQRLTVHRVHLTFLPFEPPA-PDPHDVTLVAQLSMDRL---- 448
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY+SDAEAQQFL +
Sbjct: 449 -----------------------------QMLEALCRHWPGPMSLALYLSDAEAQQFLRF 479
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L +RR++ YH+VY+EG YP+N LR VAL Q TPYVFL DIDFLP F LY YL
Sbjct: 480 AQASEVLAARRDVAYHIVYREGPLYPVNLLRXVALGQARTPYVFLSDIDFLPAFSLYDYL 539
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI+ + + K LVVPAFET RYR FP S H
Sbjct: 540 RTSIKELGLATR--KAALVVPAFETLRYRLNFPTSKKELLALLDAGSLYTFRYHVWPRGH 597
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PT+++RW A PY++EWA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 598 EPTDYARWREAQAPYRVEWAADYEPYVVVARDCPRYDPRFVGFGWNK 644
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 104/175 (59%), Positives = 125/175 (71%), Gaps = 4/175 (2%)
Query: 2 SSDEEPVIPTCEV----IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALN 57
S + P +P C+V +Q+A G + RKNPLHFHL+TDTVA +
Sbjct: 84 SQPQPPAVPKCQVSRGGLQMAEREGGATCCSQPFRTGRCGPRERKNPLHFHLVTDTVARS 143
Query: 58 ILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIV 117
IL+TLFSTW VP V VSFY AD + +V+WIPNKHYSG+YGLLKL L + LP TL + IV
Sbjct: 144 ILETLFSTWMVPSVLVSFYNADELKPEVAWIPNKHYSGIYGLLKLALLRALPPTLPQVIV 203
Query: 118 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
LDTD+ FA+DIA LWA+F+R +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 204 LDTDITFASDIADLWAVFARFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGF 258
>gi|241051730|ref|XP_002407451.1| conserved hypothetical protein [Ixodes scapularis]
gi|215492218|gb|EEC01859.1| conserved hypothetical protein [Ixodes scapularis]
Length = 636
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 299/539 (55%), Positives = 371/539 (68%), Gaps = 81/539 (15%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FYLAD++ +VSWIPNKHYSGVYGL+KL LP+ LP+ L K IVLDTD+ FA+DI
Sbjct: 66 PAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPRALPDDLDKVIVLDTDITFASDI 125
Query: 226 AQLWALFSRLRQRQ---TIGLVENQSDWYLGKLWKNHKPWP-----ALGRGFNTGVILLD 277
A+LW +F R +Q + ++ + + L ++ P P + GRGFNTGVILL
Sbjct: 126 AELWRIFHRFTAKQVPPSFQILPHNT-----FLTRDQIP-PIGRRLSQGRGFNTGVILLQ 179
Query: 278 LTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDN 337
L KLR+ +W WR+IAEK L++ +TSLADQDIFNA++ +H L+Y LPCQWNVQL DN
Sbjct: 180 LQKLRERNWMQMWRLIAEKELISMWFTSLADQDIFNAMLKQHTDLLYRLPCQWNVQLGDN 239
Query: 338 TRSD-ELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNL 396
T S+ LCY+E+ +LKIIHWNSPKKLKVKNKH+EFFRNLYLTFLEYDGNLLRREL GCN
Sbjct: 240 TLSEASLCYSEVQELKIIHWNSPKKLKVKNKHVEFFRNLYLTFLEYDGNLLRRELIGCNG 299
Query: 397 TQTTLQ-------QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLV 449
+ ++ Q L+ L+E+DPCYD RR+++ RTHLY++EY+YE S +GNDVTLV
Sbjct: 300 SASSSAGAHASHLQDALNLLDEDDPCYDFRRARVAQHRTHLYYIEYDYEPSPEGNDVTLV 359
Query: 450 AQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLY 509
AQLSMDRL QMVE L KHWEGPISL LY
Sbjct: 360 AQLSMDRL---------------------------------QMVEALCKHWEGPISLALY 386
Query: 510 MSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDI 569
MSD+E QQFLSY +SE L+SR+N+GYH+VYK+G FYP+N LRNVAL QV+TP+VFL DI
Sbjct: 387 MSDSEVQQFLSYTLSSEILQSRKNVGYHIVYKDGTFYPVNLLRNVALQQVNTPFVFLTDI 446
Query: 570 DFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------- 616
DFLPM GLY YLK S+ ++ + +K L+VPAFETQRYR +FP S
Sbjct: 447 DFLPMHGLYEYLKRSVFALGL--ESARKALIVPAFETQRYRLSFPKSKAELLSMLDMGTL 504
Query: 617 -----------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF++W ATTPY++ W +FEPY+V RD+P YD RFVGFGWNK
Sbjct: 505 FTFRYHVWQKGHAPTNFAKWRTATTPYRVAWEANFEPYVVVKRDVPEYDRRFVGFGWNK 563
Score = 209 bits (531), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 94/140 (67%), Positives = 116/140 (82%)
Query: 4 DEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLF 63
D P +P CEV+QVAIVCAGYN++RS+VTL+KS+LFYRKNPLHFH ++D VA +LQTLF
Sbjct: 1 DALPEVPKCEVLQVAIVCAGYNASRSVVTLVKSVLFYRKNPLHFHFVSDAVAHLVLQTLF 60
Query: 64 STWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 123
TW+VP VEVSFYLAD++ +VSWIPNKHYSGVYGL+KL LP+ LP+ L K IVLDTD+
Sbjct: 61 RTWNVPAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPRALPDDLDKVIVLDTDIT 120
Query: 124 FATDIAQLWALFSRLRQRQT 143
FA+DIA+LW +F R +Q
Sbjct: 121 FASDIAELWRIFHRFTAKQV 140
>gi|321476466|gb|EFX87427.1| hypothetical protein DAPPUDRAFT_192525 [Daphnia pulex]
Length = 705
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 288/524 (54%), Positives = 354/524 (67%), Gaps = 64/524 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + FY AD + DV+WIPNKHYSGVYGLLKLTLPKVLP+++ K IV D DV F+ DI
Sbjct: 154 PHMNFSFYSADELTSDVAWIPNKHYSGVYGLLKLTLPKVLPQSVNKVIVFDCDVTFSADI 213
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A LWA+ S Q IGLVENQSDWYLGKLWKN++PWPALGRGFNTGVI +DL KLR ++
Sbjct: 214 ADLWAVISSFSSNQAIGLVENQSDWYLGKLWKNYEPWPALGRGFNTGVIAMDLKKLRSLN 273
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W + AEK L++ +TSLADQDIFNA++ + P+LVY LPCQ+NVQLSDNTRS E CY
Sbjct: 274 WNQLWLLTAEKDLVSHYFTSLADQDIFNAVLFQEPHLVYRLPCQYNVQLSDNTRS-ESCY 332
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+E+ DLKIIHWNSPKKLKVKNKH +FFRNLY TF+ +DGN LRR+LF CN + Q+
Sbjct: 333 SEVADLKIIHWNSPKKLKVKNKHSDFFRNLYYTFVGFDGNFLRRQLFHCN--DSGFNQSS 390
Query: 406 LSNLN-EEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
+ +L E+D CY++R+++ RTHL++L Y+
Sbjct: 391 VDDLPVEDDACYEIRKARDAKYRTHLFYLSYD---------------------------- 422
Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
D ND+T VAQLS+DRLQM++ L + WEGPISL LY+SD EA QF S+
Sbjct: 423 ------AGKIDDNDITWVAQLSLDRLQMIDPLCRLWEGPISLALYLSDTEADQFHSFVTE 476
Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
S L SR NIGYHVVYK+G+ YP+N LRNVAL Q STP+VFL DIDFLPM LY LK +
Sbjct: 477 SHYLNSRTNIGYHVVYKQGSLYPVNLLRNVALEQASTPFVFLCDIDFLPMPNLYTTLKKA 536
Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
++S+ + K LVVPAFE+QRY+T P + HA T
Sbjct: 537 VQSLKVATEN--KALVVPAFESQRYQTKVPRTKAEVIAALDMGDLLTFRYHDWALGHAAT 594
Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
NF W AT PY+I+W P+FEPY+V RD+ RYD RF+GFGWNK
Sbjct: 595 NFPMWRTATIPYKIKWEPNFEPYVVVRRDVTRYDPRFLGFGWNK 638
Score = 252 bits (644), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 111/171 (64%), Positives = 135/171 (78%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
+ D ++ CE+I VAIVCAGY+++R++ TLIKS+LFYRKNP+H H I D +A NIL+T
Sbjct: 87 AKDASILVKRCEIIHVAIVCAGYDASRAVSTLIKSLLFYRKNPIHLHFIADAIAENILKT 146
Query: 62 LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
LF TW +P + SFY AD + DV+WIPNKHYSGVYGLLKLTLPKVLP+++ K IV D D
Sbjct: 147 LFQTWDIPHMNFSFYSADELTSDVAWIPNKHYSGVYGLLKLTLPKVLPQSVNKVIVFDCD 206
Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
V F+ DIA LWA+ S Q IGLVENQSDWYLGKLWKN++PWPALGRGF
Sbjct: 207 VTFSADIADLWAVISSFSSNQAIGLVENQSDWYLGKLWKNYEPWPALGRGF 257
>gi|410973795|ref|XP_003993333.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Felis catus]
Length = 702
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 293/526 (55%), Positives = 354/526 (67%), Gaps = 63/526 (11%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY + + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 134 PAVGVSFYDLEELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASDI 193
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR
Sbjct: 194 AELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 253
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 254 WGQMWKLTATRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 312
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQ---TTLQ 402
+E +DLK+IHWNSPKKL+VKNKH+E+FRNLYLTFLEYDGNLLRRELFGC T
Sbjct: 313 SEASDLKVIHWNSPKKLRVKNKHVEYFRNLYLTFLEYDGNLLRRELFGCPSPPPPGTEQL 372
Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
Q L+ L+EED C++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 373 QRALAQLDEEDACFEFRQQQLTVHRVHISFLPHE-PLPPRPHDVTLVAQLSMDRL----- 426
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 427 ----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 458
Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL+
Sbjct: 459 EASAVLSARQDVAYHVVYREGPLYPVNELRNVALAQALTPYVFLSDIDFLPAYSLYDYLR 518
Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
+SI + + K LVVPAFET YR +FP+S HA
Sbjct: 519 ASIEQLKLDSE-RKVALVVPAFETLHYRFSFPSSKAELLALLDSGSLYTFRYHEWPRGHA 577
Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PT+++RW A TPY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 578 PTDYARWREAQTPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 623
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 112/163 (68%), Positives = 132/163 (80%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P +++ VAIVCAG+NS+R L TL+KS+LFYRKNPLHFHL+TD VA NIL+ LF TW VP
Sbjct: 75 PPPQLLHVAIVCAGHNSSRDLNTLVKSVLFYRKNPLHFHLVTDAVARNILEMLFHTWMVP 134
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V VSFY + + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DIA
Sbjct: 135 AVGVSFYDLEELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASDIA 194
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+LWALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 195 ELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 237
>gi|390470460|ref|XP_002755202.2| PREDICTED: glycosyltransferase-like protein LARGE2 [Callithrix
jacchus]
Length = 689
Score = 580 bits (1496), Expect = e-163, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTD+ FA+DI
Sbjct: 118 PAVRVSFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSTLPAELARVIVLDTDITFASDI 177
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 178 AELWALFAHFSDMQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRHAG 237
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 238 WEQMWRLTARQELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 296
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFL YDGNLLRR LFGC + L
Sbjct: 297 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLAYDGNLLRRGLFGCPSQPPSGAEQL 356
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E S +DVTLVAQLSMDRL
Sbjct: 357 QQA-LAQLDEEDPCFEFRQQQLTVHRVHITFLPHE-PPPSRPHDVTLVAQLSMDRL---- 410
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 411 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 441
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 442 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 501
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET YR +FP S H
Sbjct: 502 RASIEQLGL-GSRRKAALVVPAFETLHYRFSFPRSKVELLALLDAGALYTFRYHEWPRGH 560
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY ++W D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 561 APTDYARWREAQAPYSVQWVADYEPYVVVPRDCPRYDPRFVGFGWNK 607
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 114/163 (69%), Positives = 133/163 (81%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+ LF TW VP
Sbjct: 59 PKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILEMLFHTWMVP 118
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTD+ FA+DIA
Sbjct: 119 AVRVSFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSTLPAELARVIVLDTDITFASDIA 178
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 179 ELWALFAHFSDMQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 221
>gi|291384909|ref|XP_002709120.1| PREDICTED: glycosyltransferase-like 1B [Oryctolagus cuniculus]
Length = 721
Score = 580 bits (1495), Expect = e-163, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 357/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PAL FY A+++ VSWIPNKHYSG+YGL+KL LP VLP LA+ IVLDTDV FA+DI
Sbjct: 153 PALRVSFYDAEALKPQVSWIPNKHYSGLYGLMKLVLPAVLPPDLARVIVLDTDVTFASDI 212
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 213 AELWALFAHFSDKQLIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLGRLQQAG 272
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 273 WEQMWKLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 331
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+E +DLK+IHWNSPKKL+VKNKH EFFRN +LTFLEYDGNLLRRELFGC L
Sbjct: 332 SEASDLKVIHWNSPKKLRVKNKHAEFFRNFHLTFLEYDGNLLRRELFGCPSQPPAGAEQL 391
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EED C++ R+ +LT R HL FL ++ +DVTLVAQLSMDRLQM+E
Sbjct: 392 QQA-LAQLDEEDACFEFRQQRLTVHRVHLTFLPHKPPPPRP-HDVTLVAQLSMDRLQMLE 449
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
+ L +HW GP+SL LY++DAEAQQFL +
Sbjct: 450 V---------------------------------LCRHWPGPMSLALYLTDAEAQQFLRF 476
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 477 VEASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 536
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 537 RASIEQLELGGR-RKAALVVPAFETLHYRFSFPTSKAELLALLDAGSLHTFRYHEWPQGH 595
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 596 APTDYARWREARVPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 642
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 128/219 (58%), Positives = 159/219 (72%), Gaps = 5/219 (2%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
+ P+ P CE++ VAIVCAG+NS+R +VTL+KS+LFYRKNPLH HL+TD VA NIL+ LF
Sbjct: 89 QPPLPPKCELLHVAIVCAGHNSSREVVTLVKSMLFYRKNPLHLHLVTDAVARNILEMLFH 148
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP + VSFY A+++ VSWIPNKHYSG+YGL+KL LP VLP LA+ IVLDTDV F
Sbjct: 149 TWLVPALRVSFYDAEALKPQVSWIPNKHYSGLYGLMKLVLPAVLPPDLARVIVLDTDVTF 208
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSW 184
A+DIA+LWALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGF ++ W
Sbjct: 209 ASDIAELWALFAHFSDKQLIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----W 264
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
+ +G + KLT + L TL T + D D+ A
Sbjct: 265 LGRLQQAGWEQMWKLTARREL-LTLPATSLADQDIFNAV 302
>gi|109106604|ref|XP_001112766.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 1
[Macaca mulatta]
Length = 690
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 299/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLDTDVTFASDI 179
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 239
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 240 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 298
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+ +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC L
Sbjct: 299 SAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCPSQPPPGAEQL 358
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA LS LNEEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDR
Sbjct: 359 QQA-LSQLNEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDR----- 411
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 412 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLRF 443
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 444 IEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 503
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET YR +FP S H
Sbjct: 504 RASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYTFRYHEWPRGH 562
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 563 APTDYARWREAQDPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 609
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 117/173 (67%), Positives = 135/173 (78%), Gaps = 3/173 (1%)
Query: 3 SDEEPVIP---TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
SD P P CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL
Sbjct: 51 SDCGPPPPRPSKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNIL 110
Query: 60 QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
+ LF TW VP V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVLD
Sbjct: 111 EMLFHTWMVPAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLD 170
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
TDV FA+DI++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 171 TDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 223
>gi|109106606|ref|XP_001112795.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 2
[Macaca mulatta]
gi|109106608|ref|XP_001112825.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 3
[Macaca mulatta]
Length = 721
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 299/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLDTDVTFASDI 210
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 329
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+ +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC L
Sbjct: 330 SAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCPSQPPPGAEQL 389
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA LS LNEEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 390 QQA-LSQLNEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 474
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 475 IEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET YR +FP S H
Sbjct: 535 RASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYTFRYHEWPRGH 593
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQDPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 640
Score = 251 bits (641), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 117/173 (67%), Positives = 135/173 (78%), Gaps = 3/173 (1%)
Query: 3 SDEEPVIP---TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
SD P P CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL
Sbjct: 82 SDCGPPPPRPSKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNIL 141
Query: 60 QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
+ LF TW VP V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVLD
Sbjct: 142 EMLFHTWMVPAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLD 201
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
TDV FA+DI++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 202 TDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254
>gi|403255509|ref|XP_003920468.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Saimiri
boliviensis boliviensis]
Length = 694
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 294/527 (55%), Positives = 354/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTD+ FA+DI
Sbjct: 123 PAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPAELARVIVLDTDITFASDI 182
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF++ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 183 AELWALFAQFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 242
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 243 WEQMWKLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 301
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN+YLTFLEYDGNLLRR LFGC + L
Sbjct: 302 SEASDLKVIHWNSPKKLRVKNKHVEFFRNVYLTFLEYDGNLLRRGLFGCPSQPPSGAEQL 361
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ ++ +LT R H+ FL +E S +DVTLVAQLSMDRL
Sbjct: 362 QQA-LTQLDEEDPCFEFQQQQLTVHRVHVTFLPHE-PPPSRPHDVTLVAQLSMDRL---- 415
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 416 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 446
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 447 VEASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 506
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET Y +FP S H
Sbjct: 507 RASIEQLGL-GSRRKAALVVPAFETLHYHFSFPRSKVELLALLDAGAVYTFRYHEWPRGH 565
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY ++W D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 566 APTDYARWREAQAPYGVQWVADYEPYVVVPRDCPRYDPRFVGFGWNK 612
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 113/161 (70%), Positives = 134/161 (83%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYR+NPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 66 CELLHVAIVCAGHNSSRDVITLVKSMLFYRRNPLHLHLVTDAVARNILETLFHTWMVPAV 125
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTD+ FA+DIA+L
Sbjct: 126 RVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPAELARVIVLDTDITFASDIAEL 185
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF++ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 186 WALFAQFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 226
>gi|395816209|ref|XP_003781599.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Otolemur
garnettii]
Length = 687
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 294/525 (56%), Positives = 351/525 (66%), Gaps = 63/525 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 121 PAVHVSFYDADELKHQVSWIPNKHYSGLYGLMKLVLPGALPPGLARVIVLDTDVTFASDI 180
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQS+WYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 181 AELWALFAHFSDKQAIGLVENQSNWYLGNLWKNHRPWPALGRGFNTGVILLQLDRLRTAG 240
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 241 WEQMWRLTASRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCIWNVQLSDHTLA-ERCY 299
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQQ 403
+E +DLK+IHWNSPKKL+VKNKH+ FFRN YLTF EYDGNLLRRELFGC +QQ
Sbjct: 300 SEASDLKVIHWNSPKKLRVKNKHVAFFRNFYLTFTEYDGNLLRRELFGCPSQPLPGAVQQ 359
Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
A L L+EED C++ R+ +LT R H+ FL +E + +DVTLV QLSMDRL
Sbjct: 360 A-LEQLDEEDSCFEFRQQQLTVHRVHIAFLPHEL-PTPRPHDVTLVTQLSMDRL------ 411
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 412 ---------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFVE 444
Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL+S
Sbjct: 445 ASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLRS 504
Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
SI + + G K LVVPAFET YR +FP S HAP
Sbjct: 505 SIEQLGL-GSQRKAALVVPAFETLHYRFSFPTSKEELLALLDAGALYTFRYHEWPRGHAP 563
Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 564 TDYTRWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 608
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 116/163 (71%), Positives = 134/163 (82%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P CE + VAIVCAG+NS+R L+TL+KSILFYRKNPLH HL+T+ VA NIL+TLF TW VP
Sbjct: 62 PRCERLHVAIVCAGHNSSRDLITLVKSILFYRKNPLHLHLVTNAVARNILETLFHTWMVP 121
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DIA
Sbjct: 122 AVHVSFYDADELKHQVSWIPNKHYSGLYGLMKLVLPGALPPGLARVIVLDTDVTFASDIA 181
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+LWALF+ +Q IGLVENQS+WYLG LWKNH+PWPALGRGF
Sbjct: 182 ELWALFAHFSDKQAIGLVENQSNWYLGNLWKNHRPWPALGRGF 224
>gi|405965905|gb|EKC31250.1| Glycosyltransferase-like protein LARGE1 [Crassostrea gigas]
Length = 725
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 293/538 (54%), Positives = 360/538 (66%), Gaps = 73/538 (13%)
Query: 154 YLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTI 213
YL + W+ P + FY AD V ED+ WI NKHYSG+YGLLKL LPK LP +L K I
Sbjct: 162 YLFETWR----VPDVKISFYHADKVKEDIEWIHNKHYSGIYGLLKLVLPKTLPTSLEKVI 217
Query: 214 VLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGV 273
VLDTDV FATDIA+LW +FS L + +GLVENQSDWYLGK+WKNH PWPALGRGFNTGV
Sbjct: 218 VLDTDVSFATDIAELWKMFSFLNGLKVLGLVENQSDWYLGKIWKNHIPWPALGRGFNTGV 277
Query: 274 ILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
IL++L LR+++W W+ IA K L T +T LADQDIFNA++ EHPY VY LPCQWNVQ
Sbjct: 278 ILMNLRVLREMNWMEMWKKIAVKELETMQYTQLADQDIFNAVLKEHPYFVYHLPCQWNVQ 337
Query: 334 LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFG 393
LSDN++SD+ CY +L + KIIHWNSPKK KV+NKH+++FRNL+L F+EYDGNLLRRELFG
Sbjct: 338 LSDNSKSDK-CYNQLEEPKIIHWNSPKKFKVENKHVDYFRNLHLAFVEYDGNLLRRELFG 396
Query: 394 CNLTQT---TLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVA 450
C ++ T Q E+S ED C++ RR + RTHL++L+Y+Y+ + DVTLV
Sbjct: 397 CEKSEDANPTASQEEVS----EDECFEFRREQSLVHRTHLFYLDYKYQP--EEYDVTLVT 450
Query: 451 QLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM 510
QLS+DRLQM+E+ N WEGPISL LY+
Sbjct: 451 QLSIDRLQMLELLSN---------------------------------RWEGPISLALYL 477
Query: 511 SDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
SDAEAQQFL YA +SE L R NIGYH+VYK+G FYP+N LRNVALNQ T ++FL DID
Sbjct: 478 SDAEAQQFLKYAQDSEILTKRTNIGYHIVYKDGQFYPVNYLRNVALNQSQTSHMFLSDID 537
Query: 571 FLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------- 616
FLPM GLY +LKS + +DM K LV+PAFETQRYR FP +
Sbjct: 538 FLPMPGLYNHLKSVVSRIDMEKEN--KALVIPAFETQRYRLEFPTTKADLLTKLDKEEIF 595
Query: 617 ----------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
H PTN+++W A Y ++W D+EPYIV + +P +D RFVGFGWNK
Sbjct: 596 TFRYDVWPQGHLPTNYTKWRTAKAMYTVKWERDYEPYIVVKKGIPLFDQRFVGFGWNK 653
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 115/164 (70%), Positives = 133/164 (81%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
+P CEVI +AIVC+G+ STR +VTLIKS+LFYRKNPLHFH ++DT A IL LF TW V
Sbjct: 110 VPKCEVIHIAIVCSGHKSTRDVVTLIKSVLFYRKNPLHFHFLSDTTAQVILNYLFETWRV 169
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
P V++SFY AD V ED+ WI NKHYSG+YGLLKL LPK LP +L K IVLDTDV FATDI
Sbjct: 170 PDVKISFYHADKVKEDIEWIHNKHYSGIYGLLKLVLPKTLPTSLEKVIVLDTDVSFATDI 229
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
A+LW +FS L + +GLVENQSDWYLGK+WKNH PWPALGRGF
Sbjct: 230 AELWKMFSFLNGLKVLGLVENQSDWYLGKIWKNHIPWPALGRGF 273
>gi|332259858|ref|XP_003279001.1| PREDICTED: glycosyltransferase-like protein LARGE2 isoform 1
[Nomascus leucogenys]
Length = 720
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 297/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 152 PAVRVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 211
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 212 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 271
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 272 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 330
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFR+ YLTFLEYDGNLLRRELF C L
Sbjct: 331 SEASDLKVIHWNSPKKLRVKNKHVEFFRSFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 390
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 391 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 444
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 445 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 475
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 476 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 535
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET YR +FP S H
Sbjct: 536 RASIEQLGL-GSRRKAALVVPAFETLHYRFSFPHSKVELLASLDVGALYTFRYHEWPRGH 594
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WAPD+EPY+V RD PRYD RFVGFGWNK
Sbjct: 595 APTDYARWRGAQDPYRVQWAPDYEPYVVVPRDCPRYDPRFVGFGWNK 641
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 133/161 (82%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 95 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 154
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI++L
Sbjct: 155 RVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 214
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 215 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 255
>gi|119588426|gb|EAW68020.1| glycosyltransferase-like 1B, isoform CRA_a [Homo sapiens]
Length = 573
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 3 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 62
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 63 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 122
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LL+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 123 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 181
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C L
Sbjct: 182 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 241
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDR
Sbjct: 242 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDR----- 294
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 295 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 326
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 327 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 386
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RYR +FP S H
Sbjct: 387 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 445
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA ++EPY+V RD PRYD RFVGFGWNK
Sbjct: 446 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 492
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 75/105 (71%), Positives = 84/105 (80%)
Query: 68 VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
VP V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+D
Sbjct: 2 VPAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASD 61
Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
I++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 62 ISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 106
>gi|402893700|ref|XP_003910029.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Papio anubis]
Length = 690
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 297/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPAELAHVIVLDTDVTFASDI 179
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 239
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 240 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 298
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+ +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC L
Sbjct: 299 SAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCPSQPPPGAEQL 358
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 359 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDRL---- 412
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 413 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 443
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 444 IEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 503
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET YR +FP S H
Sbjct: 504 RASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYTFRYHEWPRGH 562
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 563 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 609
Score = 251 bits (641), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 117/173 (67%), Positives = 135/173 (78%), Gaps = 3/173 (1%)
Query: 3 SDEEPVIP---TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
SD P P CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL
Sbjct: 51 SDCGPPPPRPSKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNIL 110
Query: 60 QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
+ LF TW VP V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVLD
Sbjct: 111 EMLFHTWMVPAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPAELAHVIVLD 170
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
TDV FA+DI++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 171 TDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 223
>gi|116235482|ref|NP_689525.3| glycosyltransferase-like protein LARGE2 precursor [Homo sapiens]
gi|146345450|sp|Q8N3Y3.2|LARG2_HUMAN RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
Full=Glycosyltransferase-like 1B
gi|119588427|gb|EAW68021.1| glycosyltransferase-like 1B, isoform CRA_b [Homo sapiens]
Length = 721
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LL+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RYR +FP S H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA ++EPY+V RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 640
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 133/161 (82%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI++L
Sbjct: 154 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 213
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254
>gi|22713410|gb|AAH37291.1| GYLTL1B protein [Homo sapiens]
gi|123981284|gb|ABM82471.1| glycosyltransferase-like 1B [synthetic construct]
gi|123995351|gb|ABM85277.1| glycosyltransferase-like 1B [synthetic construct]
gi|124000717|gb|ABM87867.1| glycosyltransferase-like 1B [synthetic construct]
Length = 721
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LL+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RYR +FP S H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA ++EPY+V RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 640
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 133/161 (82%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI++L
Sbjct: 154 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 213
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254
>gi|21755403|dbj|BAC04675.1| unnamed protein product [Homo sapiens]
Length = 721
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPRVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LL+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RYR +FP S H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA ++EPY+V RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 640
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 133/161 (82%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI++L
Sbjct: 154 RVSFYHADQLKPRVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 213
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254
>gi|426368099|ref|XP_004051050.1| PREDICTED: glycosyltransferase-like protein LARGE2 isoform 1
[Gorilla gorilla gorilla]
gi|426368101|ref|XP_004051051.1| PREDICTED: glycosyltransferase-like protein LARGE2 isoform 2
[Gorilla gorilla gorilla]
Length = 721
Score = 576 bits (1485), Expect = e-161, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 353/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 211 LELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAD 270
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LL+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWKLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RYR +FP S H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 640
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 132/161 (81%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI +L
Sbjct: 154 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDILEL 213
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254
>gi|338712079|ref|XP_001915260.2| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like 1B [Equus
caballus]
Length = 695
Score = 576 bits (1485), Expect = e-161, Method: Compositional matrix adjust.
Identities = 295/526 (56%), Positives = 352/526 (66%), Gaps = 63/526 (11%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 126 PAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPLDLARVIVLDTDVTFASDI 185
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR +
Sbjct: 186 AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQVG 245
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 246 WEQMWRLTATRELLTLPATSLADQDIFNAVIKEHPGLVRPLPCVWNVQLSDHTLA-EGCY 304
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC-NLTQTTLQ-- 402
+E +DLK+IHWNSPKKL+VKNKH+ FFRNLYLTFLEYDGNLLRRELFGC +L +
Sbjct: 305 SEASDLKVIHWNSPKKLRVKNKHVGFFRNLYLTFLEYDGNLLRRELFGCPSLPHPGAEQW 364
Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
Q L+ L+ ED C + R+ +LT R H+ FL +E DVTLVAQLSMDRL
Sbjct: 365 QRALAQLDGEDACSEFRQQQLTVHRVHVTFLPHELPPPRP-LDVTLVAQLSMDRL----- 418
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 419 ----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 450
Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
S L +R+++ YHVVY+EG YPIN LRNVAL Q TPYVFL DIDFLP + LY YL+
Sbjct: 451 EASAVLSARQDVAYHVVYREGPLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLR 510
Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
+SI +++ G K LVVPAFET YR +FP S HA
Sbjct: 511 ASIEQLELGGR-RKAALVVPAFETLHYRFSFPRSKAEVLASLDAGALYTFRYHEWPRGHA 569
Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 570 PTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 615
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 116/167 (69%), Positives = 135/167 (80%)
Query: 6 EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
P+ P CE++ VAIVCAG+NS+R +VTL+KSILFYRKNPLH HL+TD VA +IL+ LF T
Sbjct: 63 RPLAPKCELLHVAIVCAGHNSSRDVVTLVKSILFYRKNPLHLHLVTDAVARSILEMLFHT 122
Query: 66 WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
W VP V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA
Sbjct: 123 WMVPAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPLDLARVIVLDTDVTFA 182
Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+DIA+LWALF+ Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 183 SDIAELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 229
>gi|301773854|ref|XP_002922345.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE2-like [Ailuropoda melanoleuca]
Length = 716
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 293/526 (55%), Positives = 354/526 (67%), Gaps = 63/526 (11%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 148 PAVWVSFYDAEELKPQVSWIPNKHYSGLYGLMKLVLPATLPPDLARVIVLDTDVTFASDI 207
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR
Sbjct: 208 AELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 267
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 268 WEQMWKLTATRELLTLPATSLADQDIFNAVIKEHPRLVQPLPCVWNVQLSDHTLA-ERCY 326
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
+E +DLK+IHWNSPKKL+VKNKH++ FRNLYLTFLEYDGNLLRRELFGC +
Sbjct: 327 SEASDLKVIHWNSPKKLRVKNKHVDHFRNLYLTFLEYDGNLLRRELFGCPSPPPSGAEQV 386
Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
Q L+ L+EED C++ R+ +LT R H+ FL E +DVTLVAQLS+DRL
Sbjct: 387 QRALAQLDEEDACFEFRQQQLTVHRVHVTFLPLEPPLPR-PHDVTLVAQLSIDRL----- 440
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 441 ----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 472
Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
S L +R+N+ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL+
Sbjct: 473 EASAVLSARQNVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLR 532
Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
+SI + + G K LVVPAFET RYR +FP+S HA
Sbjct: 533 ASIEQLKL-GRERKAALVVPAFETLRYRFSFPSSKAELLALLDSGSLYTFRYHEWPQGHA 591
Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PT+++RW A TPY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 592 PTDYARWREAQTPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 637
Score = 248 bits (633), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 113/160 (70%), Positives = 133/160 (83%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
+++ VAIVCAG NS+R LVTL+KS+LFYRKNPLHFHL+TDT+A NIL+ LF TW VP V
Sbjct: 92 QLLHVAIVCAGQNSSRDLVTLVKSVLFYRKNPLHFHLVTDTMARNILEMLFHTWMVPAVW 151
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
VSFY A+ + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DIA+LW
Sbjct: 152 VSFYDAEELKPQVSWIPNKHYSGLYGLMKLVLPATLPPDLARVIVLDTDVTFASDIAELW 211
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 212 ALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 251
>gi|193786298|dbj|BAG51581.1| unnamed protein product [Homo sapiens]
Length = 690
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 295/527 (55%), Positives = 353/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 179
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 239
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LL+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 240 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 298
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C L
Sbjct: 299 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 358
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 359 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDRL---- 412
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 413 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 443
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 444 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 503
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RY +FP S H
Sbjct: 504 RASIEQLGL-GSRRKAALVVPAFETLRYCFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 562
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA ++EPY+V RD PRYD RFVGFGWNK
Sbjct: 563 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 609
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 133/161 (82%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 63 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 122
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI++L
Sbjct: 123 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 182
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 183 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 223
>gi|397473583|ref|XP_003808287.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE2 [Pan paniscus]
Length = 757
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 296/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 187 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 246
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 247 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 306
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ A + LL+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 307 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQHLPCVWNVQLSDHTLA-ERCY 365
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
+E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYD NLLRRELF C L
Sbjct: 366 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDXNLLRRELFVCPSQPPPGAEQL 425
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 426 QQA-LAQLDEEDPCFEFRQQPLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 479
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 480 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 510
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 511 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 570
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RYR +FP S H
Sbjct: 571 RASIEQLGL-GSWRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 629
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 630 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 676
Score = 252 bits (643), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 113/161 (70%), Positives = 132/161 (81%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+ LF TW VP V
Sbjct: 130 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILEMLFHTWMVPAV 189
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI++L
Sbjct: 190 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 249
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 250 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 290
>gi|355566575|gb|EHH22954.1| hypothetical protein EGK_06315, partial [Macaca mulatta]
gi|355752188|gb|EHH56308.1| hypothetical protein EGM_05685, partial [Macaca fascicularis]
Length = 647
Score = 573 bits (1477), Expect = e-160, Method: Compositional matrix adjust.
Identities = 299/537 (55%), Positives = 352/537 (65%), Gaps = 75/537 (13%)
Query: 166 PALGRGFYLADSVVED----------VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVL 215
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVL
Sbjct: 67 PAVRVSFYNADELKAGAQPFPAPPSCVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVL 126
Query: 216 DTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVIL 275
DTDV FA+DI++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVIL
Sbjct: 127 DTDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVIL 186
Query: 276 LDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
L L +LR W WR+ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLS
Sbjct: 187 LRLDRLRQAGWEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLS 246
Query: 336 DNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCN 395
D+T + E CY+ +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC
Sbjct: 247 DHTLA-ERCYSAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCP 305
Query: 396 LT----QTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQ 451
LQQA LS LNEEDPC++ R+ +LT R H+ FL +E +DVTLVAQ
Sbjct: 306 SQPPPGAEQLQQA-LSQLNEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQ 363
Query: 452 LSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS 511
LSMDR LQM+E L +HW GP+SL LY++
Sbjct: 364 LSMDR---------------------------------LQMLEALCRHWPGPMSLALYLT 390
Query: 512 DAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
DAEAQQFL + S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDF
Sbjct: 391 DAEAQQFLRFIEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDF 450
Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS--------------- 616
LP + LY YL++SI + + G K LVVPAFET YR +FP S
Sbjct: 451 LPAYSLYDYLRASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYT 509
Query: 617 ---------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 510 FRYHEWPRGHAPTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 566
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 111/170 (65%), Positives = 130/170 (76%), Gaps = 10/170 (5%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
+++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+ LF TW VP V
Sbjct: 11 QLLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILEMLFHTWMVPAVR 70
Query: 73 VSFYLADSVVED----------VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA IVLDTDV
Sbjct: 71 VSFYNADELKAGAQPFPAPPSCVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLDTDV 130
Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
FA+DI++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 131 TFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 180
>gi|354469848|ref|XP_003497334.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 1
[Cricetulus griseus]
Length = 690
Score = 573 bits (1476), Expect = e-160, Method: Compositional matrix adjust.
Identities = 288/527 (54%), Positives = 357/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 181
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 182 AELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQTG 241
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++IA++ LLT + TSLADQD+FNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 242 WEQMWKLIAKQELLTLMATSLADQDVFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 300
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 301 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQLPPGAEQL 360
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +L+ R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 361 QQA-LAQLDEEEPCFEFRQQQLSVHRVHITFLPHQPPPPR-PHDVTLVAQLSMDR----- 413
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 414 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 445
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YH+VY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 446 VETSPVLSARKDVAYHIVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 505
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 506 RASIEQLEL-GSQRKAALVVPAFETLHYRLSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 564
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY ++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 565 APTDYARWREAKAPYHVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 611
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 122/214 (57%), Positives = 155/214 (72%), Gaps = 5/214 (2%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HL+TD +A NIL+TLF TW VP
Sbjct: 63 PKCELLHVAIVCAGYNSSREVITLVKSLLFYRKNPLHLHLVTDAIARNILETLFRTWMVP 122
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V + FY A+ + +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DIA
Sbjct: 123 AVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDIA 182
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKH 189
+LWALF+ +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++ W+
Sbjct: 183 ELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLR 238
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
+G + KL + L TL T + D DV A
Sbjct: 239 QTGWEQMWKLIAKQEL-LTLMATSLADQDVFNAV 271
>gi|311247925|ref|XP_003122879.1| PREDICTED: glycosyltransferase-like 1B [Sus scrofa]
Length = 726
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 295/527 (55%), Positives = 357/527 (67%), Gaps = 64/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA++I
Sbjct: 157 PAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASNI 216
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR
Sbjct: 217 AELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 276
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++IA + LLT TSLADQDIFNA+I EHP+LV LPC WNVQLSD+T + E CY
Sbjct: 277 WEQMWKLIATQELLTLPATSLADQDIFNAVIKEHPWLVQPLPCVWNVQLSDHTLA-ERCY 335
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
+E +DLK+IHWNSPKKL+VK KH+E FRNL+LTFLEYDGNLLRRELFGC
Sbjct: 336 SEASDLKVIHWNSPKKLRVKTKHVESFRNLHLTFLEYDGNLLRRELFGCPCPPPPGAEQL 395
Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
Q L+ L+EED C++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 396 QRALAQLDEEDACFEFRQQQLTVHRVHITFLPHKPPPPQ-PYDVTLVAQLSMDR------ 448
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 449 ---------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 481
Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
S L +R+N+ YHVVY+EG YPIN LRNVAL Q TPYVFL DIDFLP + LY YL+
Sbjct: 482 ETSAVLSARQNVAYHVVYREGPLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYHYLR 541
Query: 583 SSIRSMDMHGHGGKKV-LVVPAFETQRYRTAFPAS------------------------H 617
+SI +++ G GG+KV LVVPAFET YR +FP+S H
Sbjct: 542 ASIEQLEL-GTGGRKVALVVPAFETLHYRFSFPSSKAELLALLDAGSLYTFRYHEWPRGH 600
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 601 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 647
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 116/167 (69%), Positives = 138/167 (82%)
Query: 6 EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
P+ P CE++QVAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF T
Sbjct: 94 RPLPPKCELLQVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHT 153
Query: 66 WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
W VP V VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA
Sbjct: 154 WMVPAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFA 213
Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
++IA+LWALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 214 SNIAELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 260
>gi|156405409|ref|XP_001640724.1| predicted protein [Nematostella vectensis]
gi|156227860|gb|EDO48661.1| predicted protein [Nematostella vectensis]
Length = 746
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 289/536 (53%), Positives = 359/536 (66%), Gaps = 73/536 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY + + VSW+PNKHYSGV+GL+KL L + LPE L K IVLDTDVIFA+D+
Sbjct: 174 PAVQTSFYHLEKYKDRVSWVPNKHYSGVFGLMKLVLTEALPEVLDKVIVLDTDVIFASDV 233
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LW L +L +++ IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDL KLR ++
Sbjct: 234 AELWKLLDQLTKKKAIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLNKLRRMN 293
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AEK L+T L T+LADQDIFNA+I P+L++ LPC WNVQL DNTRS++ CY
Sbjct: 294 WKDKWRLTAEKELMTMLSTALADQDIFNAVIKNDPHLLHKLPCTWNVQLGDNTRSEQ-CY 352
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----------N 395
TE+ +LK+IHWNSPKKL VKNKH +FFRNLYLTFLEYDGNLLRR LF C N
Sbjct: 353 TEVNELKVIHWNSPKKLLVKNKHGDFFRNLYLTFLEYDGNLLRRALFHCEKEGDKTTPSN 412
Query: 396 LTQTTLQQ---AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQL 452
+ T +Q +L ++ EDPCY+ ++ + + RTHLY+L+Y++ +DVTLVAQL
Sbjct: 413 NSLTASEQEIATKLPGVDGEDPCYEFKQEQHLTQRTHLYYLDYDFTVEYGPHDVTLVAQL 472
Query: 453 SMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD 512
SMDRL QMVE L ++WEGPISL LYMSD
Sbjct: 473 SMDRL---------------------------------QMVEALCQNWEGPISLALYMSD 499
Query: 513 AEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFL 572
+EAQQFL YA S L++R+NIGYH+V+KEG FYP+N LRN+AL TP+VFL DIDFL
Sbjct: 500 SEAQQFLRYALGSPVLKARKNIGYHIVFKEGQFYPVNYLRNIALEHARTPFVFLTDIDFL 559
Query: 573 PMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------- 616
PM L Y+ + + M + VVPAFET Y+ FP +
Sbjct: 560 PMPSLNVYINVRVNNKTMTLLS--QAYVVPAFETLHYKLEFPQTKSQLLSSWDLGTVFIF 617
Query: 617 --------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPT++++W ATT Y ++W D+EPYIV +D+ RYDTRFVGFGWNK
Sbjct: 618 RYNEWSKGHAPTDYAKWRTATTLYTVQWEEDYEPYIVVPKDIVRYDTRFVGFGWNK 673
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 109/165 (66%), Positives = 134/165 (81%)
Query: 8 VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
+P CEVI +A+VCAG++++R VTLIKS+LFYR NPLHFH ++D+VA IL TLF TW
Sbjct: 113 AVPQCEVIHIAVVCAGHSASRQAVTLIKSLLFYRHNPLHFHFVSDSVAELILGTLFRTWG 172
Query: 68 VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
VP V+ SFY + + VSW+PNKHYSGV+GL+KL L + LPE L K IVLDTDVIFA+D
Sbjct: 173 VPAVQTSFYHLEKYKDRVSWVPNKHYSGVFGLMKLVLTEALPEVLDKVIVLDTDVIFASD 232
Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+A+LW L +L +++ IGLVENQSDWYLGKLWKNHKPWPALGRGF
Sbjct: 233 VAELWKLLDQLTKKKAIGLVENQSDWYLGKLWKNHKPWPALGRGF 277
>gi|432090380|gb|ELK23806.1| Glycosyltransferase-like protein LARGE2, partial [Myotis davidii]
Length = 632
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 294/527 (55%), Positives = 353/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSW+PNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 65 PAVRVSFYDANELKPQVSWVPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASDI 124
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ Q IGLVENQSDWYLG LW+NHKPWPALGRGFNTGVILL L +LR
Sbjct: 125 AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHKPWPALGRGFNTGVILLWLDRLRQAG 184
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I HP LV LPC WNVQLSD+T + E CY
Sbjct: 185 WEQMWKLTATQELLTLPATSLADQDIFNAVIKGHPALVQPLPCVWNVQLSDHTLA-ERCY 243
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQ----TTL 401
E D+K IHWNSPKKL+VKNKH+EFFRNLYLTFLE+DGNLLRRELFGC + L
Sbjct: 244 LEAADIKAIHWNSPKKLRVKNKHVEFFRNLYLTFLEHDGNLLRRELFGCPSLRPPEAEQL 303
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ LNEED C++ R+ +LT R H+ FL +E +DVTLVAQLSMDR
Sbjct: 304 QQA-LAQLNEEDACFEFRQQQLTVHRVHITFLPHEPPPPR-PHDVTLVAQLSMDR----- 356
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 357 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLRF 388
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
SE L +R+++ YHVVY+EG YPIN +RNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 389 VEASEVLSARQDVAYHVVYREGPLYPINQIRNVALAQALTPYVFLSDIDFLPAYSLYDYL 448
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET Y+ +FP S H
Sbjct: 449 RASIEQLEL-GSRRKAALVVPAFETLHYQFSFPTSKAELLSLLDAGSVYTFRYHEWPRGH 507
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 508 APTDYARWREAQAPYRVQWAVDYEPYVVVPRDCPRYDPRFVGFGWNK 554
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 122/216 (56%), Positives = 154/216 (71%), Gaps = 5/216 (2%)
Query: 8 VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
+P +++ VAIVCAG++++R +VTL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW
Sbjct: 4 CLPRPQLLHVAIVCAGHHTSRDVVTLVKSLLFYRKNPLHLHLVTDAVARNILETLFHTWM 63
Query: 68 VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
+P V VSFY A+ + VSW+PNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+D
Sbjct: 64 LPAVRVSFYDANELKPQVSWVPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASD 123
Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPN 187
IA+LWALF+ Q IGLVENQSDWYLG LW+NHKPWPALGRGF ++ W+
Sbjct: 124 IAELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHKPWPALGRGFNTGVILL----WLDR 179
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
+G + KLT + L TL T + D D+ A
Sbjct: 180 LRQAGWEQMWKLTATQEL-LTLPATSLADQDIFNAV 214
>gi|73983379|ref|XP_540758.2| PREDICTED: glycosyltransferase-like 1B [Canis lupus familiaris]
Length = 721
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 293/527 (55%), Positives = 353/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + +SWIPNKHYSG+YGL+KL LP VLP LA+ IVLDTDV FA+DI
Sbjct: 154 PAVRVSFYDAEELKPQISWIPNKHYSGLYGLMKLVLPGVLPPDLARVIVLDTDVTFASDI 213
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR
Sbjct: 214 AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLWLDRLRQAG 273
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 274 WEQMWKLTAGQELLTLPSTSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 332
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
+E DLK+IHWNSPKKL+VKNKH+ +FRNLYLTFLEYDGNLLRRELFGC + L
Sbjct: 333 SEAADLKVIHWNSPKKLRVKNKHVAYFRNLYLTFLEYDGNLLRRELFGCPSPPPSGAKQL 392
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L++L+EED C++ R+ +L R H+ FL + +DVTLVAQLSMDRL
Sbjct: 393 QQA-LAHLDEEDACFEFRQQQLMVHRVHVTFLPH-VPPPGRPHDVTLVAQLSMDRL---- 446
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 447 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 477
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP F LY YL
Sbjct: 478 VEASAVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAFSLYDYL 537
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET YR +FP+S H
Sbjct: 538 RASIEQLQL-GSEQKAALVVPAFETLHYRFSFPSSKAELLALLDSGALYTFRFHEWPRGH 596
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
A T++ RW A TPY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 597 AATDYVRWRQARTPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 643
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/218 (58%), Positives = 156/218 (71%), Gaps = 5/218 (2%)
Query: 6 EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
P +P CE++ VA+VCAG+NS+R LVTL+KS+LFYRKNPLHFHL+TD A NIL+ LF T
Sbjct: 91 RPPLPKCELLHVAVVCAGHNSSRDLVTLVKSVLFYRKNPLHFHLVTDATARNILEMLFHT 150
Query: 66 WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
W VP V VSFY A+ + +SWIPNKHYSG+YGL+KL LP VLP LA+ IVLDTDV FA
Sbjct: 151 WMVPAVRVSFYDAEELKPQISWIPNKHYSGLYGLMKLVLPGVLPPDLARVIVLDTDVTFA 210
Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWI 185
+DIA+LWALF+ Q IGLVENQSDWYLG LW+NH+PWPALGRGF ++ W+
Sbjct: 211 SDIAELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILL----WL 266
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
+G + KLT + L TL T + D D+ A
Sbjct: 267 DRLRQAGWEQMWKLTAGQEL-LTLPSTSLADQDIFNAV 303
>gi|348558768|ref|XP_003465188.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Cavia
porcellus]
Length = 695
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 297/529 (56%), Positives = 353/529 (66%), Gaps = 69/529 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PALG FY AD + +SWIPNKHYSG+YGL+KL LP VLP LA+ IVLDTDV A+DI
Sbjct: 126 PALGVSFYDADQLKPHISWIPNKHYSGLYGLMKLVLPGVLPPGLAQVIVLDTDVTLASDI 185
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR +
Sbjct: 186 AELWALFAYFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQVG 245
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT+ TSLADQDIFNA+I EHP LV LPC WNVQLSD+T + E CY
Sbjct: 246 WEQMWQLTARRELLTQPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 304
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
E DLK+IHWNSPKKL+VK KH EFFRNLYLTFL YDGNLLRRELFGC L
Sbjct: 305 LEAADLKVIHWNSPKKLRVKIKHAEFFRNLYLTFLGYDGNLLRRELFGCPSQSPAGAEQL 364
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQ+ L+ L+EED C++ R+ +LT R H+ FL +E +DVTLVAQLSMDR
Sbjct: 365 QQS-LAQLDEEDACFEFRQQQLTVHRVHITFLPHEPPPPRP-HDVTLVAQLSMDR----- 417
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 418 ----------------------------LQMLEALCRHWLGPMSLALYLTDAEAQQFLRF 449
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 450 VEASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 509
Query: 582 KSSIRSMDMHGHGG--KKVLVVPAFETQRYRTAFPAS----------------------- 616
++SI + G GG K LVVPAFET RYR +FP S
Sbjct: 510 RASIEQL---GLGGLRKAALVVPAFETPRYRFSFPKSKAELLALLDAGALYTFRYHEWPQ 566
Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HA T+++RW A PY+++WA D+EPY+V RD P YD RFVGFGWNK
Sbjct: 567 GHAATDYARWREAQAPYRVQWAADYEPYVVVPRDCPHYDPRFVGFGWNK 615
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 113/163 (69%), Positives = 134/163 (82%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P C+++ VAIVCAG NS+R +VTL+KS+LFYRKNPLH HL+TD VA +IL+TLF TW VP
Sbjct: 67 PRCQLLHVAIVCAGLNSSREVVTLVKSMLFYRKNPLHLHLVTDAVAQSILETLFRTWMVP 126
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
+ VSFY AD + +SWIPNKHYSG+YGL+KL LP VLP LA+ IVLDTDV A+DIA
Sbjct: 127 ALGVSFYDADQLKPHISWIPNKHYSGLYGLMKLVLPGVLPPGLAQVIVLDTDVTLASDIA 186
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+LWALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 187 ELWALFAYFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGF 229
>gi|351697353|gb|EHB00272.1| Glycosyltransferase-like protein LARGE2, partial [Heterocephalus
glaber]
Length = 630
Score = 570 bits (1468), Expect = e-159, Method: Compositional matrix adjust.
Identities = 297/528 (56%), Positives = 351/528 (66%), Gaps = 67/528 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PALG FY AD + +SWIPNKHYSG+YGL+KL L VLP LA+ IVLDTDV FA DI
Sbjct: 62 PALGVSFYDADQLKPHISWIPNKHYSGLYGLMKLVLLSVLPPGLAQVIVLDTDVTFAADI 121
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 122 AELWALFADFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRHEG 181
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT+ TSLADQDIFNA+I EHP LV +LPC WNVQLSD+T + E CY
Sbjct: 182 WEQMWQLTARRELLTQPATSLADQDIFNAVIKEHPGLVQSLPCVWNVQLSDHTLA-ERCY 240
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
E DLK+IHWNSPKKL+VKNK EFFRNLYLTFL YDGNLLRRELFGC T
Sbjct: 241 LEAADLKVIHWNSPKKLQVKNKDAEFFRNLYLTFLGYDGNLLRRELFGCPSQPPTGAEEL 300
Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
Q L+ L+EED C++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 301 QHSLAQLDEEDTCFEFRQQQLTMHRVHITFLPHKPPPPQP-HDVTLVAQLSMDR------ 353
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 354 ---------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHFV 386
Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP LY YL+
Sbjct: 387 EASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAHSLYDYLR 446
Query: 583 SSIRSMDMHGHGG--KKVLVVPAFETQRYRTAFPAS------------------------ 616
+SI + G GG K LVVPAFET RYR +FP S
Sbjct: 447 ASIEQL---GLGGLRKAALVVPAFETLRYRFSFPRSKAELLDLLDAGALYTFRYHEWPWG 503
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 504 HAPTDYARWRAAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 551
Score = 239 bits (609), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 110/160 (68%), Positives = 130/160 (81%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
+++ VAIVCAG NS+R +VTL+KS+LFYRKNPLH HL+TD VA +IL+ LF TW VP +
Sbjct: 6 QLLHVAIVCAGLNSSREVVTLVKSMLFYRKNPLHLHLVTDAVARSILEMLFRTWMVPALG 65
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
VSFY AD + +SWIPNKHYSG+YGL+KL L VLP LA+ IVLDTDV FA DIA+LW
Sbjct: 66 VSFYDADQLKPHISWIPNKHYSGLYGLMKLVLLSVLPPGLAQVIVLDTDVTFAADIAELW 125
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 126 ALFADFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGF 165
>gi|344247816|gb|EGW03920.1| Glycosyltransferase-like protein LARGE2 [Cricetulus griseus]
Length = 692
Score = 569 bits (1467), Expect = e-159, Method: Compositional matrix adjust.
Identities = 288/529 (54%), Positives = 357/529 (67%), Gaps = 67/529 (12%)
Query: 166 PALGRGFYLADSVVED--VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
PA+ FY A+ + +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++
Sbjct: 122 PAVVIRFYDAEELKAGPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSS 181
Query: 224 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD 283
DIA+LWALF+ +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 182 DIAELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQ 241
Query: 284 ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDEL 343
W W++IA++ LLT + TSLADQD+FNA+I EHP LV+ LPC WNVQLSD+T + E
Sbjct: 242 TGWEQMWKLIAKQELLTLMATSLADQDVFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ER 300
Query: 344 CYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQT 399
CY E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC
Sbjct: 301 CYLEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQLPPGAE 360
Query: 400 TLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQM 459
LQQA L+ L+EE+PC++ R+ +L+ R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 361 QLQQA-LAQLDEEEPCFEFRQQQLSVHRVHITFLPHQPPPPR-PHDVTLVAQLSMDR--- 415
Query: 460 VEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL 519
LQM+E L +HW GP+SL LY++DAEAQQFL
Sbjct: 416 ------------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFL 445
Query: 520 SYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
+ S L +R+++ YH+VY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY
Sbjct: 446 HFVETSPVLSARKDVAYHIVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYD 505
Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
YL++SI +++ G K LVVPAFET YR +FP S
Sbjct: 506 YLRASIEQLEL-GSQRKAALVVPAFETLHYRLSFPNSKAELLTLLDAGSLYTFRYHEWPQ 564
Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPT+++RW A PY ++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 565 GHAPTDYARWREAKAPYHVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 613
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 122/216 (56%), Positives = 155/216 (71%), Gaps = 7/216 (3%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HL+TD +A NIL+TLF TW VP
Sbjct: 63 PKCELLHVAIVCAGYNSSREVITLVKSLLFYRKNPLHLHLVTDAIARNILETLFRTWMVP 122
Query: 70 QVEVSFYLADSVVED--VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
V + FY A+ + +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++D
Sbjct: 123 AVVIRFYDAEELKAGPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSD 182
Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPN 187
IA+LWALF+ +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++ W+
Sbjct: 183 IAELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDR 238
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
+G + KL + L TL T + D DV A
Sbjct: 239 LRQTGWEQMWKLIAKQEL-LTLMATSLADQDVFNAV 273
>gi|329664130|ref|NP_001193121.1| glycosyltransferase-like protein LARGE2 precursor [Bos taurus]
gi|296479677|tpg|DAA21792.1| TPA: glycosyltransferase-like 1B [Bos taurus]
Length = 726
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 292/527 (55%), Positives = 354/527 (67%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA++I
Sbjct: 158 PAVQISFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNI 217
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR
Sbjct: 218 AELWALFAHFSDKQAIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 277
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I EHP+LV LPC WNVQLSD+T + E CY
Sbjct: 278 WEHMWKLTATRELLTLPATSLADQDIFNAVIKEHPWLVQPLPCMWNVQLSDHTLA-ERCY 336
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
+E DLK+IHWNSPKKL VKNKH+E FRNLYLTFLEYDGNLLRRELFGC L
Sbjct: 337 SEAPDLKVIHWNSPKKLHVKNKHVESFRNLYLTFLEYDGNLLRRELFGCPCPPPPGAEQL 396
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L E+D C++ R+ +LT R H+ FL ++ +DVTLVAQLSM+RL
Sbjct: 397 QQA-LAQLEEQDACFEFRQQQLTVHRVHITFLPHKPPPPQ-PHDVTLVAQLSMERL---- 450
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 451 -----------------------------QMLEALCRHWPGPMSLALYLTDTEAQQFLRF 481
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG+ YPIN LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 482 VETSAVLSARQDVAYHVVYREGSLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 541
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP+S H
Sbjct: 542 RASIEQLELDGR-RKAALVVPAFETLHYRFSFPSSKAELLTLLDAGSLYTFRYHEWPQGH 600
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 601 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 647
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 113/161 (70%), Positives = 136/161 (84%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++QVAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 101 CELLQVAIVCAGHNSSRDVITLVKSLLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 160
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
++SFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA++IA+L
Sbjct: 161 QISFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNIAEL 220
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 221 WALFAHFSDKQAIGLVENQSDWYLGNLWRNHRPWPALGRGF 261
>gi|148695646|gb|EDL27593.1| glycosyltransferase-like 1B, isoform CRA_a [Mus musculus]
Length = 629
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 61 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 120
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 121 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 180
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 181 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 239
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 240 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 299
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 300 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 352
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 353 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 384
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 385 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 444
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ K LVVPAFET YR +FP S H
Sbjct: 445 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 503
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ T++SRW A PY ++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 504 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 550
Score = 206 bits (524), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 104/183 (56%), Positives = 127/183 (69%), Gaps = 5/183 (2%)
Query: 41 RKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL 100
RKNPLH HLITD VA NIL+TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+
Sbjct: 33 RKNPLHLHLITDAVARNILETLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLM 92
Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
KL LP +LP +LA+ IVLDTDV F++DI +LWALF +Q +GLVENQSDWYLG LWK
Sbjct: 93 KLVLPSILPPSLARVIVLDTDVTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWK 152
Query: 161 NHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
NH+PWPALGRGF ++ W+ +G + K+T + L TL T + D D+
Sbjct: 153 NHRPWPALGRGFNTGVILL----WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIF 207
Query: 221 FAT 223
A
Sbjct: 208 NAV 210
>gi|109730755|gb|AAI16716.1| Gyltl1b protein [Mus musculus]
Length = 691
Score = 563 bits (1452), Expect = e-158, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 123 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 182
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 183 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 242
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 243 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 301
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 302 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 361
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 362 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 414
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 415 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 446
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 447 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 506
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ K LVVPAFET YR +FP S H
Sbjct: 507 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 565
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ T++SRW A PY ++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 566 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 612
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 128/223 (57%), Positives = 158/223 (70%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPTCEV-IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
SS + + P CEV + VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+
Sbjct: 55 SSPQLLLPPKCEVMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILE 114
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 115 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDT 174
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 175 DVTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 233
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + K+T + L TL T + D D+ A
Sbjct: 234 ---WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIFNAV 272
>gi|262231828|ref|NP_766258.2| glycosyltransferase-like protein LARGE2 isoform 1 precursor [Mus
musculus]
gi|81910390|sp|Q5XPT3.1|LARG2_MOUSE RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
Full=Glycosyltransferase-like 1B
gi|53830728|gb|AAU95213.1| glycosyltransferase-like 1B splice form 1 [Mus musculus]
gi|66794531|gb|AAH96655.1| Gyltl1b protein [Mus musculus]
gi|148695649|gb|EDL27596.1| glycosyltransferase-like 1B, isoform CRA_d [Mus musculus]
Length = 690
Score = 563 bits (1452), Expect = e-158, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 181
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 182 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 241
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 242 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 300
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 301 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 360
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 361 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 413
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 414 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 445
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 446 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 505
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ K LVVPAFET YR +FP S H
Sbjct: 506 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 564
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ T++SRW A PY ++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 565 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 611
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 127/222 (57%), Positives = 158/222 (71%), Gaps = 5/222 (2%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
SS + + P CE++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+T
Sbjct: 55 SSPQLLLPPKCEMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILET 114
Query: 62 LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
LF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTD
Sbjct: 115 LFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTD 174
Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVED 181
V F++DI +LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 175 VTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL-- 232
Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + K+T + L TL T + D D+ A
Sbjct: 233 --WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIFNAV 271
>gi|148695648|gb|EDL27595.1| glycosyltransferase-like 1B, isoform CRA_c [Mus musculus]
Length = 642
Score = 563 bits (1451), Expect = e-158, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 74 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 133
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 134 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 193
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 194 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 252
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 253 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 312
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 313 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 365
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 366 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 397
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 398 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 457
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ K LVVPAFET YR +FP S H
Sbjct: 458 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 516
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ T++SRW A PY ++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 517 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 563
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 123/212 (58%), Positives = 153/212 (72%), Gaps = 5/212 (2%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C+++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+TLF TW VP V
Sbjct: 17 CQMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILETLFRTWMVPAV 76
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI +L
Sbjct: 77 VVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDIVEL 136
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYS 191
WALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++ W+ +
Sbjct: 137 WALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLQQT 192
Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
G + K+T + L TL T + D D+ A
Sbjct: 193 GWEQMWKVTAKREL-LTLMATSLADQDIFNAV 223
>gi|26346524|dbj|BAC36913.1| unnamed protein product [Mus musculus]
Length = 629
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPN HYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 61 PAVVVSFYDAEELKPLVSWIPNNHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 120
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 121 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 180
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 181 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 239
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 240 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 299
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 300 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 352
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 353 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 384
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 385 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 444
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ K LVVPAFET YR +FP S H
Sbjct: 445 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 503
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ T++SRW A PY ++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 504 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 550
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/183 (56%), Positives = 126/183 (68%), Gaps = 5/183 (2%)
Query: 41 RKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL 100
RKNPLH HLITD VA NIL+TLF TW VP V VSFY A+ + VSWIPN HYSG+YGL+
Sbjct: 33 RKNPLHLHLITDAVARNILETLFRTWMVPAVVVSFYDAEELKPLVSWIPNNHYSGLYGLM 92
Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
KL LP +LP +LA+ IVLDTDV F++DI +LWALF +Q +GLVENQSDWYLG LWK
Sbjct: 93 KLVLPSILPPSLARVIVLDTDVTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWK 152
Query: 161 NHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
NH+PWPALGRGF ++ W+ +G + K+T + L TL T + D D+
Sbjct: 153 NHRPWPALGRGFNTGVILL----WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIF 207
Query: 221 FAT 223
A
Sbjct: 208 NAV 210
>gi|40018582|ref|NP_954538.1| glycosyltransferase-like protein LARGE2 [Rattus norvegicus]
gi|81885334|sp|Q6P7A1.1|LARG2_RAT RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
Full=Glycosyltransferase-like 1B
gi|38197418|gb|AAH61762.1| Glycosyltransferase-like 1B [Rattus norvegicus]
gi|149022667|gb|EDL79561.1| glycosyltransferase-like 1B, isoform CRA_a [Rattus norvegicus]
gi|149022668|gb|EDL79562.1| glycosyltransferase-like 1B, isoform CRA_a [Rattus norvegicus]
gi|149022669|gb|EDL79563.1| glycosyltransferase-like 1B, isoform CRA_a [Rattus norvegicus]
Length = 690
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 64/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP VLP +LA+ IVLDTDV F++DI
Sbjct: 121 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLARVIVLDTDVTFSSDI 180
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+ I
Sbjct: 181 MELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQIG 240
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT TSLADQDIFNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 241 WEQMWKLTAKRELLTLTATSLADQDIFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 299
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR+L+LTFL +DG LL RELFGC L
Sbjct: 300 LEAADLKVIHWNSPKKLRVKNKHAEFFRDLHLTFLGFDGKLLCRELFGCPNQFPPGVEQL 359
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 360 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLSHQ-PPPPRPHDVTLVAQLSMDR----- 412
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 413 ----------------------------LQMLEALCRHWRGPMSLALYLTDAEAQQFLRF 444
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 445 VETSPVLSARKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 504
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + K LVVPAFET YR +FP S H
Sbjct: 505 RASIEQLALGRRQRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 564
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
A T+++RW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 565 ASTDYTRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 611
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 127/214 (59%), Positives = 154/214 (71%), Gaps = 5/214 (2%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD VA NIL+TLF TW VP
Sbjct: 62 PKCEMLHVAIVCAGYNSSREIITLMKSVLFYRKNPLHLHLITDAVARNILETLFRTWMVP 121
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V VSFY A+ + VSWIPNKHYSG+YGL+KL LP VLP +LA+ IVLDTDV F++DI
Sbjct: 122 AVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLARVIVLDTDVTFSSDIM 181
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKH 189
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++ W+
Sbjct: 182 ELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLQ 237
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
G + KLT + L TL T + D D+ A
Sbjct: 238 QIGWEQMWKLTAKREL-LTLTATSLADQDIFNAV 270
>gi|149022670|gb|EDL79564.1| glycosyltransferase-like 1B, isoform CRA_b [Rattus norvegicus]
Length = 621
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 64/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP VLP +LA+ IVLDTDV F++DI
Sbjct: 52 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLARVIVLDTDVTFSSDI 111
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+ I
Sbjct: 112 MELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQIG 171
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT TSLADQDIFNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 172 WEQMWKLTAKRELLTLTATSLADQDIFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 230
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR+L+LTFL +DG LL RELFGC L
Sbjct: 231 LEAADLKVIHWNSPKKLRVKNKHAEFFRDLHLTFLGFDGKLLCRELFGCPNQFPPGVEQL 290
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 291 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLSHQ-PPPPRPHDVTLVAQLSMDR----- 343
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 344 ----------------------------LQMLEALCRHWRGPMSLALYLTDAEAQQFLRF 375
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 376 VETSPVLSARKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 435
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + K LVVPAFET YR +FP S H
Sbjct: 436 RASIEQLALGRRQRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 495
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
A T+++RW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 496 ASTDYTRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 542
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 106/183 (57%), Positives = 126/183 (68%), Gaps = 5/183 (2%)
Query: 41 RKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL 100
RKNPLH HLITD VA NIL+TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+
Sbjct: 24 RKNPLHLHLITDAVARNILETLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLM 83
Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
KL LP VLP +LA+ IVLDTDV F++DI +LWALF +Q +GLVENQSDWYLG LWK
Sbjct: 84 KLVLPSVLPLSLARVIVLDTDVTFSSDIMELWALFGHFSDKQVVGLVENQSDWYLGNLWK 143
Query: 161 NHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
NH+PWPALGRGF ++ W+ G + KLT + L TL T + D D+
Sbjct: 144 NHRPWPALGRGFNTGVILL----WLDRLQQIGWEQMWKLTAKREL-LTLTATSLADQDIF 198
Query: 221 FAT 223
A
Sbjct: 199 NAV 201
>gi|298683993|gb|ADI96176.1| LARGE-2 [Mastomys erythroleucus]
Length = 653
Score = 560 bits (1443), Expect = e-157, Method: Compositional matrix adjust.
Identities = 286/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
TN+SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTNYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 258 bits (658), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298684007|gb|ADI96183.1| LARGE-2 [Rattus losea]
Length = 654
Score = 560 bits (1442), Expect = e-156, Method: Compositional matrix adjust.
Identities = 283/527 (53%), Positives = 350/527 (66%), Gaps = 64/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP VLP +L + IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLTRVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+ I
Sbjct: 169 MELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQIG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT TSLADQDIFNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR+L+LTFL +DG LL RELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRDLHLTFLGFDGKLLCRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLSHQ-PPPPRPHDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWRGPMSLALYLTDAEAQQFLRF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLALGRRQRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 552
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
A T+++RW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 553 ASTDYTRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 599
Score = 252 bits (643), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 127/217 (58%), Positives = 156/217 (71%), Gaps = 6/217 (2%)
Query: 8 VIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
++PT CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD VA NIL+TLF TW
Sbjct: 47 LLPTKCEMLHVAIVCAGYNSSREIITLMKSVLFYRKNPLHLHLITDAVARNILETLFRTW 106
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP VLP +L + IVLDTDV F++
Sbjct: 107 MVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLTRVIVLDTDVTFSS 166
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIP 186
DI +LWALF +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++ W+
Sbjct: 167 DIMELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLD 222
Query: 187 NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
G + KLT + L TL T + D D+ A
Sbjct: 223 RLQQIGWEQMWKLTAKREL-LTLTATSLADQDIFNAV 258
>gi|298684001|gb|ADI96180.1| LARGE-2 [Mastomys erythroleucus]
Length = 653
Score = 559 bits (1440), Expect = e-156, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPXLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683999|gb|ADI96179.1| LARGE-2 [Mastomys erythroleucus]
Length = 653
Score = 559 bits (1440), Expect = e-156, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 258 bits (658), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683995|gb|ADI96177.1| LARGE-2 [Mastomys erythroleucus]
Length = 653
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 258 bits (658), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683997|gb|ADI96178.1| LARGE-2 [Mastomys erythroleucus]
Length = 653
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 246 bits (628), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 122/212 (57%), Positives = 152/212 (71%), Gaps = 5/212 (2%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAI A YNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+TLF TW VP V
Sbjct: 52 CEMLHVAIEWAEYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILETLFRTWMVPAV 111
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI +L
Sbjct: 112 VVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDIVEL 171
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYS 191
WALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++ W+ +
Sbjct: 172 WALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLQQT 227
Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
G + KLT + L TL T + D D+ A
Sbjct: 228 GWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298684003|gb|ADI96181.1| LARGE-2 [Mastomys erythroleucus]
Length = 653
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683979|gb|ADI96169.1| LARGE-2 [Mastomys natalensis]
Length = 653
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 350/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
TN+SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTNYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683991|gb|ADI96175.1| LARGE-2 [Mastomys natalensis]
Length = 653
Score = 556 bits (1434), Expect = e-155, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 350/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 128/223 (57%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|431915739|gb|ELK16072.1| Glycosyltransferase-like protein LARGE2 [Pteropus alecto]
Length = 673
Score = 556 bits (1432), Expect = e-155, Method: Compositional matrix adjust.
Identities = 284/510 (55%), Positives = 343/510 (67%), Gaps = 63/510 (12%)
Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DIA+LWALF+ Q I
Sbjct: 120 VSWIPNKHYSGLYGLMKLVLPGALPPALARVIVLDTDVTFASDIAELWALFAYFSDEQVI 179
Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
GLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR W W++ A + LLT
Sbjct: 180 GLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAGWEQMWKLTATQELLTL 239
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKK 361
TSLADQDIFNAII HP LV LPC WNVQLSD+T + E CY+E DLK+IHWNSPKK
Sbjct: 240 PATSLADQDIFNAIIKRHPGLVLPLPCIWNVQLSDHTLA-ERCYSEAADLKVIHWNSPKK 298
Query: 362 LKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC---NLTQTTLQQAELSNLNEEDPCYDL 418
L+VKNKH+EFFRNL+LTFLEY+GNLLRRELFGC + Q L++L+EED C +
Sbjct: 299 LRVKNKHVEFFRNLHLTFLEYNGNLLRRELFGCPSPPPPEAEQLQRALAHLSEEDTCLEF 358
Query: 419 RRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGND 478
R+ +LT R H+ FL +E +DVTLVAQLSMDR
Sbjct: 359 RQQQLTVHRVHITFLPHEPPPPRP-HDVTLVAQLSMDR---------------------- 395
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
LQM+E L +HW GP+SL LY++DAEAQQFL + S L +R+++ YHV
Sbjct: 396 -----------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHFVEASAVLSARQDVAYHV 444
Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
VY+EG+ YPIN LRNVA+ Q TPYVFL DID LP + LY YL++SI +++ G G K
Sbjct: 445 VYREGSLYPINQLRNVAMAQALTPYVFLSDIDLLPSYSLYDYLRASIEQLEL-GSGRKAA 503
Query: 599 LVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQI 634
LVVPAFET Y +FP S +APT+++RW A PY++
Sbjct: 504 LVVPAFETLHYHFSFPNSKAELLALLDAGSLYTFRSHEWPQGYAPTDYARWREAQAPYRV 563
Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 564 QWAADYEPYVVVPRDCPRYDPRFVGFGWNK 593
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/166 (53%), Positives = 109/166 (65%), Gaps = 33/166 (19%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P+ P CE++ VA+VCAG+N++R ++TL+KS+LF+R W
Sbjct: 75 PLPPKCELLHVAMVCAGHNTSRDVITLVKSMLFFR-----------------------AW 111
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
V + E VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+
Sbjct: 112 GVGRGE----------PQVSWIPNKHYSGLYGLMKLVLPGALPPALARVIVLDTDVTFAS 161
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
DIA+LWALF+ Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 162 DIAELWALFAYFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 207
>gi|298683983|gb|ADI96171.1| LARGE-2 [Mastomys natalensis]
Length = 653
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 284/527 (53%), Positives = 349/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRXLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T +SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTXYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683981|gb|ADI96170.1| LARGE-2 [Mastomys natalensis]
Length = 653
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 283/527 (53%), Positives = 350/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRXLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683989|gb|ADI96174.1| LARGE-2 [Mastomys natalensis]
Length = 653
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 283/527 (53%), Positives = 349/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLV NQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVXNQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 127/223 (56%), Positives = 161/223 (72%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLV NQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVXNQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|345326808|ref|XP_001510588.2| PREDICTED: glycosyltransferase-like protein LARGE1 [Ornithorhynchus
anatinus]
Length = 502
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 271/455 (59%), Positives = 324/455 (71%), Gaps = 65/455 (14%)
Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKF 297
+Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W WR+ AE+
Sbjct: 10 QQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERE 69
Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 70 LMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 128
Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+E+D
Sbjct: 129 SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 187
Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDRLQM+E
Sbjct: 188 LCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDRLQMLEA----------- 236
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
+ KHWEGPISL LY+SDAEAQQFL YA SE L SRRN
Sbjct: 237 ----------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRRN 274
Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
+GYH+VYKEG FYP+N LRNVA+ + TPY+FL DIDFLPM+GLY YL+ S+ +D+
Sbjct: 275 VGYHIVYKEGQFYPVNLLRNVAMKHIGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--A 332
Query: 594 GGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNAT 629
KK L+VPAFET RYR +FP S HAPTNF++W AT
Sbjct: 333 NTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTAT 392
Query: 630 TPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
TPY++EW DFEPY+V R+ P YD RFVGFGWNK
Sbjct: 393 TPYRVEWEADFEPYVVVRRNCPEYDRRFVGFGWNK 427
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/32 (81%), Positives = 30/32 (93%)
Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 10 QQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 41
>gi|298683985|gb|ADI96172.1| LARGE-2 [Mastomys natalensis]
Length = 653
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 283/527 (53%), Positives = 348/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R Q +GLVE QSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDXQVVGLVEXQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 127/223 (56%), Positives = 160/223 (71%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R Q +GLVE QSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDXQVVGLVEXQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|298683973|gb|ADI96166.1| LARGE-2 [Mastomys kollmannspergeri]
Length = 653
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 283/527 (53%), Positives = 348/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F +DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFFSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDXQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT TSLADQDIFN ++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNVVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEHL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ ++EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQVDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQ FL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQLFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYXEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 127/220 (57%), Positives = 159/220 (72%), Gaps = 6/220 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIVTLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F +DI +LWALF R Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFFSDIVELWALFDRFSDXQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
W+ +G + KLT + L TL T + D D+
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-LTLTATSLADQDIF 255
>gi|298684005|gb|ADI96182.1| LARGE-2 [Mus musculus]
Length = 653
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 281/527 (53%), Positives = 349/527 (66%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWI NKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++ I
Sbjct: 109 PAVVVXFYDAEELKPLVSWIXNKHYSGLYGLMKLVLPSILPXSLARVIVLDTDVTFSSXI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +Q + LVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDHFSDKQVVVLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 229 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 348 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ T++SRW A PY+++W+ D+EPY+V RD P YD RFVGFGWNK
Sbjct: 552 SSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPXYDPRFVGFGWNK 598
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 123/222 (55%), Positives = 154/222 (69%), Gaps = 5/222 (2%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
SS + + P CE++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+T
Sbjct: 42 SSPQLLLPPKCEMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILET 101
Query: 62 LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
LF TW VP V V FY A+ + VSWI NKHYSG+YGL+KL LP +LP +LA+ IVLDTD
Sbjct: 102 LFRTWMVPAVVVXFYDAEELKPLVSWIXNKHYSGLYGLMKLVLPSILPXSLARVIVLDTD 161
Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVED 181
V F++ I +LWALF +Q + LVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 162 VTFSSXIVELWALFDHFSDKQVVVLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL-- 219
Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + K+T + L TL T + D D+ A
Sbjct: 220 --WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIFNAV 258
>gi|298683987|gb|ADI96173.1| LARGE-2 [Mastomys natalensis]
Length = 653
Score = 550 bits (1416), Expect = e-153, Method: Compositional matrix adjust.
Identities = 281/527 (53%), Positives = 347/527 (65%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLV Q DWYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVXXQXDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ L T TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 252 bits (643), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 125/223 (56%), Positives = 159/223 (71%), Gaps = 6/223 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F++DI +LWALF R +Q +GLV Q DWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVXXQXDWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
W+ +G + KLT + L TL T + D D+ A
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258
>gi|281342977|gb|EFB18561.1| hypothetical protein PANDA_010467 [Ailuropoda melanoleuca]
Length = 496
Score = 550 bits (1416), Expect = e-153, Method: Compositional matrix adjust.
Identities = 270/455 (59%), Positives = 320/455 (70%), Gaps = 65/455 (14%)
Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKF 297
+Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W WR+ AE+
Sbjct: 4 QQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERE 63
Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 64 LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 122
Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+E+D
Sbjct: 123 SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 181
Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDRLQM+E
Sbjct: 182 LCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDRLQMLEA----------- 230
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
+ KHWEGPISL LY+SDAEAQQFL YA SE L SR N
Sbjct: 231 ----------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHN 268
Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL+ S+ +D+
Sbjct: 269 VAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--A 326
Query: 594 GGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNAT 629
KK ++VPAFET RYR +FP S HAPTNF++W AT
Sbjct: 327 NTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTAT 386
Query: 630 TPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
TPY +EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 387 TPYHVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 421
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/32 (81%), Positives = 30/32 (93%)
Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 4 QQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 35
>gi|298683977|gb|ADI96168.1| LARGE-2 [Mastomys kollmannspergeri]
Length = 653
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 282/527 (53%), Positives = 345/527 (65%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F +DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFFSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF TGVILL L +L+
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFXTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT TSLADQDIFN ++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNVVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEHL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +L R H+ FL DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLXVHRVHITFLP-HQPPPPRPYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQ FL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQLFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 126/220 (57%), Positives = 159/220 (72%), Gaps = 6/220 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ E++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKYEMLHVAIVCAGYNSSREIVTLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F +DI +LWALF R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFFSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFXTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
W+ +G + KLT + L TL T + D D+
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-LTLTATSLADQDIF 255
>gi|335288967|ref|XP_003126812.2| PREDICTED: glycosyltransferase-like protein LARGE1-like [Sus
scrofa]
Length = 618
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 270/458 (58%), Positives = 321/458 (70%), Gaps = 65/458 (14%)
Query: 235 LRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIA 294
L +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W WR+ A
Sbjct: 123 LTGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTA 182
Query: 295 EKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKII 354
E+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+I
Sbjct: 183 ERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVI 241
Query: 355 HWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLN 410
HWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+
Sbjct: 242 HWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELD 300
Query: 411 EEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEY 470
E+D CY+ RR + T RTHLYFL YEYE +SD DVTLVAQLSMDRLQM+E
Sbjct: 301 EDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDRLQMLEA-------- 352
Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
+ KHWEGPISL LY+SDAEAQQFL YA SE L
Sbjct: 353 -------------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMG 387
Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
R N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL+ S+ +D+
Sbjct: 388 RHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL 447
Query: 591 HGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWV 626
KK ++VPAFET RYR +FP S HAPTNF++W
Sbjct: 448 --ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWR 505
Query: 627 NATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 506 TATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 543
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/35 (77%), Positives = 31/35 (88%)
Query: 138 LRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
L +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 123 LTGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 157
>gi|168278675|dbj|BAG11217.1| glycosyltransferase-like protein LARGE1 [synthetic construct]
gi|168985116|emb|CAQ09434.1| like-glycosyltransferase [Homo sapiens]
Length = 704
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 273/523 (52%), Positives = 332/523 (63%), Gaps = 109/523 (20%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+++DLK+ Q +
Sbjct: 371 RDVSDLKL-------------------------------------------------QKQ 381
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
LS L+E+D CY+ RR + T RTHLYFL YEYE ++
Sbjct: 382 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA------------------------ 417
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA S
Sbjct: 418 ---------DSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 468
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S+
Sbjct: 469 EVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSV 528
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+D+ KK ++VPAFET RYR +FP S HAPTN
Sbjct: 529 IQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTN 586
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
F++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 587 FAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 629
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|58257733|dbj|BAA25535.3| KIAA0609 protein [Homo sapiens]
Length = 769
Score = 544 bits (1402), Expect = e-152, Method: Compositional matrix adjust.
Identities = 273/523 (52%), Positives = 332/523 (63%), Gaps = 109/523 (20%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 257 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 316
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 317 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 376
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 377 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 435
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+++DLK+ Q +
Sbjct: 436 RDVSDLKL-------------------------------------------------QKQ 446
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
LS L+E+D CY+ RR + T RTHLYFL YEYE ++
Sbjct: 447 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA------------------------ 482
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA S
Sbjct: 483 ---------DSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 533
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S+
Sbjct: 534 EVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSV 593
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+D+ KK ++VPAFET RYR +FP S HAPTN
Sbjct: 594 IQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTN 651
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
F++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 652 FAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 694
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 193 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 252
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 253 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 312
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 313 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 360
>gi|298683975|gb|ADI96167.1| LARGE-2 [Mastomys kollmannspergeri]
Length = 653
Score = 543 bits (1399), Expect = e-151, Method: Compositional matrix adjust.
Identities = 280/527 (53%), Positives = 344/527 (65%), Gaps = 65/527 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F +DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFFSDI 168
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF R Q +GL E QS WYLG LWKNH+PWPALGRGFNTGVILL L +L+
Sbjct: 169 VELWALFDRFSDXQXVGLXEXQSXWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT TSLADQDIFN ++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNVVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEHL 347
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +L R H+ FL ++ DVTLVAQLSMDR
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLXVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQ FL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQLFLHF 432
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T++SRW A PY+++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598
Score = 246 bits (628), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 124/220 (56%), Positives = 156/220 (70%), Gaps = 6/220 (2%)
Query: 2 SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S + ++P+ CE++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41 ASSSQLLLPSKCEMLHVAIVCAGYNSSREIVTLMKSLLFYRKNPLHLHLITDAIARNILE 100
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
TLF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
DV F +DI +LWALF R Q +GL E QS WYLG LWKNH+PWPALGRGF ++
Sbjct: 161 DVTFFSDIVELWALFDRFSDXQXVGLXEXQSXWYLGNLWKNHRPWPALGRGFNTGVILL- 219
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
W+ +G + KLT + L TL T + D D+
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-LTLTATSLADQDIF 255
>gi|301627879|ref|XP_002943094.1| PREDICTED: glycosyltransferase-like protein LARGE1-like, partial
[Xenopus (Silurana) tropicalis]
Length = 540
Score = 538 bits (1387), Expect = e-150, Method: Compositional matrix adjust.
Identities = 270/523 (51%), Positives = 333/523 (63%), Gaps = 109/523 (20%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP +L + IVLDTD+ FATDI
Sbjct: 28 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPASLERVIVLDTDITFATDI 87
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + +++Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 88 AELWAVFHKFKRQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 147
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 148 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRSEQ-CY 206
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+++DLK+ Q +
Sbjct: 207 RDVSDLKL-------------------------------------------------QKQ 217
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
LS L+E+D CY+ RR + T RTHLYFL YEY+ + D
Sbjct: 218 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEYKPAGDDT--------------------- 256
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA S
Sbjct: 257 ------------DVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 304
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E L +RRN+GYH+VYKEG FYP+N LRNVA+ VSTPY+FL DIDFLPM+GLY L+ ++
Sbjct: 305 EVLLNRRNVGYHIVYKEGQFYPVNLLRNVAMKHVSTPYMFLSDIDFLPMYGLYESLRKAV 364
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+DM KK L++PAFET RYR +FP S H PTN
Sbjct: 365 VQLDM--KITKKALIIPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHGPTN 422
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
F++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 423 FAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 465
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 89/131 (67%), Positives = 107/131 (81%)
Query: 42 KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
+NPLHFHLI DT+A IL TLF TW VP V V FY AD + +VSWIPNKHYSG+YGL+K
Sbjct: 1 RNPLHFHLIADTIAKQILATLFQTWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMK 60
Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
L L K LP +L + IVLDTD+ FATDIA+LWA+F + +++Q +GLVENQSDWYLG LWKN
Sbjct: 61 LVLTKTLPASLERVIVLDTDITFATDIAELWAVFHKFKRQQVLGLVENQSDWYLGNLWKN 120
Query: 162 HKPWPALGRGF 172
H+PWPALGRG+
Sbjct: 121 HRPWPALGRGY 131
>gi|344280770|ref|XP_003412155.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE2-like [Loxodonta africana]
Length = 746
Score = 538 bits (1385), Expect = e-150, Method: Compositional matrix adjust.
Identities = 284/532 (53%), Positives = 345/532 (64%), Gaps = 68/532 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY VSWIPN HYSG+Y LLKL LP LP LA+ IVLDTDV FA+DI
Sbjct: 149 PAVRVSFYNTSKFQSQVSWIPNTHYSGLYELLKLVLPDALPPDLARVIVLDTDVTFASDI 208
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA F+ +Q IGLVENQS+WYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 209 AELWAFFAHFSDKQMIGLVENQSNWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQAG 268
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS----- 340
W W++ A + LLT TSLADQDIFNA+I EHP LV LPC WNVQLSD+T +
Sbjct: 269 WEQMWKVTARRLLLTLSSTSLADQDIFNAVIKEHPGLVLLLPCLWNVQLSDHTLALSDHT 328
Query: 341 -DELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT-- 397
E CY+E +DLK+IHWNSPKKL+VKNK +EFFRNL+LTFLEYDGNLLRRELFGC
Sbjct: 329 LAERCYSEASDLKVIHWNSPKKLRVKNKDVEFFRNLHLTFLEYDGNLLRRELFGCPSQPP 388
Query: 398 -QTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDR 456
Q +Q L++L+EED C++ ++ +LT R HL FL + +DVTLVAQLS+DR
Sbjct: 389 PQAEQRQRALAHLDEEDACFEFQQQRLTVHRVHLTFLPLDLLPPE-LHDVTLVAQLSVDR 447
Query: 457 LQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ 516
L QM+E L HW GP+SL LY++DAEAQ
Sbjct: 448 L---------------------------------QMLEALCSHWPGPMSLALYLTDAEAQ 474
Query: 517 QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
QFL + S+ L +R N+ YHVVY+EG YP+N LRNVAL Q T +VFL DIDFLP +
Sbjct: 475 QFLRFVEASKVLSARSNVAYHVVYREGPLYPVNHLRNVALAQARTSHVFLSDIDFLPAYS 534
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------------- 616
LY YL++SI +D+ K LVVPAFE+ YR +FP S
Sbjct: 535 LYNYLRASIEQLDLSSQ-RKTALVVPAFESLHYRLSFPTSKAELLALLDAGSLHTFRYDV 593
Query: 617 ----HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
H PT+++RW A PYQ++WA D+EPY+V R+ PRYD+RFVGFGWNK
Sbjct: 594 WPQGHTPTDYARWREAQAPYQVQWAADYEPYVVVPRECPRYDSRFVGFGWNK 645
Score = 248 bits (634), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 113/165 (68%), Positives = 131/165 (79%)
Query: 8 VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
++P CE++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW
Sbjct: 88 LLPKCELLHVAIVCAGYNSSRDVVTLVKSMLFYRKNPLHLHLVTDEVARNILETLFHTWM 147
Query: 68 VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
VP V VSFY VSWIPN HYSG+Y LLKL LP LP LA+ IVLDTDV FA+D
Sbjct: 148 VPAVRVSFYNTSKFQSQVSWIPNTHYSGLYELLKLVLPDALPPDLARVIVLDTDVTFASD 207
Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
IA+LWA F+ +Q IGLVENQS+WYLG LWKNH+PWPALGRGF
Sbjct: 208 IAELWAFFAHFSDKQMIGLVENQSNWYLGNLWKNHRPWPALGRGF 252
>gi|449666871|ref|XP_002154954.2| PREDICTED: glycosyltransferase-like protein LARGE2-like [Hydra
magnipapillata]
Length = 727
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 270/535 (50%), Positives = 343/535 (64%), Gaps = 77/535 (14%)
Query: 163 KPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
K W +G FY A+ + DV WIPN HYSGVYGL+KLTL + LPE L+K IVLDTDV
Sbjct: 171 KTWVLKQVGVSFYDAEKLKADVDWIPNTHYSGVYGLMKLTLTRALPEFLSKVIVLDTDVF 230
Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
F TD+A+LWA F+ + Q IGLVENQS WY GKLWK +K WPA+GRGFNTGV+L DL K
Sbjct: 231 FLTDLAELWAFFNNFTEDQAIGLVENQSQWYTGKLWKKYKIWPAIGRGFNTGVMLFDLQK 290
Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
LR WA WR+ AEK LL L T LADQD+ NA + ++P +VY LPCQWN+QLSDNT S
Sbjct: 291 LRKFQWAHLWRLTAEKQLLNLLSTVLADQDVINAALKDNPQIVYKLPCQWNIQLSDNTES 350
Query: 341 DELCYTELTDLKIIHWNSPKK-LKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----- 394
E CY +L +LK IHWNSP K K KH+E+FRN+YLTFLEY+GNLLR+ +F C
Sbjct: 351 -EYCYNKLIELKAIHWNSPNKHTGNKLKHVEYFRNMYLTFLEYNGNLLRKSIFSCLIPGE 409
Query: 395 NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSM 454
N T+T+ ++ NL++ D CYD++ + T+ R HLY+++Y+Y+
Sbjct: 410 NATETSKEK----NLSDNDQCYDIKIEQSTTRRVHLYYMDYDYKP--------------- 450
Query: 455 DRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE 514
+ DVTL+ LSMDR+Q++E+L +HWEGP+S++LY SDAE
Sbjct: 451 --------------------EVGDVTLMTHLSMDRIQILELLSQHWEGPMSISLYASDAE 490
Query: 515 AQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
AQQF Y+ S LR R+NIG H+VYK+G+ YP+N LRNVAL V T YVFL DIDFLPM
Sbjct: 491 AQQFTRYSSVSSFLRKRKNIGLHIVYKDGDLYPVNYLRNVALEHVQTTYVFLSDIDFLPM 550
Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------ 616
GLY YL+ +M KK +VVPAFET +Y+ +P S
Sbjct: 551 IGLYNYLREIANITNME----KKAVVVPAFETYQYKLDYPQSKKQLIEYWGSQKVDTFRG 606
Query: 617 ------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
HA T+F RW ++T Y++ W DFEPYI+ + +P YD RFVGFGWNK
Sbjct: 607 EVWQQGHAATDFQRWKTSSTDYEVRWEVDFEPYILVKKSSIPLYDMRFVGFGWNK 661
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 116/215 (53%), Positives = 151/215 (70%), Gaps = 5/215 (2%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
+P C+VI +AI+CAGY T+ +VTLIKSILFYR++PLHFH I+D ++LQ LF TW +
Sbjct: 116 LPDCQVIHIAIICAGYKETQRVVTLIKSILFYRRHPLHFHFISDISGRHVLQVLFKTWVL 175
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
QV VSFY A+ + DV WIPN HYSGVYGL+KLTL + LPE L+K IVLDTDV F TD+
Sbjct: 176 KQVGVSFYDAEKLKADVDWIPNTHYSGVYGLMKLTLTRALPEFLSKVIVLDTDVFFLTDL 235
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNK 188
A+LWA F+ + Q IGLVENQS WY GKLWK +K WPA+GRGF ++ D+ +
Sbjct: 236 AELWAFFNNFTEDQAIGLVENQSQWYTGKLWKKYKIWPAIGRGFNTG-VMLFDLQKLRKF 294
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
++ ++ +LT K L L+ T++ D DVI A
Sbjct: 295 QWAHLW---RLTAEKQLLNLLS-TVLADQDVINAA 325
>gi|426246004|ref|XP_004016789.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE2, partial [Ovis aries]
Length = 646
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 280/513 (54%), Positives = 345/513 (67%), Gaps = 57/513 (11%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA++I
Sbjct: 130 PAVQISFYDADELKPLVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNI 189
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR
Sbjct: 190 AELWALFAHFSDKQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 249
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ +T +++ QDIFNAII EHP+LV LPC WNVQLSD+T + E CY
Sbjct: 250 WEQMWKLTAKRGGVT---ATVSAQDIFNAIIKEHPWLVQPLPCMWNVQLSDHTLA-ERCY 305
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
+E +DL++IHWNSPKKL VKNKH+E FRNLYLTFLEY+GNLLRRELFGC L
Sbjct: 306 SEASDLQVIHWNSPKKLHVKNKHVESFRNLYLTFLEYNGNLLRRELFGCPCPPPPGAEQL 365
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L E+D C + R+ +L R H+ FL ++ +DVTLVAQLSM+RL
Sbjct: 366 QQA-LAQLEEQDACLEFRQQQLAVHRVHITFLPHKPPPPQP-HDVTLVAQLSMERL---- 419
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 420 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 450
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG+ YPIN LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 451 VETSAVLSARQDVAYHVVYREGSLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYSYL 510
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPT----------NFSRWVNATTP 631
++SI +++ G K LVVPAFET YR +FP+S A F RW
Sbjct: 511 RASIEQLELGGK-RKAALVVPAFETLHYRFSFPSSKAELLTLLDAGSLYTF-RWXRGH-- 566
Query: 632 YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
++++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 567 WRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 599
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 136/161 (84%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++QVAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 73 CELLQVAIVCAGHNSSRDVITLVKSLLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 132
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
++SFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA++IA+L
Sbjct: 133 QISFYDADELKPLVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNIAEL 192
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 193 WALFAHFSDKQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 233
>gi|196015485|ref|XP_002117599.1| hypothetical protein TRIADDRAFT_32796 [Trichoplax adhaerens]
gi|190579768|gb|EDV19857.1| hypothetical protein TRIADDRAFT_32796 [Trichoplax adhaerens]
Length = 618
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 256/528 (48%), Positives = 338/528 (64%), Gaps = 69/528 (13%)
Query: 171 GFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
FY S+ ++++WIPN+HYSG+YGL+KL L +LP +L + IVLDTD+ F DIA LW
Sbjct: 60 SFYNLQSIKKEIAWIPNRHYSGIYGLMKLILITILPNSLKRVIVLDTDLTFNADIADLWY 119
Query: 231 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW 290
F L + + +GLVENQSDWYLGKLWKNH PWPALGRGFNTGV+L++L+KLR +W W
Sbjct: 120 YFRYLTKDKILGLVENQSDWYLGKLWKNHTPWPALGRGFNTGVMLINLSKLRQNNWNYIW 179
Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD 350
+ + EK L T +T+LADQDI NA+I +P LVY LPC WN+QLSDNTRS + CY + ++
Sbjct: 180 KEVTEKQLKTLTYTTLADQDIINAVIKSYPKLVYRLPCSWNIQLSDNTRS-QTCYKKSSE 238
Query: 351 LKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC---NLTQTTLQQAEL- 406
+KIIHWNSPKK+ V+NKH+++FRN+Y TF+E DGNLLRREL GC NL T+++ +
Sbjct: 239 IKIIHWNSPKKINVRNKHIQYFRNIYQTFIELDGNLLRRELIGCEKSNLKPTSVRTRNMH 298
Query: 407 ------SNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMV 460
SN N DPCY R S R H Y L+Y+Y+
Sbjct: 299 VQLKIVSNDNLCDPCYQFRVEAYISHRIHPYLLDYKYQ---------------------- 336
Query: 461 EINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS 520
PN D DVTLV LS+DRLQM++ L KHW GP+S+ LY++D + S
Sbjct: 337 ---PN---------DNYDVTLVTHLSIDRLQMLDGLAKHWTGPMSIALYVTDRDVPSIRS 384
Query: 521 YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
+ +S L +R+NIGYH+VY +G+ YP+N LRN+AL+ ++T YVFLLDIDFLP Y Y
Sbjct: 385 FIQHSNNLSNRKNIGYHIVYIQGDCYPVNKLRNIALDYITTDYVFLLDIDFLPSHNSYQY 444
Query: 581 LKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------ 616
L+ + + + K L+VPAFETQRYR FP S
Sbjct: 445 LRYYFDFLSLQHNFNKAALIVPAFETQRYRVNFPESKERLIALLNKQELFTFRYHLYTKA 504
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
H+PTN+++W A YQ++W +FEPYIV +++ RYD RF+GFGWNK
Sbjct: 505 HSPTNYTKWQTANETYQVQWQENFEPYIVVTKNVARYDERFIGFGWNK 552
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 94/158 (59%), Positives = 119/158 (75%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVS 74
I VAIV GY+++R ++TLIKS+LFYR +PLHFH I D A NIL F TW + V++S
Sbjct: 1 IHVAIVVGGYDASRDVITLIKSVLFYRNHPLHFHFIADPSAQNILSKHFFTWQLHHVDIS 60
Query: 75 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
FY S+ ++++WIPN+HYSG+YGL+KL L +LP +L + IVLDTD+ F DIA LW
Sbjct: 61 FYNLQSIKKEIAWIPNRHYSGIYGLMKLILITILPNSLKRVIVLDTDLTFNADIADLWYY 120
Query: 135 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
F L + + +GLVENQSDWYLGKLWKNH PWPALGRGF
Sbjct: 121 FRYLTKDKILGLVENQSDWYLGKLWKNHTPWPALGRGF 158
>gi|441618397|ref|XP_003264907.2| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE1-like [Nomascus leucogenys]
Length = 1173
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 265/527 (50%), Positives = 324/527 (61%), Gaps = 94/527 (17%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 638 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 697
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTG L+
Sbjct: 698 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGQPFSSLSLS---- 753
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
LL +L +D ++++H L T + D+T
Sbjct: 754 ----------FSLLRKLTFPQSDPHPSTFLLTQH--LAST-----GIIRKDDT------- 789
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
+ L +IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ L
Sbjct: 790 --IHSLDVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 847
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++
Sbjct: 848 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA-------------------- 886
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 887 -------------DSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRY 933
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
A SE L SR N+GYH+VYKEG FYP+N LRNVA+ +S PY+ F GL YL
Sbjct: 934 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISIPYMLTCATHFXATKGLXEYL 993
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
+ S+ +D+ KK ++VPAFET RYR +FP S H
Sbjct: 994 RKSVIQLDLANT--KKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 1051
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 1052 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 1098
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 574 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 633
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 634 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 693
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 694 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 741
>gi|56403760|emb|CAI29669.1| hypothetical protein [Pongo abelii]
Length = 463
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 244/425 (57%), Positives = 293/425 (68%), Gaps = 65/425 (15%)
Query: 268 GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLP 327
G+NTGVILL L KLR + W WR+ AE+ L+ L TSLADQDIFNA I ++P+LVY LP
Sbjct: 1 GYNTGVILLLLDKLRKMKWEQMWRLTAERELMGMLSTSLADQDIFNAAIKQNPFLVYQLP 60
Query: 328 CQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLL 387
C WNVQLSD+TRS++ CY +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLL
Sbjct: 61 CFWNVQLSDHTRSEQ-CYRDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLL 119
Query: 388 RRELFGC----NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDG 443
RRELFGC ++ LQ+ +LS L+E+D CY+ RR + T RTHLYFL YEYE ++
Sbjct: 120 RRELFGCPSEADVNSENLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA-- 176
Query: 444 NDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGP 503
D DVTLVAQLSMDRLQM+E + KHWEGP
Sbjct: 177 -------------------------------DSTDVTLVAQLSMDRLQMLEAICKHWEGP 205
Query: 504 ISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPY 563
ISL LY+SDAEAQQFL YA SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY
Sbjct: 206 ISLALYLSDAEAQQFLRYAQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPY 265
Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------- 616
+FL D DFLPM+GLY YL+ S+ +D+ KK ++VPAFET RYR +FP S
Sbjct: 266 MFLSDTDFLPMYGLYEYLRKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSM 323
Query: 617 -----------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
HAPTNF++W ATTPY++EW DFEPY+V RD P YD RFVG
Sbjct: 324 LDMGTLFTFRYHVWTKDHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVG 383
Query: 660 FGWNK 664
FGWNK
Sbjct: 384 FGWNK 388
>gi|354469850|ref|XP_003497335.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 2
[Cricetulus griseus]
Length = 655
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 259/527 (49%), Positives = 326/527 (61%), Gaps = 100/527 (18%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 181
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ +GVILL L +LR
Sbjct: 182 AELWALFAHF-----------------------------------SGVILLWLDRLRQTG 206
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++IA++ LLT + TSLADQD+FNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 207 WEQMWKLIAKQELLTLMATSLADQDVFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 265
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 266 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQLPPGAEQL 325
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +L+ R H+ FL ++ +DVTLVAQLSMDR
Sbjct: 326 QQA-LAQLDEEEPCFEFRQQQLSVHRVHITFLPHQPPPPR-PHDVTLVAQLSMDR----- 378
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 379 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 410
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YH+VY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 411 VETSPVLSARKDVAYHIVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 470
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ G K LVVPAFET YR +FP S H
Sbjct: 471 RASIEQLEL-GSQRKAALVVPAFETLHYRLSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 529
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY ++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 530 APTDYARWREAKAPYHVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 576
Score = 188 bits (477), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 83/127 (65%), Positives = 106/127 (83%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HL+TD +A NIL+TLF TW VP
Sbjct: 63 PKCELLHVAIVCAGYNSSREVITLVKSLLFYRKNPLHLHLVTDAIARNILETLFRTWMVP 122
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V + FY A+ + +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DIA
Sbjct: 123 AVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDIA 182
Query: 130 QLWALFS 136
+LWALF+
Sbjct: 183 ELWALFA 189
>gi|431905242|gb|ELK10287.1| Glycosyltransferase-like protein LARGE1, partial [Pteropus alecto]
Length = 906
Score = 483 bits (1244), Expect = e-133, Method: Compositional matrix adjust.
Identities = 250/458 (54%), Positives = 300/458 (65%), Gaps = 87/458 (18%)
Query: 236 RQR-QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIA 294
RQR QT+G E +SD+ VILL L KLR + W WR+ A
Sbjct: 433 RQRTQTLGNAEAESDFR---------------------VILLLLDKLRKMKWEQMWRLTA 471
Query: 295 EKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKII 354
E+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+I
Sbjct: 472 ERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVI 530
Query: 355 HWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLN 410
HWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+
Sbjct: 531 HWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELD 589
Query: 411 EEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEY 470
E+D CY+ RR + T RTHLYFL YEYE +
Sbjct: 590 EDDLCYEFRRERFTVHRTHLYFLHYEYEPA------------------------------ 619
Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
SD DVTLVAQLSMDRLQM+E + KHWEGP+SL LY+SDAEAQQFL YA SE L S
Sbjct: 620 ---SDNTDVTLVAQLSMDRLQMLEAICKHWEGPVSLALYLSDAEAQQFLRYAQGSEVLMS 676
Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
R N+ YHVVYKEG FYP+N LRNVA+ V TPY+FL DIDFLPM+GLY YL+ S+ +D+
Sbjct: 677 RHNVAYHVVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL 736
Query: 591 HGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWV 626
KK ++VPAFET RYR +FP S HAPTNF++W
Sbjct: 737 --ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWR 794
Query: 627 NATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 795 TATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 832
Score = 261 bits (667), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 125/222 (56%), Positives = 155/222 (69%), Gaps = 11/222 (4%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 93 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSW 184
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+ + D S
Sbjct: 213 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGTAHPRDPSP 272
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 226
+P GL LP P T + VL DVI + A
Sbjct: 273 LP--------GLFFHQLPPAQPLT---SSVLSLDVISSRHCA 303
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 72/108 (66%), Positives = 87/108 (80%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGV 273
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTG
Sbjct: 217 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGT 264
>gi|262231830|ref|NP_001160105.1| glycosyltransferase-like protein LARGE2 isoform 2 precursor [Mus
musculus]
gi|53830730|gb|AAU95214.1| glycosyltransferase-like 1B splice form 2 [Mus musculus]
Length = 655
Score = 483 bits (1243), Expect = e-133, Method: Compositional matrix adjust.
Identities = 256/527 (48%), Positives = 321/527 (60%), Gaps = 100/527 (18%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 181
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWALF +GVILL L +L+
Sbjct: 182 VELWALFDHF-----------------------------------SGVILLWLDRLQQTG 206
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 207 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 265
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCN----LTQTTL 401
E DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC L
Sbjct: 266 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 325
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EE+PC++ R+ +LT R H+ FL ++
Sbjct: 326 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-------------------- 364
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
+DVTLVAQLSMDRLQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 365 --------------PHDVTLVAQLSMDRLQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 410
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L R+++ YHVVY++G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 411 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 470
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI +++ K LVVPAFET YR +FP S H
Sbjct: 471 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 529
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ T++SRW A PY ++W+ D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 530 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 576
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 87/126 (69%), Positives = 104/126 (82%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P CE++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+TLF TW VP
Sbjct: 63 PKCEMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILETLFRTWMVP 122
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V VSFY A+ + VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 123 AVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDIV 182
Query: 130 QLWALF 135
+LWALF
Sbjct: 183 ELWALF 188
>gi|444511887|gb|ELV09961.1| Glycosyltransferase-like protein LARGE1 [Tupaia chinensis]
Length = 436
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 234/409 (57%), Positives = 283/409 (69%), Gaps = 65/409 (15%)
Query: 284 ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDEL 343
+ W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++
Sbjct: 1 MKWEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ- 59
Query: 344 CYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQT 399
CY +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++
Sbjct: 60 CYRDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSE 119
Query: 400 TLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQM 459
LQ+ +LS L+E+D CY+ RR +LT RTHLYFL YEYE ++
Sbjct: 120 NLQK-QLSELDEDDLCYEFRRERLTVHRTHLYFLHYEYEPAT------------------ 160
Query: 460 VEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL 519
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL
Sbjct: 161 ---------------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFL 205
Query: 520 SYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
YA SE L SR N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY
Sbjct: 206 RYAQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYE 265
Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
YL+ S+ +D+ KK ++VPAFET RYR +FP S
Sbjct: 266 YLRKSVVQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTK 323
Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 324 GHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 372
>gi|322794621|gb|EFZ17629.1| hypothetical protein SINV_09453 [Solenopsis invicta]
Length = 664
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 249/523 (47%), Positives = 313/523 (59%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + FY A+ V VSWIPNKHYSGVYGLLKL LP + E + +VLDTDV TD+
Sbjct: 108 PHVNVTFYEAEVWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--RVLVLDTDVTILTDM 165
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
LW +F QT+GLVENQS+WY+ L +PWPALGRGFNTGV+L+ L +LRD
Sbjct: 166 YPLWKMFQMFSASQTLGLVENQSNWYVKALSYGQRPWPALGRGFNTGVMLMHLQRLRDGG 225
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+A W + ++ L TSLADQD+ NA+ISE P +VY + C WN+QLSD+T SD CY
Sbjct: 226 FATSWESVTKRVLEHIPETSLADQDVINAVISEQPTVVYKIECTWNIQLSDHTISDT-CY 284
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + I+HWNSP+K V NKH+ FR L+ FLE DGNLLR+ LFGC+ ++ L E
Sbjct: 285 RDTNRINILHWNSPRKQDVHNKHVNEFRKLHRVFLEMDGNLLRKRLFGCDRHESALPYYE 344
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
LS PC + + RTH + LEYEY S DV LV Q S
Sbjct: 345 LS------PCREFEKGATLLYRTHPFLLEYEYNVYSP-TDVALVTQCS------------ 385
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
M+R+ ++E L KHW G IS+ LY++DAE Q FL + S
Sbjct: 386 ---------------------MERVPLLEALSKHWPGTISVALYLTDAEVQNFLDFVRGS 424
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR+RRNI YHVVYK+G+ YPIN LRN A++ V T +VF LDIDFLP FGLY + + I
Sbjct: 425 ADLRTRRNIAYHVVYKDGDLYPINYLRNTAVSYVPTSFVFQLDIDFLPQFGLYENIMNHI 484
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
++++ K L+VPAFET+RYR FPA+ HA TN
Sbjct: 485 NRLNIN-ESDKVALIVPAFETERYRFTFPANKEELLKFLKRGILYTFRYHVWTQGHAATN 543
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+S W N+T PYQ+ W PDFEPYIV + PRYDTRFVGFGWNK
Sbjct: 544 YSLWRNSTEPYQVSWEPDFEPYIVVSKSAPRYDTRFVGFGWNK 586
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/161 (60%), Positives = 116/161 (72%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C+ I VA+VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A L TLF TW +P V
Sbjct: 51 CDTIHVAMVCAGYNSTFALVTVVKSILFYRTNPLHFHLLVDEIARKTLSTLFRTWDLPHV 110
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
V+FY A+ V VSWIPNKHYSGVYGLLKL LP + E + +VLDTDV TD+ L
Sbjct: 111 NVTFYEAEVWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--RVLVLDTDVTILTDMYPL 168
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F QT+GLVENQS+WY+ L +PWPALGRGF
Sbjct: 169 WKMFQMFSASQTLGLVENQSNWYVKALSYGQRPWPALGRGF 209
>gi|270005097|gb|EFA01545.1| hypothetical protein TcasGA2_TC007106 [Tribolium castaneum]
Length = 576
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 244/526 (46%), Positives = 320/526 (60%), Gaps = 73/526 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLP-ETLAKTIVLDTDVIFATD 224
P FY + ++DV W+PN HYSG++GLLKL PK++P K IVLDTD+ F D
Sbjct: 20 PQSNTTFYDINDYIQDVRWVPNSHYSGIFGLLKLVFPKIIPLNVTKKIIVLDTDLTFVGD 79
Query: 225 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
I +LW LF + +Q +G+VENQSD+YLGK K KPWPA GRGFN+GV+L +L +LR +
Sbjct: 80 IIELWRLFEKFNNKQAVGIVENQSDYYLGKNTK-IKPWPASGRGFNSGVLLYNLERLRRL 138
Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
W W +A+K + T L DQDI NA++ +H L+Y +PC WN QLSD+ S C
Sbjct: 139 DWPSLWPSVAKKVAIIYGSTRLGDQDIINAVLLQHTDLLYQVPCFWNTQLSDHALSYN-C 197
Query: 345 YTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQA 404
Y + +K++HWNSPKK V NK ++FR LY TF+EY+GNLLRR+L+ C+ TQ + +
Sbjct: 198 YNKFR-VKVVHWNSPKKTHVLNKDGDYFRTLYQTFVEYNGNLLRRQLYYCDATQDAVNEP 256
Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDG-NDVTLVAQLSMDRLQMVEIN 463
+ D C + R+K + RT L+F EY++ A + NDVT VAQLS DRL
Sbjct: 257 ------QTDLCAEFHRAKTSQWRTLLFFREYQHSAEENAKNDVTFVAQLSYDRL------ 304
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
QMVE L K+W+GPISLT Y++D E QQ ++
Sbjct: 305 ---------------------------QMVEELVKYWQGPISLTFYVTDPEFQQCYNFIE 337
Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
NSE L+ R NI YH V+K+G ++PIN LRNV L V+TP+VFL DIDFLPM GLY LKS
Sbjct: 338 NSELLQDRTNIAYHAVFKDGEYHPINILRNVGLKNVATPFVFLADIDFLPMKGLYSVLKS 397
Query: 584 SIRSM-DMHGHGGKKVLVVPAFETQRYRTAFP------------------------ASHA 618
+ S+ DM K L+VPAFETQRYR+ P A HA
Sbjct: 398 HLNSIHDMKA----KALIVPAFETQRYRSRVPKNKAQLLGMWEKKSIFPFRSDVWVAGHA 453
Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PTN+++W +A PY+++W PDFEPYIV D+ YD RF+GFGWNK
Sbjct: 454 PTNYTKWKSAIAPYKVKWEPDFEPYIVVSSDVTEYDNRFMGFGWNK 499
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 65/124 (52%), Positives = 87/124 (70%), Gaps = 2/124 (1%)
Query: 50 ITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLP 109
+ + V+ +L+TLF TW+VPQ +FY + ++DV W+PN HYSG++GLLKL PK++P
Sbjct: 1 MVNKVSEKVLKTLFDTWNVPQSNTTFYDINDYIQDVRWVPNSHYSGIFGLLKLVFPKIIP 60
Query: 110 -ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL 168
K IVLDTD+ F DI +LW LF + +Q +G+VENQSD+YLGK K KPWPA
Sbjct: 61 LNVTKKIIVLDTDLTFVGDIIELWRLFEKFNNKQAVGIVENQSDYYLGKNTK-IKPWPAS 119
Query: 169 GRGF 172
GRGF
Sbjct: 120 GRGF 123
>gi|156552489|ref|XP_001602136.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Nasonia
vitripennis]
Length = 712
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 246/523 (47%), Positives = 309/523 (59%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + +Y A+ + V+WIPNKHYSGVYGLLKL LP + E +K IVLDTDV D+
Sbjct: 129 PHVAVSYYKAEEWIPKVAWIPNKHYSGVYGLLKLILPDAIRE--SKIIVLDTDVTVLNDV 186
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
LW LF+ Q Q +GLVENQS+WY+ + +PWPALGRGFN+GV+L+DL KLR S
Sbjct: 187 TLLWELFNNFNQEQVLGLVENQSNWYIKPSLYSTRPWPALGRGFNSGVMLMDLQKLRAKS 246
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+A W A L L TSLADQDI NA+I +P VYT+ C WNVQLSD+ S E CY
Sbjct: 247 FAKLWEHTAAVVLKDILETSLADQDIINAVIKNNPKFVYTIDCIWNVQLSDHALS-ENCY 305
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ L IIHWNSP+K V++K+ FR ++ FLE DGNLLRR LFGC E
Sbjct: 306 SNADQLNIIHWNSPRKQNVRHKYTHDFRKMHQVFLELDGNLLRRRLFGCQTND------E 359
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
S LNE D C+ + RTH +FLE+EY + D+ L Q S
Sbjct: 360 ASQLNERDLCFKFKEGATIIYRTHTFFLEFEYNIVNSA-DILLATQCS------------ 406
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
++R+ ++E L KHW G IS+ LY+SD E Q FL + NS
Sbjct: 407 ---------------------IERVALLEELAKHWPGVISIALYLSDNEVQSFLDFVQNS 445
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E LR+R+NI YH+VYKEG FYP+N LRNVA+ ++ PY+F +DIDFLP GL+ L + I
Sbjct: 446 EELRNRKNIAYHIVYKEGEFYPVNYLRNVAMTHITLPYIFQIDIDFLPQIGLHRILMNYI 505
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
++ + LV+PAFETQRYR +P+S HAPTN
Sbjct: 506 IQLN-SSELQQIALVIPAFETQRYRFTYPSSKADLLKYWDHGVLYTFRYHVWPQGHAPTN 564
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
FS + NAT PY+I W PDFEPYIV R P YDTRF+GFGWNK
Sbjct: 565 FSVYRNATEPYEISWEPDFEPYIVVLRSAPTYDTRFIGFGWNK 607
Score = 201 bits (512), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 94/161 (58%), Positives = 115/161 (71%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C I +AIVCAGYNST +LVT +KSILFYR PLHFH++ D +AL L LF+TW +P V
Sbjct: 72 CSTIHIAIVCAGYNSTVTLVTTVKSILFYRTKPLHFHIMVDEIALKTLNVLFTTWDIPHV 131
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VS+Y A+ + V+WIPNKHYSGVYGLLKL LP + E +K IVLDTDV D+ L
Sbjct: 132 AVSYYKAEEWIPKVAWIPNKHYSGVYGLLKLILPDAIRE--SKIIVLDTDVTVLNDVTLL 189
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W LF+ Q Q +GLVENQS+WY+ + +PWPALGRGF
Sbjct: 190 WELFNNFNQEQVLGLVENQSNWYIKPSLYSTRPWPALGRGF 230
>gi|307210698|gb|EFN87121.1| Glycosyltransferase-like protein LARGE1 [Harpegnathos saltator]
Length = 682
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 242/523 (46%), Positives = 315/523 (60%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + FY A+ V VSWIPNKHYSGVYGLLKL LP + E K +VLDTDV D+
Sbjct: 126 PHVNITFYGAEVWVPRVSWIPNKHYSGVYGLLKLILPDAIRED--KVLVLDTDVTVLNDV 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+ LW +F + QT+GL+ENQS+WY+ L +PWPALGRGFNTGV+L+ L +LRD
Sbjct: 184 SLLWGMFEKFSTGQTLGLIENQSNWYIKALSYGQRPWPALGRGFNTGVMLMHLQRLRDGK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+ W +A++ L TSLADQD+ NA+I+EHP +VY + C WN+QLSD T SD CY
Sbjct: 244 FTSSWESVAKRVLGYIPKTSLADQDVINAVINEHPSIVYRIECTWNIQLSDRTISDT-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + + I+HWNSP+K V+NKH+ FR L+ FLE DGNLLRR LFGC+ ++ L
Sbjct: 303 RDSSRINILHWNSPRKQDVRNKHVNEFRKLHRVFLEMDGNLLRRRLFGCDKHESALP--- 359
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
NE PC + +S RTH + LEY+Y S DV LV Q
Sbjct: 360 ---YNESSPCREFEKSASILYRTHPFLLEYDYNMYSP-MDVALVTQ-------------- 401
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
S++R+ ++E L KHW G IS+ +Y++DAE Q FL + S
Sbjct: 402 -------------------CSVERIPLLEALSKHWPGTISVAIYLTDAEVQNFLDFVRGS 442
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR+RRNI YHVVYK+G YPIN LRN A++ VSTP++F LDIDFLP GL+ + I
Sbjct: 443 ADLRTRRNIAYHVVYKDGELYPINYLRNTAMSYVSTPFIFQLDIDFLPQLGLHENIMGYI 502
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+D++ K L+VPAFET+RYR FPA+ HA TN
Sbjct: 503 NKLDIN-ESDKIALIVPAFETERYRFDFPANKEELLKFLKRGVLYTFRYHVWTQGHAATN 561
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+S W +++ PY++ W PDFEPYIV + PRYDTRF+GFGWNK
Sbjct: 562 YSVWRSSSEPYEVSWEPDFEPYIVVSKSAPRYDTRFIGFGWNK 604
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/161 (58%), Positives = 117/161 (72%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C+ + VA+VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A L TLF TW +P V
Sbjct: 69 CDTVHVAMVCAGYNSTLTLVTVVKSILFYRTNPLHFHLLVDEIARRTLSTLFRTWDLPHV 128
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
++FY A+ V VSWIPNKHYSGVYGLLKL LP + E K +VLDTDV D++ L
Sbjct: 129 NITFYGAEVWVPRVSWIPNKHYSGVYGLLKLILPDAIRED--KVLVLDTDVTVLNDVSLL 186
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F + QT+GL+ENQS+WY+ L +PWPALGRGF
Sbjct: 187 WGMFEKFSTGQTLGLIENQSNWYIKALSYGQRPWPALGRGF 227
>gi|328788916|ref|XP_624139.2| PREDICTED: glycosyltransferase-like protein LARGE1-like [Apis
mellifera]
Length = 692
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 238/523 (45%), Positives = 307/523 (58%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + +Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D+
Sbjct: 128 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDV 185
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+ LW +F + Q +GL ENQS WY+ L +PWPALGRGFNTGV+L+ L +LR
Sbjct: 186 SLLWHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 245
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+ W + ++ L TSLADQDI NA+I E P++VY + C WN+QLSD+T SD +CY
Sbjct: 246 FTSLWEAVTKRVLAHIPETSLADQDIINAVIKERPFIVYKIKCTWNIQLSDHTISD-ICY 304
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + + IIHWNSP+K V NKH+ FR L+ FLE DGNLLRR LFGC+ +T
Sbjct: 305 RDTSQISIIHWNSPRKQDVYNKHINEFRKLHRVFLEMDGNLLRRRLFGCDKHET------ 358
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+S NE PC D + RTH + LEYEY + DV L+ Q
Sbjct: 359 VSQYNESSPCRDFTKGATMLYRTHPFLLEYEYNVYTPA-DVALITQ-------------- 403
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
S++R+ ++E L KHW G IS+ LY++DAE Q FL + S
Sbjct: 404 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 444
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR+R+NI YHVVYK+G YPIN LRN+A++ +STPY+F LD+DFLP + L+ L + I
Sbjct: 445 IELRNRKNIAYHVVYKDGELYPINYLRNIAMSYISTPYIFQLDVDFLPQYRLHENLMNYI 504
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPA------------------------SHAPTN 621
++M K L+VPAFET+RYR FPA HA TN
Sbjct: 505 VKLNM-SESDKVALIVPAFETERYRFTFPADKDELLKFLKRGVLYTFRYHVWTQGHAATN 563
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
++ W N PY++ W PDFEPYIV R PRYDTRF+GFGWNK
Sbjct: 564 YTYWRNTMEPYEVSWEPDFEPYIVVSRLAPRYDTRFIGFGWNK 606
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/161 (54%), Positives = 112/161 (69%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE + +A+VCAGYNST +LVT++KS+LFYR PLHFHL+ D +A L T+F TW +P V
Sbjct: 71 CETVHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIAKRTLTTVFQTWDLPHV 130
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+++Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D++ L
Sbjct: 131 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDVSLL 188
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F + Q +GL ENQS WY+ L +PWPALGRGF
Sbjct: 189 WHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 229
>gi|380019964|ref|XP_003693869.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Apis florea]
Length = 692
Score = 463 bits (1192), Expect = e-127, Method: Compositional matrix adjust.
Identities = 239/523 (45%), Positives = 307/523 (58%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + +Y A+ V VSWIPNKHYSGVYGLLKL LP L E K +V DTDV D+
Sbjct: 128 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDALRED--KVLVFDTDVTVLNDV 185
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+ LW +F + Q +GL ENQS WY+ L +PWPALGRGFNTGV+L+ L +LR
Sbjct: 186 SLLWHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 245
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+ W + ++ L TSLADQDI NA+I E P++VY + C WN+QLSD+T SD +CY
Sbjct: 246 FTSLWEAVTKRVLAHIPETSLADQDIINAVIKERPFIVYKIKCTWNIQLSDHTISD-ICY 304
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + + IIHWNSP+K V NKH+ FR L+ FLE DGNLLRR LFGC+ +T
Sbjct: 305 RDTSQISIIHWNSPRKQDVYNKHINEFRKLHRVFLEMDGNLLRRRLFGCDKHET------ 358
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+S NE PC D + RTH + LEYEY + DV L+ Q
Sbjct: 359 VSQYNESSPCRDFTKGATMLYRTHPFLLEYEYNVYTPA-DVALITQ-------------- 403
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
S++R+ ++E L KHW G IS+ LY++DAE Q FL + S
Sbjct: 404 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 444
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR+R+NI YHVVYK+G YPIN LRN+A++ VSTPY+F LD+DFLP + L+ L + I
Sbjct: 445 IELRNRKNIAYHVVYKDGELYPINYLRNIAMSYVSTPYIFQLDVDFLPQYRLHETLMNYI 504
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPA------------------------SHAPTN 621
+++ K L+VPAFET+RYR FPA HA TN
Sbjct: 505 VKLNI-SESDKIALIVPAFETERYRFTFPADKDELLKFLKRGVLYTFRYHVWTQGHAATN 563
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
++ W N PY++ W PDFEPYIV R PRYDTRF+GFGWNK
Sbjct: 564 YTYWRNTMEPYEVTWEPDFEPYIVVSRLAPRYDTRFIGFGWNK 606
Score = 194 bits (494), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 89/161 (55%), Positives = 112/161 (69%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE + +A+VCAGYNST +LVT++KS+LFYR PLHFHL+ D +A L T+F TW +P V
Sbjct: 71 CETVHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIAKRTLTTVFQTWDLPHV 130
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+++Y A+ V VSWIPNKHYSGVYGLLKL LP L E K +V DTDV D++ L
Sbjct: 131 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDALRED--KVLVFDTDVTVLNDVSLL 188
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F + Q +GL ENQS WY+ L +PWPALGRGF
Sbjct: 189 WHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 229
>gi|350417177|ref|XP_003491294.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Bombus
impatiens]
Length = 689
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 236/523 (45%), Positives = 308/523 (58%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + +Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D+
Sbjct: 125 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDV 182
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+ LW +F + Q +GL ENQS WY+ L +PWPALGRGFNTGV+L+ L +LR
Sbjct: 183 SLLWQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 242
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+ W +A++ L+ TSLADQDI NA++ EH +VY + C WN+QLSD+T SD +CY
Sbjct: 243 FMSLWETVAKRVLVHIPETSLADQDIINAVVKEHSSIVYKIQCTWNIQLSDHTISD-ICY 301
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + + I+HWNSP+K V NKH+ FR L+ FLE DGNLLR+ LFGC+ +T
Sbjct: 302 RDTSQINIVHWNSPRKQDVYNKHINQFRKLHKVFLEMDGNLLRKRLFGCDKHET------ 355
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+S NE C + R T RTH + LEYEY + DV L Q
Sbjct: 356 VSQYNESTLCREFTRGATTLYRTHPFLLEYEYNVYA-STDVALATQ-------------- 400
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
S++R+ ++E L KHW G IS+ LY++DAE Q FL + S
Sbjct: 401 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 441
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR+R+NI YHVVYK+G YPIN LRN A++ +STP++F LD+DFLP +GL+ L + I
Sbjct: 442 IELRNRKNIAYHVVYKDGELYPINYLRNTAMSYISTPFIFQLDVDFLPQYGLHESLMNYI 501
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+++ K L+VPAFET+RYR FPA+ HA TN
Sbjct: 502 IKLNI-SESDKVALIVPAFETERYRFIFPANKDELLKFLKRGVLYTFRYHVWTQGHAATN 560
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+S W N PY++ W PDFEPYIV R PRYDTRF+GFGWNK
Sbjct: 561 YSYWRNTMEPYEVSWEPDFEPYIVVSRKAPRYDTRFIGFGWNK 603
Score = 195 bits (496), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 90/161 (55%), Positives = 112/161 (69%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE I +A+VCAGYNST +LVT++KS+LFYR PLHFHL+ D VA L T+F TW +P V
Sbjct: 68 CETIHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEVANRTLTTVFQTWDLPHV 127
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+++Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D++ L
Sbjct: 128 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDVSLL 185
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F + Q +GL ENQS WY+ L +PWPALGRGF
Sbjct: 186 WQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 226
>gi|340715041|ref|XP_003396029.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Bombus
terrestris]
Length = 689
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 234/523 (44%), Positives = 307/523 (58%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + +Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D+
Sbjct: 125 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDV 182
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+ LW +F + Q +GL ENQS WY+ L +PWPALGRGFNTGV+L+ L +LR
Sbjct: 183 SLLWQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 242
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+ W +A++ L+ TSLADQDI NA++ E+ +VY + C WN+QLSD+T SD +CY
Sbjct: 243 FMSLWETVAKRVLVHIPETSLADQDIINAVVKEYSSIVYKIQCTWNIQLSDHTISD-ICY 301
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + + I+HWNSP+K V NKH+ FR L+ FLE DGNLLR+ LFGC+ +
Sbjct: 302 RDTSQINIVHWNSPRKQDVYNKHINQFRKLHKVFLEMDGNLLRKRLFGCD------KHEA 355
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+S NE C + R T RTH + LEYEY + DV L Q
Sbjct: 356 VSEYNESALCREFTRGATTLYRTHPFLLEYEYNVYA-STDVALATQ-------------- 400
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
S++R+ ++E L KHW G IS+ LY++DAE Q FL + S
Sbjct: 401 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 441
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR+R+NI YHVVYK+G YPIN LRN A++ +STP++F LD+DFLP +GL+ L + I
Sbjct: 442 IELRNRKNIAYHVVYKDGELYPINYLRNTAMSYISTPFIFQLDVDFLPQYGLHESLMNYI 501
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+++ K L+VPAFET+RYR FPA+ HA TN
Sbjct: 502 IKLNI-SESDKVALIVPAFETERYRFIFPANKDELLKFLKRGVLYTFRYHVWTQGHAATN 560
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+S W N PY++ W PDFEPYIV R PRYDTRF+GFGWNK
Sbjct: 561 YSYWRNTMEPYEVSWEPDFEPYIVVSRKAPRYDTRFIGFGWNK 603
Score = 194 bits (494), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 89/161 (55%), Positives = 112/161 (69%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE I +A+VCAGYNST +LVT++KS+LFYR PLHFHL+ D +A L T+F TW +P V
Sbjct: 68 CETIHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIANRTLTTVFQTWDLPHV 127
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+++Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D++ L
Sbjct: 128 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDVSLL 185
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F + Q +GL ENQS WY+ L +PWPALGRGF
Sbjct: 186 WQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 226
>gi|332836402|ref|XP_003313074.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
LARGE2 [Pan troglodytes]
Length = 639
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 252/527 (47%), Positives = 308/527 (58%), Gaps = 116/527 (22%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 179
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTG + L S
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTG----GDSALGSPS 235
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+ ++ +R ++ +D +I WN
Sbjct: 236 -------VRRPWMCSRCYSEASD---LRXVI------------HWN-------------- 259
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
SPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C L
Sbjct: 260 ------------SPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 307
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
QQA L+ L+EEDPC++ R+ +LT R H+ FL +E +DVTLVAQLSMDRL
Sbjct: 308 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 361
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 362 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 392
Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 393 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 452
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
++SI + + G K LVVPAFET RYR +FP S H
Sbjct: 453 RASIEQLGL-GSWRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 511
Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APT+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 512 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 558
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 120/185 (64%), Positives = 143/185 (77%), Gaps = 5/185 (2%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 63 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 122
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI++L
Sbjct: 123 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 182
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYL-ADSVVEDVS----WIP 186
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF DS + S W+
Sbjct: 183 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGGDSALGSPSVRRPWMC 242
Query: 187 NKHYS 191
++ YS
Sbjct: 243 SRCYS 247
>gi|383855636|ref|XP_003703316.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Megachile
rotundata]
Length = 689
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 238/523 (45%), Positives = 309/523 (59%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + +Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D+
Sbjct: 126 PHVNLTYYKAERWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVVVFDTDVTVVNDV 183
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
LW +F + Q +GL ENQS WY+ L +PWPALGRGFNTGV+L+ L +LR
Sbjct: 184 NLLWQMFEKFTNDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRTRK 243
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+ W + ++ L TSLADQDI NA+I +HP +VY + C WN+QLSD+T SD+ CY
Sbjct: 244 FMSLWEAVTKRVLSYIPETSLADQDIINAVIKDHPSIVYKVECTWNIQLSDHTISDQ-CY 302
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + + I+HWNSP+K V NKH+ FR L+ FL+ DGNLLRR LFGC+ +
Sbjct: 303 RDTSRINIVHWNSPRKQDVYNKHINEFRKLHKVFLDMDGNLLRRRLFGCD------KHEA 356
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+S NE PC + R T RTH + LEYEY + DV LV Q S++R+ ++E
Sbjct: 357 MSEYNESSPCREFTRGASTLYRTHPFLLEYEYNVYAP-TDVALVTQCSVERIPLLE---- 411
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
D++ KHW G IS+ LY++DAE Q FL + S
Sbjct: 412 ------------DIS-----------------KHWPGTISVALYLTDAEVQNFLEFVRGS 442
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR R+NI YHVVYK+G YPIN LRN+A++ VS+P++F LD+DFLP FGL+ L S I
Sbjct: 443 VELRKRKNIAYHVVYKDGELYPINYLRNIAMSYVSSPFIFQLDVDFLPQFGLHEALMSYI 502
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+++ K L+VPAFET+RYR FPA+ HA TN
Sbjct: 503 SKLNI-TESDKVALIVPAFETERYRFTFPANKDELLKFLKRGVLYTFRYHVWTQGHAATN 561
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+S W N PY++ W PDFEPYIV R P+YDTRF+GFGWNK
Sbjct: 562 YSYWRNTMEPYEVSWEPDFEPYIVVSRLAPKYDTRFIGFGWNK 604
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/161 (55%), Positives = 111/161 (68%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE I +A+VCAGYNST +LVT++KS+LFYR PLHFHL+ D +A L T+F TW +P V
Sbjct: 69 CETIHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIAKRTLTTVFRTWDLPHV 128
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+++Y A+ V VSWIPNKHYSGVYGLLKL LP + E K +V DTDV D+ L
Sbjct: 129 NLTYYKAERWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVVVFDTDVTVVNDVNLL 186
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F + Q +GL ENQS WY+ L +PWPALGRGF
Sbjct: 187 WQMFEKFTNDQALGLTENQSHWYIKALSYGQRPWPALGRGF 227
>gi|307174208|gb|EFN64853.1| Glycosyltransferase-like protein LARGE1 [Camponotus floridanus]
Length = 613
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 232/523 (44%), Positives = 308/523 (58%), Gaps = 68/523 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + FY A+ + VSWIPN HYSGVYGLLKL LP + E K +VLDTDV D+
Sbjct: 51 PHVNVTFYEAEVWMPKVSWIPNTHYSGVYGLLKLILPDAIQED--KILVLDTDVTVMNDV 108
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
LW +F + +GL+ENQS+WY+ L +PWPALGRGFNTGV+++ L +LRD
Sbjct: 109 YSLWKMFESFSTVEILGLIENQSNWYVKALSYGQRPWPALGRGFNTGVMMMHLRRLRDRK 168
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
+A W + ++ L TSLADQD+ NA+I+E P +V+ + C WN+QLSD+T SD CY
Sbjct: 169 FASLWESVTKRVLGYIPETSLADQDVINAVINEQPNIVHEIDCTWNIQLSDHTISDT-CY 227
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ + I+HWNSP+K V NKH+ FR L+ FLE DGNLLR+ LFGC+ + L
Sbjct: 228 RDTNRINILHWNSPRKQDVHNKHINEFRKLHRVFLEMDGNLLRKRLFGCDKHENVLP--- 284
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
+E C + + RT+ + LEYEY S DV LV Q
Sbjct: 285 ---YDESSLCREFEKGAAILYRTYPFLLEYEYNVFS-SMDVALVTQ-------------- 326
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
S++R+ ++E L KHW G IS+ LY++DAE Q FL++ S
Sbjct: 327 -------------------CSVERIPLLETLAKHWPGTISVALYLTDAEVQYFLNFVRGS 367
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
LR+R+NI YHVVYK+G YPIN LRN+A++ VSTP++F LD+DFLP FGL+ L ++I
Sbjct: 368 IDLRTRKNIAYHVVYKDGELYPINYLRNIAISYVSTPFIFQLDVDFLPQFGLHENLMNNI 427
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+++ K L+VPAFET+RYR FP + HA TN
Sbjct: 428 NRLNI-NESDKIALIVPAFETERYRFTFPTNKAELLKFLKRGILYTFRYHVWTQGHAATN 486
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+S W N+ PY++ W PDFEPYIV + PRYDTRFVGFGWNK
Sbjct: 487 YSFWQNSMEPYEVSWEPDFEPYIVVPKSAPRYDTRFVGFGWNK 529
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 88/154 (57%), Positives = 108/154 (70%), Gaps = 2/154 (1%)
Query: 19 IVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLA 78
+VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A L TLF TW +P V V+FY A
Sbjct: 1 MVCAGYNSTFALVTVVKSILFYRTNPLHFHLLVDEIARRTLSTLFQTWDLPHVNVTFYEA 60
Query: 79 DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
+ + VSWIPN HYSGVYGLLKL LP + E K +VLDTDV D+ LW +F
Sbjct: 61 EVWMPKVSWIPNTHYSGVYGLLKLILPDAIQED--KILVLDTDVTVMNDVYSLWKMFESF 118
Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+ +GL+ENQS+WY+ L +PWPALGRGF
Sbjct: 119 STVEILGLIENQSNWYVKALSYGQRPWPALGRGF 152
>gi|390368395|ref|XP_790103.3| PREDICTED: glycosyltransferase-like protein LARGE2-like, partial
[Strongylocentrotus purpuratus]
Length = 576
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 215/335 (64%), Positives = 249/335 (74%), Gaps = 38/335 (11%)
Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFN 270
KTIVLDTD+ FA+DIA+LW LF R+ +QT+ LVENQSDWYLG LWK HKPWPALGRGFN
Sbjct: 278 KTIVLDTDITFASDIAELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGFN 337
Query: 271 TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQW 330
TGVILLDL KLRD+ W W + AEK L+++ T+LADQDI NA+I +HPYLVYTLPC W
Sbjct: 338 TGVILLDLQKLRDLKWMQMWLLTAEKELMSQFSTALADQDIINAVIKQHPYLVYTLPCAW 397
Query: 331 NVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRE 390
NVQLSDNTRS ELCY E+TDLK IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRE
Sbjct: 398 NVQLSDNTRS-ELCYKEVTDLKAIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRE 456
Query: 391 LFGC--NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTL 448
LFGC TL + +LS+L+E+D CY+ RR +L RTHLY+L+Y+ D DVTL
Sbjct: 457 LFGCTDGKANNTLHE-KLSSLDEDDQCYEFRREQLLMHRTHLYYLDYQLPKFLD-TDVTL 514
Query: 449 VAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTL 508
VAQLSMDR LQM+E + KHWEGPISL L
Sbjct: 515 VAQLSMDR---------------------------------LQMLEAICKHWEGPISLAL 541
Query: 509 YMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEG 543
YMSDAEAQQF+ YA +SE L +R+NIGYHVVYK+G
Sbjct: 542 YMSDAEAQQFIRYALSSETLMARKNIGYHVVYKDG 576
Score = 192 bits (489), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 92/164 (56%), Positives = 109/164 (66%), Gaps = 32/164 (19%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
IP CE I V+IVCAG+N++R ++ LIKSILFYRKNPLHFH I+D + IL TLF TW V
Sbjct: 205 IPKCERIHVSIVCAGHNASRDVIILIKSILFYRKNPLHFHFISDEMGQLILSTLFKTWDV 264
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
P V+VSFYLA++ KTIVLDTD+ FA+DI
Sbjct: 265 PGVKVSFYLAENY--------------------------------KTIVLDTDITFASDI 292
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
A+LW LF R+ +QT+ LVENQSDWYLG LWK HKPWPALGRGF
Sbjct: 293 AELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGF 336
>gi|324506667|gb|ADY42842.1| Glycosyltransferase-like protein [Ascaris suum]
Length = 692
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 229/526 (43%), Positives = 305/526 (57%), Gaps = 66/526 (12%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY + + VSWIPN+HYS YGLLKL L + PE L K +VLDTD++F DI
Sbjct: 131 PSVNVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDI 190
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA +S++ Q + +VEN SDWY+ N KPWPA RGFNTGV+L+DL KLR ++
Sbjct: 191 ARLWAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGFNTGVMLMDLEKLRTMN 250
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W+ +A L T LADQDI NA+IS P +VY LPC+WN+Q+ +R + LC
Sbjct: 251 WTLLWKEVAADNLKEFGATQLADQDIINAVISRKPEIVYKLPCEWNLQMGYQSRQN-LCP 309
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
++DLKI+HWNSPKK + +N + FFR + TF + DGNLLR E+ C+
Sbjct: 310 ARISDLKIVHWNSPKKTRTRNPYAVFFRRHFQTFTQMDGNLLRTEMLRCS---------- 359
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
NE LR S+L S+ YEYE S DG A+ R+Q+
Sbjct: 360 ----NE------LRSSELRSV--------YEYEESDDGCGDLRHARNIKYRIQLF----- 396
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
+ Y SD DVTL QLSMDR + E L W GP+S +Y+SD+E Q + Y ++
Sbjct: 397 -IRRYAHQSDDADVTLTTQLSMDRFVLFESLLAFWTGPVSAAIYLSDSELTQLMQYLADT 455
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
+ R+R +I H V+KEG +PIN LRNVALN +TP+VFL D+DFLPM GLY ++ +
Sbjct: 456 KVFRNRTDIALHAVFKEGVHFPINYLRNVALNATNTPFVFLADVDFLPMPGLYELIRKRL 515
Query: 586 RS-MDMHGHGGKKVLVVPAFETQRYRTA-FPAS------------------------HAP 619
S + + G K VVPAFE++ YR++ PAS HA
Sbjct: 516 SSGLPLKG----KAFVVPAFESKGYRSSVVPASKAELLVMLDTGQVQIFRQDVWIQGHAS 571
Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
T++ RW T Y + W D+EPY+V R + P YD RFVGFGWNK
Sbjct: 572 TDYDRWRTTQTEYSVAWRTDYEPYVVVARNETPPYDNRFVGFGWNK 617
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 86/161 (53%), Positives = 111/161 (68%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CEV+QVAIVCAGY S+ VTL+KS+L++R++ + H + D VA IL TLF TW +P V
Sbjct: 74 CEVLQVAIVCAGYESSFRSVTLVKSLLYHRRHAITMHFVVDDVAAQILPTLFDTWQLPSV 133
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
V FY + + VSWIPN+HYS YGLLKL L + PE L K +VLDTD++F DIA+L
Sbjct: 134 NVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDIARL 193
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WA +S++ Q + +VEN SDWY+ N KPWPA RGF
Sbjct: 194 WAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGF 234
>gi|357623999|gb|EHJ74927.1| hypothetical protein KGM_05471 [Danaus plexippus]
Length = 692
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 223/530 (42%), Positives = 313/530 (59%), Gaps = 77/530 (14%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + Y A + +V WIPN HYSGVY L+KL P +LP+ L + IVLD+D+ F +D+
Sbjct: 138 PDVKYSCYDAQDRLSEVKWIPNNHYSGVYALVKLLFPSILPDKLEQVIVLDSDLTFLSDV 197
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD-I 284
+QLW +F + Q IGLVEN+S+WY + + WPAL RG+NTGV+LLDL K+R I
Sbjct: 198 SQLWHMFRNMSSLQFIGLVENESNWYTNQKTR----WPALRRGYNTGVMLLDLYKIRSLI 253
Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
SW W+ + L T+LADQD+ NAII HP++VY + C +NVQ+S T + C
Sbjct: 254 SWTSVWQKTVNENLNRLKTTALADQDVINAIIKNHPHIVYDISCHYNVQMSTQTLAKN-C 312
Query: 345 YTE-LTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQ 403
Y E + ++KI+HWNSP K ++ + ++F+N++ +++ +DGNLLR +L C+ + +
Sbjct: 313 YGEDVKNIKILHWNSPSKYNIRIRDADYFKNIHQSYVNFDGNLLREKLHRCSQNEVVTYK 372
Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
+N D C R ++ LRTH+Y+++Y Y + D DVTLV QLSMDR
Sbjct: 373 -----MNHSDLCLSFRAAQRVKLRTHIYYMDYSY-MNVDNFDVTLVLQLSMDR------- 419
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
LQ +E + K+WEGP+S +Y+SD E + S
Sbjct: 420 --------------------------LQFLERIVKYWEGPLSAAIYLSDCEVTKLESILR 453
Query: 524 N-SEALRSRRNIGYHVVYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
+ S L + NIGYH+V+K + YP+N LRNVAL V+TPYVFL+D DF+PM GLY YL
Sbjct: 454 DWSSTLNIKNNIGYHLVFKHDSVHYPVNYLRNVALENVNTPYVFLMDADFVPMAGLYSYL 513
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------------------FP 614
+ SI+ ++ + KK LVVPAFETQRYR + +P
Sbjct: 514 RESIKLIN--PYPQKKCLVVPAFETQRYRASPPRYKEELLSRLSIKHLGDVAPFRAREWP 571
Query: 615 ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
H TN+ RW AT PY+++W D+EPY+VAHR +P+YDTRF GFGWNK
Sbjct: 572 RGHRATNYIRWSTATAPYEVDWQSDYEPYLVAHRSIPKYDTRFSGFGWNK 621
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/161 (45%), Positives = 107/161 (66%), Gaps = 5/161 (3%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE + +A+VC G TR++ ++KS+L +R+NP+HFH + D ++ L LF TW +P V
Sbjct: 82 CETVHIALVCMG-KCTRNITPMLKSLLHHRQNPIHFHFVVDPESMRTLNKLFETWDLPDV 140
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+ S Y A + +V WIPN HYSGVY L+KL P +LP+ L + IVLD+D+ F +D++QL
Sbjct: 141 KYSCYDAQDRLSEVKWIPNNHYSGVYALVKLLFPSILPDKLEQVIVLDSDLTFLSDVSQL 200
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F + Q IGLVEN+S+WY + + WPAL RG+
Sbjct: 201 WHMFRNMSSLQFIGLVENESNWYTNQKTR----WPALRRGY 237
>gi|198435392|ref|XP_002128775.1| PREDICTED: similar to glycosyltransferase-like 1B [Ciona
intestinalis]
Length = 682
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 226/530 (42%), Positives = 306/530 (57%), Gaps = 73/530 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P FY A +++VSWIPNKHYSGVYGL+KLT+ VLP + K IVLDTD+IFA D+
Sbjct: 123 PHFVVSFYQASDFIDEVSWIPNKHYSGVYGLMKLTILTVLPANVGKIIVLDTDLIFAHDV 182
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
++W F + + + IGLVENQSDWYLG LW+N+ PWPA+GRGFNTG+ILLD KLR I
Sbjct: 183 TEIWQYFDKFGENEAIGLVENQSDWYLGTLWENYTPWPAIGRGFNTGLILLDCMKLRAIE 242
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W+ +A+ L L T+LADQDI N+++S + L++ LPC WN+QLSD+ R D LCY
Sbjct: 243 WQSKWKEVAQTDLKHFLSTALADQDIINSVLSHNASLLHVLPCYWNLQLSDHMRMD-LCY 301
Query: 346 TELT-DLKIIHWNSPKKLKVKNKHMEFFRNL---YLTFLEYDGNLLRRELFGCNLTQTTL 401
+ ++K+IHWN+ +K N+H E ++L + TF E +G LR C+ +
Sbjct: 302 KQHKHEVKVIHWNTRRKQ--NNRHDENIKSLTDVFRTFTEINGAFLRHPRITCSTDDPEI 359
Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
Q+ + + D C DL S + RTH+++L ++Y ++S DV+++ L++
Sbjct: 360 YQS--LDDTDFDMCSDLISSAASLHRTHIFYLPFDYHSNS-ATDVSIIVHLTI------- 409
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
DRL + E + KHWEGPIS + +DAEAQ + +
Sbjct: 410 --------------------------DRLTVFEAMCKHWEGPISAAFFTTDAEAQTLVDF 443
Query: 522 AGNSEALRSRRNIGYHVVYKEGN--FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
S L R NIGYH+VYK N YPIN LRNVAL Q T +VFL D+DF+PM GL+
Sbjct: 444 VKLSPVLMDRTNIGYHIVYKNSNHAVYPINLLRNVALKQAVTDHVFLYDVDFVPMPGLFT 503
Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYR-------------------TAF-----PA 615
YL S + H K +VPAFE+ Y+ T+F P
Sbjct: 504 YLTSYLAE---HAKSPKTAYIVPAFESFWYKFKVPQTKSVLHKQIDRGVFTSFRSYMWPQ 560
Query: 616 SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
HA TNFSRW A Y + W +FEPY+V + +LP YD F+GFGWNK
Sbjct: 561 GHAATNFSRWKTAKEAYPVRWETNFEPYVVVKKQNLPLYDQAFIGFGWNK 610
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 93/161 (57%), Positives = 120/161 (74%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C + + IVCAGY ++R +VTLIKS+LFYR + LH H I + V+ IL+ LF TW +P
Sbjct: 66 CTTVHICIVCAGYTASRQVVTLIKSLLFYRTSKLHVHFIVNGVSRTILRNLFQTWDIPHF 125
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY A +++VSWIPNKHYSGVYGL+KLT+ VLP + K IVLDTD+IFA D+ ++
Sbjct: 126 VVSFYQASDFIDEVSWIPNKHYSGVYGLMKLTILTVLPANVGKIIVLDTDLIFAHDVTEI 185
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W F + + + IGLVENQSDWYLG LW+N+ PWPA+GRGF
Sbjct: 186 WQYFDKFGENEAIGLVENQSDWYLGTLWENYTPWPAIGRGF 226
>gi|339257814|ref|XP_003369093.1| glycosyl transferase family 8 [Trichinella spiralis]
gi|316966736|gb|EFV51279.1| glycosyl transferase family 8 [Trichinella spiralis]
Length = 763
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 220/528 (41%), Positives = 300/528 (56%), Gaps = 69/528 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ Y D +ED++WIPN HYSG++GL KL LPK+L ++ K +VLD D++F +DI
Sbjct: 99 PSVSLSIYRLDEALEDINWIPNGHYSGLFGLSKLILPKILSTSVTKVVVLDVDILFVSDI 158
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LW S+ Q +G+ ENQSDWYLG L+ ++KPWPALGRG+N+GVIL++L KLR I+
Sbjct: 159 FELWNFLSKFNDSQALGMTENQSDWYLGNLFLDYKPWPALGRGYNSGVILMNLLKLRAIN 218
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W ++ + ++ L T T+LADQDIFNA+I P LVY LPC +N+QL+DN R ELC
Sbjct: 219 WTELYKAVTKELLNTYNRTNLADQDIFNAVIQRLPALVYRLPCVYNLQLNDNARR-ELCS 277
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ D+KIIHWN+ +K ++N + +R+LY F++ DGN LR C L
Sbjct: 278 YDQADMKIIHWNNYRKPLLENPNESHYRSLYQQFVDLDGNFLRHINKNCKLETAKPSVEN 337
Query: 406 LSNLNEE--DPCYDLRRSKLTSLRTHLYF---LEYEYEASSDGNDVTLVAQLSMDRLQMV 460
+SN + + C + + R HLYF ++++ + NDVTL+ QLS DRL
Sbjct: 338 ISNNYDSFTNLCLPFKTASNLMHRMHLYFGNVVQFDLKL----NDVTLLTQLSFDRL--- 390
Query: 461 EINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS 520
+S++RL + W GPI LY++DAEA Q +
Sbjct: 391 ------------------------VSLNRLLI------QWTGPIIAVLYLTDAEAFQLAT 420
Query: 521 YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
+ N + L R N+ HVVYKEGN YP+N LRN+AL TP+VF+ DIDF+P GLY +
Sbjct: 421 FMEN-DFLNIRNNVIIHVVYKEGNLYPVNKLRNIALQSSITPFVFMNDIDFVPATGLYEH 479
Query: 581 LKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------ 616
L I KK +VPAFE Y P S
Sbjct: 480 L-LQILPQQRDAIASKKAYIVPAFEVYSYHVIIPESKTDLLKSLDSGEIHIFRSKEWVKG 538
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
H T++ RW NA PY+I W+ DFEPY V RD+P YD RFVGFGWNK
Sbjct: 539 HLATDYDRWRNALEPYKIAWSTDFEPYFVVRRDVPLYDERFVGFGWNK 586
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 151/225 (67%), Gaps = 7/225 (3%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDT-VALNILQTLFSTWSVPQ 70
CEVI +A+V G+ S L+TL+KS+LFYR PLHFH++TD+ ++ IL+TLFSTW++P
Sbjct: 41 CEVIHIAVVSGGFQSRLQLLTLLKSVLFYRSTPLHFHIVTDSDLSEKILRTLFSTWNLPS 100
Query: 71 VEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
V +S Y D +ED++WIPN HYSG++GL KL LPK+L ++ K +VLD D++F +DI +
Sbjct: 101 VSLSIYRLDEALEDINWIPNGHYSGLFGLSKLILPKILSTSVTKVVVLDVDILFVSDIFE 160
Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHY 190
LW S+ Q +G+ ENQSDWYLG L+ ++KPWPALGRG Y + ++ ++ + ++
Sbjct: 161 LWNFLSKFNDSQALGMTENQSDWYLGNLFLDYKPWPALGRG-YNSGVILMNLLKLRAINW 219
Query: 191 SGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
+ +Y + K L T +T + D D IF I +L AL RL
Sbjct: 220 TELYK----AVTKELLNTYNRTNLADQD-IFNAVIQRLPALVYRL 259
>gi|220732255|emb|CAX15215.1| like-glycosyltransferase [Homo sapiens]
Length = 325
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 195/325 (60%), Positives = 236/325 (72%), Gaps = 41/325 (12%)
Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 1 LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 59
Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+E+D
Sbjct: 60 SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 118
Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDRLQM+E
Sbjct: 119 LCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEA----------- 167
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
+ KHWEGPISL LY+SDAEAQQFL YA SE L SR N
Sbjct: 168 ----------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHN 205
Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S+ +D+
Sbjct: 206 VGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--A 263
Query: 594 GGKKVLVVPAFETQRYRTAFPASHA 618
KK ++VPAFET RYR +FP S A
Sbjct: 264 NTKKAMIVPAFETLRYRLSFPKSKA 288
>gi|320162741|gb|EFW39640.1| glycosyltransferase-like protein LARGE1 [Capsaspora owczarzaki ATCC
30864]
Length = 815
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 218/561 (38%), Positives = 302/561 (53%), Gaps = 110/561 (19%)
Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
V+ IP HYSG +GL+KL L +LPE++ + LDTD++F D+A LW FS + Q
Sbjct: 229 VASIPTVHYSGKFGLIKLVLANILPESVTHVLSLDTDLLFMRDVAHLWGHFSEFTEDQMF 288
Query: 242 GLVENQSDWYLGKLWKNHK-----------PWPALGRGFNTGVILLDLTKLRDISWAGFW 290
GLVENQS+WY+ + ++ K PWPAL GFNTGV+LL L +LR +W W
Sbjct: 289 GLVENQSEWYIEQEARSSKTTSASKLLPSLPWPALSHGFNTGVMLLRLDRLRLRAWDETW 348
Query: 291 RIIAEKFLLTR----LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY- 345
+A K L R T LADQD+ NA+ +VY LPC WN+QLSDN+ +
Sbjct: 349 LSVAMKELALRPAGKQTTQLADQDVINAVALTQSSIVYRLPCTWNLQLSDNSLAAPCVIA 408
Query: 346 --------------------------TELTDLK-------IIHWNSPKKLKVKNKHMEFF 372
T ++ LK I+H+NSP+K+KV+N E++
Sbjct: 409 ALNGHDSVKNWQVSTFAPEGNPSDPKTVISKLKNAPEAVGIVHFNSPRKMKVQNAFTEYY 468
Query: 373 RNLYLTFLEYDGNLLRRELFGCN---LTQTTLQQAELSN--LNEEDPCYDLRRSKLTSLR 427
R+L+ TF + DG+ LRREL C L T A S + +DPC + R + T R
Sbjct: 469 RHLHATFSQLDGSTLRRELISCRPELLDPATSSHAMHSEAEVESDDPCIEYRLAASTKPR 528
Query: 428 THLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSM 487
TH+Y+ +Y I P A G +VT ++Q+S
Sbjct: 529 THVYYFDY--------------------------IPPT------AKETGQEVTWISQMSW 556
Query: 488 DRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYP 547
DRL M+E + + W GP+S++LY++D + + + S+ LR+R+NI H+V+KEG +P
Sbjct: 557 DRLGMLEQIAERWSGPLSISLYLNDYDVDDLVDHFETSDILRTRKNIALHLVFKEGTLFP 616
Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
+N LRNVAL+ +TPYVFL DIDFLP + YL S S+D+ K LVVPAFET
Sbjct: 617 VNYLRNVALDHATTPYVFLADIDFLPSAQMATYLASMAASLDLDNKNLPKALVVPAFETF 676
Query: 608 RYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWAPDFEPY 643
Y+ AFPA+ H+PT+F+++ AT PY I+W DFEPY
Sbjct: 677 HYKFAFPANKNKLNSMLDAESIVSFRANEWPRGHSPTDFNKFRLATAPYVIQWQEDFEPY 736
Query: 644 IVAHRDLPRYDTRFVGFGWNK 664
I+ RD+PRYD RF+GFGWNK
Sbjct: 737 ILVRRDVPRYDPRFLGFGWNK 757
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 78/177 (44%), Positives = 106/177 (59%), Gaps = 11/177 (6%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P TCE I + +VCAG+ S R +V L+KSILFYR NPLH H + D + + TL TW
Sbjct: 151 PRAATCETIHIVVVCAGFRSVRMMVPLLKSILFYRTNPLHLHFVVDEASREPIGTLIETW 210
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
++ +VE+S Y V+ IP HYSG +GL+KL L +LPE++ + LDTD++F
Sbjct: 211 NLSRVELSLYDLAPFESKVASIPTVHYSGKFGLIKLVLANILPESVTHVLSLDTDLLFMR 270
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHK-----------PWPALGRGF 172
D+A LW FS + Q GLVENQS+WY+ + ++ K PWPAL GF
Sbjct: 271 DVAHLWGHFSEFTEDQMFGLVENQSEWYIEQEARSSKTTSASKLLPSLPWPALSHGF 327
>gi|220732254|emb|CAX15214.1| like-glycosyltransferase [Homo sapiens]
Length = 276
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 178/288 (61%), Positives = 212/288 (73%), Gaps = 39/288 (13%)
Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 1 LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 59
Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+E+D
Sbjct: 60 SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 118
Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMDR
Sbjct: 119 LCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----------------- 161
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
LQM+E + KHWEGPISL LY+SDAEAQQFL YA SE L SR N
Sbjct: 162 ----------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHN 205
Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL
Sbjct: 206 VGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 253
>gi|312069691|ref|XP_003137800.1| glycosyl transferase family 8 protein [Loa loa]
gi|307767035|gb|EFO26269.1| glycosyl transferase family 8 protein [Loa loa]
Length = 698
Score = 357 bits (917), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 203/531 (38%), Positives = 285/531 (53%), Gaps = 74/531 (13%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A ++ SWIPN+HYSG YGLLKL L +LP + K I LDTDV+ DI
Sbjct: 127 PAVKVTFYNASQYLDRFSWIPNRHYSGRYGLLKLILNDILPADVDKVIALDTDVLIMGDI 186
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
AQLW+ FS++ Q IGLVEN SDWYL + WPA GRGFN+GV+LLDL KLR++
Sbjct: 187 AQLWSFFSKMANLQAIGLVENLSDWYLFNRSTPQRTVWPAWGRGFNSGVMLLDLAKLRNM 246
Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
SW+ W A K + LADQD+ NA+I++H ++V LPC+WN QL ++ + LC
Sbjct: 247 SWSHIWEESASKNVKEYGPVELADQDVINAVINDHRWIVQKLPCEWNFQLGFQSQQN-LC 305
Query: 345 YTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQA 404
E++ LK++HWNSP K ++ N++ R Y + + DG++ R C + + +
Sbjct: 306 AVEISHLKLVHWNSPLKTRIVNRYAVLLRRYYDSVRDIDGSMFRSNTIHCRYG-SQREHS 364
Query: 405 ELSNLNE----EDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMV 460
L ++E +D C ++R ++ + RT LY Y + A+
Sbjct: 365 TLDKVSEYDDNDDGCSEIRHARWITYRTLLYVRPYNFTAA-------------------- 404
Query: 461 EINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS 520
P NV L+ Q S+DRL L ++W GP+S +Y++D+E +
Sbjct: 405 ---PKNV------------VLITQFSVDRLMHFNALLQYWTGPVSAAVYVTDSELSLLIQ 449
Query: 521 YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS-TPYVFLLDIDFLPMFGLYP 579
+ ++ L +R N+ H VYKEG +YPIN LRNVALN + +VFL D+DF P GLY
Sbjct: 450 FFDDT--LVNRTNVALHAVYKEGTYYPINYLRNVALNNSNDASFVFLADVDFTPAPGLYT 507
Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFE---------------------TQRY----RTAFP 614
L + D K+ VVPAFE +R R+ +
Sbjct: 508 MLYKKLVKTD---STNKRAFVVPAFEYTGNRVPAVPLTKNELLRELDARRMQIFRRSVWI 564
Query: 615 ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
HA T++ RW +A Y + W D+EPY+V R LP YD RFVGFGWNK
Sbjct: 565 QGHAATDYDRWRHADQEYSVSWKADYEPYVVVRRSGLPPYDQRFVGFGWNK 615
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/172 (49%), Positives = 110/172 (63%), Gaps = 2/172 (1%)
Query: 3 SDEEPVIPTCEVIQVA-IVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
SD E C+V+ +A +VC + L TLIKSIL YR+ + HL+ D + ++I+
Sbjct: 60 SDNEHRSQHCQVLHLALVVCGAVQLSYPLSTLIKSILRYRQQAIVLHLLVDDITMSIISL 119
Query: 62 LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
LFSTW +P V+V+FY A ++ SWIPN+HYSG YGLLKL L +LP + K I LDTD
Sbjct: 120 LFSTWRLPAVKVTFYNASQYLDRFSWIPNRHYSGRYGLLKLILNDILPADVDKVIALDTD 179
Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGF 172
V+ DIAQLW+ FS++ Q IGLVEN SDWYL + WPA GRGF
Sbjct: 180 VLIMGDIAQLWSFFSKMANLQAIGLVENLSDWYLFNRSTPQRTVWPAWGRGF 231
>gi|426394261|ref|XP_004063418.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Gorilla gorilla
gorilla]
Length = 780
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 160/229 (69%), Positives = 192/229 (83%), Gaps = 1/229 (0%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 292 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 351
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 352 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 411
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 412 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 470
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC 394
+++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC
Sbjct: 471 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGC 519
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 111/168 (66%), Positives = 137/168 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 228 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 287
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 288 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 347
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 348 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 395
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 63/109 (57%), Gaps = 26/109 (23%)
Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
Y + S+ +D+ KK ++VPAFET RYR +FP S
Sbjct: 599 YFRKSVIQLDLAN--TKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTK 656
Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 657 GHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 705
>gi|170589603|ref|XP_001899563.1| Glycosyl transferase family 8 protein [Brugia malayi]
gi|158593776|gb|EDP32371.1| Glycosyl transferase family 8 protein [Brugia malayi]
Length = 603
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 201/529 (37%), Positives = 278/529 (52%), Gaps = 81/529 (15%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
+Y A ++ SWIPN+HYSG YG LKL L +LP + K I+LDTD + DIAQLW+
Sbjct: 47 YYNASQYLDRFSWIPNRHYSGRYGFLKLILNDILPADVDKVIILDTDALIMDDIAQLWSF 106
Query: 232 FSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW 290
FS++ Q IGL EN S+WYL + WPA GRGFN+GV+LLDLTKLR+I+W+ W
Sbjct: 107 FSKMTSLQAIGLAENLSNWYLINRNTSQRIVWPAWGRGFNSGVMLLDLTKLRNINWSHIW 166
Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD 350
+ IA K + LADQD+ NA+I+ H ++V+ LPC+WN QL ++ + LC E++
Sbjct: 167 KEIASKNIKDYGPVELADQDVINAVINHHQWIVHKLPCEWNFQLGFQSQQN-LCPVEISR 225
Query: 351 LKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLN 410
LK++HWNSP K + N++ R Y + + DGN+ R + C+ Q E S L+
Sbjct: 226 LKLVHWNSPLKTRTVNRYAVLLRRYYDSMRDMDGNMFRSNIIHCHYD----SQREYSALD 281
Query: 411 E-------EDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
+ +D C ++R ++ + RT LY Y + A +V L+ Q SM
Sbjct: 282 KVSEYDDNDDGCSEIRHARWITYRTLLYVRPYNFIAVP--KNVVLITQFSM--------- 330
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
DRL L ++W GP+S +Y++D+E + +
Sbjct: 331 ------------------------DRLMHFNALLQYWTGPVSAAVYVTDSELSLLIQFF- 365
Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS-TPYVFLLDIDFLPM-FGLYPYL 581
+ L +R N+ H VYKE +YPIN LRNVALN + +VFL D+DF+P GLY L
Sbjct: 366 -DDTLANRTNVALHAVYKESTYYPINYLRNVALNSSNDASFVFLADVDFIPRPGGLYATL 424
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFE---------------------TQRY----RTAFPAS 616
+ + M K+ VVPAFE QR R +
Sbjct: 425 ---CKKLVMTNSTNKRAFVVPAFEYMGNRVPVIPLTKNELLIELDAQRMQIFRRNKWIQG 481
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVA-HRDLPRYDTRFVGFGWNK 664
H TN+ RW A Y + W D+EPYIV R LP YD RFVGFGWNK
Sbjct: 482 HVATNYDRWRYADQEYSVSWRTDYEPYIVVRRRGLPPYDQRFVGFGWNK 530
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 1/103 (0%)
Query: 71 VEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
++V +Y A ++ SWIPN+HYSG YG LKL L +LP + K I+LDTD + DIAQ
Sbjct: 43 LKVMYYNASQYLDRFSWIPNRHYSGRYGFLKLILNDILPADVDKVIILDTDALIMDDIAQ 102
Query: 131 LWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGF 172
LW+ FS++ Q IGL EN S+WYL + WPA GRGF
Sbjct: 103 LWSFFSKMTSLQAIGLAENLSNWYLINRNTSQRIVWPAWGRGF 145
>gi|355694034|gb|AER99533.1| glycosyltransferase-like 1B [Mustela putorius furo]
Length = 266
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 158/229 (68%), Positives = 183/229 (79%), Gaps = 1/229 (0%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY A+ + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 28 PAVRVSFYDAEELKPQVSWIPNKHYSGLYGLMKLVLPAALPPDLARVIVLDTDVTFASDI 87
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWALF+ Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR
Sbjct: 88 AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 147
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W++ A + LLT TSLADQDIFNA+I EHP LV TLPC WNVQLSD+T + E CY
Sbjct: 148 WGQMWKLTATRELLTLPATSLADQDIFNAVIKEHPELVQTLPCIWNVQLSDHTLA-ERCY 206
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC 394
+E +DLK+IHWNSPKKL+VKNKH+ +FRNL+LTFL YDGNLLRRELFGC
Sbjct: 207 SEASDLKVIHWNSPKKLRVKNKHVGYFRNLHLTFLGYDGNLLRRELFGC 255
Score = 202 bits (514), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 93/131 (70%), Positives = 105/131 (80%)
Query: 42 KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
KNPLHFHL+TD VA NIL+ LF TW VP V VSFY A+ + VSWIPNKHYSG+YGL+K
Sbjct: 1 KNPLHFHLVTDAVARNILEMLFYTWMVPAVRVSFYDAEELKPQVSWIPNKHYSGLYGLMK 60
Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
L LP LP LA+ IVLDTDV FA+DIA+LWALF+ Q IGLVENQSDWYLG LW+N
Sbjct: 61 LVLPAALPPDLARVIVLDTDVTFASDIAELWALFAHFSDEQVIGLVENQSDWYLGNLWRN 120
Query: 162 HKPWPALGRGF 172
H+PWPALGRGF
Sbjct: 121 HRPWPALGRGF 131
>gi|395742848|ref|XP_002821888.2| PREDICTED: glycosyltransferase-like 1B [Pongo abelii]
Length = 624
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 174/345 (50%), Positives = 213/345 (61%), Gaps = 64/345 (18%)
Query: 348 LTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTLQQ 403
L ++IHWNSPKKL+VKNKH+EFFRN YLTFL YDGNLLRRELF C LQQ
Sbjct: 235 LPGSQVIHWNSPKKLRVKNKHVEFFRNFYLTFLGYDGNLLRRELFVCPSQPPPGAEQLQQ 294
Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
A L+ L+EEDPC++ R+ +LT R H+ FL +E
Sbjct: 295 A-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP--------------------- 332
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
+DVTLVAQLSMDRLQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 333 -------------HDVTLVAQLSMDRLQMLEALCRHWPGPMSLALYLTDAEAQQFLHFVE 379
Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
S L +R+++ YHVVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL++
Sbjct: 380 ASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYNYLRA 439
Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
SI + + G K LVVPAFET RYR +FP S HAP
Sbjct: 440 SIEQLGL-GSQRKAALVVPAFETLRYRFSFPHSKVELLALLDAGALYTFRYHEWPRGHAP 498
Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
T+++RW A PY+++WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 499 TDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 543
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 132/161 (81%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 65 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 124
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
VSFY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI+ L
Sbjct: 125 RVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISAL 184
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 185 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 225
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 75/107 (70%), Positives = 84/107 (78%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + VSWIPNKHYSG+YGL+KL LP LP LA+ IVLDTDV FA+DI
Sbjct: 122 PAVRVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 181
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
+ LWALF+ Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTG
Sbjct: 182 SALWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTG 228
>gi|324511340|gb|ADY44727.1| Glycosyltransferase-like protein [Ascaris suum]
Length = 477
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 165/378 (43%), Positives = 222/378 (58%), Gaps = 35/378 (9%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P++ FY + + VSWIPN+HYS YGLLKL L + PE L K +VLDTD++F DI
Sbjct: 131 PSVNVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDI 190
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA +S++ Q + +VEN SDWY+ N KPWPA RGFNTGV+L+DL KLR ++
Sbjct: 191 ARLWAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGFNTGVMLMDLEKLRTMN 250
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W W+ +A L T LADQDI NA+IS P +VY LPC+WN+Q+ +R + LC
Sbjct: 251 WTLLWKEVAADNLKEFGATQLADQDIINAVISRKPEIVYKLPCEWNLQMGYQSRQN-LCP 309
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
++DLKI+HWNSPKK + +N + FFR + TF + DGNLLR E+ C+
Sbjct: 310 ARISDLKIVHWNSPKKTRTRNPYAVFFRRHFQTFTQMDGNLLRTEMLRCS---------- 359
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
NE LR S+L S+ YEYE S DG A+ R+Q+
Sbjct: 360 ----NE------LRSSELRSV--------YEYEESDDGCGDLRHARNIKYRIQLF----- 396
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
+ Y SD DVTL QLSMDR + E L W GP+S +Y+SD+E Q + Y ++
Sbjct: 397 -IRRYAHQSDDADVTLTTQLSMDRFVLFESLLAFWTGPVSAAIYLSDSELTQLMQYLADT 455
Query: 526 EALRSRRNIGYHVVYKEG 543
+ R+R +I H V+KEG
Sbjct: 456 KVFRNRTDIALHAVFKEG 473
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 86/161 (53%), Positives = 111/161 (68%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CEV+QVAIVCAGY S+ VTL+KS+L++R++ + H + D VA IL TLF TW +P V
Sbjct: 74 CEVLQVAIVCAGYESSFRSVTLVKSLLYHRRHAITMHFVVDDVAAQILPTLFDTWQLPSV 133
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
V FY + + VSWIPN+HYS YGLLKL L + PE L K +VLDTD++F DIA+L
Sbjct: 134 NVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDIARL 193
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
WA +S++ Q + +VEN SDWY+ N KPWPA RGF
Sbjct: 194 WAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGF 234
>gi|298684037|gb|ADI96198.1| LARGE-1 [Mastomys erythroleucus]
Length = 671
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 153/286 (53%), Positives = 182/286 (63%), Gaps = 59/286 (20%)
Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
Q +LS L+E+D CY+ RR + T RTHLYFL YE+E S+
Sbjct: 367 QKQLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA--------------------- 405
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA
Sbjct: 406 ------------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYA 453
Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+
Sbjct: 454 QGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLR 513
Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
S+ +D+ KK ++VPAFET RYR +FP S HA
Sbjct: 514 KSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVXTKGHA 571
Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 572 PTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (650), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 108/180 (60%), Positives = 132/180 (73%), Gaps = 1/180 (0%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TS DQ A+I +L LPC VQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSXXDQXXXXAVIXXXXFLXXXLPCXXXVQLSDHTRSEQ-CY 306
>gi|358337461|dbj|GAA55819.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 767
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 191/562 (33%), Positives = 269/562 (47%), Gaps = 91/562 (16%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + Y +VSWIP HYSG +GL KL LP +LP T+ K I LD D++ DI
Sbjct: 156 PGVQFSLYSVHDYENEVSWIPTTHYSGTFGLCKLLLPNILPPTVEKVIGLDVDLLLNADI 215
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LW F R Q IGLV NQS WYL + WPALG G+NTGV+LL L +++ +
Sbjct: 216 RELWDCFDRFTHSQMIGLVRNQSPWYLQG--AQNTVWPALGYGYNTGVMLLHLQRMKLMR 273
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTR------ 339
W W+ +A+ L LADQD+ NA++ + P V L C+WNVQL+ +
Sbjct: 274 WKEAWQSVAKSTLQYLPHAPLADQDVINALLVQMPDAVNELACEWNVQLNSRAQPMQCPV 333
Query: 340 --------SDELCYTELTDLKIIHWNSPKKLKVKNKHMEF----------------FRNL 375
S + + + LKI H+NSP K ++ N+ F
Sbjct: 334 HWLTVQKPSPDNINSHIGQLKIAHYNSPVKPELMNQAFSAEDESSPGDIGLRLRTNFVQQ 393
Query: 376 YLTFLEYDGNLLR-----RELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHL 430
Y F +DG +L+ R C +T QA L+ C +L R H
Sbjct: 394 YRFFQNFDGVILKHQVRERRSEECEDHETDEAQAPLAG-----ECSELIAQVQIHHRIHP 448
Query: 431 YFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRL 490
+++ E + + P E DVTLV+QL+ DR+
Sbjct: 449 FYMHCEIQPKA---------------------VPTKCAEGCVVDQDADVTLVSQLTFDRI 487
Query: 491 QMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINT 550
VE + K WEGP+SL LY++D +A + + S+ L +R NIGYH+VY +G YPIN
Sbjct: 488 HRVEEIAKRWEGPMSLALYVTDRDAAVLVDFVSRSQLLVNRTNIGYHLVYVDGVLYPINK 547
Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI----RSMDMHGHGGKKVLVVPAFET 606
LRNVA+ T ++F+LD+DF+P +Y LK+ I R + + LVVPAFET
Sbjct: 548 LRNVAMQFAQTKFLFILDVDFIPSPNMYRELKTIIERQLRGPNKETRTERWCLVVPAFET 607
Query: 607 QRYRTAFP------------------------ASHAPTNFSRWVNATTPYQIEWAPDFEP 642
P A H T++ RW N + Y++ W+ D+EP
Sbjct: 608 FGEHVQLPETKDALLAQWAQNSVVPFRHEIWTAGHMATDYERWKNTSDLYEVFWSADYEP 667
Query: 643 YIVAHRDLPRYDTRFVGFGWNK 664
YIV R++ ++D FVGFGWNK
Sbjct: 668 YIVIPRNVLQFDEHFVGFGWNK 689
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/184 (39%), Positives = 92/184 (50%), Gaps = 28/184 (15%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFY--------------------------RKNPLHFH 48
IQ+A+V G + R TL+KSI+++ R LH H
Sbjct: 76 IQLALVVGGAQAARQTSTLLKSIVYFAQWTYNCSGRNRSHDRNKQKVSGDNSRFPQLHLH 135
Query: 49 LITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVL 108
+I D A L +L TW +P V+ S Y +VSWIP HYSG +GL KL LP +L
Sbjct: 136 MIVDRHAFISLNSLLQTWELPGVQFSLYSVHDYENEVSWIPTTHYSGTFGLCKLLLPNIL 195
Query: 109 PETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL 168
P T+ K I LD D++ DI +LW F R Q IGLV NQS WYL + WPAL
Sbjct: 196 PPTVEKVIGLDVDLLLNADIRELWDCFDRFTHSQMIGLVRNQSPWYLQG--AQNTVWPAL 253
Query: 169 GRGF 172
G G+
Sbjct: 254 GYGY 257
>gi|345317667|ref|XP_001521231.2| PREDICTED: glycosyltransferase-like protein LARGE2-like, partial
[Ornithorhynchus anatinus]
Length = 285
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 192/284 (67%), Gaps = 42/284 (14%)
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
+L DIFNA+I +HP+LV+ LPC WNVQLSD+TRS++ CY+E++DLK+IHWNSPKKL+V
Sbjct: 37 TLTPGDIFNAVIKQHPWLVHPLPCSWNVQLSDHTRSEQ-CYSEVSDLKVIHWNSPKKLRV 95
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCN-------LTQTTLQQAELSNLNEEDPCYD 417
KNKH+EFFRNLYLTFLEYDGNLLR ELFGC+ + + Q L L+E+D CY+
Sbjct: 96 KNKHVEFFRNLYLTFLEYDGNLLRGELFGCSGSPPPAAVRRPAQSQRALDELDEDDVCYE 155
Query: 418 LRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN 477
RR +LT R HL FL +E A +DVTLVAQLS DRL
Sbjct: 156 FRRQRLTVHRVHLTFLPHEPPAPRP-HDVTLVAQLSTDRL-------------------- 194
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
QM+E L +HW GP+SL LY+SDAEAQ+FL +A SE L +RR++ YH
Sbjct: 195 -------------QMLEALCRHWPGPMSLALYLSDAEAQRFLRFAQASEVLAARRDVAYH 241
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
VVY+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL
Sbjct: 242 VVYREGPLYPVNLLRNVALAQARTPYVFLSDIDFLPAYSLYDYL 285
>gi|298684029|gb|ADI96194.1| LARGE-1 [Mastomys natalensis]
Length = 671
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/284 (53%), Positives = 181/284 (63%), Gaps = 59/284 (20%)
Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
+LS L+E+D CY+ RR + T RTHLYFL YE+E S+
Sbjct: 369 QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA----------------------- 405
Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA
Sbjct: 406 ----------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQG 455
Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S
Sbjct: 456 SEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKS 515
Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
+ +D+ KK ++VPAFET RYR +FP S HAPT
Sbjct: 516 VIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPT 573
Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
NF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 574 NFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
Score = 252 bits (644), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 115/175 (65%), Positives = 139/175 (79%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NT VILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTXVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLS +TRS
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSXHTRS 302
>gi|298684027|gb|ADI96193.1| LARGE-1 [Mastomys natalensis]
Length = 671
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/284 (53%), Positives = 181/284 (63%), Gaps = 59/284 (20%)
Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
+LS L+E+D CY+ RR + T RTHLYFL YE+E S+
Sbjct: 369 QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA----------------------- 405
Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA
Sbjct: 406 ----------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQG 455
Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S
Sbjct: 456 SEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKS 515
Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
+ +D+ KK ++VPAFET RYR +FP S HAPT
Sbjct: 516 VIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPT 573
Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
NF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 574 NFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 258 bits (659), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 117/177 (66%), Positives = 141/177 (79%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDE 342
W WR+ AE+ L+ L TSLADQDIFNA+I + P+LVY LPC WNVQLSD TRS++
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQXPFLVYQLPCFWNVQLSDXTRSEQ 304
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|355698993|gb|AES00982.1| like-glycosyltransferase [Mustela putorius furo]
Length = 339
Score = 305 bits (782), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 156/301 (51%), Positives = 186/301 (61%), Gaps = 64/301 (21%)
Query: 392 FGC----NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVT 447
FGC ++ LQ+ +LS L+E+D CY+ RR + T RTHLYFL YEYE +S
Sbjct: 1 FGCPSEADVNSENLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAS------ 53
Query: 448 LVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLT 507
D DVTLVAQLSMDRLQM+E + KHWEGPISL
Sbjct: 54 ---------------------------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLA 86
Query: 508 LYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLL 567
LY+SDAEAQQFL YA SE L SR N+ YH+VYKEG FYP+N LRNVA+ V TPY+FL
Sbjct: 87 LYLSDAEAQQFLRYAQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLS 146
Query: 568 DIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------- 616
DIDFLPM+GLY YL+ S+ +D+ KK ++VPAFET RYR +FP S
Sbjct: 147 DIDFLPMYGLYEYLRKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMG 204
Query: 617 -------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWN 663
HAPTNF++W ATTPY++EW DFEPY+V RD P YD RFVGFGWN
Sbjct: 205 TLFTFRYHVWTKGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWN 264
Query: 664 K 664
K
Sbjct: 265 K 265
>gi|298684017|gb|ADI96188.1| LARGE-1 [Mastomys natalensis]
Length = 671
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 151/282 (53%), Positives = 179/282 (63%), Gaps = 59/282 (20%)
Query: 407 SNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNN 466
S L+E+D CY+ RR + T RTHLYFL YE+E S+
Sbjct: 371 SELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA------------------------- 405
Query: 467 VDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSE 526
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA SE
Sbjct: 406 --------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSE 457
Query: 527 ALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S+
Sbjct: 458 VLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVI 517
Query: 587 SMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNF 622
+D+ KK ++VPAFET RYR +FP S HAPTNF
Sbjct: 518 QLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFXYHXXTKGHAPTNF 575
Query: 623 SRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 576 AKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 119/184 (64%), Positives = 148/184 (80%), Gaps = 1/184 (0%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306
Query: 346 TELT 349
+++
Sbjct: 307 RDVS 310
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|298684039|gb|ADI96199.1| LARGE-1 [Mastomys erythroleucus]
Length = 671
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 151/283 (53%), Positives = 179/283 (63%), Gaps = 59/283 (20%)
Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
LS L+E+D CY+ RR + T RTHLYFL YE+E S+
Sbjct: 370 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA------------------------ 405
Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA S
Sbjct: 406 ---------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 456
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
E L SR+N+GYH+VYKEG F P+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S+
Sbjct: 457 EVLMSRQNVGYHIVYKEGQFXPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSV 516
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
+D+ KK ++VPAFET RYR +FP S HAPTN
Sbjct: 517 IQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTN 574
Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
F++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 575 FAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 108/168 (64%), Positives = 134/168 (79%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTXTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYL LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLXNLWKNHRPWPALGRGY 231
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 111/174 (63%), Positives = 135/174 (77%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTXTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYL LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLXNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTR 339
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LP NVQLS +TR
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPXXXNVQLSXHTR 301
>gi|298684043|gb|ADI96201.1| LARGE-1 [Mastomys erythroleucus]
Length = 671
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/284 (52%), Positives = 178/284 (62%), Gaps = 59/284 (20%)
Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
+LS L+E+D CY+ RR + T RTHLYFL
Sbjct: 369 QLSELDEDDLCYEFRRERFTVHRTHLYFL------------------------------- 397
Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
+E +D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA
Sbjct: 398 --XXXFEPXADNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQG 455
Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
SE L SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S
Sbjct: 456 SEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKS 515
Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
+ +D+ KK ++VPAFET RYR +FP S HAPT
Sbjct: 516 VIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPT 573
Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
NF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 574 NFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 116/177 (65%), Positives = 142/177 (80%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDE 342
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLS +TRS++
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSXHTRSEQ 304
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|298684041|gb|ADI96200.1| LARGE-1 [Mastomys erythroleucus]
Length = 671
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 147/279 (52%), Positives = 175/279 (62%), Gaps = 59/279 (21%)
Query: 410 NEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDE 469
+E+D CY+ RR + T RTHLYFL YE+E S+
Sbjct: 374 DEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA---------------------------- 405
Query: 470 YEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALR 529
D DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA SE L
Sbjct: 406 -----DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLM 460
Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
SR+N+GYH+VYKEG FYP+N LRNVA+ +STPY+ L DIDFLPM+GLY YL+ S+ +D
Sbjct: 461 SRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMXLSDIDFLPMYGLYEYLRKSVIQLD 520
Query: 590 MHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRW 625
+ KK ++VPAFET RYR +FP S HAPTNF++W
Sbjct: 521 L--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYXVWTKGHAPTNFAKW 578
Query: 626 VNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
ATTPYQ+EW DFEPY+V RD P D RFVGFGWNK
Sbjct: 579 RTATTPYQVEWEADFEPYVVVRRDCPEXDRRFVGFGWNK 617
Score = 258 bits (658), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 117/175 (66%), Positives = 141/175 (80%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRS 302
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|21751142|dbj|BAC03909.1| unnamed protein product [Homo sapiens]
Length = 274
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 154/284 (54%), Positives = 188/284 (66%), Gaps = 40/284 (14%)
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
++ QDIFNA+I EHP LV LPC WNVQLSD+T + E CY+E +DLK+IHWNSPKKL+V
Sbjct: 2 GVSAQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCYSEASDLKVIHWNSPKKLRV 60
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTLQQAELSNLNEEDPCYDLRR 420
KNKH+EFFRN YLTFLEYDGNLLRRELF C LQQA L+ L+ EDPC++ R+
Sbjct: 61 KNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQLQQA-LAQLDGEDPCFEFRQ 119
Query: 421 SKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVT 480
+LT R H+ FL +E +DVTLVAQLSMDR
Sbjct: 120 QQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDR------------------------ 154
Query: 481 LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVY 540
LQM+E L +HW GP+SL LY++DAEAQQFL + S L +R+++ YHVVY
Sbjct: 155 ---------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHFVEASPVLAARQDVAYHVVY 205
Query: 541 KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
+EG YP+N LRNVAL Q TPYVFL DIDFLP + LY YL+ +
Sbjct: 206 REGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLREA 249
>gi|168985115|emb|CAQ09433.1| like-glycosyltransferase [Homo sapiens]
Length = 187
Score = 271 bits (694), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 124/183 (67%), Positives = 153/183 (83%), Gaps = 1/183 (0%)
Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
+VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDIA+LWA+F + + +Q
Sbjct: 6 EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65
Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
+GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W WR+ AE+ L+
Sbjct: 66 LGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERELMG 125
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPK 360
L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWNSPK
Sbjct: 126 MLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWNSPK 184
Query: 361 KLK 363
KL+
Sbjct: 185 KLR 187
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 63/89 (70%), Positives = 76/89 (85%)
Query: 84 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
+VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDIA+LWA+F + + +Q
Sbjct: 6 EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65
Query: 144 IGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 66 LGLVENQSDWYLGNLWKNHRPWPALGRGY 94
>gi|198435803|ref|XP_002121835.1| PREDICTED: similar to like-glycosyltransferase [Ciona intestinalis]
Length = 671
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 176/541 (32%), Positives = 264/541 (48%), Gaps = 66/541 (12%)
Query: 134 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW--PALGRGFYLADSVVEDVSWIPNKHYS 191
LF R LV+N S L +L+++ W P L Y +V++DVSW+P H+S
Sbjct: 85 LFHRKDSLHFHYLVDNVSKPILKELFRS---WDIPDLKVSMYEDIAVLDDVSWLPFTHHS 141
Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 251
G+ + KL + KVLP + K IVLD+D++FATDIA+LW F + Q+Q G+VENQSDWY
Sbjct: 142 GINSVYKLAILKVLPLYIDKVIVLDSDMVFATDIAELWLQFRHMDQQQAFGMVENQSDWY 201
Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
LG L + WPA+GRG N+G++LLD KLR +W W+ A++ + LADQD+
Sbjct: 202 LGTLKFEYVVWPAIGRGLNSGMMLLDCEKLRRANWDSEWKETAQQGIKRFKTVPLADQDV 261
Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELT-DLKIIHWNSPKKLKVKNKHME 370
N + + +++Y L C WN QL R ELCY + ++KI+HWN+ +K + + ++
Sbjct: 262 INLFLVNNKHMLYKLECNWNFQLP-YERKMELCYNDHKHNVKIVHWNNVRKHDINSDQIQ 320
Query: 371 FFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHL 430
NL+ FLE + LR + GC A+ + E + C + RTH+
Sbjct: 321 GLTNLFTIFLEMNSMSLRHQAPGCTQRGEVKLYADDNLTKETEICDKFTAQASSRHRTHI 380
Query: 431 YFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRL 490
Y+L ++Y+ + D D+TL V L + +
Sbjct: 381 YYLPFKYQ-NKDSRDITL---------------------------------VTHLQLKDM 406
Query: 491 QMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGN--FYPI 548
Q E L K W GPIS+ ++++D E Q + +S+ L+ R NIGYH+V+ N PI
Sbjct: 407 QTFETLCKTWTGPISVAIFVADRETDQLEHFIDSSDILKQRDNIGYHLVFNGVNEQVPPI 466
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
+ L N + +QV T Y+ + D+ + LY L + D+ + V V+P FET
Sbjct: 467 SYLLNTSASQVLTEYLLIYDVKYQQAQQLYDILYNYFSKFDV--LSLESVYVLPVFETTS 524
Query: 609 Y-------------------RTAFPASHAPTNFS--RWVNATTPYQIEWAPDFEPYIVAH 647
+ +F RW ATTPY +W P I A
Sbjct: 525 NVLDSTPNTKVSIVKQIAVGEIKYSNDQRANSFDIVRWATATTPYLTDWRNIGTPIIFAL 584
Query: 648 R 648
R
Sbjct: 585 R 585
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 82/160 (51%), Positives = 114/160 (71%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
CE I V +VC GY + + +V L+KSILF+RK+ LHFH + D V+ IL+ LF +W +P +
Sbjct: 59 CETIHVFLVCTGYKTVKDMVVLVKSILFHRKDSLHFHYLVDNVSKPILKELFRSWDIPDL 118
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+VS Y +V++DVSW+P H+SG+ + KL + KVLP + K IVLD+D++FATDIA+L
Sbjct: 119 KVSMYEDIAVLDDVSWLPFTHHSGINSVYKLAILKVLPLYIDKVIVLDSDMVFATDIAEL 178
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
W F + Q+Q G+VENQSDWYLG L + WPA+GRG
Sbjct: 179 WLQFRHMDQQQAFGMVENQSDWYLGTLKFEYVVWPAIGRG 218
>gi|157823495|ref|NP_001101909.1| glycosyltransferase-like protein LARGE1 [Rattus norvegicus]
gi|149032476|gb|EDL87367.1| like-glycosyltransferase (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 385
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 121/187 (64%), Positives = 150/187 (80%), Gaps = 1/187 (0%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370
Query: 346 TELTDLK 352
+++DLK
Sbjct: 371 RDVSDLK 377
Score = 254 bits (650), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
>gi|392927212|ref|NP_509833.3| Protein LGE-1 [Caenorhabditis elegans]
gi|92058691|sp|Q21389.3|LGE1_CAEEL RecName: Full=Glycosyltransferase-like protein LARGE
gi|255068767|emb|CAA91997.4| Protein LGE-1 [Caenorhabditis elegans]
Length = 631
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 176/522 (33%), Positives = 252/522 (48%), Gaps = 108/522 (20%)
Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
V+WIPN HYS YGL KL +P+++ + K + +D D+IF T+I LW F Q
Sbjct: 113 VAWIPNSHYSKYYGLSKLLIPEIIGNDIGKIMFMDVDIIFQTNIFDLWKQFRNFNNSQVF 172
Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
G+VEN SDWYL K K WPALGRGFNTG+I+ DL KLR WA WR++A K+L
Sbjct: 173 GMVENLSDWYLNKDGKK-SVWPALGRGFNTGIIMFDLDKLRKNGWASKWRVVANKYLRIH 231
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC-YTELTDLKIIHWNSPK 360
T+++DQDIFNA I ++P + +PC +N QL T+S ELC T L +H+NS
Sbjct: 232 GKTAMSDQDIFNAYIHDYPTEIIQIPCAYNYQLGALTKSKELCPETPLA----LHFNSQN 287
Query: 361 KLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRR 420
K KN FF + F E DG+ L+R ++ N++D C+
Sbjct: 288 KTVGKN--YAFFDKIRKAFDEMDGSDLKR------------RRRSFKGNNQKDICH---- 329
Query: 421 SKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVT 480
EY L +D +++ PN + + ++
Sbjct: 330 -------------EY----------------LPLDNFRII---PNAIGRMTKPA---ELC 354
Query: 481 LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVY 540
+V Q S DRL W PIS +Y D + L A L +R +I H+V+
Sbjct: 355 MVTQFSKDRLNHFLESANAWRHPISTAVYGKDKD---LLDIAKAVTEL-NRTDITIHLVF 410
Query: 541 KE-------GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
+E + YPIN LRNVA+ + Y+ + D+DF+ + G Y + +D G+
Sbjct: 411 EEPTESWMLDSLYPINFLRNVAIEHANCKYILMTDVDFV-VLGDYGTI------IDQTGN 463
Query: 594 -GGKKVLVVPAFET----------------------------QRYR-TAFPASHAPTNFS 623
K+VLV+PA E Q +R T +P+SH PTN S
Sbjct: 464 LKQKEVLVIPALEMTYPQLRLNLSNFLSRKDLVIEHLLNKTIQTFRETIWPSSHVPTNIS 523
Query: 624 RWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
+W+ + Y + + ++EPY V + + P YD RF GFGWNK
Sbjct: 524 KWIKSNRTYMVNYEKNYEPYFVIKKEECPFYDQRFGGFGWNK 565
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 90/166 (54%), Gaps = 2/166 (1%)
Query: 8 VIPTCEVIQVAIVCAGYNSTRSL-VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P + I++A + G TR + + KS+L Y + HLITD + + L ++W
Sbjct: 35 AFPEEDYIRLAYIIGGNFMTRLMFMQHFKSVLKYSDHFFRLHLITDENHRSDIHELMTSW 94
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
++ E F+ + V+WIPN HYS YGL KL +P+++ + K + +D D+IF T
Sbjct: 95 NISNCEWFFHNLTEFEKRVAWIPNSHYSKYYGLSKLLIPEIIGNDIGKIMFMDVDIIFQT 154
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
+I LW F Q G+VEN SDWYL K K WPALGRGF
Sbjct: 155 NIFDLWKQFRNFNNSQVFGMVENLSDWYLNKDGKK-SVWPALGRGF 199
>gi|167516064|ref|XP_001742373.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778997|gb|EDQ92611.1| predicted protein [Monosiga brevicollis MX1]
Length = 716
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 168/534 (31%), Positives = 254/534 (47%), Gaps = 72/534 (13%)
Query: 165 WPALGRGFYL--ADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 222
WP G ++ AD+ V+WIP H +G LKL P++LP+ +A+ +V+D D++
Sbjct: 115 WPLAGVEWHAHPADAGQARVAWIPTLHKAGWVAHLKLVYPQLLPQ-VAQLVVIDLDLLVQ 173
Query: 223 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGFNTGVILLDLTK 280
D+ QL + + + V QSDWYLG L + + WPA+ RG NTGV++L L +
Sbjct: 174 ADLCQLAGILAD-HPKALFAAVPQQSDWYLGTLPDTRENVVWPAIERGLNTGVMVLHLER 232
Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
+R W W+ + + LL R T LADQD++N + EHP L C WNVQL D++ +
Sbjct: 233 MRAWGWDQRWQSLVRRELLVRHHTGLADQDVYNLVALEHPETWTMLSCAWNVQLHDHSVT 292
Query: 341 DELCYTE--LTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQ 398
L + +++HWNSPKK + R+L + G
Sbjct: 293 QHCDQPSHLLANARVLHWNSPKKYDAAFAEATYVRSLLAAAEAHSG-------------- 338
Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
A + P YD S L+ + S G D A D ++
Sbjct: 339 ----LAASAQFTLRCPGYDAGGSSLS-------LAWAAVDTSWAGEDPAARA----DIVE 383
Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE---A 515
++ NN A G+ +TLV L+ DRL ++ W+GP+ +Y++DA+
Sbjct: 384 GGHLDENNPAATRAP--GSRITLVTHLTPDRLAALQQTLHQWDGPVHAAVYLADADLWRL 441
Query: 516 QQFLSYAGNSEALRSRRNIGYHVVYKEGN-FYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
+QFL EA+ N+ HV +++G YP+N LRN+ L+ V+T +VF+ D+D P+
Sbjct: 442 RQFLEQVATLEAI----NLTLHVAFEDGGRRYPVNHLRNLLLDHVTTSHVFMADVDLCPL 497
Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRT----------------------- 611
G +L+ ++ S G + LVVPAFE YR
Sbjct: 498 AGSRAHLEQAL-SASATGSAQRVALVVPAFEYVGYRAPTFLSLTTVQDLDLADIRPFRVD 556
Query: 612 AFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAH-RDLPRYDTRFVGFGWNK 664
+P H PTN++R +A+ YQ+ WAP +EPY+V PRY GFGWNK
Sbjct: 557 VWPRGHGPTNYTRLQSASDYYQVSWAPGYEPYVVLRTATAPRYPEALAGFGWNK 610
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 86/174 (49%), Gaps = 11/174 (6%)
Query: 1 MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S E+P + V ++ G + L ++K ++ R PL H++ D +
Sbjct: 57 VSCSEQP-------LHVGMIALGDQAPSELYRVLKDVVARRSCPLVLHVVCDPTTCEAVA 109
Query: 61 TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
L TW + VE + AD+ V+WIP H +G LKL P++LP+ +A+ +V+D
Sbjct: 110 ALKQTWPLAGVEWHAHPADAGQARVAWIPTLHKAGWVAHLKLVYPQLLPQ-VAQLVVIDL 168
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGF 172
D++ D+ QL + + + V QSDWYLG L + + WPA+ RG
Sbjct: 169 DLLVQADLCQLAGILAD-HPKALFAAVPQQSDWYLGTLPDTRENVVWPAIERGL 221
>gi|341903546|gb|EGT59481.1| hypothetical protein CAEBREN_08278 [Caenorhabditis brenneri]
Length = 643
Score = 255 bits (651), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 171/521 (32%), Positives = 259/521 (49%), Gaps = 94/521 (18%)
Query: 180 EDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQ 239
+ V WIPN HYS YGL KL +P+++ + + K + +D D++F +DI LW F +Q
Sbjct: 115 KKVKWIPNSHYSKYYGLTKLLIPEIMSDNVGKVMYVDIDIVFRSDIFYLWQKFRDFNNQQ 174
Query: 240 TIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLL 299
+IG++EN SDWYL K K W A GRGFN+G+I+ DL KLR+ ++ WR +A K+L
Sbjct: 175 SIGMIENLSDWYLNKDGKK-SVWTARGRGFNSGIIMFDLDKLREENFKEKWRSVANKYLK 233
Query: 300 TRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY-TELTDLKIIHWNS 358
T+++DQDIFNA + ++P + TLPC++N QL + +S+ELC+ T L +H+NS
Sbjct: 234 IHGKTTMSDQDIFNAYVHDYPNQIKTLPCEYNFQLGNLAKSEELCHETPLA----LHFNS 289
Query: 359 PKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDL 418
K KN Y + N +R+E+ G + + DL
Sbjct: 290 QNKTVRKN---------YALY-----NAIRKEIDGIDGS-------------------DL 316
Query: 419 RRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGND 478
RR K S + +E Y + + V LQ I PN + E ++
Sbjct: 317 RRRKRRSFDA-MQKMEETYGMRNRNECSSYVP------LQNFRILPNALGRLEKTAH--- 366
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
+ LV Q S+DRL K W PIS +Y +D E LS ++ +R +I H+
Sbjct: 367 LCLVTQFSIDRLHNFLENVKTWMHPISAAIYGNDEE----LSIIVDAIKALNRTDITIHM 422
Query: 539 VYKEGNF------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG 592
V++E + YPIN LRN A+N + ++ + D+DF+ ++G L+ + M
Sbjct: 423 VFREPDHQIISQTYPINFLRNFAMNHSNCEHILMADVDFM-IYGDDSSLQEQVMKM---- 477
Query: 593 HGGKKVLVVPAFET----------------QRYRTAF------------PASHAPTNFSR 624
K +L++PAFET + R F P +H TN +
Sbjct: 478 -KEKDLLIIPAFETSDENITDVSKFPQTKEKVARGIFSGKVRIFREVIWPNAHNATNVAE 536
Query: 625 WVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
W+ A Y +E+ ++EPY V + + P YD RF GFGWNK
Sbjct: 537 WIIAEDTYTVEYGKNYEPYFVIRKSECPIYDQRFGGFGWNK 577
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 88/166 (53%), Gaps = 2/166 (1%)
Query: 8 VIPTCEVIQVAIVCAGYNSTRSL-VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P + + + + G TR L + KS+L Y + H I D + L ++W
Sbjct: 39 AFPEEDHVSLVYILGGNFMTRLLFMQHFKSVLNYSDSNFRLHFIVDYENKEDVNDLMTSW 98
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
+V VE F+ + + V WIPN HYS YGL KL +P+++ + + K + +D D++F +
Sbjct: 99 NVSNVEWFFHNLTNYEKKVKWIPNSHYSKYYGLTKLLIPEIMSDNVGKVMYVDIDIVFRS 158
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
DI LW F +Q+IG++EN SDWYL K K W A GRGF
Sbjct: 159 DIFYLWQKFRDFNNQQSIGMIENLSDWYLNKDGKK-SVWTARGRGF 203
>gi|149032477|gb|EDL87368.1| like-glycosyltransferase (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 328
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 110/168 (65%), Positives = 136/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 72/107 (67%), Positives = 87/107 (81%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTG
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTG 298
>gi|343958624|dbj|BAK63167.1| glycosyltransferase-like protein LARGE1 [Pan troglodytes]
Length = 286
Score = 252 bits (643), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 118/201 (58%), Positives = 140/201 (69%), Gaps = 26/201 (12%)
Query: 488 DRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYP 547
RLQM+E + KHWEGPISL LY+SDAEAQQFL YA SE L SR N+GYH+VYKEG FYP
Sbjct: 13 SRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHNVGYHIVYKEGQFYP 72
Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
+N LRNVA+ +STPY+FL DIDFLPM+GLY YL+ S+ +D+ KK ++VPAFET
Sbjct: 73 VNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--ANTKKAMIVPAFETL 130
Query: 608 RYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWAPDFEPY 643
RYR +FP S HAPTNF++W ATTPY++EW DFEPY
Sbjct: 131 RYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTATTPYRVEWEADFEPY 190
Query: 644 IVAHRDLPRYDTRFVGFGWNK 664
+V RD P YD RFVGFGWNK
Sbjct: 191 VVVRRDCPEYDRRFVGFGWNK 211
>gi|298684025|gb|ADI96192.1| LARGE-1 [Mastomys natalensis]
Length = 671
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 135/168 (80%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALG G+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGXGY 231
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 82/127 (64%), Positives = 99/127 (77%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALG G+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGXGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRI 292
W R+
Sbjct: 248 WEQMXRL 254
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 63/109 (57%), Gaps = 26/109 (23%)
Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
YL+ S+ +D+ KK ++VPAFET RY +FP S
Sbjct: 511 YLRKSVIQLDLAN--TKKAMIVPAFETLRYGLSFPKSKAELLSMLDMGTLFTFRYHVWTK 568
Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPTNF++W ATTPYQ+EW DFEPY+V RD P YD RFVGFGWNK
Sbjct: 569 GHAPTNFAKWRTATTPYQVEWEADFEPYVVXRRDCPEYDRRFVGFGWNK 617
>gi|298684011|gb|ADI96185.1| LARGE-1 [Mastomys kollmannspergeri]
Length = 302
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 114/175 (65%), Positives = 138/175 (78%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+G FY AD + +VSW PNKHYSG+YGL+KL L K LP L + VLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWNPNKHYSGIYGLMKLVLTKTLPANLERVTVLDTDITFATDI 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR +
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
W WR+ AE+ L+ L TSLADQDIFNA+I ++P+L Y LPC WNVQLSD+TRS
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLXYQLPCFWNVQLSDHTRS 302
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 108/168 (64%), Positives = 134/168 (79%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 64 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSW PNKHYSG+YGL+KL L K LP L + VLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWNPNKHYSGIYGLMKLVLTKTLPANLERVTVLDTDITF 183
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231
>gi|355694037|gb|AER99534.1| glycosyltransferase-like 1B [Mustela putorius furo]
Length = 282
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 113/205 (55%), Positives = 138/205 (67%), Gaps = 25/205 (12%)
Query: 484 QLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEG 543
QLSMDRLQM+E L +HW GP+SL LY++DAEAQQFL + S L +R+N+ YHVVY+EG
Sbjct: 1 QLSMDRLQMLEALCRHWPGPMSLALYLTDAEAQQFLRFVEASAVLSARQNVAYHVVYREG 60
Query: 544 NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPA 603
YP+N LRNVAL Q TPYVFL DIDFLP + LY YL++SI + + G K LVVPA
Sbjct: 61 PLYPVNQLRNVALAQSLTPYVFLSDIDFLPAYSLYDYLRASIEQLKL-GSERKAALVVPA 119
Query: 604 FETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWAPD 639
FET YR FP+S HAPT+++RW A TPY+++WA D
Sbjct: 120 FETLHYRFRFPSSKAELLALLDSGSLYTFRYHEWPRGHAPTDYARWREAQTPYRVQWAAD 179
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EPY+V RD PRYD RFVGFGWNK
Sbjct: 180 YEPYVVVPRDCPRYDPRFVGFGWNK 204
>gi|332019413|gb|EGI59897.1| Glycosyltransferase-like protein LARGE1 [Acromyrmex echinatior]
Length = 636
Score = 232 bits (592), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 121/281 (43%), Positives = 157/281 (55%), Gaps = 59/281 (20%)
Query: 408 NLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNV 467
N NE +PC + + RTH + LEYEY S
Sbjct: 313 NYNESNPCREFEKGANILYRTHPFLLEYEYNVYSPM------------------------ 348
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
DV LV Q S++R+ ++E L KHW G IS+ LY++DAE Q FL++ S
Sbjct: 349 ----------DVALVTQCSIERIPLLEALSKHWPGTISVALYLTDAEVQNFLNFVRGSLD 398
Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
LR+RRNI YHVVYK+G+ YPIN LRN A++ V TP+VF LD+DFLP FGLY + + I
Sbjct: 399 LRTRRNIAYHVVYKDGDLYPINYLRNTAISYVPTPFVFQLDVDFLPQFGLYENIMNYINR 458
Query: 588 MDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFS 623
++++ K L+VPAFET+RYR FPA+ HA TN+S
Sbjct: 459 LNIN-ESDKVALIVPAFETERYRFTFPANKDELLKFLKRGILYTFRYHVWTQGHAATNYS 517
Query: 624 RWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
W N+T PYQ+ W PDFEPYI+ + PRYDTRF+GFGWNK
Sbjct: 518 FWRNSTDPYQVSWEPDFEPYIIVSKSAPRYDTRFIGFGWNK 558
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 96/161 (59%), Positives = 115/161 (71%), Gaps = 2/161 (1%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C+ I VA+VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A L TLF TW +P
Sbjct: 73 CDTIHVAMVCAGYNSTFALVTVVKSILFYRTNPLHFHLLVDEIARKTLSTLFRTWDLPHA 132
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
V+FY A+ V VSWIPNKHYSGVYGLLKL LP + E K +VLDTDV TD+ L
Sbjct: 133 NVTFYEAEIWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVLDTDVTVLTDMHPL 190
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
W +F R + +GLVENQS+WY+ L +PWPALGRGF
Sbjct: 191 WKMFERFSVDEALGLVENQSNWYVKALSYGQRPWPALGRGF 231
Score = 192 bits (487), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 135/221 (61%), Gaps = 6/221 (2%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P FY A+ V VSWIPNKHYSGVYGLLKL LP + E K +VLDTDV TD+
Sbjct: 130 PHANVTFYEAEIWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVLDTDVTVLTDM 187
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
LW +F R + +GLVENQS+WY+ L +PWPALGRGFNTGV+L+ L +LRD
Sbjct: 188 HPLWKMFERFSVDEALGLVENQSNWYVKALSYGQRPWPALGRGFNTGVMLMHLRRLRDRK 247
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
++ W + ++ L TSLADQD+ NA+I+E P +VY + C WN+QLSD+T SD +CY
Sbjct: 248 FSRSWENVTKRVLGHIPETSLADQDVINAVINEQPSIVYKIECTWNIQLSDHTLSD-ICY 306
Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNL 386
D I++N + K + LEY+ N+
Sbjct: 307 ---RDTNRINYNESNPCREFEKGANILYRTHPFLLEYEYNV 344
>gi|349805191|gb|AEQ18068.1| putative glycosyltransferase isoform 4 [Hymenochirus curtipes]
Length = 279
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 113/187 (60%), Positives = 132/187 (70%), Gaps = 20/187 (10%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVTLVAQLSMDRLQM+E++ +HWEGP+SL LY+SDAEAQQFL YA SE L+SR N+GYH
Sbjct: 56 DVTLVAQLSMDRLQMLELICRHWEGPMSLALYLSDAEAQQFLRYAQASEVLQSRTNVGYH 115
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
VVYKEG YP+N LRNVAL TPYVF DIDFLPM+GLY
Sbjct: 116 VVYKEGQLYPVNLLRNVALKNAQTPYVF-SDIDFLPMYGLY------------------S 156
Query: 598 VLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRF 657
+L + A T RY + HAPTN+++W A TPY +EWAPDFEPY+V RD P YD RF
Sbjct: 157 MLDMGALYTFRYHV-WEKGHAPTNYAKWRTAVTPYSVEWAPDFEPYVVVKRDCPEYDQRF 215
Query: 658 VGFGWNK 664
+GFGWNK
Sbjct: 216 LGFGWNK 222
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 30/34 (88%), Gaps = 1/34 (2%)
Query: 326 LPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSP 359
LPC WNVQLSD+TRS++ CY ++ DLK+IHWNSP
Sbjct: 22 LPCFWNVQLSDHTRSEQ-CYRDVADLKVIHWNSP 54
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 18/20 (90%), Positives = 18/20 (90%)
Query: 148 ENQSDWYLGKLWKNHKPWPA 167
E QSDWYLG LWKNHKPWPA
Sbjct: 1 EKQSDWYLGNLWKNHKPWPA 20
>gi|168985117|emb|CAQ09435.1| like-glycosyltransferase [Homo sapiens]
Length = 183
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 110/162 (67%), Positives = 134/162 (82%), Gaps = 6/162 (3%)
Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
L+ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 1 LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 59
Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC ++ LQ+ +LS L+E+D
Sbjct: 60 SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 118
Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMD 455
CY+ RR + T RTHLYFL YEYE ++D DVTLVAQLSMD
Sbjct: 119 LCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMD 160
>gi|350645709|emb|CCD59684.1| glycosyltransferase-related [Schistosoma mansoni]
Length = 757
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 145/423 (34%), Positives = 210/423 (49%), Gaps = 76/423 (17%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P FY + + VSWIP+ HYSG++GL KL +P++LPE++ K I LD D++ +D+
Sbjct: 273 PNFNVYFYQLELYLHKVSWIPSTHYSGLFGLSKLLIPEILPESVEKVINLDVDLLVNSDL 332
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWA F + Q IGLV NQS WYL K N WPA G GFNTGV+LLDL +LR+
Sbjct: 333 LELWAHFQKFNTSQMIGLVANQSPWYLRK--TNRIVWPAWGPGFNTGVMLLDLVRLRESH 390
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W+ W + L ++ LADQDI NA + E P ++Y LPC+WNVQL+ N + +LC
Sbjct: 391 WSEHWYKTTKSALQHIPYSMLADQDIINAALVEIPSIIYQLPCEWNVQLTSNM-NIKLCA 449
Query: 346 TELTD---------------------LKIIHWNSPKK----------------LKVKNKH 368
++ LKI H+N P K L++K +
Sbjct: 450 IIWSNDYSIHNVQSFHSNPFESFNYHLKIAHFNHPIKADMITLNEIIDDDQLRLRLKRQF 509
Query: 369 MEFFRNLYLTFLEYDGNLLRR--------ELFGCNLTQTTLQQAELSNLNEEDPCYDLRR 420
M +R + +DG +++ E + + + Q N D C +
Sbjct: 510 MRMYR----YYQNFDGTIVQSPGKLSSCSEYYNPSTFNRKINQTSSKNTITIDGCEEFSE 565
Query: 421 SKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVT 480
L + R H Y+ +Y + + +M + V IN N V+
Sbjct: 566 DLLKTHRIHPYYYKYNF------------STFTMKDKENV-INMKN-----------HVS 601
Query: 481 LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVY 540
LV+QL+ DRL +E L W GP+SL LY++D EA + Y NS L++R NIGYH+VY
Sbjct: 602 LVSQLTFDRLHRLEELAIRWLGPMSLALYLTDREATLLIEYITNSRILQNRTNIGYHIVY 661
Query: 541 KEG 543
+G
Sbjct: 662 VDG 664
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 101/198 (51%), Gaps = 42/198 (21%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRK-------------------------------- 42
I +++V G + R LVTLIKSIL RK
Sbjct: 179 IDISLVIGGAQAARGLVTLIKSILLQRKWERSVEFSKFPRPATLKSTVTFNGKETIMDNV 238
Query: 43 --------NPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYS 94
++ HLI+D A L TLF TW++P V FY + + VSWIP+ HYS
Sbjct: 239 NKHNECYIRHINLHLISDYHAFKSLSTLFDTWNIPNFNVYFYQLELYLHKVSWIPSTHYS 298
Query: 95 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 154
G++GL KL +P++LPE++ K I LD D++ +D+ +LWA F + Q IGLV NQS WY
Sbjct: 299 GLFGLSKLLIPEILPESVEKVINLDVDLLVNSDLLELWAHFQKFNTSQMIGLVANQSPWY 358
Query: 155 LGKLWKNHKPWPALGRGF 172
L K N WPA G GF
Sbjct: 359 LRK--TNRIVWPAWGPGF 374
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
++W+ D+EPY++ R +D F+GFGWNK
Sbjct: 665 VQWSADYEPYVIVSRHTTMFDEIFIGFGWNK 695
>gi|21706836|gb|AAH33922.1| Gyltl1b protein, partial [Mus musculus]
Length = 285
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 108/207 (52%), Positives = 136/207 (65%), Gaps = 25/207 (12%)
Query: 482 VAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYK 541
VAQLSMDRLQM+E L +HW GP+SL LY++D EAQQFL + S L R+++ YHVVY+
Sbjct: 1 VAQLSMDRLQMLEALCRHWPGPMSLALYLTDEEAQQFLHFVETSPVLSMRKDVAYHVVYR 60
Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
+G YP+N LRNVAL Q TPYVFL DIDFLP + LY YL++SI +++ K LVV
Sbjct: 61 DGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLRASIEQLELDSR-RKTALVV 119
Query: 602 PAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWA 637
PAFET YR +FP S H+ T++SRW A PY+++W+
Sbjct: 120 PAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGHSSTDYSRWREAQAPYRVQWS 179
Query: 638 PDFEPYIVAHRDLPRYDTRFVGFGWNK 664
D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 180 ADYEPYVVVPRDCPRYDPRFVGFGWNK 206
>gi|402590775|gb|EJW84705.1| glycosyltransferase, partial [Wuchereria bancrofti]
Length = 354
Score = 219 bits (558), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 147/231 (63%), Gaps = 2/231 (0%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ +Y A + SWIPN+HYSG YG LKL L +LP + K I LDTD + DI
Sbjct: 122 PAVKVIYYNASQYLGRFSWIPNRHYSGRYGFLKLILNDILPADVDKVITLDTDALIMDDI 181
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
AQLW+ FS++ Q IGLVEN S+WYL + WPA GRGFN+GV+LLDLTKLR+I
Sbjct: 182 AQLWSFFSKMTSLQAIGLVENLSNWYLINRNTSQRIVWPAWGRGFNSGVMLLDLTKLRNI 241
Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
SW+ W+ IA K + LADQD+ NA+I+ H ++V+ LPC+WN QL ++ D LC
Sbjct: 242 SWSYIWKEIASKNVKDYGPVELADQDVINAVINHHQWIVHKLPCEWNFQLGFQSQQD-LC 300
Query: 345 YTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCN 395
E++ LK++HWNSP K + N++ R Y + + DGN+ R + C+
Sbjct: 301 PVEISRLKLVHWNSPLKTRTVNRYAVLLRRYYDSMRDMDGNMFRSNIIHCH 351
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 78/164 (47%), Positives = 104/164 (63%), Gaps = 2/164 (1%)
Query: 11 TCEVIQVA-IVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
C+V+ +A +VC G + + L TLIKSIL YR+ + HL+ D + + I+ LF+TW +P
Sbjct: 63 NCQVLHLALVVCGGVHLSYPLSTLIKSILRYRQQAIVLHLLVDDITMRIISLLFNTWRLP 122
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
V+V +Y A + SWIPN+HYSG YG LKL L +LP + K I LDTD + DIA
Sbjct: 123 AVKVIYYNASQYLGRFSWIPNRHYSGRYGFLKLILNDILPADVDKVITLDTDALIMDDIA 182
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGF 172
QLW+ FS++ Q IGLVEN S+WYL + WPA GRGF
Sbjct: 183 QLWSFFSKMTSLQAIGLVENLSNWYLINRNTSQRIVWPAWGRGF 226
>gi|256079175|ref|XP_002575865.1| glycosyltransferase-related [Schistosoma mansoni]
Length = 757
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 141/432 (32%), Positives = 206/432 (47%), Gaps = 94/432 (21%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P FY + + VSWIP+ HYSG++GL KL +P++LPE++ K I LD D++ +D+
Sbjct: 273 PNFNVYFYQLELYLHKVSWIPSTHYSGLFGLSKLLIPEILPESVEKVINLDVDLLVNSDL 332
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
+LWA F + Q IGLV NQS WYL K N WPA G GFNTGV+LLDL +LR+
Sbjct: 333 LELWAHFQKFNTSQMIGLVANQSPWYLRK--TNRIVWPAWGPGFNTGVMLLDLVRLRESH 390
Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
W+ W + L ++ LADQDI NA + E P ++Y LPC+WNVQL+ N + +LC
Sbjct: 391 WSEHWYKTTKSALQHIPYSMLADQDIINAALVEIPSIIYQLPCEWNVQLTSNM-NIKLCA 449
Query: 346 TELTD---------------------LKIIHWNSPKK----------------LKVKNKH 368
++ LKI H+N P K L++K +
Sbjct: 450 IIWSNDYSIHNVQSFHSNPFESFNYHLKIAHFNHPIKADMITLNEIIDDDQLRLRLKRQF 509
Query: 369 MEFFRNLYLTFLEYDGNLLRR--------ELFGCNLTQTTLQQAELSNLNEEDPCYDLRR 420
M +R + +DG +++ E + + + Q N D C +
Sbjct: 510 MRMYR----YYQNFDGTIVQSPGKLSSCSEYYNPSTFNRKINQTSSKNTITIDGCEEFSE 565
Query: 421 SKLTSLRTHLYFLEYEYEASS---------DGNDVTLVAQLSMDRLQMVEINPNNVDEYE 471
L + R H Y+ +Y + + N V+LV+QL+ DRL +E
Sbjct: 566 DLLKTHRIHPYYYKYNFSTFTMKDKENVINMKNHVSLVSQLTFDRLHRLE---------- 615
Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR 531
E+ + W GP+SL LY++D EA + Y NS L++R
Sbjct: 616 ----------------------ELAIR-WLGPMSLALYLTDREATLLIEYITNSRILQNR 652
Query: 532 RNIGYHVVYKEG 543
NIGYH+VY +G
Sbjct: 653 TNIGYHIVYVDG 664
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 100/198 (50%), Gaps = 42/198 (21%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRK-------------------------------- 42
I +++V G + R LVTLIKSIL RK
Sbjct: 179 IDISLVIGGAQAARGLVTLIKSILLQRKWERSVEFSKFPRPATLKSTVTFNGKETIMDNV 238
Query: 43 --------NPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYS 94
++ HLI D A L TLF TW++P V FY + + VSWIP+ HYS
Sbjct: 239 NKHNECYIRHINLHLIIDYHAFKSLSTLFDTWNIPNFNVYFYQLELYLHKVSWIPSTHYS 298
Query: 95 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 154
G++GL KL +P++LPE++ K I LD D++ +D+ +LWA F + Q IGLV NQS WY
Sbjct: 299 GLFGLSKLLIPEILPESVEKVINLDVDLLVNSDLLELWAHFQKFNTSQMIGLVANQSPWY 358
Query: 155 LGKLWKNHKPWPALGRGF 172
L K N WPA G GF
Sbjct: 359 LRK--TNRIVWPAWGPGF 374
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
++W+ D+EPY++ R +D F+GFGWNK
Sbjct: 665 VQWSADYEPYVIVSRHTTMFDEIFIGFGWNK 695
>gi|444511886|gb|ELV09960.1| Glycosyltransferase-like protein LARGE1, partial [Tupaia chinensis]
Length = 255
Score = 196 bits (498), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 86/138 (62%), Positives = 108/138 (78%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 93 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
TW VP V V FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212
Query: 125 ATDIAQLWALFSRLRQRQ 142
ATDIA+LWA+F + + Q
Sbjct: 213 ATDIAELWAVFHKFKASQ 230
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/74 (59%), Positives = 55/74 (74%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
PA+ FY AD + +VSWIPNKHYSG+YGL+KL L K LP L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216
Query: 226 AQLWALFSRLRQRQ 239
A+LWA+F + + Q
Sbjct: 217 AELWAVFHKFKASQ 230
>gi|326428547|gb|EGD74117.1| Gyltl1b protein [Salpingoeca sp. ATCC 50818]
Length = 830
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 171/591 (28%), Positives = 255/591 (43%), Gaps = 119/591 (20%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
Y D + +SW+ HYSG Y LLKL LP +L + + I LDTD+ F DIA+LW +
Sbjct: 180 LYPGDRHLSAISWVETSHYSGQYPLLKLLLPTLLAD-VQHVIALDTDLTFRADIAELWRI 238
Query: 232 FSRLRQRQTIGLVENQSD-----------WYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
F+ + IG+VEN S W H P + + L+
Sbjct: 239 FAMFEAEEAIGVVENLSGGERRGGRRLCIWRARARVSPHSP-----------LCMASLSC 287
Query: 281 LRDI-SWAGFWRIIAEKFL----LTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
R +WA WR L + T+LADQDI NA++++HP + L C WN QLS
Sbjct: 288 RRSQRNWANVWRDTTRAVLSDNRIGVTATALADQDIINAVLNQHPEWMLLLSCVWNFQLS 347
Query: 336 DNTRSDELCYTELTDLKIIHWNSPKK----LKVKNKHMEFFRNLYLTFLEYDGNLLRREL 391
+++ + C D ++HWN+ K + + R+ Y DG LL
Sbjct: 348 EHSLAQTQCSAR--DALVVHWNTRAKDMQATLLDDNAFRALRDAYEISRRVDGALLTSTT 405
Query: 392 F--GCNLTQTTLQQ--------------------------AELSNLNEEDPCYDLRRSKL 423
C+ T + EL N E+ R+S
Sbjct: 406 LDVACDSHTTAVNNRQQQQQIQIQQQLQHQHQHQQQQQQQKELENQGEDGGGGHTRQSAH 465
Query: 424 TSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDE-------------Y 470
R L L +SS + + ++ Q S+ R V P + Y
Sbjct: 466 RQPRPQLSALSSSSSSSSQSSILGVMGQTSIPRAVRVACPPTYLSALQAPLRVHPYFIAY 525
Query: 471 EASSDG-NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALR 529
+AS + +++TLV ++DRLQ + + W+GPIS +++S+ +A F S A AL
Sbjct: 526 DASLETESEITLVTHFTLDRLQAFVAMAETWQGPISAAVFISEHDA--FTSLA----ALH 579
Query: 530 SRRNIGYHVVYKE----GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
N+ HVVY++ G+ YPIN LRN+A++ TPY LD+DF L+ L +
Sbjct: 580 -LTNVALHVVYQDPDHDGSLYPINRLRNMAISLCRTPYFLHLDVDFSFRSDLHAVLGRHV 638
Query: 586 RSMDMH--------GHGGKKVLVVPAFETQRYR------------------------TAF 613
R++ + G LVVPAFE+ Y+ + +
Sbjct: 639 RAVSIGELNIEASMGAARGTALVVPAFESVSYKRHVPDTKERLLWELSTGQMQPFRVSEW 698
Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
P H+ T+F+RW A PY I+ FEPY+V + P +D F+GFGWNK
Sbjct: 699 PQGHSATDFARWKQADAPYLIQHQSGFEPYLVLPKTAPPFDETFLGFGWNK 749
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 81/140 (57%), Gaps = 1/140 (0%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
C+ + +A V GY + L T++KSIL +R P+H H+ITD L +L + TW +
Sbjct: 117 CDTVHLAAVSVGYAQSAHLRTMLKSILLHRSLPIHLHVITDDATLPVLDVMLGTWGIRAF 176
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
+ Y D + +SW+ HYSG Y LLKL LP +L + + I LDTD+ F DIA+L
Sbjct: 177 NYTLYPGDRHLSAISWVETSHYSGQYPLLKLLLPTLLAD-VQHVIALDTDLTFRADIAEL 235
Query: 132 WALFSRLRQRQTIGLVENQS 151
W +F+ + IG+VEN S
Sbjct: 236 WRIFAMFEAEEAIGVVENLS 255
>gi|60360244|dbj|BAD90366.1| mKIAA4105 protein [Mus musculus]
Length = 249
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 140/243 (57%), Gaps = 36/243 (14%)
Query: 376 YLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEY 435
+L F ++L+ LF + LQQA L+ L+EE+PC++ R+ +LT R H+ FL +
Sbjct: 28 WLAFGRSCPSVLKDMLFFRSWNPAQLQQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPH 86
Query: 436 EYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEM 495
+ +DVTLVAQLSMDRLQM+E
Sbjct: 87 QPPPPQ----------------------------------PHDVTLVAQLSMDRLQMLEA 112
Query: 496 LFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVA 555
L HW GP+SL LY++D EAQQFL + S L R+++ YHVVY++G YP+N LRNVA
Sbjct: 113 LCGHWPGPMSLALYLTDEEAQQFLHFVETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVA 172
Query: 556 LNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPA 615
L Q TPYVFL DIDFLP + LY YL++SI +++ K LVVPAFET YR +FP
Sbjct: 173 LAQALTPYVFLSDIDFLPAYSLYDYLRASIEQLELDSR-RKTALVVPAFETLHYRFSFPN 231
Query: 616 SHA 618
S A
Sbjct: 232 SKA 234
>gi|345324407|ref|XP_001507723.2| PREDICTED: glycosyltransferase-like protein LARGE2-like
[Ornithorhynchus anatinus]
Length = 542
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 84/124 (67%), Positives = 103/124 (83%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
+++ VAIVCAG+NS+R VTL+KSILF+RKNPLHFHL+TD VA IL+TLF +W VP V
Sbjct: 32 QLLHVAIVCAGHNSSRDAVTLVKSILFHRKNPLHFHLVTDAVARLILETLFRSWMVPSVL 91
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
VSFY AD + +V+WIPNKHYSG+YGLLKLTLP+ LP L + IVLDTD+ FA+DIA LW
Sbjct: 92 VSFYDADELKPEVAWIPNKHYSGIYGLLKLTLPRALPPDLPRVIVLDTDITFASDIADLW 151
Query: 133 ALFS 136
A+FS
Sbjct: 152 AVFS 155
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 64/110 (58%), Gaps = 26/110 (23%)
Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
P+ ++SI +D+ K LVVPAFET RYR +FPAS
Sbjct: 228 PFPRASIEQLDLATR--KAALVVPAFETLRYRLSFPASKAELLTLLDAGSLYTFRYHVWP 285
Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
HAPT+++RW A PY++ WA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 286 RGHAPTDYARWREAPAPYRVAWAADYEPYVVVRRDCPRYDPRFVGFGWNK 335
Score = 95.9 bits (237), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 42/62 (67%), Positives = 51/62 (82%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
FY AD + +V+WIPNKHYSG+YGLLKLTLP+ LP L + IVLDTD+ FA+DIA LWA+
Sbjct: 94 FYDADELKPEVAWIPNKHYSGIYGLLKLTLPRALPPDLPRVIVLDTDITFASDIADLWAV 153
Query: 232 FS 233
FS
Sbjct: 154 FS 155
>gi|395543773|ref|XP_003773787.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Sarcophilus
harrisii]
Length = 448
Score = 181 bits (460), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 89/177 (50%), Positives = 115/177 (64%), Gaps = 36/177 (20%)
Query: 177 SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLR 236
S +V+WIPNKHYSG+YGLLKLTLP+ LP +L + IVLDTD+ FA+DIA+LWA+F+R
Sbjct: 22 STAPEVAWIPNKHYSGIYGLLKLTLPRALPPSLPRVIVLDTDITFASDIAELWAVFARF- 80
Query: 237 QRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEK 296
+GVILL L +LR W WR+ AE+
Sbjct: 81 ----------------------------------SGVILLHLERLRQAGWEQMWRLTAER 106
Query: 297 FLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKI 353
LLT L TSLADQD+FNA+I +HP+LV+ LPC WNVQLSD+TR+++ CY E +DLK+
Sbjct: 107 ELLTMLATSLADQDVFNAVIKQHPWLVHPLPCSWNVQLSDHTRAEQ-CYLEGSDLKV 162
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 54/93 (58%), Gaps = 24/93 (25%)
Query: 596 KKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTP 631
K LVVPAFET RYR +FP S H PT+++RW A P
Sbjct: 184 KAALVVPAFETLRYRLSFPTSKAELLSLLDDGSLYTFRYHVWPRGHEPTDYARWREARAP 243
Query: 632 YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
Y++EWA D+EPY+V RD PRYD RFVGFGWNK
Sbjct: 244 YRVEWAADYEPYVVVPRDCPRYDPRFVGFGWNK 276
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/59 (66%), Positives = 50/59 (84%)
Query: 80 SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
S +V+WIPNKHYSG+YGLLKLTLP+ LP +L + IVLDTD+ FA+DIA+LWA+F+R
Sbjct: 22 STAPEVAWIPNKHYSGIYGLLKLTLPRALPPSLPRVIVLDTDITFASDIAELWAVFARF 80
>gi|268578177|ref|XP_002644071.1| C. briggsae CBR-LGE-1 protein [Caenorhabditis briggsae]
Length = 471
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 136/448 (30%), Positives = 206/448 (45%), Gaps = 107/448 (23%)
Query: 266 GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
GRGFNTGVI++DL +LRD+ + WR +A K+L T+++DQDIFNA + E+P V T
Sbjct: 15 GRGFNTGVIMMDLDRLRDMDFKTKWRAVANKYLRIHGKTTMSDQDIFNAYLYEYPQEVKT 74
Query: 326 LPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGN 385
LPC WN QL ++SD+LC+ E + +H+NS K KN YL + N
Sbjct: 75 LPCAWNYQLGVLSKSDKLCFEE---PQALHFNSQNKTVRKN---------YLLY-----N 117
Query: 386 LLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGND 445
+R+E+ G + T+ R + + + TH +S +
Sbjct: 118 AVRKEIDGIDGTEL--------------------RKRRSLVSTH------PAPSSPTPSK 151
Query: 446 VTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPIS 505
Q LQ + PN + + + + ++ LV Q S DRL + W PIS
Sbjct: 152 TNTCEQFL--PLQNFRVLPNALGKIK---ENPELCLVTQFSKDRLHSFVKNARKWNRPIS 206
Query: 506 LTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVF 565
+Y D++ + A +AL +R +I H+V++E + YPIN LRNVA+N +
Sbjct: 207 AAIYGKDSDLAEI---ADVIKAL-NRSDIAIHMVFEEESTYPINFLRNVAINHTECVNML 262
Query: 566 LLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFET------------------- 606
L D DF+ ++G Y L+ R++ K+VLV+PAFET
Sbjct: 263 LADADFI-IYGDYKELEDQARNL-----TEKEVLVIPAFETTNASMTDVSKFPQTKKDVV 316
Query: 607 ----------------------------QRYR-TAFPASHAPTNFSRWVNATTPYQIEWA 637
Q +R ++P +H TN W+ A Y + +
Sbjct: 317 LSFFNKTVKIMLNPLRIFFTLPKTILKVQMFRGKSWPNAHNATNIVEWIAAEHVYSVNYN 376
Query: 638 PDFEPY-IVAHRDLPRYDTRFVGFGWNK 664
++EPY I+ P YD RF G+GWNK
Sbjct: 377 RNYEPYFIIKKTSFPMYDLRFGGYGWNK 404
>gi|291242113|ref|XP_002740954.1| PREDICTED: conserved hypothetical protein-like [Saccoglossus
kowalevskii]
Length = 656
Score = 165 bits (417), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 136/519 (26%), Positives = 232/519 (44%), Gaps = 96/519 (18%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 244
+P + S Y L L K + + + +D +VIF +D A++W+ F+ Q +G+V
Sbjct: 125 VPGVNCSTYYNTEFLELMKSPATRIPRLMYIDWNVIFLSDPAEIWSQFNNFAPGQAVGVV 184
Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT 304
+++P NT ++LLD+ + +++ + +K L
Sbjct: 185 G-----------LSYRP-----EDMNTNLLLLDMNYVMELNLQD----VLDKIGLGSDNY 224
Query: 305 SLADQD-----IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSP 359
+ D + ++ + +P ++ LPC +N+ D T S CY ++ ++ + + SP
Sbjct: 225 DITDHNSKLHTTITSLSTTNPDALFLLPCNYNL---DFTNSSMDCYLQMGNM--LSFQSP 279
Query: 360 KKLK----VKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPC 415
+ + + + FRN Y + L+YDGNL R ++ GC + ++ N E C
Sbjct: 280 ESFGNDPWLNIDYHQSFRNQYFSLLDYDGNLFRHKIIGCGINIARPEKIVKLFRNGELDC 339
Query: 416 YDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSD 475
+ LRTH YF+ GN T E S
Sbjct: 340 SVFFMRETMILRTHTYFI---------GNIPT-----------------------EKDSP 367
Query: 476 GNDVTLVAQLSMDRL-QMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNI 534
++TL Q + DR QM+E + +W GPIS+ +Y +D + Q + S+ L++R N+
Sbjct: 368 PIEITLSLQSTFDRFYQMLERIATYWTGPISVAVYATDTQVQLIPAMFEMSQILKNRNNV 427
Query: 535 GYHV--VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG 592
HV V +E YP N +RN+A+ +V+T Y+++ +IDFL LY LK ++ G
Sbjct: 428 AVHVGFVEREVGQYPFNHMRNIAMEEVNTTYLYIGEIDFLYKPTLYEELKDIVQWFVTSG 487
Query: 593 HGGK-KVLVVPAFETQRYRTA-------------------------FPASHAPTNFSRWV 626
G + K LVVPA+E R + FP H P+N+++ +
Sbjct: 488 GGMEMKALVVPAYEVNRLNVSYIPSTKAELIRLWDNEDLHLFHFWDFPQGHLPSNYTKIM 547
Query: 627 NATTPYQIEWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
PY I W +EPY+V ++ LP + RF G G +K
Sbjct: 548 LEDQPYPIIWEKGYEPYVVLLNKALPDFPLRFSGQGHDK 586
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 65/137 (47%), Gaps = 21/137 (15%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALN-ILQTLFSTWSVPQ 70
C+ I + I + +R+L T +KS+ FYR LHFH I D +L+TL+ TW+VP
Sbjct: 68 CDDIHLLINIPFKDLSRNLTTTLKSLFFYRTTRLHFHYIVDQERTGYMLRTLYKTWNVPG 127
Query: 71 VEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
V S Y Y L L K + + + +D +VIF +D A+
Sbjct: 128 VNCSTY--------------------YNTEFLELMKSPATRIPRLMYIDWNVIFLSDPAE 167
Query: 131 LWALFSRLRQRQTIGLV 147
+W+ F+ Q +G+V
Sbjct: 168 IWSQFNNFAPGQAVGVV 184
>gi|291242121|ref|XP_002740957.1| PREDICTED: glycosyltransferase-like 1b-like [Saccoglossus
kowalevskii]
Length = 630
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/490 (25%), Positives = 216/490 (44%), Gaps = 100/490 (20%)
Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 268
+ K IVLD +VIF DI++LW F + +E P++ R
Sbjct: 134 IDKVIVLDLNVIFIRDISELWDEFRQFGDNNCFATIE-----------------PSIERF 176
Query: 269 FNTG-VILLDLTKL-RDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTL 326
TG V+L+++ L R FW E L + +S + I AI ++ + L
Sbjct: 177 DQTGNVVLIEIETLSRFADQTAFW----ENILQSNEQSSFTTK-IMKAIETQPGERLKLL 231
Query: 327 PCQWNVQLSDNTRSDELCYTELTDL---KIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYD 383
C++NV ++ + + C ++++D+ ++H + + F ++Y + +E D
Sbjct: 232 ACKFNVDVNSHIQQ---CVSDISDIVSFNMVHVTEKSLNENDALFVHHFNSIYDSIVEGD 288
Query: 384 GNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDG 443
GNLLR + C ++ T + + S + + R +L LR+H+++L + D
Sbjct: 289 GNLLRNRMVKCPVSLTPKNKTKCSFVTD--------REQLV-LRSHMFYLGENITSPDDD 339
Query: 444 NDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGP 503
+++ Q R +QM++ + +W P
Sbjct: 340 FSISMAMQTDFSRF--------------------------------VQMIDKISVYWTCP 367
Query: 504 ISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNF--YPINTLRNVALNQVST 561
+S+T+Y +D EA+Q L + S L R++I H+ Y E F YP+N LRNVA +T
Sbjct: 368 MSITIYATDEEARQLLKFRRISSTLSQRKDISIHISYVEREFEDYPVNHLRNVAQEGATT 427
Query: 562 PYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR------------ 608
Y+F+ +ID+ + L++ + + D KK +VVPAFE+ R
Sbjct: 428 QYIFITEIDYFYSLDICQNLQNLVSMIEDRERTMDKKAIVVPAFESNRLSDRMIPETKEE 487
Query: 609 -------------YRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYD 654
+ T +P H PTN+ RW++++ Y++ W +EPY+V R P +
Sbjct: 488 IVDKYLNEEYHYFHHTIYPGGHKPTNYERWIDSSEEYRVRWNRGYEPYLVLKRKHSPEFP 547
Query: 655 TRFVGFGWNK 664
F G G+NK
Sbjct: 548 KIFSGRGYNK 557
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 26/141 (18%)
Query: 12 CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLIT-DTVALNILQTLFSTWSVPQ 70
CE + + + + Y R +L+KSI + K+ L+FH+I+ D +++ TL TW +P
Sbjct: 52 CEYVNILMYVSSYTEVRQFASLVKSITAHSKSSLYFHVISADEKIRSVVSTLMKTWLLPS 111
Query: 71 VEVSFYLADSVVE---DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
V + Y D ++ D+ P + K IVLD +VIF D
Sbjct: 112 VAYTCYSVDDFLDRSRDLWATP----------------------IDKVIVLDLNVIFIRD 149
Query: 128 IAQLWALFSRLRQRQTIGLVE 148
I++LW F + +E
Sbjct: 150 ISELWDEFRQFGDNNCFATIE 170
>gi|308494773|ref|XP_003109575.1| CRE-LGE-1 protein [Caenorhabditis remanei]
gi|308245765|gb|EFO89717.1| CRE-LGE-1 protein [Caenorhabditis remanei]
Length = 434
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 121/438 (27%), Positives = 194/438 (44%), Gaps = 118/438 (26%)
Query: 275 LLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
++DL +LR+ W WR +A K+L T+++DQDIFNA I ++P + +LPC++N QL
Sbjct: 1 MMDLDRLREGDWKSKWRAVANKYLKLHGKTTMSDQDIFNAYIHDYPTEIKSLPCEYNFQL 60
Query: 335 SDNTRSDELCY-TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFG 393
T+SDELC T L +H+NS K +N Y+ + N +R+E+ G
Sbjct: 61 GTLTKSDELCVETPLA----LHFNSQNKTVRRN---------YVVY-----NAVRKEIEG 102
Query: 394 CNLTQTTLQQA--------ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGND 445
+ + ++ +S + + C + L + RT L E +
Sbjct: 103 IDGSDLRRRRRSLKKRALPSVSKTDSKTSCEEYM--PLQNFRTLPNALGRLMETAK---- 156
Query: 446 VTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPIS 505
+ LV Q S DRL N KHW+ PIS
Sbjct: 157 LCLVTQFSKDRLDSFIENA---------------------------------KHWKQPIS 183
Query: 506 LTLYMSDAEAQQFLSYAGNSEALRS--RRNIGYHVVYKE-------GNFYPINTLRNVAL 556
+Y +D + + + EA+++ R ++ H+V+KE + YPIN LRN A+
Sbjct: 184 AAIYGTDQDMNEIV------EAVKTLNRPDLVLHMVFKEPTDQTMSQDIYPINYLRNTAI 237
Query: 557 NQVSTPYVFLLDIDFLPMFGLYPYLKS-SIRSMDMHGHGGKKVLVVPAFET--------- 606
+ + + D+DF+ +Y + ++S D+ K+VLV+PAFET
Sbjct: 238 KYSNCEKILMTDVDFM----IYEDTQDLVVQSKDL---KDKEVLVIPAFETAENNITDVT 290
Query: 607 ------QRYRTAF-------------PASHAPTNFSRWVNATTPYQIEWAPDFEPY-IVA 646
++ AF P++H TN + W+ A Y + + ++EPY I+
Sbjct: 291 TFPHTKEQLVKAFFKKKINVFRGITWPSAHNSTNVAEWIIAEDVYSVNYEKNYEPYFIIK 350
Query: 647 HRDLPRYDTRFVGFGWNK 664
P YD RF GFGWNK
Sbjct: 351 KSSCPMYDQRFGGFGWNK 368
>gi|10834722|gb|AAG23791.1|AF258588_1 PP5656 [Homo sapiens]
Length = 257
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 66/134 (49%), Positives = 81/134 (60%), Gaps = 25/134 (18%)
Query: 555 ALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFP 614
L Q TPYVFL DIDFLP + LY YL++SI + + G K LVVPAFET RYR +FP
Sbjct: 18 GLGQALTPYVFLSDIDFLPAYSLYDYLRASIEQLGL-GSRRKAALVVPAFETLRYRFSFP 76
Query: 615 AS------------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDL 650
S HAPT+++RW A PY+++WA ++EPY+V RD
Sbjct: 77 HSKVELLALLDAGTLYTFRYHEWPRGHAPTDYARWREAQAPYRVQWAANYEPYVVVPRDC 136
Query: 651 PRYDTRFVGFGWNK 664
PRYD RFVGFGWNK
Sbjct: 137 PRYDPRFVGFGWNK 150
>gi|443686400|gb|ELT89685.1| hypothetical protein CAPTEDRAFT_152623 [Capitella teleta]
Length = 314
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 105/211 (49%), Gaps = 28/211 (13%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
D N +T V QL+ DRL+ + + + W GP++ Y++ + F+ L+ R N
Sbjct: 37 GDLNQITFVTQLTPDRLERLRRVAESWRGPMAANFYVTPKKIADFMHAYCTDPVLQKREN 96
Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
I HV EG FYP N +RN+A++ V T YVFL D+DF+P+ G+ ++ IR +
Sbjct: 97 IQLHVTQTEGLFYPSNIMRNIAVDPVKTKYVFLCDVDFVPIPGIEDIMQRYIRDGFV--- 153
Query: 594 GGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNAT 629
K++V+PAFE + A P S H T RW A
Sbjct: 154 SDDKLIVIPAFEIKEKEIAHPKSKREVLDLLDKEKIQLMDIDHFDGAHKWTELDRWKTAE 213
Query: 630 TPYQIEWAPDFEPYIVAHRD-LPRYDTRFVG 659
PY ++W FEPY+V +D PRY+ RFVG
Sbjct: 214 EPYIVQWREYFEPYVVMPKDKTPRYNPRFVG 244
>gi|440797224|gb|ELR18319.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 807
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 73/230 (31%), Positives = 122/230 (53%), Gaps = 40/230 (17%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLY-MSDAEAQQFLSYAGNSEALRSR 531
S D V++ QL++DRL + M+ ++W+GPIS ++ M +AE + L + + L
Sbjct: 488 SPDPKGVSIATQLTIDRLPSLRMMAQNWDGPISAAVFVMDEAEVDEVLYFRSTWDKLAD- 546
Query: 532 RNIGYHVVYKEGNF-------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
NI +H+V E N+ YP+NTLRN+AL + T + +LD DF+P + +
Sbjct: 547 -NIDFHLVISERNYAYKTLGLYPVNTLRNIALARCRTEHALILDADFIPSPDSHSVIMDM 605
Query: 585 IRSMDMHGHGGKK------VLVVPAFE-----------------------TQRYRTAFPA 615
+ + G ++ V VVPAFE Q + +
Sbjct: 606 MPRLGRISEGAQEPDSPGVVWVVPAFEIEPGVPLPDSRKELVEMSRRDLIRQVHEVKWAP 665
Query: 616 SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
+HAPTN++ W+N++ PY+I + P++EPY++ RD +P YD RF+G+G++K
Sbjct: 666 AHAPTNYTAWMNSSEPYEITFRPEYEPYMMVPRDSVPLYDERFIGYGYDK 715
>gi|167523375|ref|XP_001746024.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775295|gb|EDQ88919.1| predicted protein [Monosiga brevicollis MX1]
Length = 306
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 110/234 (47%), Gaps = 47/234 (20%)
Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
Q+V P V + + VTL+ + DRL + L + +EGP+S L+ +D EA +
Sbjct: 28 QIVRELPRFVCLERPALETGAVTLITAFTPDRLDLFLSLVRSYEGPVSAALFANDEEADR 87
Query: 518 FLSYAGNSEALRSRRNIGYHVVYKEG---NFYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
+ R R + H++ +G PIN LRN+A + TP+VF +D+DF
Sbjct: 88 VAQALAETSFARRHRRVCVHLLMPDGLAPGAVPINRLRNLAWDHAPTPWVFHVDVDF--- 144
Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------------- 612
++S R+ VVPAFET +YR
Sbjct: 145 -------EASNRT----------AFVVPAFETPQYRLPRFGTKAALLEAWDLGLLRPFRM 187
Query: 613 --FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+P +H PT++ RW T PY I+W PD+EPY++ PR+D RF GFGWNK
Sbjct: 188 DLWPVAHRPTDYHRWSGTTDPYAIDWQPDYEPYLLMSPHAPRFDERFSGFGWNK 241
>gi|326437988|gb|EGD83558.1| hypothetical protein PTSG_04163 [Salpingoeca sp. ATCC 50818]
Length = 1024
Score = 125 bits (315), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 117/271 (43%), Gaps = 53/271 (19%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P + F +++V+WIP +H +G YG KL L + LP+ + IVLDTD+ DI
Sbjct: 139 PCVSSTFRDITPALDEVAWIPTRHTAGAYGFAKLVLERALPD-VDHVIVLDTDLSVHADI 197
Query: 226 AQLWALF----------------------SRLRQ-------------RQTIGLVENQSDW 250
QLW L RL R+ QSDW
Sbjct: 198 GQLWRLAFFPSDTAHGSNNGNMHNNHNAGDRLHSLDDSTQQQQQQQQRRLFAAALQQSDW 257
Query: 251 YLGKL--WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR-LWTSLA 307
YLG L + WPA+GRG NTGV+ L L +R W W + L +R T+LA
Sbjct: 258 YLGTLPGIPSSLIWPAVGRGLNTGVLALHLAHMRQDGWRDLWETALRRQLASRRATTALA 317
Query: 308 DQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK--------------I 353
+QD+FN + EH L LPC+WN+QL D+T + + I
Sbjct: 318 EQDVFNVVFLEHRELGGVLPCEWNLQLHDHTSAPACLMPAYQGHQHPRQYWLPPPRQPAI 377
Query: 354 IHWNSPKKLKVKNKHMEFFRNLYLTFLEYDG 384
+HWNSP K + ++ + +E +G
Sbjct: 378 LHWNSPLKQEADIPLRPHLMRMHQSAVEANG 408
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 73/150 (48%), Gaps = 31/150 (20%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM--DMHGHG-GKKVLVVP 602
YPIN +RNVA ++ +T +V +LD DF+ Y +++++ D G VVP
Sbjct: 716 YPINYMRNVAWDEATTEHVLMLDADFVISSNAYARIRATLHRQLNDRDARDTGATAFVVP 775
Query: 603 AFETQRYRTAFP------------------------ASHAPTNFSRWVNATT----PYQI 634
A ET Y P HAPTN +RW+++T PY++
Sbjct: 776 ALETDEYAFRVPDTVDGVRRLAKEGELRAFRSREWEKGHAPTNMTRWLSSTQQQQQPYRV 835
Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+W P +EPY+V PRYD R GFGWNK
Sbjct: 836 DWQPGYEPYLVLPATAPRYDERLSGFGWNK 865
Score = 89.0 bits (219), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 86/197 (43%), Gaps = 41/197 (20%)
Query: 16 QVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW---SVPQVE 72
VA+V AG ++ R I+S+L +R +H H++ D A + LQ +P V
Sbjct: 83 HVAMVVAGVDAARGASITIRSLLQHRTVCVHLHIVCDAEARHSLQRFLQGGLQARLPCVS 142
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
+F +++V+WIP +H +G YG KL L + LP+ + IVLDTD+ DI QLW
Sbjct: 143 STFRDITPALDEVAWIPTRHTAGAYGFAKLVLERALPD-VDHVIVLDTDLSVHADIGQLW 201
Query: 133 ALF----------------------SRLRQ-------------RQTIGLVENQSDWYLGK 157
L RL R+ QSDWYLG
Sbjct: 202 RLAFFPSDTAHGSNNGNMHNNHNAGDRLHSLDDSTQQQQQQQQRRLFAAALQQSDWYLGT 261
Query: 158 L--WKNHKPWPALGRGF 172
L + WPA+GRG
Sbjct: 262 LPGIPSSLIWPAVGRGL 278
>gi|443695311|gb|ELT96252.1| hypothetical protein CAPTEDRAFT_226558 [Capitella teleta]
Length = 453
Score = 125 bits (314), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 73/219 (33%), Positives = 110/219 (50%), Gaps = 25/219 (11%)
Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
+A+ DG D+T V +S +R+ M+E + HWEGP+S+ + + Q + SE+L+
Sbjct: 173 QATEDGRDITFVTHISYNRMYMLERIAAHWEGPMSVAIDILPENFDQLVENFCASESLQL 232
Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
+ N+ H+ K+G FYP N +R VALN T YVFL D DF+P+ G + ++ +
Sbjct: 233 KENLMIHLSLKQGEFYPANFIRTVALNGSCTNYVFLSDGDFVPVQGSELRMMEYVKEHGV 292
Query: 591 HGHGGKKVLVVPAFETQRYRTAF------------------------PASHAPTNFSRWV 626
G K +V+PAFE R T F P H+ T F W
Sbjct: 293 LAPGEKHAVVLPAFELLRDGTPFPREKRDVIDLWKEKQLRPFHLREWPPGHSATLFPYWK 352
Query: 627 NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
A Y+I+W FEPY++ D +P Y+ FVG ++K
Sbjct: 353 IADKIYRIKWRRQFEPYLIMRSDEVPAYNQEFVGRFFDK 391
>gi|256072706|ref|XP_002572675.1| glycosyltransferase-related [Schistosoma mansoni]
gi|360044230|emb|CCD81777.1| glycosyltransferase-related [Schistosoma mansoni]
Length = 249
Score = 122 bits (306), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 61/168 (36%), Positives = 99/168 (58%), Gaps = 12/168 (7%)
Query: 176 DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
D ++ + WI +KH +G +KL LP++LP ++ K IVLD D+I T+I +LW F R
Sbjct: 5 DQTLDRLQWIVSKHPAGAKPFVKLLLPEILPPSVHKVIVLDLDMILNTNIVELWNHFERF 64
Query: 236 RQRQTIGLVENQSDWYLGKLWKNHKPWPAL-----GRGFNTGVILLDLTKLRDISWAGFW 290
++ Q IG+ Q+ ++ HK +L G G+N G++L DL+KLR + W W
Sbjct: 65 QETQMIGIGLEQNPYF-------HKVMKSLISDWKGYGYNGGILLFDLSKLRLMMWNDIW 117
Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNT 338
I + ++ + +QD+ N II ++ +L+Y +PC+WNVQLSD +
Sbjct: 118 LSITAHLMQSKGYLVTGEQDVMNMIIFKYKHLLYEIPCEWNVQLSDGS 165
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 48/76 (63%)
Query: 79 DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
D ++ + WI +KH +G +KL LP++LP ++ K IVLD D+I T+I +LW F R
Sbjct: 5 DQTLDRLQWIVSKHPAGAKPFVKLLLPEILPPSVHKVIVLDLDMILNTNIVELWNHFERF 64
Query: 139 RQRQTIGLVENQSDWY 154
++ Q IG+ Q+ ++
Sbjct: 65 QETQMIGIGLEQNPYF 80
>gi|443733888|gb|ELU18085.1| hypothetical protein CAPTEDRAFT_135861, partial [Capitella teleta]
Length = 270
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 98/196 (50%), Gaps = 28/196 (14%)
Query: 485 LSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGN 544
L + R ++E + +WEGP S+ ++ ++ S S+ L+ R+N+ +H+VY+ G
Sbjct: 1 LLIGRFGVLEKMLSNWEGPASVVFHVKHSDLSVLDSLYTKSKVLKDRKNVSFHLVYRRGE 60
Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
FYPIN LRNV +NQ TPY +D+D +P G+Y K I+ M K LV+P+F
Sbjct: 61 FYPINYLRNVGVNQSRTPYTIFIDVDLVPNPGIYQQAKVYIQQ--MLKPNLKVALVIPSF 118
Query: 605 ETQRYRTAFPAS-------------------------HAPTNFSRWVNATTPYQIEWAPD 639
E + FP + H PTNF +W+ AT PY+ W
Sbjct: 119 EAIEEKLTFPKNKAELLQQIQQNKTLPYKMHRGWNLGHKPTNFPKWLTATEPYEALWRDA 178
Query: 640 FEPYIVAHRDL-PRYD 654
FEPY + DL P Y+
Sbjct: 179 FEPYTITQTDLMPPYN 194
>gi|358332625|dbj|GAA51264.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 458
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 108/217 (49%), Gaps = 34/217 (15%)
Query: 173 YLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF 232
Y A+ + + IPN H +GV+GLLKL +P +LPET++K IV+D D ++ D+ +LW F
Sbjct: 154 YEAEKHLRVLKLIPNTHPAGVWGLLKLEIPSLLPETVSKVIVVDCDTLWNADVQELWLQF 213
Query: 233 SRLRQRQTIGLVENQS----DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAG 288
+ + Q++ QS D G ++++P G N GV L++L LR+ W
Sbjct: 214 NTFTENQSLSFAREQSADVGDCDTG--LRSNRP----EAGVNGGVALMNLENLRESGWWQ 267
Query: 289 FWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSD------------ 336
WR K LL +Q ++NAI + P+L LPC+WN+QL D
Sbjct: 268 LWRSTVAKILLAEPTLKEGEQTVYNAIRNYAPFLFGELPCEWNIQLWDQKAHECCPVIWP 327
Query: 337 --NTRSDELCY----------TELTDLKIIHWNSPKK 361
N R C LT+ KIIH+NS +K
Sbjct: 328 DLNQRHKMQCKLGGSLSAAYPNGLTEAKIIHYNSAQK 364
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 80/164 (48%), Gaps = 27/164 (16%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYR---------------------------KNPLHF 47
I +A+ G N+ + +++KS+L++R +PL
Sbjct: 66 IHIAVTLHGMNAIYNFRSMLKSLLYFRGTHNGGTYNCLLRKTSVTQCPKINRVTASPLVV 125
Query: 48 HLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKV 107
H+I D + +W+ V++ Y A+ + + IPN H +GV+GLLKL +P +
Sbjct: 126 HVILDESTKLGYEDHIKSWADDHVQLIEYEAEKHLRVLKLIPNTHPAGVWGLLKLEIPSL 185
Query: 108 LPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 151
LPET++K IV+D D ++ D+ +LW F+ + Q++ QS
Sbjct: 186 LPETVSKVIVVDCDTLWNADVQELWLQFNTFTENQSLSFAREQS 229
>gi|167523373|ref|XP_001746023.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775294|gb|EDQ88918.1| predicted protein [Monosiga brevicollis MX1]
Length = 1116
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 94/160 (58%), Gaps = 7/160 (4%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYR--KNPLHFHLITDTVALNILQTLFSTWSVPQ 70
E I+VA+V G N R L LIKSILF+ PL FH+ITD L+ L+ +W +P
Sbjct: 876 EAIEVAVVAVGSNHARRLQVLIKSILFHHLPPQPLRFHVITDHETAASLRHLYRSWRLPA 935
Query: 71 VEVSFYLADSVVE--DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
V+V FY + ++ D+ + HY+G Y +KL + +LP +L + +VLDTD++F I
Sbjct: 936 VQVRFYSITAALQGVDLHGLETHHYAGRYAFVKLFVADLLPVSLERVMVLDTDLLFLGPI 995
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPA 167
A+LW F +V+N S+WY+ G+L + +PWPA
Sbjct: 996 AELWDQFKGWSASAIFAVVDNFSEWYIPGRLQR--QPWPA 1033
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 106/182 (58%), Gaps = 12/182 (6%)
Query: 166 PALGRGFYLADSVVE--DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
PA+ FY + ++ D+ + HY+G Y +KL + +LP +L + +VLDTD++F
Sbjct: 934 PAVQVRFYSITAALQGVDLHGLETHHYAGRYAFVKLFVADLLPVSLERVMVLDTDLLFLG 993
Query: 224 DIAQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGR-GFNTGVILLDLTKL 281
IA+LW F +V+N S+WY+ G+L + +PWPA G NTGV LL L +L
Sbjct: 994 PIAELWDQFKGWSASAIFAVVDNFSEWYIPGRLQR--QPWPAPAPLGINTGVTLLHLARL 1051
Query: 282 RDISWAGFW-----RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSD 336
R ++ W R++A+ L + LADQD+ N + ++P L++ LPC++N QLS+
Sbjct: 1052 RHQNFPKVWTSAVARVLADP-RLNITYAPLADQDVMNTVFYDNPTLLHRLPCRFNYQLSE 1110
Query: 337 NT 338
NT
Sbjct: 1111 NT 1112
>gi|443717625|gb|ELU08592.1| hypothetical protein CAPTEDRAFT_191108 [Capitella teleta]
Length = 364
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 73/214 (34%), Positives = 105/214 (49%), Gaps = 35/214 (16%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
+DVTLV QL++ R+ +E + W+GP+S ++ D + +++ L +R N+
Sbjct: 87 HDVTLVTQLTISRVGRLEQVLDKWKGPVSAAFHV-DESPNDVIQAICDNKVLSNRNNLYI 145
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
H++ K G YPIN LRN+AL VST + FL D+DF+P LY K I S ++ G K
Sbjct: 146 HLMRKTGRLYPINQLRNLALKCVSTGFSFLSDVDFIPHQNLYNESK-QIVSKNIEG-SKK 203
Query: 597 KVLVVPAFETQRYR-------TAFPAS------------------------HAPTNFSRW 625
+ VV AF Q + FP S H TN++RW
Sbjct: 204 RAYVVAAFGVQEAKGTNKAKAVEFPESKEKLLKLWDSKMIEPVHFAKFIHGHYQTNYTRW 263
Query: 626 VNATTPYQIEWAPDFEPY-IVAHRDLPRYDTRFV 658
NAT Y I+W +EPY IV + P YD F+
Sbjct: 264 RNATEIYPIQWYTHYEPYVIVKTTETPAYDGTFI 297
>gi|440804874|gb|ELR25737.1| fibronectin type III domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 642
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 116/226 (51%), Gaps = 39/226 (17%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
+ + +DV + +Q+++DRL ++E + + W GPIS + +S+ EA+ ++ + + R+
Sbjct: 301 TPESDDVAIASQMTIDRLPVLESMAESWAGPISAVILLSNPEAELKQIHSLRNRSAVIRK 360
Query: 533 NIGYHVVYKEGN--------FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
N+ +H+V+ E N YP+N+LRN A+ + T +V +LDIDF+P + +
Sbjct: 361 NVDFHLVHGEENVYALHTLGLYPVNSLRNEAVVKSRTAHVMVLDIDFIPSRD----AREN 416
Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTA--------------------------FPASHA 618
I + H + V VV +FE + FPA H
Sbjct: 417 ILRLWDQLHEPQTVFVVASFEMDDFNMRDVPHTKDQLLRMVAQDRIRQVHKDKWFPA-HG 475
Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
PT++ W + YQ+ + D+EPY + +D+P +D RFVG+G++K
Sbjct: 476 PTDYEHWYSTRELYQVPYQRDYEPYYIVRKDVPLWDERFVGYGFDK 521
>gi|449017583|dbj|BAM80985.1| similar to glycosyltransferase [Cyanidioschyzon merolae strain 10D]
Length = 505
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 125/255 (49%), Gaps = 43/255 (16%)
Query: 453 SMDRLQMVEI-NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS 511
++D L++ + P + + +V+LV S+ +L + L +HW PIS L +
Sbjct: 180 ALDTLKLENVLGPQKIQRSHRALTEINVSLVTHASLSKLSELRRLTEHWPAPISCALVLP 239
Query: 512 D-AEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
D +++ Q L + R ++ Y+ +YP N RN+AL+ T +VFL+D+D
Sbjct: 240 DVSQSHQVLDMYPKGRSDRVDIHLLLADDYERPTYYPFNAARNLALDNARTDWVFLVDVD 299
Query: 571 FLPMFGLYPYLKSSIR---SMDMHGHGGKKVLVVPAFET----QRYRTAFPAS------- 616
F+P L ++ ++R + + VL+VPAFE + ++T+ PA+
Sbjct: 300 FVPSPNLVESVQRTLRRFPELREEMQQRRAVLIVPAFEKLQAREEHQTSLPATRAELVAE 359
Query: 617 -----------------HAPTNFSRWV---NATT------PYQIEWAPDFEPYIVAHR-D 649
H PT++ RW+ N T+ PY++ + +EPYIVA+R
Sbjct: 360 TRAGRIVPFHVSWYWPGHGPTDYIRWLADANETSEHEPHKPYRVRYRDGYEPYIVAYRHG 419
Query: 650 LPRYDTRFVGFGWNK 664
LPRY+ FVG+GWNK
Sbjct: 420 LPRYEDSFVGYGWNK 434
>gi|345326810|ref|XP_003431086.1| PREDICTED: glycosyltransferase-like protein LARGE1-like
[Ornithorhynchus anatinus]
Length = 390
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 70/108 (64%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++P + CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A IL TLF
Sbjct: 269 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 328
Query: 65 TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETL 112
TW VP V V FY AD + ++ + + L + P+ LP L
Sbjct: 329 TWMVPAVRVDFYDADELKWNLEPLSSPVQESFAILENVLCPQSLPVCL 376
>gi|358255463|dbj|GAA57164.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 409
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 85/163 (52%), Gaps = 5/163 (3%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
Y D + VSWI N+H SGV L K+ +P +LP ++ K IVLDTD I DI +LW L
Sbjct: 156 LYDLDLYLPRVSWIRNQHCSGVGALSKMLVPDILPTSVNKAIVLDTDTILNADILELWKL 215
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
F Q Q G+ W + + P G N+GV+L+ L+++R W+ +
Sbjct: 216 FDNFTQIQIFGMA-----WETDPVGPENAETPIPKSGVNSGVVLMYLSRMRRAKWSQLFE 270
Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+FL S+ +Q N +I + P L + +PC+WN+QL
Sbjct: 271 WAVREFLFHYGLLSVGEQSALNTLIQQRPELYHRIPCEWNLQL 313
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 56/103 (54%)
Query: 45 LHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTL 104
+H HLI + L+ + S W V Q + Y D + VSWI N+H SGV L K+ +
Sbjct: 126 IHLHLIVEQKLLSAAYSYVSQWRVAQFNWTLYDLDLYLPRVSWIRNQHCSGVGALSKMLV 185
Query: 105 PKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 147
P +LP ++ K IVLDTD I DI +LW LF Q Q G+
Sbjct: 186 PDILPTSVNKAIVLDTDTILNADILELWKLFDNFTQIQIFGMA 228
>gi|443712918|gb|ELU06002.1| hypothetical protein CAPTEDRAFT_208613 [Capitella teleta]
Length = 403
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 106/216 (49%), Gaps = 35/216 (16%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
TLV Q + DR + M+ K+W+GPIS +Y+ + + F+ E +RRNI H+
Sbjct: 130 TTLVTQFTPDRFNRLSMVAKNWDGPISAGIYLPVEKLESFVQDFCTDENTYNRRNIQIHI 189
Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK- 597
+ G FYP+N LRN+AL+ T +VFL DID +PM G+ +I + ++ K+
Sbjct: 190 TFTHGEFYPMNFLRNLALDAAMTSHVFLSDIDLVPMPGI-----DTIVAKYLYEEKFKRG 244
Query: 598 -VLVVPAFETQ------------------------RYR---TAFPASHAPTNFSRWVNAT 629
LVV FE +Y T + A TN++ W
Sbjct: 245 QALVVACFEKTPNVETPLDKKQLLSLWSKHELEPCQYSFKWTKKRKAQAATNYALWRKTV 304
Query: 630 TPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
Y+I WAP++EP+I+ + D+P Y +FVG +K
Sbjct: 305 DIYRITWAPEYEPFIIMFKDDVPAYYPKFVGRNRDK 340
>gi|443719772|gb|ELU09782.1| hypothetical protein CAPTEDRAFT_220855 [Capitella teleta]
Length = 472
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 111/220 (50%), Gaps = 35/220 (15%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNI 534
+G+ VTLV Q+S +RL + KHW GP+S+ +Y+ + F+ +E+L R NI
Sbjct: 190 NGSRVTLVTQVSFNRLFNLTTQAKHWPGPMSVAMYLPLFRLETFVMDYCEAESLNKRENI 249
Query: 535 GYHVVYKEGN-FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
H++ EG F+P N +RN+A+ V T + F+ D D +P P ++++I G+
Sbjct: 250 QIHIMLDEGCYFFPTNVMRNLAMEYVYTSHSFITDGDHIPN----PNMENNIIKHAEDGY 305
Query: 594 -GGKKVLVVPAFE---------------------TQRYRTAF------PASHAPTNFSRW 625
G V+++PAFE ++Y F PA + PTN++ W
Sbjct: 306 LKGATVMIIPAFEYTVMPEGFTMVHDKEELMKHWDEKYLKPFYIDRFLPA-YGPTNYTYW 364
Query: 626 VNATTPYQIEWAPDFEPYIVAHRDLPR-YDTRFVGFGWNK 664
A PY IE+ FEPYIVA L Y+ F+G +NK
Sbjct: 365 RTAEHPYTIEYGDQFEPYIVAETKLTHSYNNSFIGRFYNK 404
>gi|443687234|gb|ELT90283.1| hypothetical protein CAPTEDRAFT_177310 [Capitella teleta]
Length = 361
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 101/201 (50%), Gaps = 27/201 (13%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
+TLV QL +RLQ +E L W GP S+ LY SD EA F Y ++ LR+ ++ H+
Sbjct: 46 ITLVTQLQPNRLQGLERLLMEWRGPASVALYASDEEAVDFAKYFRSNTKLRTLPDLVVHL 105
Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
V+K YPIN LRNVA+ +V T Y +LD+DF+P GL + ++ +
Sbjct: 106 VFKRQKQYPINYLRNVAMREVLTQYHLMLDVDFVPNSGLAQHAQNIFEKLST-TEETPVA 164
Query: 599 LVVPAFETQR--YRTAFPAS-----------------------HAPTNFSRWVNATTPYQ 633
LVVP FET Y+ A S H T++++W AT PY+
Sbjct: 165 LVVPTFETSNSFYKIAENRSEFLHHLDEGKVLSYHQTAQWNLGHRYTSYAKWKTATDPYK 224
Query: 634 IEWAPDFEPYIVAH-RDLPRY 653
+ A +EPY+V +D P Y
Sbjct: 225 VYGAFGYEPYLVIRAQDSPEY 245
>gi|358340195|dbj|GAA48141.1| glycosyltransferase-like protein LARGE, partial [Clonorchis
sinensis]
Length = 695
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 93/205 (45%), Gaps = 48/205 (23%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT---- 240
+PN H+SG+ ++KL +P +LP T+ K IV+D+D++F ++ +LW LF R Q
Sbjct: 105 MPNSHHSGIPSMMKLLVPVILPSTVEKVIVMDSDMLFNHNVLELWELFYRFNSTQVSVEC 164
Query: 241 -IGLVENQSDWYLG---------------KLWKNHK------------------------ 260
+ + DW +LW K
Sbjct: 165 QVTVTARHFDWCTNLIKYNNTDFHEVPDTELWVTCKLEDLEYQKGELGSKVYGTVYTKKL 224
Query: 261 ----PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
P P RG N G+ ++ LT+LR+ W W+ ++KF ADQDIFN +I
Sbjct: 225 TTVPPKPQQDRGLNGGLSIMHLTRLRENGWHDKWQEASKKFFERNTMLKQADQDIFNTVI 284
Query: 317 SEHPYLVYTLPCQWNVQLSDNTRSD 341
+P + + +PC+WN+QL T +D
Sbjct: 285 EMNPTMYFRIPCEWNIQLCSKTTAD 309
>gi|281212139|gb|EFA86299.1| putative glycosyltransferase [Polysphondylium pallidum PN500]
Length = 393
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 112/213 (52%), Gaps = 32/213 (15%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD--AEAQQFLSYAGNSEALRSRRNIGY 536
VT+V Q +++R+ + ++ W+ PIS ++++ D E Q N+ L + + +
Sbjct: 104 VTIVTQATVNRVDRIVLMADRWKAPISTSIFIRDPDTELDQLNQMIANNPILM--KFVDF 161
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
H++Y YP+N LRN+++ T YV ++D DF+P +G++ YL ++ +D G+ K
Sbjct: 162 HLLYANNTRYPVNNLRNLSIRNSHTDYVLVMDADFIPPYGMHDYL---LKYIDNTGNEEK 218
Query: 597 KVLVVPAFETQR------------------------YRTAFPASHAPTNFSRWVNATTPY 632
+VP+F + + P H+P++++RW T PY
Sbjct: 219 VAYIVPSFSSSQDPRILPNDKPTLVEMVEKNLVNPSNLNVCPKCHSPSDYNRWFETTVPY 278
Query: 633 QIEWAPDFEPYIVAHRDLPR-YDTRFVGFGWNK 664
+++++ +EPY++ +R+ +D RF G+G++K
Sbjct: 279 EVKYSWIYEPYLIFNRNNTELFDERFKGYGFDK 311
>gi|443726598|gb|ELU13717.1| hypothetical protein CAPTEDRAFT_190301 [Capitella teleta]
Length = 449
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/229 (31%), Positives = 112/229 (48%), Gaps = 41/229 (17%)
Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR 531
+ +D VTLV LS+D L+ + +L HW+GPIS +++ + + + EAL SR
Sbjct: 146 SGADDGLVTLVTHLSLDHLESLAILIDHWQGPISAAVWVPKNKTASAVMGFCSIEALLSR 205
Query: 532 RNIGYHVVYKE------GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
NI H+V ++ FYP N LRNVAL + T YVFL D+D +PM + L +
Sbjct: 206 DNILLHLVIEDHVFTFNKAFYPSNYLRNVALEEAKTIYVFLSDVDVIPMASMTRRLGFQL 265
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTN-----------------FSRWVNA 628
+ G K+VLVVP FE+ ++F S P + F R++
Sbjct: 266 AKRPL---GSKQVLVVPTFES--LNSSFCCSTIPNSKIALLKLWAQHRIGPFYFKRYIPG 320
Query: 629 TTP------------YQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
+P Y++ + FEPYIV R+ P+++T F G ++K
Sbjct: 321 YSPIRYDIWKVAKELYKVNFHKQFEPYIVMQRETTPKFNTSFFGLYFDK 369
>gi|328865206|gb|EGG13592.1| putative glycosyltransferase [Dictyostelium fasciculatum]
Length = 405
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 112/219 (51%), Gaps = 31/219 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEALRSRRN 533
D + +T+V Q+S++RL+ + + W PIS +Y+ +D E S NS ++ +
Sbjct: 92 DHSKITIVTQVSVERLERIAWMADKWRAPISAAVYIKNDNEIHNVTSLINNSYSVTQFVD 151
Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM--- 590
I H+++ YP+NTLRN+++ + ST +V L+D DF+P L+ +++ + +++
Sbjct: 152 I--HLLFANKTRYPVNTLRNLSIKKASTKWVLLMDADFIPPLDLHDTIQTHVDRLELDSD 209
Query: 591 HGHGGKKVLVVPAFETQRYR------------------------TAFPASHAPTNFSRWV 626
H VVP+F + R P H+PTN++RW
Sbjct: 210 HNQNDLVSFVVPSFASSLPRFQLPDDKESFIQSVKSQTIVPTNLNVCPKCHSPTNYTRWY 269
Query: 627 NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
N + PY++E+ +EP++V D + +D R G+G++K
Sbjct: 270 NESIPYEVEYQWIYEPFLVFRTDQVEPFDERLKGYGFDK 308
>gi|90079249|dbj|BAE89304.1| unnamed protein product [Macaca fascicularis]
Length = 215
Score = 106 bits (265), Expect = 4e-20, Method: Composition-based stats.
Identities = 45/74 (60%), Positives = 58/74 (78%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187
Query: 65 TWSVPQVEVSFYLA 78
TW VP V V L+
Sbjct: 188 TWMVPAVRVDSLLS 201
>gi|281342976|gb|EFB18560.1| hypothetical protein PANDA_010466 [Ailuropoda melanoleuca]
Length = 162
Score = 105 bits (263), Expect = 6e-20, Method: Composition-based stats.
Identities = 44/69 (63%), Positives = 56/69 (81%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A IL TLF
Sbjct: 93 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFR 152
Query: 65 TWSVPQVEV 73
TW VP V V
Sbjct: 153 TWMVPAVRV 161
>gi|443691848|gb|ELT93598.1| hypothetical protein CAPTEDRAFT_227063 [Capitella teleta]
Length = 463
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 40/230 (17%)
Query: 468 DEYEASSDGN--DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
D ++A + N +TLV QLS RL+ + W GPIS+ LY+ +
Sbjct: 143 DVFDAKINPNISQITLVTQLSHVRLERFALHAAKWPGPISMALYVPQGMLLPTMLAWCRF 202
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
+ L +R NI HV+ ++G FYP+N LRNVA+ V T +++D D +PM G+ L +
Sbjct: 203 KELCTRDNIQLHVIVEKGIFYPVNYLRNVAVKYVKTDLTYVVDADVMPMPGMEAMLHEHL 262
Query: 586 RSMDMHGHGGKKVLVVPAFET------------------------------QRYRTAFPA 615
+ H +VLV+P FE+ +RY+ A+
Sbjct: 263 QR---HPLNTTQVLVIPTFESYDGSCCDTMPKTKHEMIKLWDQGVVGPFYFERYKPAY-- 317
Query: 616 SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAH-RDLPRYDTRFVGFGWNK 664
AP + RW AT Y + + FEPYI+ H + +P + T F G ++K
Sbjct: 318 --APIRYQRWKVATEAYIVPFHKQFEPYIIMHTKRVPAFHTGFFGLYFDK 365
>gi|390347596|ref|XP_001200797.2| PREDICTED: glycosyltransferase-like protein LARGE2-like
[Strongylocentrotus purpuratus]
Length = 560
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 110/230 (47%), Gaps = 43/230 (18%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM--------SDAEAQQFLSYAGNSEAL 528
+DVTL+ S+DR+ V + W+ P S++LY+ D E ++ + L
Sbjct: 261 DDVTLITHASLDRIDGVLRMLNSWDAPCSISLYLPIKADERDEDIEWKKIYVFK-KMRIL 319
Query: 529 RSRRNIGYHVVYKEG--NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
++R +I +VY + YPIN +RNVA++Q T ++ LLD DFLP L+ + +
Sbjct: 320 QTRADIDLTLVYSNTMEDIYPINYMRNVAISQSKTKFILLLDADFLPSPTLHKHFLDVMA 379
Query: 587 SMDMHGHGGKK-VLVVPAF---ETQRYRTAFPA--------------------------S 616
+ +K VVPAF E ++ + P+ S
Sbjct: 380 VWQVSEQRDRKTAFVVPAFDYIEEKQDQAGLPSTKDELISLMKQDLPAVSIFRFYESPLS 439
Query: 617 HAPTNFSRWVNATTPYQIEWAPD-FEPYIVAHRD-LPRYDTRFVGFGWNK 664
H PTN+ +W A PY+I + D +EPY++ + +P YD RF G+G NK
Sbjct: 440 HRPTNYVQWYMANEPYKISYYEDKYEPYLILQKSGMPMYDERFTGYGMNK 489
>gi|332267349|ref|XP_003282645.1| PREDICTED: glycosyltransferase-like protein LARGE1-like, partial
[Nomascus leucogenys]
Length = 79
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/80 (61%), Positives = 63/80 (78%), Gaps = 1/80 (1%)
Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
VILL L KLR + W WR+ AE+ L+ L TSLADQDIFNA+I ++P+LVY LPC WNV
Sbjct: 1 VILLLLDKLRKMKWEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNV 60
Query: 333 QLSDNTRSDELCYTELTDLK 352
QLSD+TRS++ CY +++DLK
Sbjct: 61 QLSDHTRSEQ-CYRDVSDLK 79
>gi|428184713|gb|EKX53567.1| hypothetical protein GUITHDRAFT_160868 [Guillardia theta CCMP2712]
Length = 389
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 94/199 (47%), Gaps = 19/199 (9%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVT V QL+ DR+ + + + + W GPIS LY D + E L R + +H
Sbjct: 159 DVTWVTQLTTDRIPVFKKMLERWTGPISAALYSQDVSKD-----TKDIEDLLDR--VDFH 211
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG-GK 596
+V YP+NTLRNVA+N+ +T ++ L D+DF+P G+Y KS + + G K
Sbjct: 212 IVGASQGLYPVNTLRNVAINKANTDFIILADVDFVPSEGIYEATKSFVEKLASTGAAHDK 271
Query: 597 KVLVVPAFETQRYRTAFPAS-----------HAPTNFSRWVNATTPYQIEWAPDFEPYIV 645
KV V+PAFE P + H N A Q + Y
Sbjct: 272 KVYVLPAFEIDGADQTVPPNFDELQKMGSRIHQVHNDKGRDIAHKYTQYDKRQIHTEYPT 331
Query: 646 AHRDLPRYDTRFVGFGWNK 664
A R +PR+D RF G+G +K
Sbjct: 332 APRTIPRFDVRFFGYGNDK 350
>gi|66809567|ref|XP_638506.1| hypothetical protein DDB_G0284567 [Dictyostelium discoideum AX4]
gi|74854201|sp|Q54PG8.1|GNT14_DICDI RecName: Full=Glycosyltransferase-like protein gnt14
gi|60467115|gb|EAL65152.1| hypothetical protein DDB_G0284567 [Dictyostelium discoideum AX4]
Length = 640
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 105/212 (49%), Gaps = 28/212 (13%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
VT+V Q ++DRL V + + W PIS++L++ L A ++ + + +H+
Sbjct: 348 VTIVTQTTVDRLYKVAAMAERWRSPISVSLFIKSQGDIDKLEKAISANKILATF-ADFHL 406
Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
Y YP+N LRN+A+ T +V LLD+DF+P L+ ++ I ++
Sbjct: 407 FYHNNTRYPVNNLRNLAIRNALTEHVLLLDVDFIPPSHLHDHIAHYINLNYLNQDDSLNA 466
Query: 599 LVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQI 634
V+P+F + P S H+PT+++RW+ +T PY I
Sbjct: 467 FVIPSFSSNLQPKDIPDSKFEFIEMLTKNKIEPSNLKVCPKCHSPTDYTRWMTSTEPYAI 526
Query: 635 EWAPDFEPYIVAHRD--LPRYDTRFVGFGWNK 664
E+ +EP++V +R LP +D RF G+G++K
Sbjct: 527 EYHWIYEPFLVYNRSQTLP-FDERFKGYGFDK 557
>gi|358336370|dbj|GAA54896.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 399
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 92/175 (52%), Gaps = 11/175 (6%)
Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
V+ IP ++++G L KL P +LPE + K I LD+DV+F ++ LW F + + Q
Sbjct: 157 VASIPTRYFAGTPPLYKLLAPYILPEDVTKVIALDSDVLFNYNVLDLWKEFGQFKPGQAF 216
Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
G V Q + K P+P G NTGV+L+DL +LR I+W WR K +
Sbjct: 217 GAVCEQITFCPYSCPKEDLPFPTF--GINTGVLLMDLAELRRINWWQLWRQELLKEVNEW 274
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHW 356
+ +Q + + ++S++ +L + L C+WN+Q+ RSD + + IHW
Sbjct: 275 GYLPAGEQKVISQVMSKYNHLYHRLRCEWNIQI---FRSDGM------NCCPIHW 320
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 69/145 (47%), Gaps = 2/145 (1%)
Query: 25 NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVED 84
N ++V + + YR P+ H + D A N + ++F W V + Y + +
Sbjct: 99 NRPSNIVCKHQPLEAYR--PVRLHWVADIDARNRILSIFEEWQPTDVRIFLYEYERYLPY 156
Query: 85 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
V+ IP ++++G L KL P +LPE + K I LD+DV+F ++ LW F + + Q
Sbjct: 157 VASIPTRYFAGTPPLYKLLAPYILPEDVTKVIALDSDVLFNYNVLDLWKEFGQFKPGQAF 216
Query: 145 GLVENQSDWYLGKLWKNHKPWPALG 169
G V Q + K P+P G
Sbjct: 217 GAVCEQITFCPYSCPKEDLPFPTFG 241
>gi|260801759|ref|XP_002595763.1| hypothetical protein BRAFLDRAFT_117562 [Branchiostoma floridae]
gi|229281010|gb|EEN51775.1| hypothetical protein BRAFLDRAFT_117562 [Branchiostoma floridae]
Length = 554
Score = 96.7 bits (239), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 106/234 (45%), Gaps = 50/234 (21%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ-----QFLSYAGNSEALRSR 531
+ VTLV Q+SMDR+ M++ + +HW+GP+++ LY+ + + Y N + +
Sbjct: 253 DTVTLVTQVSMDRIAMLDRIIRHWDGPMAVALYVPTTSQKPDTNHDWKRYYINKKLRHPK 312
Query: 532 RNIGYHVVYKEGNF----YPINTLRNVALNQVSTPYVFLLDIDFLPM------FGLYPYL 581
V+ N YPIN LRN+AL V + YVFLLD DF+P F +YP
Sbjct: 313 FRTFCSVLAVYANVVNDEYPINYLRNLALRMVKSKYVFLLDADFIPSPNFQDNFCVYP-- 370
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFE-----------------TQR----------YRTA-F 613
+ D + VVPAFE Q+ +R+
Sbjct: 371 --RVLINDQSDPTDRTAYVVPAFEFLEEQEDQPPPQDKEALKQQIFADDANILPFRSVES 428
Query: 614 PASHAPTNFSRWVNATTPYQIEWAPD-FEPYIVAHR--DLPRYDTRFVGFGWNK 664
P SH PT++ RW Y + D FEPY+V LP YD RF G+G NK
Sbjct: 429 PESHRPTDYQRWYAEEQVYPVSTYQDKFEPYLVLRMSPSLPLYDERFSGYGMNK 482
>gi|330799891|ref|XP_003287974.1| hypothetical protein DICPUDRAFT_152182 [Dictyostelium purpureum]
gi|325081998|gb|EGC35495.1| hypothetical protein DICPUDRAFT_152182 [Dictyostelium purpureum]
Length = 460
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 105/210 (50%), Gaps = 25/210 (11%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD-AEAQQFLSYAGNSEALRSRRNIGY 536
++T+V Q ++DR+ + M+ W PIS +Y+ + ++ NS ++ +I
Sbjct: 118 EITIVTQFTIDRMDRIAMMADIWRAPISAAVYIRNQSDIDSVFKLVRNSFSVAQFVDI-- 175
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM------ 590
H +Y YP+N LRN+AL T + LLD+DF+ G+Y YL+S ++ +D
Sbjct: 176 HFLYSNQTRYPVNNLRNLALVNARTDWTLLLDVDFVTNEGMYEYLQSLLKEIDKNKLVSF 235
Query: 591 -----------HGHGGKKVLVVPAFETQRY----RTAFPASHAPTNFSRWVNATTPYQIE 635
+ KV ++ E + P H PT++SRW +A PY+++
Sbjct: 236 IIPSFSSSISHYDFPTGKVDLIQMVEAGDIKIINKNVCPRCHGPTDYSRWFSAVEPYKVD 295
Query: 636 WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
+ +EP+++ ++ + YD R G+G++K
Sbjct: 296 YRWLYEPFLLYNKSQIELYDERLKGYGFDK 325
>gi|443732711|gb|ELU17338.1| hypothetical protein CAPTEDRAFT_85145, partial [Capitella teleta]
Length = 241
Score = 95.9 bits (237), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 100/210 (47%), Gaps = 36/210 (17%)
Query: 482 VAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYK 541
V QL++ RL + W+GP+S +++ D ++ +++ L R N+ H++ K
Sbjct: 1 VTQLTLSRLGSFNEVLMRWQGPVSAAIHVEDP-VLNVINAVCHNQVLTERGNVYIHLMRK 59
Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
G YP+N LRN+AL+ V+T YVFL D DF+P LY + + + S D + K+ VV
Sbjct: 60 IGKLYPVNQLRNLALSSVTTDYVFLTDADFIPNKDLYKSMLNVVASSDT--NRTKRAYVV 117
Query: 602 PAF------------------ETQR--------------YRTAFPASHAPTNFSRWVNAT 629
AF ET+ + T F H TN++RW AT
Sbjct: 118 AAFGVFEEKEKNKPAKFPDFPETRETLLQLWDSKAIEPVHFTKFIHGHYQTNYTRWRKAT 177
Query: 630 TPYQIEWAPDFEPYIVAHR-DLPRYDTRFV 658
Y I W +EPY+V D+P Y+ F+
Sbjct: 178 ELYPITWFHHYEPYVVVRTIDVPLYEGAFL 207
>gi|358336371|dbj|GAA54897.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 400
Score = 95.9 bits (237), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 97/209 (46%), Gaps = 21/209 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
Y + V V IP HY+G + K+ +P +LPE + K IVLD+DVIF ++ LW
Sbjct: 146 LYDYERFVSYVEAIPINHYAGTPAMYKILVPYILPEEVTKVIVLDSDVIFNHNVLDLWKE 205
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
F + +Q G Q + + P G N+GV+L++LT+LR I+W W+
Sbjct: 206 FELFKPQQVFGAACEQITYCPHSCSSDDTNIPLF--GINSGVVLMNLTELRRINWWYQWQ 263
Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL-------------SDNT 338
K + + +A+Q + N I+ + L Y L C+WNVQ+ D
Sbjct: 264 QEVLKEVTEFGYLPIAEQKVINEIMRRNIKLYYQLGCEWNVQVFKADGIQCCPVQWVDRL 323
Query: 339 RSDELCYTELTD------LKIIHWNSPKK 361
+ C L D +I+H+N +K
Sbjct: 324 PHEMYCKNTLDDANPREPARIVHYNIKRK 352
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 53/107 (49%)
Query: 44 PLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLT 103
P+ H + D A + + +F W V + Y + V V IP HY+G + K+
Sbjct: 115 PVRLHWVADVAARDRIIYIFEQWHPSDVHIFLYDYERFVSYVEAIPINHYAGTPAMYKIL 174
Query: 104 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 150
+P +LPE + K IVLD+DVIF ++ LW F + +Q G Q
Sbjct: 175 VPYILPEEVTKVIVLDSDVIFNHNVLDLWKEFELFKPQQVFGAACEQ 221
>gi|443710196|gb|ELU04494.1| hypothetical protein CAPTEDRAFT_221047 [Capitella teleta]
Length = 304
Score = 95.9 bits (237), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 92/202 (45%), Gaps = 37/202 (18%)
Query: 495 MLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNV 554
ML +HW GPIS ++Y+S+ + + N + NI HV++ G+ YP N LRN
Sbjct: 1 MLAQHWSGPISASVYVSNIQDSSMID---NLCQFSTASNIFIHVLHAVGDIYPANILRNF 57
Query: 555 ALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE------TQR 608
AL +V T YVFL D+DF+P F LY I+ + H + VV AFE T
Sbjct: 58 ALEEVFTEYVFLSDVDFMPDFSLYNASVDFIQHNERHLNSS--AFVVAAFEILNDSKTWG 115
Query: 609 YRTAFPAS-------------------------HAPTNFSRWVNATTPYQIEWAPDFEPY 643
FP + H TN+ +W A Y + W P +EPY
Sbjct: 116 QPVQFPRNKTELIDLWGAQKLQPFHFSKTKNYGHIETNYEKWTVAHEIYAVTWMPYYEPY 175
Query: 644 IVAHRD-LPRYDTRFVGFGWNK 664
++ +PRY FVG ++K
Sbjct: 176 VIVRTSTVPRYHNAFVGRYFDK 197
>gi|358342270|dbj|GAA49775.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 398
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 91/171 (53%), Gaps = 6/171 (3%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P L FY IPN +++G G+ KL +P +LP + +TI +DTD ++ ++
Sbjct: 138 PYLECSFYNLAEYQSFAERIPNTNHAGAAGIAKLWVPYILPHWVIRTIFVDTDTLWNENV 197
Query: 226 AQLWALFSRLRQRQTIGLV-ENQS-DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD 283
LW +F Q +G+ E QS D + + + P G N+G++L+ L+++R
Sbjct: 198 ETLWKIFDNFTSTQMLGIAWEQQSQDAHCRRSSETSLP----VTGVNSGLVLMHLSRMRV 253
Query: 284 ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+W FL R S ADQDI+N++++ P +Y+LPC+WNVQ+
Sbjct: 254 ANWGLITEQHIVSFLAVRKHLSQADQDIYNSVLALKPEWLYSLPCEWNVQV 304
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 76/166 (45%), Gaps = 29/166 (17%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYR----------------------------KNPLH 46
I + + G S VT++KS+L+++ +NP+H
Sbjct: 56 IHITQIVKGKKSAIRSVTMLKSLLYFQNHLNSASQECLRVEPFETSPCFTELHPAQNPIH 115
Query: 47 FHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPK 106
H IT+ +L++ S+P +E SFY IPN +++G G+ KL +P
Sbjct: 116 VHFITEYDLWPVLKSKLGNCSLPYLECSFYNLAEYQSFAERIPNTNHAGAAGIAKLWVPY 175
Query: 107 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQS 151
+LP + +TI +DTD ++ ++ LW +F Q +G+ E QS
Sbjct: 176 ILPHWVIRTIFVDTDTLWNENVETLWKIFDNFTSTQMLGIAWEQQS 221
>gi|156602774|ref|XP_001618715.1| hypothetical protein NEMVEDRAFT_v1g224874 [Nematostella vectensis]
gi|156200021|gb|EDO26615.1| predicted protein [Nematostella vectensis]
Length = 158
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 39/62 (62%), Positives = 50/62 (80%)
Query: 8 VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
+P CEVI +A+VCAG++++R VTLIKS+LFYR NPLHFH ++D+VA IL TLF TW
Sbjct: 72 AVPQCEVIHIAVVCAGHSASRQAVTLIKSLLFYRHNPLHFHFVSDSVAELILGTLFRTWG 131
Query: 68 VP 69
VP
Sbjct: 132 VP 133
>gi|328869315|gb|EGG17693.1| putative glycosyltransferase [Dictyostelium fasciculatum]
Length = 407
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 104/232 (44%), Gaps = 51/232 (21%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR----RNI 534
VT+V Q ++DR++ V + K W+GPIS +++ + + L + LR +
Sbjct: 124 VTIVTQATVDRIERVVAMAKKWKGPISTAVFIKNPKDIPQLK-----KLLRKHWILAEYV 178
Query: 535 GYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM------ 588
H++Y YP+N LRN+++ + YV L+D DF+P G+ Y+ + I ++
Sbjct: 179 DIHLLYANHTRYPVNNLRNLSIKYAQSDYVLLMDADFVPHLGMNEYVSAQIETLLQQQQQ 238
Query: 589 -----------DMHGHGGKKVLVVPAF---------------------ETQRYRTAF--- 613
+ K VVP+F E + Y T
Sbjct: 239 QQQNKHPLSSHEQQQQEEKIAFVVPSFASSENPNHLPNDKQSLLKMIKENKVYPTNLNVC 298
Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLP-RYDTRFVGFGWNK 664
P H PTN+SRW YQ+E+ +EPY++ +R +D R G+G++K
Sbjct: 299 PKCHTPTNYSRWYETDDLYQVEYRWIYEPYLIFNRSTCLAFDERLKGYGFDK 350
>gi|443682653|gb|ELT87166.1| hypothetical protein CAPTEDRAFT_200998 [Capitella teleta]
Length = 201
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 85/159 (53%), Gaps = 14/159 (8%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---- 528
S ND+TLV QL+ DR+ + ++ W+GPIS ++Y D E LS G L
Sbjct: 42 SQSSNDITLVTQLTSDRIDRLLLIMNIWKGPISASVY-PDTE----LSIRGEDICLLKKQ 96
Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM 588
SR + H+V K G F+P+N LRN+AL+ T +VFL D+DF+P LY +++++
Sbjct: 97 DSRCRVQLHLVQKSGVFFPVNKLRNIALDMAVTSHVFLTDVDFIPDQNLY---ENALQQ- 152
Query: 589 DMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVN 627
+H G + LV+PAFE + P+ +N
Sbjct: 153 -LHSLQGMQSLVIPAFEMIEMKNLVRKEDLPSGKPELLN 190
>gi|358342269|dbj|GAA49774.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 289
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 92/192 (47%), Gaps = 26/192 (13%)
Query: 161 NHKPWPALGR------------GFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPET 208
+HK WP L FY ++ + IPN H+SG + KL P +LP
Sbjct: 12 DHKLWPLLKSHLYRCELSYLKCSFYSLENYYDFAGRIPNTHHSGPTAMAKLWTPYILPHW 71
Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDWYLGKLWKNHKPWPAL-G 266
L K I++D+D ++ +I LW F R Q IG+ E QSD L N +
Sbjct: 72 LEKIILIDSDTLWNENIYTLWNHFDRFNSTQMIGIAWEQQSD----DLHCNRSSTMVIPS 127
Query: 267 RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS----LADQDIFNAIISEHPYL 322
G N G++L+ L ++R++ W I+E ++ RL +DQ I+N IIS P
Sbjct: 128 EGVNAGLVLMHLWRMRNVRWDA----ISELLVVARLERDKMLYQSDQSIYNDIISLKPEW 183
Query: 323 VYTLPCQWNVQL 334
Y L C+WNVQL
Sbjct: 184 YYPLACEWNVQL 195
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 1/112 (0%)
Query: 42 KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
+NP+H H + D +L++ + ++ SFY ++ + IPN H+SG + K
Sbjct: 2 QNPIHIHFVADHKLWPLLKSHLYRCELSYLKCSFYSLENYYDFAGRIPNTHHSGPTAMAK 61
Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSD 152
L P +LP L K I++D+D ++ +I LW F R Q IG+ E QSD
Sbjct: 62 LWTPYILPHWLEKIILIDSDTLWNENIYTLWNHFDRFNSTQMIGIAWEQQSD 113
>gi|358332611|dbj|GAA51245.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 313
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 79/165 (47%), Gaps = 28/165 (16%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR-----KNP-------------------- 44
PT I + V G + +TL+KS+L+YR K P
Sbjct: 76 PTPHTIHLVHVIKGEATLARSITLLKSVLYYRGSLRSKRPECHLRRIPSPHLCAKQRAIE 135
Query: 45 ---LHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
+H HL+ + L+ + S W V ++ + Y D + V+WIPNKH SGV L K
Sbjct: 136 QIAIHLHLVVEQRLLSAAYSYVSQWRVARLNWTLYELDPYLPRVNWIPNKHGSGVAALSK 195
Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
+ +P +LP + K IVLDTD++F DI +LW LF Q Q G+
Sbjct: 196 MLIPDMLPTNVTKAIVLDTDILFNADILELWKLFDDFDQSQKNGV 240
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 64/128 (50%), Gaps = 26/128 (20%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
Y D + V+WIPNKH SGV L K+ +P +LP + K IVLDTD++F DI +LW L
Sbjct: 169 LYELDPYLPRVNWIPNKHGSGVAALSKMLIPDMLPTNVTKAIVLDTDILFNADILELWKL 228
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
F Q Q KN G N+GV+L+ L+++R+ W+ +
Sbjct: 229 FDDFDQSQ-----------------KN---------GVNSGVVLMHLSRMREAKWSELFD 262
Query: 292 IIAEKFLL 299
K L+
Sbjct: 263 WAVRKVLI 270
>gi|384252903|gb|EIE26378.1| hypothetical protein COCSUDRAFT_64466 [Coccomyxa subellipsoidea
C-169]
Length = 389
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 107/250 (42%), Gaps = 65/250 (26%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR--- 531
D ND+T++ Q S+DRL M+E W+G S+ +Y ++ + N++ LR
Sbjct: 45 DKNDITIITQCSLDRLYMLEEQCNTWQGLTSVAVYYP-----LYIFQSTNADRLREAITT 99
Query: 532 -RNIGYHVVYKEGNF------------------YPINTLRNVALNQVSTPYVFLLDIDFL 572
N + G YP N LRN AL + T + LLD DFL
Sbjct: 100 VENFHEQIDRSHGKCRMDIVLLSEVIKPHDEWSYPYNALRNHALVRSETDVLLLLDADFL 159
Query: 573 PMFGLYPYLKSSIRSMDMHGH--GGKKVLVVPAFETQR---------------------- 608
G++ L ++ ++ + KK++V+PAFET R
Sbjct: 160 VSEGVHEALTATDKAAALMEDLTANKKIIVLPAFETPRTLSEETGKSYALQAQASTKAEL 219
Query: 609 -----------YRTAFPASHAPTNFSRWVNA--TTPYQIEWAPDFEPYI-VAHRDLPRYD 654
+ +P H +N+ +W ++ T PY I + +EPY+ V R +P YD
Sbjct: 220 KSMFDEGKIIQFADFYPVGHGSSNYKKWFSSEGTEPYPITYKTGYEPYVLVNRRTVPWYD 279
Query: 655 TRFVGFGWNK 664
RF G+GW+K
Sbjct: 280 ERFRGYGWDK 289
>gi|443734790|gb|ELU18647.1| hypothetical protein CAPTEDRAFT_226416 [Capitella teleta]
Length = 461
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 106/241 (43%), Gaps = 53/241 (21%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
+DVTLV Q S+DRL + + W G IS ++Y+S + FL + L+ RRNI
Sbjct: 145 DDVTLVTQFSVDRLDRFYEIAQLWPGTISASVYVSRSHVLAFLQQWRSLPHLKLRRNILL 204
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS-----------SI 585
H+V + G YP N LRN+A+ +++L D+D LPM GL L+S I
Sbjct: 205 HMVTQTGPIYPSNALRNIAVKNSVGRFLYLTDVDILPMKGLQSRLRSHLDNSFAKSKKRI 264
Query: 586 RSMD--------------MHGHGGKKVLV-VPAFETQR---------------------- 608
+++D + G K VP FE
Sbjct: 265 QNVDDLVAKTHCWVLTTVIMGVAAKGAAAYVPTFERVHREGRGILPEDKAEVLQQWDAGR 324
Query: 609 ----YRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWN 663
Y T + ++ P +++W + Y++++ FEPYIVA RD + Y R G ++
Sbjct: 325 IRPFYSTRYQPAYLPILYNKWRRSNAEYKVKYYRQFEPYIVAARDAVHNYSERIPGLYFD 384
Query: 664 K 664
K
Sbjct: 385 K 385
>gi|358333649|dbj|GAA52132.1| glycosyltransferase-like protein LARGE2 [Clonorchis sinensis]
Length = 323
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 54/173 (31%), Positives = 90/173 (52%), Gaps = 10/173 (5%)
Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
P L FY +S V+ IPN H +G + KL +P +LP + KTIV+D+D+++ +
Sbjct: 65 PYLKCSFYDVESYVDFAQRIPNGHIAGPSAVAKLWIPYILPPWVIKTIVIDSDLLWNENA 124
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
LW +F + + Q IGL Q + + K P RG N G++L+ L ++R+I
Sbjct: 125 LMLWKIFEQFKSTQMIGLAWEQQSYDPLCIEDPAKLLPR--RGVNGGLVLMHLERMRNIR 182
Query: 286 WAGFWRIIAEKFLLTRL----WTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
W I E+ + RL +Q++++ +I P Y +PC+WN+Q+
Sbjct: 183 ----WDITIEQIIHQRLNQVGILGEGEQEVYSFLIHLKPDWYYGIPCEWNIQV 231
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 65/111 (58%), Gaps = 1/111 (0%)
Query: 42 KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
+N +H HL+ D ++L++ S+P ++ SFY +S V+ IPN H +G + K
Sbjct: 38 QNIIHMHLVCDRNLWSMLESQLQNCSLPYLKCSFYDVESYVDFAQRIPNGHIAGPSAVAK 97
Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQS 151
L +P +LP + KTIV+D+D+++ + LW +F + + Q IGL E QS
Sbjct: 98 LWIPYILPPWVIKTIVIDSDLLWNENALMLWKIFEQFKSTQMIGLAWEQQS 148
>gi|26350809|dbj|BAC39041.1| unnamed protein product [Mus musculus]
Length = 379
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPEMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNTSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + + Y ++D+D +P GL+ L+ +D H LVVPAFE +R
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFEIRRS 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R HAPTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|443719771|gb|ELU09781.1| hypothetical protein CAPTEDRAFT_220854 [Capitella teleta]
Length = 512
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 92/174 (52%), Gaps = 5/174 (2%)
Query: 445 DVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN-DVTLVAQLSMDRLQMVEMLFKHWEGP 503
D + +L+ + + + +N+ + A DG+ V+ V S DRL + +HW G
Sbjct: 84 DADKICKLTEEESEYDRVVLDNILDSGAPDDGDLRVSYVTHFSYDRLDKFLAVVQHWPGY 143
Query: 504 ISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPY 563
IS + + + ++F++ L +R NIG H+V ++G FYP N RN+AL++ T +
Sbjct: 144 ISAAVQIKADKVEEFVNNYCREPELAARDNIGIHLVIQKGIFYPYNHYRNIALDRAPTSH 203
Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE-TQRYRTAFPAS 616
VF+ DID +P L + S++ G K+VLVVP+FE T+ + P S
Sbjct: 204 VFVTDIDLVPSDDL---ERRSLQHFKEQKVGEKQVLVVPSFEITKEHGYTVPRS 254
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 72/146 (49%), Gaps = 29/146 (19%)
Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
FYP N RN+AL++ T +VF+ DID +P L + S++ G K+VLVVP+F
Sbjct: 298 FYPYNHYRNIALDRAPTSHVFVTDIDLVPSDDL---ERRSLQHFKEQKVGEKQVLVVPSF 354
Query: 605 ETQR-------------------------YRTAFPASHAPTNFSRWVNATTPYQIEWAPD 639
E + Y T + ++ P N++ W AT PY +++
Sbjct: 355 EITKEHGYSVPRSKEDLLEIWDRGELQEFYATRYAPAYMPLNYTVWRTATEPYWVKYYDQ 414
Query: 640 FEPYIVAHR-DLPRYDTRFVGFGWNK 664
FEPY++A + L Y RFVG +NK
Sbjct: 415 FEPYVIAEKAKLQPYTNRFVGRFYNK 440
>gi|157822857|ref|NP_001099794.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Rattus norvegicus]
gi|392344822|ref|XP_003749084.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Rattus
norvegicus]
gi|149062023|gb|EDM12446.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6
(predicted) [Rattus norvegicus]
Length = 415
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 107/265 (40%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATREEAQLATVLAYALSSHCPEMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNISYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
LRN+A + + Y ++D+D +P GL+ S+R M D H LVVPAFE +R
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLW----RSLREMLDQSNHWDGTALVVPAFEIRR 264
Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
R HAPTN+SRWVN Y + W
Sbjct: 265 ARRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349
>gi|443703882|gb|ELU01224.1| hypothetical protein CAPTEDRAFT_189662, partial [Capitella teleta]
Length = 265
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 76/137 (55%), Gaps = 14/137 (10%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---- 528
S ND+TLV QL+ DR+ + ++ W+GPIS ++Y D E LS G L
Sbjct: 107 SQSSNDITLVTQLTSDRIDRLLLIMNIWKGPISASVY-PDTE----LSIRGEEICLLKKQ 161
Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM 588
SR + H+V K G F+P+N LRN+AL+ T +VFL D+DF+P LY + S+
Sbjct: 162 DSRCRVQLHLVQKSGVFFPVNKLRNIALDMAVTSHVFLTDVDFIPDQNLYANALQQLHSL 221
Query: 589 DMHGHGGKKVLVVPAFE 605
+ + LV+PAFE
Sbjct: 222 QV-----MQSLVIPAFE 233
>gi|384486166|gb|EIE78346.1| hypothetical protein RO3G_03050 [Rhizopus delemar RA 99-880]
Length = 402
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 103/230 (44%), Gaps = 48/230 (20%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA----EAQQFLSYAGNSEALRS--- 530
DVTL++Q S++RL+ + + W+GPIS+ +Y++ E QFL N E +
Sbjct: 84 DVTLISQFSVNRLKSFIQVIEAWQGPISIAIYLTQPDDIDELIQFLKIPKNLEIYSALTI 143
Query: 531 ---RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
+ N G+ +E YPIN LRN+A+ + ST ++F+ D DF+P LY +++S +
Sbjct: 144 TLVKPNYGH----QEHLAYPINHLRNIAITESSTEFIFVTDADFVPSSNLYTFIRSRLVP 199
Query: 588 MDMHGHGGKK--VLVVPAFETQRYRTAFP-----------------------ASHAPTNF 622
++ G + V+P F P A H PT
Sbjct: 200 YVIYQSGKLQPTAWVIPCFAIHEAYAQLPIPNTYNELRKLVGQGVAYITDPGAGHGPT-L 258
Query: 623 SRWVNATTP--------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ + P Y++ + +EPY V HR P YD RF G +K
Sbjct: 259 ATEIAMVRPLLMGNPLAYEVCYESQWEPYYVLHRSAPLYDVRFRNQGGDK 308
>gi|157111360|ref|XP_001651502.1| hypothetical protein AaeL_AAEL015300 [Aedes aegypti]
gi|108868339|gb|EAT32564.1| AAEL015300-PA [Aedes aegypti]
Length = 331
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 82/166 (49%), Gaps = 18/166 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +LP + + +D+D IF + + +LW++F Q G+ D G W
Sbjct: 106 RLFLPSLLPH-IDSVLYVDSDTIFLSPVQELWSMFRNFNASQFAGMAPEHEDKNAG--WY 162
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT++RD W I +++ L +W DQDI N +
Sbjct: 163 NRFARHPYYGDLGVNSGVMLMNLTRMRDFKWEEHIMPIYKEYRLKLVW---GDQDIINIL 219
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWN 357
HP ++Y PC WN R+D Y + D +KIIH N
Sbjct: 220 FHHHPDMLYIFPCDWNY------RADHCMYMSVCDAPDGVKIIHGN 259
>gi|428181260|gb|EKX50124.1| hypothetical protein GUITHDRAFT_103938 [Guillardia theta CCMP2712]
Length = 577
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 103/213 (48%), Gaps = 37/213 (17%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DV+ V QL+ DRL + + S + +E Q+ L LR R + +H
Sbjct: 181 DVSWVTQLTSDRLDGFKAMMARRANSCS----SASSELQRDLDLI---RPLRDR--VDFH 231
Query: 538 VVYKE---GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG 594
+V + G YP+NTLRN+A+++ T ++ L+D DF+P +Y L + G
Sbjct: 232 LVLADLHPGRLYPVNTLRNLAMDKCRTQWMVLVDADFVPNDDVYDLLLPRLEPWGKTG-- 289
Query: 595 GKKVLVVPAFETQRYRTAFPA-----------------------SHAPTNFSRWVNATTP 631
G ++ V+PAF+ Q P +H T++ RW ++
Sbjct: 290 GNEIYVLPAFQMQDKEKGIPRNKMELLSLGPAVSQVHPEKGRDIAHKYTDYERWRVSSEI 349
Query: 632 YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
Y+I++ +EPY VA++ +PRY++ FVG+G +K
Sbjct: 350 YEIQYRMPYEPYYVANKRIPRYESAFVGYGNDK 382
>gi|428183316|gb|EKX52174.1| hypothetical protein GUITHDRAFT_102076 [Guillardia theta CCMP2712]
Length = 688
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 105/224 (46%), Gaps = 39/224 (17%)
Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-EAQQFLSYAGNSEALRSRRNI 534
G+D+++ L++DRL ++ F+ W GP+ +Y+ + E ++ + + + R+N
Sbjct: 366 GDDISISTHLTVDRLDRLQSYFRRWAGPMVACIYIRNKLEEKELVDRFVKALDVEGRKN- 424
Query: 535 GYHVVY---KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH 591
YH++ ++ YP N LRN+AL V + +VFLLDID +P G Y L+S I + +
Sbjct: 425 -YHLILANARDATTYPANILRNLALETVKSTFVFLLDIDLIPDPGFYKSLQSQINVLRVM 483
Query: 592 GHGGKKVLVVPAFE----TQRYRTAFPA--------------------------SHAPTN 621
V +PAF+ T FP + + N
Sbjct: 484 SKN--HVFTIPAFQGVGAAASTETQFPVDKDDLMTLLRAAKIKPILSGKDEFWPAFSCLN 541
Query: 622 FSRWVNATTPYQIEWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
+ W NAT Y ++ EPY++ A +P +D RF+ +G +K
Sbjct: 542 YDAWYNATQSYLADYRWPCEPYLLGATSSMPFFDERFIHYGNDK 585
>gi|157128205|ref|XP_001655090.1| hypothetical protein AaeL_AAEL011072 [Aedes aegypti]
gi|108872658|gb|EAT36883.1| AAEL011072-PA [Aedes aegypti]
Length = 373
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 82/166 (49%), Gaps = 18/166 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +LP + + +D+D IF + + +LW++F Q G+ D G W
Sbjct: 148 RLFLPSLLPH-IDSVLYVDSDTIFLSPVQELWSMFRNFNASQFAGMAPEHEDKNAG--WY 204
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT++RD W I +++ L +W DQDI N +
Sbjct: 205 NRFARHPYYGDLGVNSGVMLMNLTRMRDFKWEEHIMPIYKEYRLKLVW---GDQDIINIL 261
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWN 357
HP ++Y PC WN R+D Y + D +KIIH N
Sbjct: 262 FHHHPDMLYIFPCDWNY------RADHCMYMSVCDAPDGVKIIHGN 301
>gi|354494706|ref|XP_003509476.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Cricetulus
griseus]
gi|344243237|gb|EGV99340.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Cricetulus griseus]
Length = 415
Score = 89.0 bits (219), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQMATVLAYALSSHCPEMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGTNYALGTNISYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + + Y ++D+D +P GL+ L+ + D H LVVPAFE +R
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRRLREML---DQSNHWDGMALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R HAPTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|148701142|gb|EDL33089.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1,
isoform CRA_a [Mus musculus]
Length = 415
Score = 89.0 bits (219), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPEMR 150
Query: 532 RNIGYHVV------------YKEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNTSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + + Y ++D+D +P GL+ L+ +D H LVVPAFE +R
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFEIRRS 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R HAPTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|402578443|gb|EJW72397.1| hypothetical protein WUBG_16696, partial [Wuchereria bancrofti]
Length = 144
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 3/97 (3%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
+V L+ Q SMDRL L ++W GP+S +Y++D+E + + ++ L +R N+ H
Sbjct: 48 NVVLITQFSMDRLTHFNALLQYWTGPVSAAVYVTDSELSLLIQFFDDT--LANRTNVALH 105
Query: 538 VVYKEGNFYPINTLRNVALNQVS-TPYVFLLDIDFLP 573
VYKE +YPIN LRNVALN + +VFL D+DF+P
Sbjct: 106 AVYKESTYYPINYLRNVALNSSNDASFVFLADVDFIP 142
>gi|30039682|ref|NP_780592.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Mus
musculus]
gi|67460548|sp|Q8BWP8.1|B3GN1_MOUSE RecName: Full=N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase; AltName:
Full=I-beta-1,3-N-acetylglucosaminyltransferase;
Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
beta-1,3-N-acetylglucosaminyltransferase 1
gi|26341082|dbj|BAC34203.1| unnamed protein product [Mus musculus]
gi|47124409|gb|AAH69927.1| B3gnt1 protein [Mus musculus]
gi|74182782|dbj|BAE34720.1| unnamed protein product [Mus musculus]
gi|148701143|gb|EDL33090.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1,
isoform CRA_b [Mus musculus]
Length = 415
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPEMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNTSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + + Y ++D+D +P GL+ L+ +D H LVVPAFE +R
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFEIRRS 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R HAPTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|326427667|gb|EGD73237.1| hypothetical protein PTSG_04953 [Salpingoeca sp. ATCC 50818]
Length = 625
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 105/234 (44%), Gaps = 46/234 (19%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSE-----ALRSR 531
+ +++V Q+S+DRL ++ L W GP+S+ ++ SD + L+
Sbjct: 318 HPLSIVTQVSLDRLSLLRYLPSTWSGPVSMAVHHSDDSDGGDGGRESAAAWVPAFVLKHL 377
Query: 532 RNIGYH-------VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
R + + + G YPIN LRN A+ +T +VF++D DF+P L +
Sbjct: 378 RKLAPTQPWAATLTLGRTGEPYPINALRNAAIEGAATDFVFIMDADFVPTSNLKDQFALA 437
Query: 585 IRSMDMHGHG-GKKVLVVPAFET-QRYRTAF----------------------------- 613
+D +G + VVPAFE Q R +
Sbjct: 438 AGFVDGNGDSVARTAFVVPAFEMLQGDRASIGVGDLPKTMEQLVAAMEAGTVRPFRESVS 497
Query: 614 PASHAPTNFSRWVNATTPYQIEWAPD-FEPYIVAHR--DLPRYDTRFVGFGWNK 664
P SHA T++S W +A TPY +++ D +EPY++ + LP +D F G+G NK
Sbjct: 498 PLSHAATDYSYWRHAYTPYTVDYQVDLYEPYVIVQKTNHLPLFDPNFTGYGMNK 551
>gi|198425664|ref|XP_002127967.1| PREDICTED: similar to MGC83717 protein [Ciona intestinalis]
Length = 397
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 132/316 (41%), Gaps = 48/316 (15%)
Query: 64 STWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV- 122
ST + + +FY A++ +V + +G + + K + + A D V
Sbjct: 51 STQTFTKESKAFYRAENHPINVKSDREEQIAGTFEGKQAKAAKDVAQKHADEFKNDYHVL 110
Query: 123 -IFATDIAQLWALF----------SRLRQRQTIGL---VENQSDWYLGKLWKNHKPWPAL 168
IFA +L F ++ +R+T L E + ++ + +K H P
Sbjct: 111 FIFAHKNPKLEKKFTTAFGSMFKHAKFNERETFHLHFVCEKEGKEFVQQYFKEHVSHPKF 170
Query: 169 GRGFYLADSV---------VEDV-SWIPNKH----YSGVYGLLKLTLPKVLPETLAKTIV 214
+ D++ V+ + + + N H + ++ L + LP ++PE + + +
Sbjct: 171 NLKIHFHDTIKLARQVHPLVQGIQAMLGNDHPYFSETAIF-FLSMVLPNIMPENIHRLVQ 229
Query: 215 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW---------KNHKPWPAL 265
LD D+ F T+I LW+ F + IG+ Y LW K +P+PA
Sbjct: 230 LDLDLKFVTNIRDLWSEFDYFTKDTVIGIGNENQPVYRHLLWQYRNENPNTKVGEPYPAG 289
Query: 266 GRGFNTGVILLDLTKLRDISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
GFN+GV+LL L +R S A + I +K+ L DQD + + EH
Sbjct: 290 QPGFNSGVVLLHLDNMRTSSLYNSYMNATNLQRIVQKY---HFKGHLGDQDFYTLLAFEH 346
Query: 320 PYLVYTLPCQWNVQLS 335
P L + LPC WN QL
Sbjct: 347 PPLFHVLPCGWNKQLC 362
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/170 (22%), Positives = 70/170 (41%), Gaps = 25/170 (14%)
Query: 27 TRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQVEVSFYLADSV---- 81
T + ++ K F + H H + + +Q F S P+ + + D++
Sbjct: 125 TTAFGSMFKHAKFNERETFHLHFVCEKEGKEFVQQYFKEHVSHPKFNLKIHFHDTIKLAR 184
Query: 82 -----VEDV-SWIPNKH----YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
V+ + + + N H + ++ L + LP ++PE + + + LD D+ F T+I L
Sbjct: 185 QVHPLVQGIQAMLGNDHPYFSETAIF-FLSMVLPNIMPENIHRLVQLDLDLKFVTNIRDL 243
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLW---------KNHKPWPALGRGF 172
W+ F + IG+ Y LW K +P+PA GF
Sbjct: 244 WSEFDYFTKDTVIGIGNENQPVYRHLLWQYRNENPNTKVGEPYPAGQPGF 293
>gi|355751935|gb|EHH56055.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Macaca fascicularis]
Length = 383
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 101/245 (41%), Gaps = 62/245 (25%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSRRN 533
NDV L S+D L + L + WEGP+S++++ + E Q L YA +S R
Sbjct: 70 NDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLDYALSSHCPDMRAR 129
Query: 534 IGYHVV-----------------------YKEGN--FYPINTLRNVALNQVSTPYVFLLD 568
+ H+V Y G YP N LRN+A + Y ++D
Sbjct: 130 VAMHLVCPSRYEAAEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGAN--YALVID 187
Query: 569 IDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------- 612
+D +P GL+ L+ +D G LVVPAFE +R R
Sbjct: 188 VDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEV 244
Query: 613 -------FPASHAPTNFSRWVNATT------PYQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
APTN+SRWVN Y + W +EP+ VA +P +D RF
Sbjct: 245 RPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQ 304
Query: 660 FGWNK 664
+G+N+
Sbjct: 305 YGFNR 309
>gi|444510201|gb|ELV09536.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Tupaia chinensis]
Length = 415
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 75/266 (28%), Positives = 107/266 (40%), Gaps = 83/266 (31%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
LRN+A + Y ++D+D +P GL+ S+R M D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSNQWGGTALVVPAFEIRR 264
Query: 609 YRTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAP 638
R P S APTN+SRWVN Y + W
Sbjct: 265 ARR-MPMSKTELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQD 323
Query: 639 DFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 324 PWEPFYVAGGKVPPFDERFRQYGFNR 349
>gi|241133589|ref|XP_002404534.1| Gyltl1B protein, putative [Ixodes scapularis]
gi|215493625|gb|EEC03266.1| Gyltl1B protein, putative [Ixodes scapularis]
Length = 281
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 95/219 (43%), Gaps = 46/219 (21%)
Query: 487 MDRLQMVEMLFKHWEGPISLTLYMSDAEAQ-----QFLSYAGNSEALR----SRRNIGYH 537
MDRL ++E + WEGP+SL +Y Q Q L + L+ S ++ Y
Sbjct: 1 MDRLPVLEHTLQQWEGPVSLVIYAPVKAGQPEGDWQKLYVQKKLQKLKLHPDSHVSLAYG 60
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
V YPIN LRN+A+ Q + ++FL D DF P + + S RS+ + K
Sbjct: 61 V--PGSGDYPINALRNLAIRQTNCEHLFLADADFQPSPDFHKHFLLSTRSV---SNTDKV 115
Query: 598 VLVVPAFE----------------------------TQRYR-TAFPASHAPTNFSRWVNA 628
VVPAFE QR+R P SH T++ +W
Sbjct: 116 AFVVPAFEYLELPQKSDGVAQTKEELMQLLHRQDPFVQRFRHDVTPDSHQSTDYWKWYRT 175
Query: 629 TTPYQIEWAPD-FEPYIVAHR--DLPRYDTRFVGFGWNK 664
PY + D FEPY+V LP YD RF G+G NK
Sbjct: 176 DRPYPVRTFVDMFEPYMVVKNVPQLPLYDERFTGYGSNK 214
>gi|256083969|ref|XP_002578207.1| TATA-binding protein-associated phosphoprotein [Schistosoma
mansoni]
Length = 316
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 69/127 (54%), Gaps = 7/127 (5%)
Query: 178 VVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQ 237
+ E + + H +G LKL + ++LP + K IVLDTD++ DI+ LW+LF +
Sbjct: 170 IKERIKSFKSSHPTGHKNYLKLLITQILPNEIRKVIVLDTDILLNDDISNLWSLFDEFNK 229
Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW-----RI 292
Q IG+ Q+ + + + WP LG G+N+G++L+ LTKLR W W +
Sbjct: 230 DQCIGIAAEQNPTFYSNM--GQRFWPTLGYGYNSGILLIHLTKLRARGWDEIWMKTAFNL 287
Query: 293 IAEKFLL 299
I EK +L
Sbjct: 288 IEEKRIL 294
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 49/92 (53%), Gaps = 2/92 (2%)
Query: 81 VVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQ 140
+ E + + H +G LKL + ++LP + K IVLDTD++ DI+ LW+LF +
Sbjct: 170 IKERIKSFKSSHPTGHKNYLKLLITQILPNEIRKVIVLDTDILLNDDISNLWSLFDEFNK 229
Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
Q IG+ Q+ + + + WP LG G+
Sbjct: 230 DQCIGIAAEQNPTFYSNM--GQRFWPTLGYGY 259
>gi|351710870|gb|EHB13789.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Heterocephalus glaber]
Length = 415
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 105/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEQAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVTRPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDMDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R HAPTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKAELLQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPSYIVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPAFDERFRQYGFNR 349
>gi|384486813|gb|EIE78993.1| hypothetical protein RO3G_03698 [Rhizopus delemar RA 99-880]
Length = 415
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 68/229 (29%), Positives = 112/229 (48%), Gaps = 38/229 (16%)
Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
E + D+T+V ++ +R+ + L K ++GPIS T+++SD E ++ +S
Sbjct: 98 EMDMESKDITIVTLVTRNRIPNLARLAKKYKGPISTTIHISDDEEGEYTLEVLEDTIEKS 157
Query: 531 ---RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
R+ + H+V E + +N RNVA T YV LLDIDF P L + ++ ++
Sbjct: 158 SEMRKYVDVHIVRDEFD-RELNLWRNVAKLFARTDYVMLLDIDFYPCSNLRDNVLNNPKA 216
Query: 588 MDMHGHGGKKVLVVPAFETQR------YRTAFPAS-----------------------HA 618
M + G+ LV+PAFE + YRT FP + HA
Sbjct: 217 MSL-LESGEAALVIPAFEFSKQQDGLDYRT-FPKNKQELIMAYQDNKVEMFHSFWLPGHA 274
Query: 619 PTNFSRWVNA-TTPYQI-EWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
PT++ RW++ + YQ+ + +EPY++ ++ P D RF G+G NK
Sbjct: 275 PTDYDRWMSTDDSIYQVTTYQHSYEPYVIFKNQGSPWCDERFAGYGSNK 323
>gi|291416316|ref|XP_002724391.1| PREDICTED: UDP-GlcNAc:betaGal
beta-1,3-N-acetylglucosaminyltransferase 1 [Oryctolagus
cuniculus]
Length = 415
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 106/265 (40%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRTCQEVFDKLARVAQPGVNYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSSQWGGTALVVPAFEIRRA 265
Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
R P S APTN+SRWVN Y + W
Sbjct: 266 RR-MPMSKTELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349
>gi|395851721|ref|XP_003798401.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Otolemur
garnettii]
Length = 415
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 106/265 (40%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
R P S APTN+SRWVN Y + W
Sbjct: 266 RR-MPMSKTELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349
>gi|189442730|gb|AAI67687.1| LOC100170600 protein [Xenopus (Silurana) tropicalis]
Length = 435
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 110/263 (41%), Gaps = 80/263 (30%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA------------------EAQQFL 519
DV L + S++ L ++ L +HW+G ISL L+ S+A +Q +
Sbjct: 111 DVVLASHASVNNLGHLQDLVQHWDGRISLALFASNAVQAKLAVMFVYALSQLCPHVRQRV 170
Query: 520 SYA-----------------------GNSEALRSRR-NIGYHVVYKEGNF-YPINTLRNV 554
S+ N EA+ S+ ++G VV GN YP N LRNV
Sbjct: 171 SFHLVCKSSDRVTFPELEDLSDFATLPNCEAVFSKAADMGIKVVNYAGNASYPNNLLRNV 230
Query: 555 ALNQV-STPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE---TQR-- 608
A + S YV ++DID +P GL +S ++ G V VVPAFE T+R
Sbjct: 231 ARAGIESAAYVLVVDIDMVPSEGL----RSGFVNLATGGVDQHLVFVVPAFEIRHTRRLP 286
Query: 609 ------------------YRTAFPASHAPTNFSRWVNA---------TTPYQIEWAPDFE 641
Y P APTN+S W+N Y +EW +E
Sbjct: 287 STKEELMRLYQVGEVRAFYEELCPRCQAPTNYSLWINLPEKKPAAKLGVAYVVEWKDPWE 346
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
P+ + D+P YD RF +G+N+
Sbjct: 347 PFYIGRADVPAYDERFKQYGFNR 369
>gi|193786796|dbj|BAG52119.1| unnamed protein product [Homo sapiens]
Length = 415
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 105/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L + S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILASHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|426252530|ref|XP_004019962.1| PREDICTED: LOW QUALITY PROTEIN: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Ovis aries]
Length = 421
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 105/266 (39%), Gaps = 81/266 (30%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRS 530
SD NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S
Sbjct: 96 SDPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDM 155
Query: 531 RRNIGYHVVY------------KEGNF------------------------------YPI 548
R + H+V + G F YP
Sbjct: 156 RARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGVNYALGTNVSYPN 215
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQ 607
N LRN+A + Y ++D+D +P GL+ S+R M D LVVPAFE +
Sbjct: 216 NLLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIR 269
Query: 608 RYRTA-----------------------FPASHAPTNFSRWVNAT------TPYQIEWAP 638
R R APTN+SRWVN Y + W
Sbjct: 270 RARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQD 329
Query: 639 DFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 330 PWEPFYVAGGKVPTFDERFRQYGFNR 355
>gi|296218850|ref|XP_002807418.1| PREDICTED: LOW QUALITY PROTEIN: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Callithrix
jacchus]
Length = 415
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|301619991|ref|XP_002939368.1| PREDICTED: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Xenopus
(Silurana) tropicalis]
Length = 421
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 110/263 (41%), Gaps = 80/263 (30%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA------------------EAQQFL 519
DV L + S++ L ++ L +HW+G ISL L+ S+A +Q +
Sbjct: 97 DVVLASHASVNNLGHLQDLVQHWDGRISLALFASNAVQAKLAVMFVYALSQLCPHVRQRV 156
Query: 520 SYA-----------------------GNSEALRSRR-NIGYHVVYKEGNF-YPINTLRNV 554
S+ N EA+ S+ ++G VV GN YP N LRNV
Sbjct: 157 SFHLVCKSSDRVTFPELEDLSDFATLPNCEAVFSKAADMGIKVVNYAGNASYPNNLLRNV 216
Query: 555 ALNQV-STPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE---TQR-- 608
A + S YV ++DID +P GL +S ++ G V VVPAFE T+R
Sbjct: 217 ARAGIESAAYVLVVDIDMVPSEGL----RSGFVNLATGGVDQHLVFVVPAFEIRHTRRLP 272
Query: 609 ------------------YRTAFPASHAPTNFSRWVNA---------TTPYQIEWAPDFE 641
Y P APTN+S W+N Y +EW +E
Sbjct: 273 STKEELMRLYQVGEVRAFYEELCPRCQAPTNYSLWINLPEKKPAAKLGVAYVVEWKDPWE 332
Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
P+ + D+P YD RF +G+N+
Sbjct: 333 PFYIGRADVPAYDERFKQYGFNR 355
>gi|197101823|ref|NP_001126908.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Pongo abelii]
gi|67460445|sp|Q5R4S2.1|B3GN1_PONAB RecName: Full=N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase; AltName:
Full=I-beta-1,3-N-acetylglucosaminyltransferase;
Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
beta-1,3-N-acetylglucosaminyltransferase 1
gi|55733120|emb|CAH93244.1| hypothetical protein [Pongo abelii]
Length = 415
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|5802984|ref|NP_006867.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Homo sapiens]
gi|114638684|ref|XP_508571.2| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Pan
troglodytes]
gi|397517017|ref|XP_003828716.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Pan paniscus]
gi|426369296|ref|XP_004051629.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Gorilla
gorilla gorilla]
gi|23396461|sp|O43505.1|B3GN1_HUMAN RecName: Full=N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase; AltName:
Full=I-beta-1,3-N-acetylglucosaminyltransferase;
Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
beta-1,3-N-acetylglucosaminyltransferase 1
gi|2745741|gb|AAC39538.1| i-beta-1,3-N-acetylglucosaminyltransferase [Homo sapiens]
gi|18314366|gb|AAH21965.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Homo
sapiens]
gi|66394590|gb|AAY46155.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6 [Homo
sapiens]
gi|119594923|gb|EAW74517.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6,
isoform CRA_a [Homo sapiens]
gi|119594924|gb|EAW74518.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6,
isoform CRA_a [Homo sapiens]
gi|123984573|gb|ABM83632.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1
[synthetic construct]
gi|123998563|gb|ABM86883.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1
[synthetic construct]
gi|410208370|gb|JAA01404.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
troglodytes]
gi|410247954|gb|JAA11944.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
troglodytes]
gi|410301258|gb|JAA29229.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
troglodytes]
gi|410353541|gb|JAA43374.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
troglodytes]
Length = 415
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|403293582|ref|XP_003937792.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Saimiri
boliviensis boliviensis]
Length = 415
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|343958360|dbj|BAK63035.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Pan
troglodytes]
Length = 415
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAVHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEMRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|328719955|ref|XP_001943688.2| PREDICTED: glucoside xylosyltransferase 1-like isoform 1
[Acyrthosiphon pisum]
gi|328719957|ref|XP_003246910.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 2
[Acyrthosiphon pisum]
Length = 376
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 16/151 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KL 255
+L LP VL L + +DTD +F + + +W FS++ Q +V D +G
Sbjct: 155 RLFLPTVLNH-LDSILYVDTDTLFLSSVEDVWKHFSQMNSSQLAAMVPEHEDMNVGWYNR 213
Query: 256 WKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
+ NH P G+ G N+GV+L++LT++R+ W + EK+ RL+ + DQDI N
Sbjct: 214 FANH---PYYGKLGLNSGVMLMNLTRMREFDWISRLAPVLEKY---RLYLTWGDQDIINV 267
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
I +HP VY PC++N RSD Y
Sbjct: 268 IFHDHPDKVYVYPCRYNY------RSDHCMY 292
Score = 42.4 bits (98), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 71/169 (42%), Gaps = 23/169 (13%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNP-LHFHLITDTVALNILQTLFST---WSV 68
EV+ A++C N T + +IKS + +R N L +I + NI Q T W +
Sbjct: 67 EVVIAAVICG--NRTSQALVMIKSAIVFRGNSDLRIVVIAEK---NIQQEFDETLRYWRL 121
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL------KLTLPKVLPETLAKTIVLDTDV 122
+ Y S++ P +H L +L LP VL L + +DTD
Sbjct: 122 LTNDSFSYEIHSII-----FPKEHSDEWRKLFQPCASERLFLPTVLNH-LDSILYVDTDT 175
Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KLWKNHKPWPALG 169
+F + + +W FS++ Q +V D +G + NH + LG
Sbjct: 176 LFLSSVEDVWKHFSQMNSSQLAAMVPEHEDMNVGWYNRFANHPYYGKLG 224
>gi|402892745|ref|XP_003909569.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Papio anubis]
Length = 415
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
+ NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 EPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGSTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLAEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|410974574|ref|XP_003993719.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Felis catus]
Length = 415
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 106/265 (40%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265
Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
R PA+ APTN+SRWVN Y + W
Sbjct: 266 RR-MPANKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349
>gi|161078603|ref|NP_001097911.1| CG9996 [Drosophila melanogaster]
gi|158030380|gb|AAF56394.2| CG9996 [Drosophila melanogaster]
gi|162944842|gb|ABY20490.1| IP22168p [Drosophila melanogaster]
Length = 362
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 85/162 (52%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD++F + I+ +W F + + Q L + +G W
Sbjct: 136 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 192
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R++ W I +++ L +W DQDI N +
Sbjct: 193 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 249
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
HP +Y +PC++N + D+ +C T +K+IH N
Sbjct: 250 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMSQTGVKVIHGN 290
>gi|281208069|gb|EFA82247.1| putative glycosyltransferase [Polysphondylium pallidum PN500]
Length = 391
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 96/206 (46%), Gaps = 31/206 (15%)
Query: 485 LSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEG 543
+S+DRL+ + + W PIS +Y+ +D E + + S ++ +I H++Y
Sbjct: 70 VSVDRLERLAWMADKWRAPISAAVYIQNDTEIEAIANLIRGSYSVTQFVDI--HLLYANN 127
Query: 544 NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPA 603
YPINTLRN+++ T +V L+D DF+P LY K+ ++ V+P+
Sbjct: 128 TRYPINTLRNLSIKHAETDWVLLMDADFIPPISLY---KNIVQHTYKVPRDELTAFVIPS 184
Query: 604 FETQ--RYR----------------------TAFPASHAPTNFSRWVNATTPYQIEWAPD 639
F + R++ P H P+N++RW Y+ ++
Sbjct: 185 FASSLDRFKIPDDKLDLLKEIKKKTIVPTNLNVCPKCHTPSNYTRWYKEKDTYEAKYQWV 244
Query: 640 FEPYIVAHR-DLPRYDTRFVGFGWNK 664
+EPY+V + D +D R G+G++K
Sbjct: 245 YEPYLVYRKIDAEPFDERLKGYGFDK 270
>gi|452824557|gb|EME31559.1| glycosyltransferase-like protein LARGE [Galdieria sulphuraria]
Length = 655
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 111/217 (51%), Gaps = 30/217 (13%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWE-GPI-SLTLYMSDAEAQQFLSYAGNSEALRSRR 532
+ +++TL LSM R+ +++ L K ++ GP+ ++ S+ E ++ L Y R
Sbjct: 384 NKSELTLAIPLSMHRMFILKELLKFYKTGPVVAVIAIQSEQEKKELLEYLSIIPN-NMRM 442
Query: 533 NIGYHVVYKE--GNFYPINTLRNVALNQVSTPYVFLLDIDFLPM---FGLYPYLKSSIRS 587
N+ + +V + +PIN LRN+AL +T +V +LD+D P+ F +P + +
Sbjct: 443 NVEFVIVLISPWKDRFPINQLRNIALKYTTTDFVCVLDVDTFPVSSAFTAFPQIME--KE 500
Query: 588 MDMHGHGGKKVLVV-------------PAFE---TQRYRTAFP---ASHAPTNFSRWVNA 628
++ K+ LVV P+ E ++ +T FP AS +P +F RW+
Sbjct: 501 PELLPMNRKRCLVVTNWIASDSEQLVYPSVEDLKSKYLKTWFPYCEASQSPISFRRWLYD 560
Query: 629 TTPYQIEWAPDFEPY-IVAHRDLPRYDTRFVGFGWNK 664
Y + + P+FEPY I+ RD +D RF G+G+NK
Sbjct: 561 NNSYFVSFRPNFEPYCIMRTRDFILFDERFHGYGFNK 597
>gi|332249596|ref|XP_003273944.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Nomascus
leucogenys]
Length = 415
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAF+ +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFQIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|109939728|sp|Q5EA01.2|B3GN1_BOVIN RecName: Full=N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase; AltName:
Full=I-beta-1,3-N-acetylglucosaminyltransferase;
Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
beta-1,3-N-acetylglucosaminyltransferase 1
gi|75773634|gb|AAI05207.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 2 [Bos
taurus]
gi|296471480|tpg|DAA13595.1| TPA: N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Bos taurus]
gi|440899377|gb|ELR50680.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Bos
grunniens mutus]
Length = 415
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 104/265 (39%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGVNYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
LRN+A + Y ++D+D +P GL+ S+R M D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIRR 264
Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
R APTN+SRWVN Y + W
Sbjct: 265 ARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349
>gi|77736590|ref|NP_001029980.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Bos
taurus]
gi|59857901|gb|AAX08785.1| beta-1,3-N-acetylglucosaminyltransferase bGnT-6 [Bos taurus]
Length = 415
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 104/265 (39%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFAPLRSCQEVFDKLARVAQPGVNYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
LRN+A + Y ++D+D +P GL+ S+R M D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIRR 264
Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
R APTN+SRWVN Y + W
Sbjct: 265 ARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349
>gi|61553937|gb|AAX46482.1| beta-1,3-N-acetylglucosaminyltransferase bGnT-6 [Bos taurus]
Length = 388
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 104/265 (39%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGVNYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
LRN+A + Y ++D+D +P GL+ S+R M D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIRR 264
Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
R APTN+SRWVN Y + W
Sbjct: 265 ARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349
>gi|67970700|dbj|BAE01692.1| unnamed protein product [Macaca fascicularis]
Length = 415
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 103/262 (39%), Gaps = 79/262 (30%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSRRN 533
NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 93 NDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMRAR 152
Query: 534 IGYHVVY------------KEGNF------------------------------YPINTL 551
+ H+V + G F YP N L
Sbjct: 153 VAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNNLL 212
Query: 552 RNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRT 611
RN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R R
Sbjct: 213 RNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRARR 267
Query: 612 A-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDFEP 642
APTN+SRWVN Y + W +EP
Sbjct: 268 MPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEP 327
Query: 643 YIVAHRDLPRYDTRFVGFGWNK 664
+ VA +P +D RF +G+N+
Sbjct: 328 FYVAGGKVPTFDERFRQYGFNR 349
>gi|358336521|dbj|GAA55008.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 213
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 80/160 (50%), Gaps = 29/160 (18%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRK-----------------------------N 43
+ I +AI+ +G N L+ +K++L+Y+ N
Sbjct: 43 DTIDIAIILSGPNQEYQLMNFLKNLLYYQGRFNSSSEECKLNYTVQRDYVCPKQLTVLYN 102
Query: 44 PLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLT 103
PL+ HLI D++++ + + TW + VE +FY DS++ V IP+ H +G+ L K+
Sbjct: 103 PLNIHLILDSLSMEFILNVIHTWRIRDVEFTFYHFDSLMRYVQAIPDIHMAGIAALAKIA 162
Query: 104 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
+P+V+ + K I+LD+D++F+ +I LW F + Q
Sbjct: 163 VPEVISSGVNKVILLDSDMLFSENINLLWKEFENFNETQV 202
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 41/69 (59%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
FY DS++ V IP+ H +G+ L K+ +P+V+ + K I+LD+D++F+ +I LW
Sbjct: 134 FYHFDSLMRYVQAIPDIHMAGIAALAKIAVPEVISSGVNKVILLDSDMLFSENINLLWKE 193
Query: 232 FSRLRQRQT 240
F + Q
Sbjct: 194 FENFNETQV 202
>gi|90076258|dbj|BAE87809.1| unnamed protein product [Macaca fascicularis]
gi|355566281|gb|EHH22660.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Macaca mulatta]
gi|380788399|gb|AFE66075.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Macaca mulatta]
gi|383411805|gb|AFH29116.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Macaca mulatta]
gi|384943476|gb|AFI35343.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Macaca mulatta]
Length = 415
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
+ NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 EPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|432854536|ref|XP_004067949.1| PREDICTED: xyloside xylosyltransferase 1-like [Oryzias latipes]
Length = 384
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 86/180 (47%), Gaps = 22/180 (12%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + ++PE+L + LD D+ + T+I L+
Sbjct: 173 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTHIVQLDLDLKYRTNIRNLFQE 232
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK--PWPALGR-------GFNTGVILLDLTKLR 282
F + IG++ Y W+ K P +G GFN+GV+LLDL +R
Sbjct: 233 FDKFPPGAVIGIIREMQPVYRHTFWQYRKENPQTKVGEPPPDGLPGFNSGVMLLDLQGMR 292
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ A + +++ A+K+ R L DQD F I EHP L Y+LPC WN QL
Sbjct: 293 --ASALYNQLLEPSNVAKLADKY---RFRGHLGDQDFFTMIGMEHPNLFYSLPCGWNRQL 347
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/157 (19%), Positives = 66/157 (42%), Gaps = 14/157 (8%)
Query: 29 SLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW---SVPQVEVSFY--------- 76
++++++K F + L H ++D + + Q + + + + EV F+
Sbjct: 113 AMLSMVKHGHFLKGEVLVLHFVSDKASQQLGQRMLQDFLQDATFKYEVLFHDVAELTQTL 172
Query: 77 --LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
+ +++ + S +YS L + + ++PE+L + LD D+ + T+I L+
Sbjct: 173 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTHIVQLDLDLKYRTNIRNLFQE 232
Query: 135 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
F + IG++ Y W+ K P G
Sbjct: 233 FDKFPPGAVIGIIREMQPVYRHTFWQYRKENPQTKVG 269
>gi|194374501|dbj|BAG57146.1| unnamed protein product [Homo sapiens]
Length = 408
Score = 82.4 bits (202), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 84 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 143
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 144 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 203
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVP FE +R
Sbjct: 204 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPTFEIRRA 258
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 259 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 318
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 319 EPFYVAGGKVPTFDERFRQYGFNR 342
>gi|198434732|ref|XP_002132013.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 444
Score = 82.4 bits (202), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 84/268 (31%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM---SDAEAQQFLSYAGNSEALRSRRN 533
NDVTLV Q S++ L + + W+GPIS+ L++ S A+ + + LR
Sbjct: 79 NDVTLVTQCSVNNLYHLIETLQLWKGPISVALFVPGSSIIHAELAIKFIKQCFPLRK--- 135
Query: 534 IGYHVVYKE--------------------------------------------GNFYPIN 549
+ +H+VY E G YP N
Sbjct: 136 VQFHLVYPEHHTPTFDGNNLEIWVGLLFANDRLRCKDLFETLSSIEGENYATWGIPYPQN 195
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYP-YLKSSIRSMDMHGHGGKKVLVV-PAFETQ 607
TLRNVA V T Y+ +LDID +P +GL +L+ +R+ + K+ + PAFE +
Sbjct: 196 TLRNVAKLAVQTEYLIVLDIDVMPSYGLRENFLEKVLRNRGRNKTAETKIAYIPPAFELK 255
Query: 608 ------------------------RYRTAFPASHAPTNFSRW------VNATTPYQIEWA 637
Y T P+ H P + W +N Y+ EW
Sbjct: 256 INEQMPENKALLLTAWKSNQIRPFHYETC-PSCHKPARYDEWKAISTNLNLGIAYEAEWQ 314
Query: 638 PDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
P +EP+ +AH P YD RF +G+++
Sbjct: 315 PSWEPFYIAHTSHTPDYDERFKQYGYDR 342
>gi|158299248|ref|XP_319364.4| AGAP010187-PA [Anopheles gambiae str. PEST]
gi|157014274|gb|EAA13820.4| AGAP010187-PA [Anopheles gambiae str. PEST]
Length = 376
Score = 82.4 bits (202), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 113/248 (45%), Gaps = 32/248 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + L + +D+D +F + + ++WA+F + Q G+ D G W
Sbjct: 151 RLFLPSLL-QHLDAVLYVDSDTVFLSPVQEIWAMFQQFNATQFAGMAPEHEDMNAG--WY 207
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT++R+ W I ++ L +W DQDI N +
Sbjct: 208 NRFARHPYYGELGVNSGVMLMNLTRMREFRWEEHILPIYREYRLQLVW---GDQDILNVL 264
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWNSPKKLKVKNKHMEF 371
+P +Y PC WN R+D Y + D +KIIH N + ++
Sbjct: 265 FHYNPDRLYVFPCDWNY------RADHCMYMSVCDAPDGVKIIHGN---RGYFHSRAQPI 315
Query: 372 FRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAEL-SNLNEEDPCYDLRRSKLTSLRTHL 430
F +Y T E+ R +++G N +T + L SN N C L L R
Sbjct: 316 FNLIYSTIEEF---TFRTDVYG-NFIRTIEESLVLPSNSN----CDKLLDKFLQDPRK-- 365
Query: 431 YFLEYEYE 438
YF E +YE
Sbjct: 366 YFKENQYE 373
Score = 39.3 bits (90), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 66/155 (42%), Gaps = 17/155 (10%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYR-KNPLHFHLITD-TVALNILQTLFSTWSVPQVE 72
I V +V +L + +I+F R ++PL F +I + + LN ++ L + W
Sbjct: 63 ISVVVVACNERHNEALNMIKSAIIFNRNQSPLRFVIIAEEKLKLNFMEKL-NDWQEVTDR 121
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFAT 126
+ Y S+ P + L K L LP +L + L + +D+D +F +
Sbjct: 122 IFTYEIHSLT-----FPRANKDEWRKLFKPCAAQRLFLPSLL-QHLDAVLYVDSDTVFLS 175
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ ++WA+F + Q G+ D G W N
Sbjct: 176 PVQEIWAMFQQFNATQFAGMAPEHEDMNAG--WYN 208
>gi|348535861|ref|XP_003455416.1| PREDICTED: uncharacterized protein C3orf21 homolog [Oreochromis
niloticus]
Length = 394
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 83/177 (46%), Gaps = 16/177 (9%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + ++PE+L + + LD D+ + T+I L+
Sbjct: 183 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTRIVQLDLDLKYRTNIRDLFQE 242
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLW--KNHKPWPALGR-------GFNTGVILLDLTKLR 282
F R IG+ Y W + KP +G GFN+GV+LLDL +R
Sbjct: 243 FHRFPPGAVIGITREMQPVYRHTFWQYRKEKPRTRVGEPPPDGLPGFNSGVMLLDLGAMR 302
Query: 283 DISWAGFWRI-----IAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
S A + ++ +A+ R L DQD F I EHP L Y+L C WN QL
Sbjct: 303 --SSALYNKLLEPSNVAKLADQYRFRGHLGDQDFFTMIGMEHPELFYSLACGWNRQL 357
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/153 (21%), Positives = 65/153 (42%), Gaps = 16/153 (10%)
Query: 29 SLVTLIKSILFYRKNPLHFHLITDTVALNI----LQTLFSTWSVPQVEVSFY-------- 76
++++++K F L H ++D + + LQ LF + + EV F+
Sbjct: 123 AMLSMVKHGHFLEGEVLVLHFVSDQASQELGKRMLQELFLDATF-KYEVVFHDVVALTQK 181
Query: 77 ---LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
+ +++ + S +YS L + + ++PE+L + + LD D+ + T+I L+
Sbjct: 182 LFPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTRIVQLDLDLKYRTNIRDLFQ 241
Query: 134 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP 166
F R IG+ Y W+ K P
Sbjct: 242 EFHRFPPGAVIGITREMQPVYRHTFWQYRKEKP 274
>gi|344295836|ref|XP_003419616.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Loxodonta
africana]
Length = 415
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPMSVSVFAATKEEAQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 SRVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREL---LDQSRQWAGTALVVPAFEIRRA 265
Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
R P S APTN+SRWVN Y + W
Sbjct: 266 RR-MPVSKTELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPSYVVPWQDP 324
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 325 WEPFYVAGGKVPPFDERFRQYGFNR 349
>gi|72006259|ref|XP_782394.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like
[Strongylocentrotus purpuratus]
Length = 441
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 105/271 (38%), Gaps = 84/271 (30%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA--LRSRRNIG 535
D+TL+ SM+ L V L W+GPISL ++ S ++ + +SY + + L RRN+
Sbjct: 103 DITLITHCSMNNLHHVPDLVDRWKGPISLAVFGSLSDVKTIVSYIMSLKECILGVRRNVT 162
Query: 536 YHVVY----------------------------------------------KEGNFYPIN 549
+H+V+ +G YP N
Sbjct: 163 FHLVHPVNSVSEEAEIFGSVNNKRSIRVPLIHCSKLMEAIKASYSSVSNYAADGILYPNN 222
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYP----YLKSSIRSMDMHGHGGKKVLVVPAFE 605
LRNV + T + F+LDID LP GL ++K S + V V+PAFE
Sbjct: 223 LLRNVGRANILTEFHFVLDIDMLPSVGLRENFLNFVKGSSSLWLDEDMVQQMVFVLPAFE 282
Query: 606 T-------------------------QRYRTAFPASHAPTNFSRWVNATT-------PYQ 633
Q Y A T++ RW + + Y
Sbjct: 283 MDDRAVSKVPGDKKELLSYLEKDRVRQFYVKACLYCQNHTDYRRWTSLQSFTGQLGVAYT 342
Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EW P +EP+ ++ D P YD RF +G+N+
Sbjct: 343 VEWKPMWEPFYISRSDAPLYDERFRQYGYNR 373
>gi|291237248|ref|XP_002738549.1| PREDICTED: like-glycosyltransferase-like [Saccoglossus kowalevskii]
Length = 715
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 84/171 (49%), Gaps = 19/171 (11%)
Query: 180 EDVSWIPNKHYSGVYGLLKLTLPK-VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
E +S +K+Y L + L K +LP + K I++D+D+ F TDI +L+ F
Sbjct: 511 EHISTGKHKYYMEAIFFLSVALHKNILPSEVKKLIMIDSDLKFRTDIKKLFQHFDNFETG 570
Query: 239 QTIGLVENQSDWYLG--KLWKNHKPWPALG-------RGFNTGVILLDLTKLRD-ISWAG 288
IG+ Q Y L+++ P +G GFN+GV+LLDL +++ IS+
Sbjct: 571 NVIGIAHEQQPVYRHTFSLYRSQNPGTLVGDPPPNGLSGFNSGVMLLDLERMQSSISYNA 630
Query: 289 FW-----RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
F R +++K+ L DQD F I E+ L Y LPC WN QL
Sbjct: 631 FLDVDVVRALSDKY---SFKGHLGDQDFFTLISLENKNLFYVLPCTWNRQL 678
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 3/90 (3%)
Query: 83 EDVSWIPNKHYSGVYGLLKLTLPK-VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
E +S +K+Y L + L K +LP + K I++D+D+ F TDI +L+ F
Sbjct: 511 EHISTGKHKYYMEAIFFLSVALHKNILPSEVKKLIMIDSDLKFRTDIKKLFQHFDNFETG 570
Query: 142 QTIGLVENQSDWYLG--KLWKNHKPWPALG 169
IG+ Q Y L+++ P +G
Sbjct: 571 NVIGIAHEQQPVYRHTFSLYRSQNPGTLVG 600
>gi|417400521|gb|JAA47198.1| Putative glycosyltransferase [Desmodus rotundus]
Length = 415
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|311247200|ref|XP_003122534.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Sus
scrofa]
Length = 415
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPSYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|338712409|ref|XP_001916917.2| PREDICTED: LOW QUALITY PROTEIN: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Equus
caballus]
Length = 396
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 72 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 131
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 132 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 191
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 192 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 246
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 247 RRMPTDKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDPW 306
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 307 EPFYVAGGKVPTFDERFRQYGFNR 330
>gi|391339560|ref|XP_003744116.1| PREDICTED: glycosyltransferase-like protein LARGE2-like
[Metaseiulus occidentalis]
Length = 286
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 97/217 (44%), Gaps = 41/217 (18%)
Query: 487 MDRLQMVEMLFKHWEGPISLTLY-----MSDAEAQ-QFLSYAGNSEALRSRRNIGYHVVY 540
MDRL ++E W+ P+SL +Y + +A+ Q L + A + + +VY
Sbjct: 1 MDRLGVLEHTLDQWDMPVSLVIYAPLKALRNADDDWQRLYISKKVTAFKLHPSSVVCLVY 60
Query: 541 KEG--NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
YPIN+LRN+A+ Q + ++ L D DF P L P + IR+ K
Sbjct: 61 ASSVSEQYPINSLRNIAIKQAHSQFLLLADADFQPSPDLAPKVLPLIRNRT--KKFLKSA 118
Query: 599 LVVPAFETQRY----------------------------RTAFPASHAPTNFSRWVNATT 630
+V+PAFE R P SHA T++ RW A
Sbjct: 119 VVIPAFEYADVPHKSDVVAANKHELMQLIHAQGIVCPFRRQDAPQSHAATDYWRWYTARE 178
Query: 631 PYQIEWAPD-FEPYIVAHRD--LPRYDTRFVGFGWNK 664
PY++ D FEPY+V ++ LP Y +FVG+G NK
Sbjct: 179 PYEVHLFSDKFEPYLVIEKNAHLPLYWEKFVGYGMNK 215
>gi|301762478|ref|XP_002916660.1| PREDICTED: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like
[Ailuropoda melanoleuca]
gi|281350599|gb|EFB26183.1| hypothetical protein PANDA_004757 [Ailuropoda melanoleuca]
Length = 415
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVALHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|327286348|ref|XP_003227892.1| PREDICTED: uncharacterized protein C3orf21 homolog, partial [Anolis
carolinensis]
Length = 236
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 75/164 (45%), Gaps = 18/164 (10%)
Query: 187 NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN 246
+YS L + + +++P +++ I++D DV + I +L+ F R + IG+
Sbjct: 29 GTYYSDAIFFLSVAMHRIMPAEISRLILVDLDVEYRASIQELFEEFDRFPEGAVIGIAHE 88
Query: 247 QSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR------DISWAGFWR 291
Y W+ K P P GFN+GV+LLDL +R + G R
Sbjct: 89 MQPVYRHLFWQFRKEHPGTRVGDPPPEGLPGFNSGVLLLDLPAMRASGLYNGLLEEGPLR 148
Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
+A++F R L DQD F EHP L + LPC WN QL
Sbjct: 149 ALAQRF---RFRGHLGDQDFFTLAGMEHPGLFHRLPCAWNRQLC 189
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 39/174 (22%), Positives = 67/174 (38%), Gaps = 20/174 (11%)
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
P VE + + E S +YS L + + +++P +++ I++D DV + I
Sbjct: 8 PLVERLLPVIGPMQELFSAGAGTYYSDAIFFLSVAMHRIMPAEISRLILVDLDVEYRASI 67
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNK 188
+L+ F R + IG+ Y W+ K P G + D E + +
Sbjct: 68 QELFEEFDRFPEGAVIGIAHEMQPVYRHLFWQFRKEHP----GTRVGDPPPEGLPGFNS- 122
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
G+L L LP + L ++ + L AL R R R +G
Sbjct: 123 ------GVLLLDLPAMRASGLYNGLLEE---------GPLRALAQRFRFRGHLG 161
>gi|355733055|gb|AES10900.1| N-acetyllactosaminide beta-1 [Mustela putorius furo]
Length = 427
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 105/265 (39%), Gaps = 81/265 (30%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 104 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 163
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 164 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 223
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 224 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 278
Query: 610 RTAFPAS------------------------HAPTNFSRWVNAT------TPYQIEWAPD 639
R P + APTN+SRWVN Y + W
Sbjct: 279 RR-MPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDP 337
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 338 WEPFYVAGGKVPTFDERFRQYGFNR 362
>gi|345783078|ref|XP_533222.3| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Canis lupus
familiaris]
Length = 415
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 102/264 (38%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D NDV L S+D L + L + WEGP+S+ ++ + E Q L+YA +S R
Sbjct: 91 DPNDVILATHASVDNLLHLSGLLERWEGPLSVAVFAATKEEAQLATVLTYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|384501051|gb|EIE91542.1| hypothetical protein RO3G_16253 [Rhizopus delemar RA 99-880]
Length = 417
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/231 (29%), Positives = 111/231 (48%), Gaps = 40/231 (17%)
Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS-DAEAQQFLSYAGNSEALR 529
E + D+T+V ++ +R+ + L K ++GPIS T+++S D E + L +
Sbjct: 98 EMDMESKDITIVTLVTRNRIPNLARLAKKYKGPISTTIHISDDKEGEYTLEILAQTMEKN 157
Query: 530 S--RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
R+ + H+V E + +N RNVA T YV LLDIDF P + + ++ ++
Sbjct: 158 PDMRKYVDVHIVRDEFD-RELNLWRNVAKLFARTEYVMLLDIDFYPCTNIRKNVLNNPKA 216
Query: 588 MDMHGHGGKKVLVVPAFETQR------YRTAFPAS-----------------------HA 618
M + G+ LV+PAFE + YRT FP + HA
Sbjct: 217 MTLL-ESGEAALVIPAFEFSKQEDGLDYRT-FPKNKPDLIKAYKENKIEMFHSFWLPGHA 274
Query: 619 PTNFSRWVNAT---TPYQI-EWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
PT++ RW++ + YQ+ + +EPY++ ++ P D RF G+G NK
Sbjct: 275 PTDYDRWISVEQDDSIYQVTTYQHSYEPYVIFKNQGSPWCDERFAGYGSNK 325
>gi|313232704|emb|CBY19374.1| unnamed protein product [Oikopleura dioica]
Length = 578
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 108/267 (40%), Gaps = 74/267 (27%)
Query: 469 EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL 528
E E+ D DVTLV+Q S D L +E L K W+GPIS+ ++ D +A +S + L
Sbjct: 73 ETESGVDAKDVTLVSQCSPDNLHHIEGLMKAWKGPISIAVFSPDRDA---VSAMFGIQIL 129
Query: 529 RS-----RRNIGYHVVY-----------------------------KEG-NF------YP 547
R RR++ +H++Y EG N+ YP
Sbjct: 130 RKCNADIRRHVTFHIMYPSSHPPRFDETVEDFTDDECSIAIEQLKESEGKNYARGELPYP 189
Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
N+LRN+A ++ T +L+DID +P L + + + + PAFE +
Sbjct: 190 QNSLRNIARVKIKTDLFYLVDIDTVPSNDLRYQFTKFAKKRGLLDQTRLEAFITPAFEIR 249
Query: 608 RYRT-----------------------AFPASHAPTNFSRWV------NATTPYQIEWAP 638
+ P H PT F W ++IE++
Sbjct: 250 SHHNLPANKDNVIRLIDAGEMRPFHYDTCPYCHKPTEFDHWYGYRGSDELEDAFEIEYSN 309
Query: 639 DFEPYIVAH-RDLPRYDTRFVGFGWNK 664
FEP+ +A+ P +D RF +G+++
Sbjct: 310 SFEPFYLANTHTAPLHDERFKAYGFDR 336
>gi|195504526|ref|XP_002099117.1| GE10743 [Drosophila yakuba]
gi|194185218|gb|EDW98829.1| GE10743 [Drosophila yakuba]
Length = 340
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD++F + I+ +W F + + Q L + +G W
Sbjct: 114 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 170
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R++ W I +++ L +W DQDI N +
Sbjct: 171 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 227
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
HP +Y +PC++N + D+ +C +K+IH N
Sbjct: 228 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMTHAGVKVIHGN 268
>gi|195354760|ref|XP_002043864.1| GM17754 [Drosophila sechellia]
gi|194129102|gb|EDW51145.1| GM17754 [Drosophila sechellia]
Length = 340
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD++F + I+ +W F + + Q L + +G W
Sbjct: 114 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 170
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R++ W I +++ L +W DQDI N +
Sbjct: 171 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 227
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
HP +Y +PC++N + D+ +C +K+IH N
Sbjct: 228 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMSHAGVKVIHGN 268
>gi|348565043|ref|XP_003468313.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Cavia
porcellus]
Length = 415
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D +DV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPSDVILATHASVDNLLHLSGLLERWEGPLSISVFAATKEEVQLATVLAYALSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEASVPDPREPGEFALLRSCQEVFDKLAKVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDADMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPTNKAELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPAFDERFRQYGFNR 349
>gi|302565470|ref|NP_001181411.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Macaca mulatta]
Length = 415
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
+ NDV L S+D L + L + WEGP+S++++ + E Q L+Y +S R
Sbjct: 91 EPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYVLSSHCPDMR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
LRN+A + Y ++D+D +P GL+ L+ +D G LVVPAFE +R
Sbjct: 211 LLRNLA--REGAHYALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265
Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
R APTN+SRWVN Y + W +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325
Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ VA +P +D RF +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349
>gi|194909001|ref|XP_001981876.1| GG12290 [Drosophila erecta]
gi|190656514|gb|EDV53746.1| GG12290 [Drosophila erecta]
Length = 340
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD++F + I+ +W F + + Q L + +G W
Sbjct: 114 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 170
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R++ W I +++ L +W DQDI N +
Sbjct: 171 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 227
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
HP +Y +PC++N + D+ +C +K+IH N
Sbjct: 228 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMTHAGVKVIHGN 268
>gi|195453679|ref|XP_002073892.1| GK12907 [Drosophila willistoni]
gi|194169977|gb|EDW84878.1| GK12907 [Drosophila willistoni]
Length = 361
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 83/162 (51%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTDV+F + I+ LW F + + Q L +G W
Sbjct: 130 RLFLPSLL-TNVDSLLYVDTDVLFLSPISDLWTFFKKFNETQMAALTPEHESENIG--WY 186
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ +W I +++ L W DQDI N +
Sbjct: 187 NRFARHPFYGKLGVNSGVMLMNLTRMREFNWEKQILPIHKEYKLRITW---GDQDIINIL 243
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
HP +Y +PC++N + D+ +C T +K+IH N
Sbjct: 244 FYYHPDKLYVMPCEYNYR-PDHCMYMSVCSTSHLGIKLIHGN 284
>gi|397571327|gb|EJK47739.1| hypothetical protein THAOC_33522, partial [Thalassiosira oceanica]
Length = 1892
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/313 (27%), Positives = 135/313 (43%), Gaps = 75/313 (23%)
Query: 397 TQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDR 456
+ +QQ E+S+ +D D+ S T ++ HL E ASSD L+ + M +
Sbjct: 1192 SSPQMQQVEMSDAWPKD-IEDIVTSWKTQVQNHLI----EAMASSD-----LINYIPMQQ 1241
Query: 457 LQMVEINPNNVDEYEASS--DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE 514
++ + E+ +S +G DV +V Q S+DRL +E + W G ++ +Y+S E
Sbjct: 1242 SHVL------LQEWTSSKARNGIDVAIVTQCSLDRLASLEAQLRSWPGNSAVAVYVSSTE 1295
Query: 515 AQQFLSYAGNSEALRSRRNIGYHVVYKEG-----NFYPINTLRNVALNQVSTPYVFLLDI 569
++ A +++ SR N VV EG YPIN LRNVAL + + +
Sbjct: 1296 SKSDAIVA--IQSMVSRGNFNCAVVLVEGAPFDDEPYPINYLRNVALLESQRQH-----L 1348
Query: 570 DFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR--------------------- 608
F P L +I + + KV+V+PAFE
Sbjct: 1349 RFHPTLEQSAALLDAILNQN-------KVVVIPAFECSEGVDCPETLPELHELVRDGRAD 1401
Query: 609 --YRTAFPASHAPTNFSR-WVNATT-------------PYQIEWAPDFEPYIVAHR-DLP 651
+ FP H PTNF W+ + + Y +E+ FEPY++ ++P
Sbjct: 1402 GFHLAHFPQGHGPTNFEEFWLVSQSLETSQDCAQAWQKSYSVEYERCFEPYVIMKSANVP 1461
Query: 652 RYDTRFVGFGWNK 664
YD RF G+G NK
Sbjct: 1462 LYDERFTGYGLNK 1474
>gi|195573727|ref|XP_002104843.1| GD18236 [Drosophila simulans]
gi|194200770|gb|EDX14346.1| GD18236 [Drosophila simulans]
Length = 511
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD++F + I+ +W F + + Q L + +G W
Sbjct: 285 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 341
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R++ W I +++ L +W DQDI N +
Sbjct: 342 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQQIVSIHKEYKLRIIW---GDQDIINIL 398
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
HP +Y +PC++N + D+ +C +K+IH N
Sbjct: 399 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMSHAGVKVIHGN 439
>gi|194742393|ref|XP_001953687.1| GF17104 [Drosophila ananassae]
gi|190626724|gb|EDV42248.1| GF17104 [Drosophila ananassae]
Length = 361
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 85/162 (52%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD++F + I+ +W F + + Q L + +G W
Sbjct: 137 RLFLPSLLTK-VDSLLYVDTDILFLSPISDIWQYFKKFNESQISALTPEHENENIG--WY 193
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R++ W I +++ L +W DQDI N +
Sbjct: 194 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHILSIHKEYKLRIIW---GDQDIINIL 250
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
HP +Y +PC++N + D+ +C +K+IH N
Sbjct: 251 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMTHAGVKLIHGN 291
>gi|390338379|ref|XP_781007.3| PREDICTED: xyloside xylosyltransferase 1-like [Strongylocentrotus
purpuratus]
Length = 467
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 80/157 (50%), Gaps = 16/157 (10%)
Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 251
++ L + ++LPE + + I+LDTD++F +DI +L+ F + +G+V Q Y
Sbjct: 277 AIFFLSTMLHKELLPEYVHRVIMLDTDLVFMSDIKELFEHFDHFQGDNIMGVVHEQQPVY 336
Query: 252 --LGKLWKNHKPWPALGR-------GFNTGVILLDLTK-LRDISWAGFWRIIAEKFLLTR 301
+ L+++ P +G GFN+GV+LLDL + L+ + + + R EK + R
Sbjct: 337 RHIFSLYRSQNPGTRVGNPPPDGLTGFNSGVLLLDLDRMLQSVHFEDYMR--PEKVVELR 394
Query: 302 ----LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
L DQD + + E L Y LPC WN QL
Sbjct: 395 DKFHFKGHLGDQDFYTLVSMEKEDLFYNLPCSWNKQL 431
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 95 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 154
++ L + ++LPE + + I+LDTD++F +DI +L+ F + +G+V Q Y
Sbjct: 277 AIFFLSTMLHKELLPEYVHRVIMLDTDLVFMSDIKELFEHFDHFQGDNIMGVVHEQQPVY 336
Query: 155 --LGKLWKNHKPWPALG 169
+ L+++ P +G
Sbjct: 337 RHIFSLYRSQNPGTRVG 353
>gi|72079522|ref|XP_780812.1| PREDICTED: xyloside xylosyltransferase 1-like [Strongylocentrotus
purpuratus]
Length = 400
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 18/146 (12%)
Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KLWKNHKP 261
+LP+ + + I+LDTD+ F +DI+ L+A F + + +G+ +Q Y K ++ P
Sbjct: 223 ILPDFVHQIIMLDTDLKFLSDISGLFAHFDKFQGSNVMGIAHDQQPTYRDAFKAFRARNP 282
Query: 262 WPALGR-------GFNTGVILLDLTKLRDISWAGFW------RIIAEKFLLTRLWTSLAD 308
+G GFN+GV+LLD+ ++R ++ +I+ K+L SL D
Sbjct: 283 TTRVGSPAPDGLPGFNSGVVLLDIARMRSSQLYNYYLNPPVAELISRKYLFK---GSLGD 339
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQL 334
QD ++ I +E L Y LPC WN L
Sbjct: 340 QDFYSLIGNERENLFYVLPCSWNRHL 365
Score = 45.4 bits (106), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 70/155 (45%), Gaps = 19/155 (12%)
Query: 34 IKSILFYRKNPLHFHLITDTVA---LNILQTLFSTWSVPQVEVSFYLAD---------SV 81
+KSIL PL F + D + L T+FS +VE+ + D S
Sbjct: 133 MKSILEKSSIPLFFFFVVDEPSRLYLEETMTMFSRKYRSKVEIQYTFLDQETIAKKLASS 192
Query: 82 VEDVSWIPNKHYSGVYG----LLKLTLPK-VLPETLAKTIVLDTDVIFATDIAQLWALFS 136
V V I + + G Y L + + + +LP+ + + I+LDTD+ F +DI+ L+A F
Sbjct: 193 VSVVQQIVSGNTQGYYNQSIFFLSVAIHRGILPDFVHQIIMLDTDLKFLSDISGLFAHFD 252
Query: 137 RLRQRQTIGLVENQSDWYLG--KLWKNHKPWPALG 169
+ + +G+ +Q Y K ++ P +G
Sbjct: 253 KFQGSNVMGIAHDQQPTYRDAFKAFRARNPTTRVG 287
>gi|307106498|gb|EFN54743.1| hypothetical protein CHLNCDRAFT_58024 [Chlorella variabilis]
Length = 492
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 113/279 (40%), Gaps = 68/279 (24%)
Query: 447 TLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISL 506
T+V QL ++Q V +D + S DVTLV QLS +RL M+E W G IS
Sbjct: 132 TIVEQLPQLKIQQV------LDSNKPRSP-QDVTLVTQLSFERLYMLEGQCDVWNGVISA 184
Query: 507 TLYMSDAEAQQFLSYAGNSEALR-------SRRNIGYHVVYKEGNF-------------- 545
+Y++ + +++ R R+ +H + +
Sbjct: 185 AVYIALLNGKVVTVELNSNQDPRLTDITEVERKFKQFHQLAEAKGLCKLDLQLVSQTVES 244
Query: 546 ------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD-MHGHGGKKV 598
YP+N +RN AL T V LLD+DF P L ++ + M
Sbjct: 245 IWLSALYPVNAMRNRALANARTDAVLLLDVDFWPSAELSELMQKPAKYESLMKTLQAGAA 304
Query: 599 LVVPAFET--------QRYRTA------------------------FPASHAPTNFSRWV 626
+V+PA+ET + R A + A H T++ +W+
Sbjct: 305 VVLPAYETGDPGDIGVEVAREAVLEGKDAAVKMFWDGRIKAFHTDRYEAGHRATDYKKWL 364
Query: 627 NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
A+ PY I + FEPY++ R +P YD RFVG+ NK
Sbjct: 365 QASRPYNIMYEEGFEPYVIISRKFVPWYDERFVGYRKNK 403
>gi|443723788|gb|ELU12058.1| hypothetical protein CAPTEDRAFT_219773 [Capitella teleta]
Length = 341
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 92/210 (43%), Gaps = 24/210 (11%)
Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
+ + + E S+ +Y+ L + +V+P T+ + I++D D+ F DI QL+ FS
Sbjct: 132 IVEPMQERFSYKKGSYYNHALFFLSTAIHRVMPSTMHRAIMMDVDLKFRGDIKQLYDQFS 191
Query: 234 RLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLRDI 284
+ +GL Y W + + P P GFN+GV+LLDL ++R
Sbjct: 192 LFDETHVMGLAREMQPVYRHLFWYHRRENPGTRVGDPPPNGLTGFNSGVLLLDLDRMRSS 251
Query: 285 SWAGFW------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS--- 335
+A++F R+ L DQD F + H L Y LPC +N QLS
Sbjct: 252 QLYNSMLERKNIESLADEF---RIKGHLGDQDFFTLLGMVHEELFYVLPCVFNRQLSSWW 308
Query: 336 -DNTRSD--ELCYTELTDLKIIHWNSPKKL 362
D+ D L + D+ I H N K+
Sbjct: 309 KDHGYKDVFHLYHDCEGDVLIYHGNGKTKI 338
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/157 (21%), Positives = 67/157 (42%), Gaps = 13/157 (8%)
Query: 28 RSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST-------WSVPQVEVSFY---L 77
+ + + S+ + PL+ H+I + + N+ + + + + + ++V L
Sbjct: 70 KQFILTVTSLFRHATIPLNIHIIGEEASQNLAEKIMAKNVPSTAKYKIVHLDVDHLATEL 129
Query: 78 ADSV---VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
AD V E S+ +Y+ L + +V+P T+ + I++D D+ F DI QL+
Sbjct: 130 ADIVEPMQERFSYKKGSYYNHALFFLSTAIHRVMPSTMHRAIMMDVDLKFRGDIKQLYDQ 189
Query: 135 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
FS + +GL Y W + + P G
Sbjct: 190 FSLFDETHVMGLAREMQPVYRHLFWYHRRENPGTRVG 226
>gi|443710195|gb|ELU04493.1| hypothetical protein CAPTEDRAFT_192232 [Capitella teleta]
Length = 320
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 57/188 (30%), Positives = 85/188 (45%), Gaps = 35/188 (18%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVTL+ L+ DR++ + L ++W GPIS +Y+ + + N +SR NI H
Sbjct: 123 DVTLMVHLTSDRIEKLLRLTENWSGPISAAIYVDNITDSLMIE---NICHFQSRPNIFIH 179
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
+++K G YP + +RN+AL +V T Y FL D+DF+P +Y I+S +
Sbjct: 180 ILHKVGGIYPSSIIRNLALEEVFTEYAFLSDVDFMPAPSVYKTSVDLIKSKN-RLLSKTT 238
Query: 598 VLVVPAFETQRYRT------AFPAS-------------------------HAPTNFSRWV 626
VV AFE +R T FP + H TN+ W
Sbjct: 239 AFVVAAFELKRNITPDGDAVKFPRNKSDLLALQNSTLIQPFHYHPKNHFGHFSTNYEAWN 298
Query: 627 NATTPYQI 634
A T YQ+
Sbjct: 299 IANTTYQV 306
>gi|410921614|ref|XP_003974278.1| PREDICTED: xyloside xylosyltransferase 1-like [Takifugu rubripes]
Length = 389
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 18/178 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P++L + + LD D+ + +I +L+
Sbjct: 178 FPIVEAMQKHFSSGSGAYYSDAIFFLSVAMHRIMPDSLTRMVQLDLDLQYRVNIRELFQE 237
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F + IG+ Y W+ K P P GFN+GV+LLDL +R
Sbjct: 238 FHQFPPGAVIGIAREMQPVYRHTFWQYRKENPQTKVGEPHPDGLPGFNSGVMLLDLASMR 297
Query: 283 DISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
S +A+++ R L DQD F I EHP L Y+L C WN QL
Sbjct: 298 ASSLYNQLLEPDHVAKLADQY---RFRGHLGDQDFFTMIGMEHPELFYSLACGWNRQL 352
>gi|289742285|gb|ADD19890.1| putative glycosyltransferase [Glossina morsitans morsitans]
Length = 361
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 80/162 (49%), Gaps = 9/162 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L ET+ + +DTD++F + +A +W F Q L D +G W
Sbjct: 134 RLFLPSIL-ETVDAALYVDTDILFLSPVADIWKFFKSFNGTQIAALSPEHEDANVG--WY 190
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT++R+ W I +++ L W DQDI N +
Sbjct: 191 NRFARHPYYGPLGVNSGVMLMNLTRMREFKWEEKVLPIYKEYKLRLTW---GDQDIINIL 247
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
H +Y +PC++N + D+ +C +K+IH N
Sbjct: 248 FYYHSEKLYLMPCEYNYR-PDHCMYMNICNVSADGIKVIHGN 288
Score = 40.0 bits (92), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 35/150 (23%), Positives = 60/150 (40%), Gaps = 10/150 (6%)
Query: 17 VAIVCAGYNSTRSLVTLIKSILF-YRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSF 75
VA+V G LV + ++LF + L+F + + + + W + E F
Sbjct: 49 VAVVVCGQRMQEVLVMIKSALLFNSSREKLYFLIFAEERLMISFNEKLNDWRAIRSEFDF 108
Query: 76 YLAD---SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
L V + W ++ +L LP +L ET+ + +DTD++F + +A +W
Sbjct: 109 RLLPLQFPVNSEREW---RNLFKPCAAQRLFLPSIL-ETVDAALYVDTDILFLSPVADIW 164
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNH 162
F Q L D +G W N
Sbjct: 165 KFFKSFNGTQIAALSPEHEDANVG--WYNR 192
>gi|301616898|ref|XP_002937879.1| PREDICTED: uncharacterized protein C3orf21 homolog [Xenopus
(Silurana) tropicalis]
Length = 376
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 94/207 (45%), Gaps = 24/207 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + L +++P+ + + I LD D+ F T++ +L+
Sbjct: 165 FPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIIQLDLDLKFKTNVRELFEE 224
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F IG+ + Y W+ K P P GFN+GV+LL+L +R
Sbjct: 225 FDNFMHMSIIGIAQEMQPVYRHTFWQYRKENPSTKVGEPPPDGMPGFNSGVMLLNLEAMR 284
Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
++ + +AEK+ L DQD F+ I EHP L Y L C WN QL
Sbjct: 285 QSRLYNELLEPNAVQQLAEKY---HFKGHLGDQDFFSMIGMEHPELFYVLDCGWNRQLCT 341
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWN 357
D+ D + Y ++KI+H N
Sbjct: 342 WWKDHGYMDVFDDYYRCEGNIKILHGN 368
Score = 43.1 bits (100), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 37/158 (23%), Positives = 67/158 (42%), Gaps = 16/158 (10%)
Query: 29 SLVTLIKSILFYRKNPLHFHLITDT----VALNILQTLFSTWSVPQVEVSFYLADSVVED 84
+L +LIK F L H ++D + ++L+ L S + +V F+ + E
Sbjct: 105 ALSSLIKYGKFESNEVLCLHFVSDEPSKEIGKSLLKDLLQGASF-KYKVIFHDVGMLTEK 163
Query: 85 VSWIP---NKHYSGVYG--------LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
+ I KH+S G L + L +++P+ + + I LD D+ F T++ +L+
Sbjct: 164 LFPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIIQLDLDLKFKTNVRELFE 223
Query: 134 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
F IG+ + Y W+ K P+ G
Sbjct: 224 EFDNFMHMSIIGIAQEMQPVYRHTFWQYRKENPSTKVG 261
>gi|390597315|gb|EIN06715.1| hypothetical protein PUNSTDRAFT_72884 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 382
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 70/130 (53%), Gaps = 7/130 (5%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ-- 70
E + ++V +S LIKSIL Y P FH++ D A ++L++ S P+
Sbjct: 4 EEVHFSLVMFSADSGTEGAMLIKSILMYTTRPTAFHIVCDEGAQSVLESRISLVRRPRYD 63
Query: 71 VEVSFY-----LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
+ V FY ++ + I +HYSG+YGL+KL + ++LP T+ ++I +DTD IF
Sbjct: 64 IRVVFYRVAREAMEARIRREGGIYTEHYSGIYGLMKLFIHEILPPTVKRSIFVDTDAIFI 123
Query: 126 TDIAQLWALF 135
+D LW F
Sbjct: 124 SDPTLLWRRF 133
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 34/48 (70%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF 232
I +HYSG+YGL+KL + ++LP T+ ++I +DTD IF +D LW F
Sbjct: 86 IYTEHYSGIYGLMKLFIHEILPPTVKRSIFVDTDAIFISDPTLLWRRF 133
>gi|432091091|gb|ELK24303.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Myotis davidii]
Length = 415
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 107/268 (39%), Gaps = 87/268 (32%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
D +DV L S+D L + L + WEGP+S++++ + E Q L+YA +S R
Sbjct: 91 DPDDVILATHASVDNLVHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDIR 150
Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
+ H+V + G F YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQQVFDKLARVAQPGINYALGTNVSYPNN 210
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK---KVLVVPAFET 606
LRN+A + Y ++D+D +P GL+ L+ +M G + LVVPAFE
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLR------EMLGQSKQWAGTALVVPAFEI 262
Query: 607 QRYRTAFPAS------------------------HAPTNFSRWVNATT------PYQIEW 636
+R R P + APTN+SRWVN Y + W
Sbjct: 263 RRARR-IPTNKNELLKLYQVDEVQPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPW 321
Query: 637 APDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 322 QDPWEPFYVAGGKVPAFDERFRQYGFNR 349
>gi|358339090|dbj|GAA47215.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 419
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 70/139 (50%), Gaps = 2/139 (1%)
Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
L L P ++ +++ K IVL D++F ++A+LW F Q +G V Q D
Sbjct: 190 ALQNLMAPYIVDQSVDKIIVLSPDLLFNENVAELWTHFEHFSSTQAVGAVCEQVDDCDQC 249
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
K+ + +P G NTG++LL L +LR W G ++ + +Q I N
Sbjct: 250 CPKDDEQFPLY--GLNTGLVLLHLDRLRKADWRGLYKAELSSNQYVQESPGDLNQGILNT 307
Query: 315 IISEHPYLVYTLPCQWNVQ 333
++S+ ++Y LPC+WN+Q
Sbjct: 308 VLSKRMRMLYRLPCEWNIQ 326
Score = 43.1 bits (100), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 57/147 (38%), Gaps = 15/147 (10%)
Query: 9 IPTCEV---IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
+ TCE+ + I+C N+T P+ HL+ A N F
Sbjct: 110 MKTCELDWSVPSGILCENINATPI------------AQPIMLHLVVHQKAWNKTLRRFQR 157
Query: 66 WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
W+ + V Y + + + L L P ++ +++ K IVL D++F
Sbjct: 158 WNSTRFLVKLYEFKNELALRGTEAERSPRVQLALQNLMAPYIVDQSVDKIIVLSPDLLFN 217
Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSD 152
++A+LW F Q +G V Q D
Sbjct: 218 ENVAELWTHFEHFSSTQAVGAVCEQVD 244
>gi|170038922|ref|XP_001847296.1| GLT8D3 protein [Culex quinquefasciatus]
gi|167862537|gb|EDS25920.1| GLT8D3 protein [Culex quinquefasciatus]
Length = 375
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 77/166 (46%), Gaps = 18/166 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +D+D IF + +LW+LF Q G+ D G W
Sbjct: 150 RLFLPSLLTH-IDSVLYVDSDTIFLAPVQELWSLFKTFNASQFAGMAPEHEDKNAG--WY 206
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT++R+ W I +F L +W DQDI N +
Sbjct: 207 NRFARHPYYGDLGVNSGVMLMNLTRMREFRWEQHIMPIYREFRLKLVW---GDQDIINIL 263
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWN 357
HP + PC WN R+D Y + + +KI+H N
Sbjct: 264 FHHHPDRLLVFPCDWNY------RADHCMYMSVCEAPDGVKIVHGN 303
>gi|71834366|ref|NP_001025270.1| glucoside xylosyltransferase 1 precursor [Danio rerio]
gi|82079018|sp|Q5SP46.1|GXLT1_DANRE RecName: Full=Glucoside xylosyltransferase 1; AltName:
Full=Glycosyltransferase 8 domain-containing protein 3
gi|190337605|gb|AAI63763.1| Glycosyltransferase 8 domain containing 3 [Danio rerio]
gi|190339952|gb|AAI63768.1| Glycosyltransferase 8 domain containing 3 [Danio rerio]
Length = 405
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/271 (26%), Positives = 118/271 (43%), Gaps = 40/271 (14%)
Query: 165 WPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTD 218
WPA +G + V+ +S+ P++H+ L K L LP +L E L + +DTD
Sbjct: 129 WPAFIQGKF--SYVLHPISF-PHEHHEEWSQLFKPCASQRLFLPMILRE-LDSLLYVDTD 184
Query: 219 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW-PALGR-GFNTGVILL 276
V+F + +W + Q I + + + W + W P G+ G N+GV+L+
Sbjct: 185 VLFLQPVELIWDMLMLFNSTQLIAMAPEHEEPRIA--WYSRFSWHPYYGKMGINSGVMLM 242
Query: 277 DLTKLR-----------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
+LT++R + W + +K+ L W DQD+ N I +P +VYT
Sbjct: 243 NLTRMRITQFKNDMTPVGLHWDELLMPLLQKYKLNITW---GDQDLINIIFHYNPEMVYT 299
Query: 326 LPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGN 385
LPC WN + E + ++H N + + FR +Y F +Y
Sbjct: 300 LPCHWNYRPDHCIYGSNCAPAEDEGVFVLHGN---RGVFHSDKQPAFRAVYEAFEQYT-- 354
Query: 386 LLRRELFGCNLTQTTLQQAELSNLNEEDPCY 416
FG +L Q+ L + E++ LNE Y
Sbjct: 355 ------FGEDLRQSLLSRLEVA-LNETTHTY 378
>gi|428166538|gb|EKX35512.1| hypothetical protein GUITHDRAFT_118323 [Guillardia theta CCMP2712]
Length = 1696
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 93/207 (44%), Gaps = 44/207 (21%)
Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS-EALRSR--- 531
+ V LV Q S+DRL +E W G S+ LY+ ++ +Q NS + R
Sbjct: 1380 ASGVVLVTQASLDRLPRLEAQLLAWRGAASVALYVPWSDQRQETEDRLNSIRDMYGRLLD 1439
Query: 532 RNIGYHVV----------YKEGNFYPINTLRNVALNQ---VSTPYVFLLDIDFLPMFGLY 578
+G VV ++ N YPIN LRN+ALN + V L+D+DFLP L
Sbjct: 1440 AGVGGVVVSILFGNSPSEHEYNNLYPINALRNLALNTSKFCAAELVLLVDVDFLPSKAL- 1498
Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAP 638
V + + Y A + A ++T PY++E+
Sbjct: 1499 ----------------------VDKCQDEEYLAAMRQASA---IGSMFSSTEPYEVEYQS 1533
Query: 639 DFEPYIVAH-RDLPRYDTRFVGFGWNK 664
++EPY++ + + +P+YD RF G+G NK
Sbjct: 1534 NYEPYVIMNVKTVPKYDERFRGYGMNK 1560
>gi|156373903|ref|XP_001629549.1| predicted protein [Nematostella vectensis]
gi|156216552|gb|EDO37486.1| predicted protein [Nematostella vectensis]
Length = 304
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 83/183 (45%), Gaps = 18/183 (9%)
Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
S+ +YS L L KV PE L K I+LD+D+ F DI L+ F + +G
Sbjct: 104 SYKKGAYYSDPIFFLSTALHKVFPENLTKIIMLDSDLRFRNDINLLFQHFKKFESSNIMG 163
Query: 243 LVENQSDWY--LGKLWKNHKPWPALGR-------GFNTGVILLDLTKLRDISWAGFWRII 293
L Q Y + ++++ P +G GFN+GV LL L +R + +II
Sbjct: 164 LATEQQPVYRHIFDTYRHNNPGTMVGEPPPNGITGFNSGVALLHLENMR--RSKTYNQII 221
Query: 294 AEKFLLT-----RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTEL 348
+ + + L DQD ++ + EH L Y LPC WN QL R + Y ++
Sbjct: 222 SSEAAIDMAKKYNFKGHLGDQDFYSLLSVEHKDLFYILPCGWNRQLCSYWRDNG--YAKV 279
Query: 349 TDL 351
DL
Sbjct: 280 FDL 282
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 78/172 (45%), Gaps = 17/172 (9%)
Query: 15 IQVAIVCAGYNSTRSLVT--LIKSILFYRKNPLHFHLITDTVALNILQTLF--STWSVPQ 70
I + +V A + S ++ T IKSI + P+ FH I D ++++ +F ++
Sbjct: 18 ILMTLVKAYHGSPLAVSTEKCIKSICKHSIIPITFHFIVDKKGTDVVKGIFMRTSGHCTN 77
Query: 71 VEVSFYLADSVVEDV-----------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
++ F+ AD + E + S+ +YS L L KV PE L K I+LD
Sbjct: 78 PDIVFHDADKLAERLKGIVQVLQRPFSYKKGAYYSDPIFFLSTALHKVFPENLTKIIMLD 137
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY--LGKLWKNHKPWPALG 169
+D+ F DI L+ F + +GL Q Y + ++++ P +G
Sbjct: 138 SDLRFRNDINLLFQHFKKFESSNIMGLATEQQPVYRHIFDTYRHNNPGTMVG 189
>gi|321452095|gb|EFX63567.1| hypothetical protein DAPPUDRAFT_204739 [Daphnia pulex]
Length = 326
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 99/217 (45%), Gaps = 27/217 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD IF T +W F+++ + Q + D+ G W
Sbjct: 111 RLFLPSLL-QGVDSVLYMDTDTIFLTSPLNVWEHFNKMDEMQMAAVAPEHEDFSTG--WY 167
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT++R W + I ++ +W DQDI N +
Sbjct: 168 NRFARHPYYGNLGVNSGVMLMNLTRMRHFGWEKYVVPIYHEYKSQIVW---GDQDIINIV 224
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYT-ELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
HP +Y PC +N + D+ +C T E + ++H N + NK F+
Sbjct: 225 FHFHPEKLYVYPCHYNYR-PDHCMYMSVCKTAEEQGIHVLHGN---RGSFHNKKQPAFKA 280
Query: 375 LYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNE 411
+Y + +Y FG NL Q L+N+ E
Sbjct: 281 VYASIEQYS--------FGENL------QTLLTNMKE 303
>gi|326426427|gb|EGD71997.1| hypothetical protein PTSG_00013 [Salpingoeca sp. ATCC 50818]
Length = 458
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 76/160 (47%), Gaps = 25/160 (15%)
Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY---------LGK 254
+LP+ + K IV+D DV+F D+ LW +F + + IGLV Q + L K
Sbjct: 271 LLPD-VDKAIVVDADVVFRVDVTLLWQIFRAFKTEELIGLVYEQQPTFRHTFAAYRKLNK 329
Query: 255 LWKNHKPWPALGR------GFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRL 302
P ++G GFNTGV LL+L +R ++ + + K++++
Sbjct: 330 RTHIGSPAASVGGVRAGLPGFNTGVALLNLAHMRASDLISELVYTDLVDHMTRKYMVSER 389
Query: 303 WTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDE 342
T +F + +EHP+L + LPC+WN QL R E
Sbjct: 390 LTPAT---LFTLMAAEHPFLFHVLPCKWNRQLCQKMRQSE 426
>gi|196016771|ref|XP_002118236.1| hypothetical protein TRIADDRAFT_62267 [Trichoplax adhaerens]
gi|190579211|gb|EDV19312.1| hypothetical protein TRIADDRAFT_62267 [Trichoplax adhaerens]
Length = 1053
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 108/237 (45%), Gaps = 46/237 (19%)
Query: 474 SDGND--VTLVAQLSMDRLQMVEMLFKHWEGPISLTLY----MSDAEAQQFLSYA-GNSE 526
+ G D +T+ Q+++DRL +E L + W+GPIS+ L+ +S Q L G
Sbjct: 253 TSGKDPFITIATQMTIDRLPTLERLVQRWQGPISVVLFVLNDISGVTVQDHLQQVLGKYH 312
Query: 527 ALRS-RRNIGYHVVY------KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
+ R+ I H+V+ K+ YP+N LRN+A+ T +V+LLDID +P
Sbjct: 313 QVDDFRKFIDLHIVFEDALNGKDSALYPVNFLRNLAIKFSKTSHVYLLDIDIIPSVSHEQ 372
Query: 580 YLKSSIR-SMDMHGH------GGKKVLVVPAFE-----------------TQRYRTAFP- 614
+ + ++ S D++ K + P F+ ++ R P
Sbjct: 373 AVAAIMKASHDINNREHNIKVCQKCAFITPLFKLKTDAVDNDIPREKNVLLEQIRKDNPP 432
Query: 615 ------ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDL-PRYDTRFVGFGWNK 664
SHA T F W ++++ ++ + + EPY + +R L P + F G+G +K
Sbjct: 433 IEPFHAVSHAMTQFGIWFSSSSAFKSNYIENNEPYFIINRKLSPNINNIFSGYGRDK 489
>gi|410929415|ref|XP_003978095.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Takifugu
rubripes]
Length = 431
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 102/262 (38%), Gaps = 77/262 (29%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM--SDAEAQQFLSYAGNSEALRSRRNI 534
N +TL +++ L +E L + W+ P+S+ ++ D L YA + + + +
Sbjct: 107 NILTLATHTTINNLHHLESLLERWQNPLSVAIFAHGEDVRFATALVYALSFFCPQIQALV 166
Query: 535 GYHVV------------------------------------YK----EGNF-YPINTLRN 553
+H+V YK GN YP N LRN
Sbjct: 167 DFHLVCLSGETASFPEKDREHFAGLEDCASVFLRLEKHRDKYKNYDISGNVSYPNNLLRN 226
Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR----- 608
VA + + YV ++DID +P L+ + +R H G +V V+PAFE +
Sbjct: 227 VARSGTESAYVLVIDIDMMPSADLHQQFLAMVRK---HKPAGDEVFVLPAFEIRHARKIP 283
Query: 609 ------------------YRTAFPASHAPTNFSRWVNA--------TTPYQIEWAPDFEP 642
Y P APTN+SRWVN+ Y I W +EP
Sbjct: 284 ANKAELVQLYQVGEVRPFYEELCPRCQAPTNYSRWVNSHVRETATLEVAYTITWVDPWEP 343
Query: 643 YIVAHRDLPRYDTRFVGFGWNK 664
+ + +P YD F +G+N+
Sbjct: 344 FYIGPNSVPLYDENFKQYGFNR 365
>gi|320170259|gb|EFW47158.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1
[Capsaspora owczarzaki ATCC 30864]
Length = 448
Score = 75.9 bits (185), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 72/299 (24%), Positives = 121/299 (40%), Gaps = 87/299 (29%)
Query: 448 LVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLT 507
+++QL +D I+ + V + + + DVTLV+Q D L L ++W+GP+S+
Sbjct: 89 VLSQLPLDESSKYRIHMHYV-RRQIAIEQPDVTLVSQCDADHLHEAIGLAQYWDGPVSVA 147
Query: 508 LYMSDAEAQQFLSYAGNSEAL--RSRRNIGYHVVYKEG-------------------NF- 545
+++ + L+ A R R + +H+VY E NF
Sbjct: 148 VFIP-GNVYKALATLRQIRACHDRVRERVSFHLVYMENESVYEEVAASELPLEGACENFQ 206
Query: 546 -------------------------------YPINTLRNVALNQVSTPYVFLLDIDFLPM 574
YP N LRNVA + VS+ ++F++DID P
Sbjct: 207 DPYEELRANNGTAVRPKLQQQKAPAKYDGVPYPNNFLRNVARSGVSSTFMFVIDIDMRPS 266
Query: 575 FGLYPYLKSSIRSMDMHGHGG-KKVLVVPAFETQR-----------------------YR 610
GL + +R D + VVP FE + Y
Sbjct: 267 RGLRLWF---LRLADKRPRASLRTAFVVPTFELKAGAPIPRIRQQLIDTYLQGNARPFYE 323
Query: 611 TAFPASHAPTNFSRWVNATTP-----YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
A PT+F +W++ T+P Y+++W +EP+ ++ +LP YD RF +G+N+
Sbjct: 324 KACWKCQKPTDFDQWLSVTSPISKPTYKVDWKDPWEPFYISWTNLPLYDERFQRYGFNR 382
>gi|195151677|ref|XP_002016765.1| GL21941 [Drosophila persimilis]
gi|194111822|gb|EDW33865.1| GL21941 [Drosophila persimilis]
Length = 318
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 81/160 (50%), Gaps = 9/160 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD +F + I+++WA F + Q L + +G W
Sbjct: 95 RLFLPSLLT-NVDSLLYVDTDTLFLSPISEIWAFFKKFNDSQMAALTPEHENENIG--WY 151
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R+ W I E++ L +W DQDI N
Sbjct: 152 NRFARHPFYGRLGVNSGVMLMNLTRMREFHWEKHILSIHEEYKLRIIW---GDQDIINIF 208
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIH 355
+P ++ +PC++N + D+ C + +K++H
Sbjct: 209 FYYYPDKLFVMPCEYNYR-PDHCMYMSTCNMTHSGVKLMH 247
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 74/169 (43%), Gaps = 16/169 (9%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILF-YRKNPLHFHLITDTVALNILQTLFSTWSV 68
P+ + + +VC G +LV + +I+F Y K L F + T+ L+ + W
Sbjct: 2 PSRTPLYIVVVCCGQRVQETLVMIKSAIVFNYEKEYLKFLIFTEKEKDEELREKLTDWK- 60
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDV 122
+ SF+ D + +++ P+++ L K L LP +L + + +DTD
Sbjct: 61 -NILTSFF--DFEILPLNF-PSQNKIEWMNLFKPCAAQRLFLPSLLT-NVDSLLYVDTDT 115
Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR 170
+F + I+++WA F + Q L + +G W N P GR
Sbjct: 116 LFLSPISEIWAFFKKFNDSQMAALTPEHENENIG--WYNRFARHPFYGR 162
>gi|49903673|gb|AAH76793.1| MGC83717 protein [Xenopus laevis]
Length = 367
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 18/178 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + L +++P+ + + + LD D+ F T+I +L+
Sbjct: 156 FPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIVQLDLDLKFKTNIRELFEE 215
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ + P P GFN+GV+LL+L +R
Sbjct: 216 FDNFMHTSVIGIAPEMQPVYRHTFWQYRRENPLTKVGDPPPDGMPGFNSGVMLLNLEAMR 275
Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
++ + +AEK+ L DQD F+ I EHP L Y L C WN QL
Sbjct: 276 LSKLYNELLEPNAVQQLAEKY---HFKGHLGDQDFFSMIGMEHPELFYVLDCGWNRQL 330
>gi|198477802|ref|XP_002136434.1| GA22542 [Drosophila pseudoobscura pseudoobscura]
gi|198145185|gb|EDY71889.1| GA22542 [Drosophila pseudoobscura pseudoobscura]
Length = 295
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 81/160 (50%), Gaps = 9/160 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD +F + I+++WA F + Q L + +G W
Sbjct: 72 RLFLPSLLT-NVDSLLYVDTDTLFLSPISEIWAFFKKFNDSQMAALTPEHENENIG--WY 128
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P GR G N+GV+L++LT++R+ W I E++ L +W DQDI N
Sbjct: 129 NRFARHPFYGRLGVNSGVMLMNLTRMREFHWEKHILSIHEEYKLRIIW---GDQDIINIF 185
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIH 355
+P ++ +PC++N + D+ C + +K++H
Sbjct: 186 FYYYPDKLFVMPCEYNYR-PDHCMYMSTCNMTHSGVKLMH 224
>gi|260822374|ref|XP_002606577.1| hypothetical protein BRAFLDRAFT_62546 [Branchiostoma floridae]
gi|229291920|gb|EEN62587.1| hypothetical protein BRAFLDRAFT_62546 [Branchiostoma floridae]
Length = 183
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 66/150 (44%), Gaps = 18/150 (12%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK--- 257
+ KV P + LD D+ F +DI +L+ +F IG+ E Q Y W+
Sbjct: 1 MHKVFPSDTKHIVQLDCDLKFRSDIRELFDMFDHFESDNVIGIAEEQQPVYRHTFWQYRN 60
Query: 258 NHK------PWPALGRGFNTGVILLDLTKLRDISW------AGFWRIIAEKFLLTRLWTS 305
HK P P GFN+GV+LLDL K+R ++E+F
Sbjct: 61 EHKGTKVGDPPPDGQPGFNSGVLLLDLDKMRGSRLYNSLLNGTVVHAMSEEFHFK---GH 117
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
L DQD F + +P L + LPC WN QL
Sbjct: 118 LGDQDYFTLLSYRYPQLFHVLPCTWNRQLC 147
>gi|353681747|ref|NP_001086551.2| xyloside xylosyltransferase 1 [Xenopus laevis]
Length = 387
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 18/178 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + L +++P+ + + + LD D+ F T+I +L+
Sbjct: 176 FPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIVQLDLDLKFKTNIRELFEE 235
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ + P P GFN+GV+LL+L +R
Sbjct: 236 FDNFMHTSVIGIAPEMQPVYRHTFWQYRRENPLTKVGDPPPDGMPGFNSGVMLLNLEAMR 295
Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
++ + +AEK+ L DQD F+ I EHP L Y L C WN QL
Sbjct: 296 LSKLYNELLEPNAVQQLAEKY---HFKGHLGDQDFFSMIGMEHPELFYVLDCGWNRQL 350
>gi|195341842|ref|XP_002037514.1| GM18308 [Drosophila sechellia]
gi|194132364|gb|EDW53932.1| GM18308 [Drosophila sechellia]
Length = 408
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 106/245 (43%), Gaps = 54/245 (22%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYA- 522
++Y SS+G+ V L Q S++RL + + +W+G +S+ L+ + E Q F++Y
Sbjct: 93 EQYVYSSEGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 152
Query: 523 -GNSEALRSRRNIGYHVVYK--------EGNFYPINTLRNVALNQVSTPYVFLLDIDFLP 573
G + R + + ++ + N YP N +RN+A T YVFL DID +P
Sbjct: 153 LGKFDCQYPDRTLKALLKFRSLKTLQWRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVP 212
Query: 574 MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------- 616
P L +R+ + V+P FE R FP S
Sbjct: 213 STNSVPQLNHFLRTANCTKSCA---YVIPTFEID-VRATFPRSKNALVRLIRKGLARPFH 268
Query: 617 -------HAPTNFSRWVNATTP----------YQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
TNFS+W+++ T E+ +EP+ ++ + P +D RF+G
Sbjct: 269 EKVFIYNQYATNFSKWLSSNTNDTEVSVSHIVTNFEFL--YEPFYISVDNAPAHDERFLG 326
Query: 660 FGWNK 664
+G+ +
Sbjct: 327 YGFTR 331
>gi|449668715|ref|XP_002164899.2| PREDICTED: glucoside xylosyltransferase 2-like [Hydra
magnipapillata]
Length = 403
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 87/179 (48%), Gaps = 20/179 (11%)
Query: 163 KPWPALGRGFYLAD-------SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVL 215
K WPA + + S V+ W K V +L L +L ++ I +
Sbjct: 163 KTWPAWSNRWMIYSMYNIYYPSSVKKSQW---KKTFKVCATQRLFLSSILA-SIDSVIYI 218
Query: 216 DTDVIFATDIAQLWALFSRLRQRQTIGL---VENQSDWYLGKLWKNHKPWPALGRGFNTG 272
DTDV+ I +LW + S++ Q IGL E +++ + K +N P G N+G
Sbjct: 219 DTDVLMFKPIEKLWKMISQMNSNQIIGLAQECETKANCWYNKYSRNIYFNP---HGLNSG 275
Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
V+L++LT++R ++W + I + + +++ + DQDI N ++HP VY L C WN
Sbjct: 276 VMLMNLTRMRSLNWE---KAIVDIYKISKSNITYGDQDILNIYFAKHPNQVYLLSCTWN 331
Score = 42.7 bits (99), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 62/140 (44%), Gaps = 9/140 (6%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVA--LNILQTLFSTWSVPQ 70
E I +A V G + + + + SI L+FH+ +D N L+T + WS
Sbjct: 114 ESIHIATVACGNDKIQEALVALMSITLTTDYNLYFHIFSDNSKPFENQLKT-WPAWSNRW 172
Query: 71 VEVSFY--LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
+ S Y S V+ W K V +L L +L ++ I +DTDV+ I
Sbjct: 173 MIYSMYNIYYPSSVKKSQW---KKTFKVCATQRLFLSSILA-SIDSVIYIDTDVLMFKPI 228
Query: 129 AQLWALFSRLRQRQTIGLVE 148
+LW + S++ Q IGL +
Sbjct: 229 EKLWKMISQMNSNQIIGLAQ 248
>gi|384486619|gb|EIE78799.1| hypothetical protein RO3G_03504 [Rhizopus delemar RA 99-880]
Length = 330
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/227 (27%), Positives = 104/227 (45%), Gaps = 43/227 (18%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL------SYAGNSEAL 528
D DVT+ ++ +R ++ L ++GPIS+ ++++D +Q + Y N +
Sbjct: 18 DSEDVTIATLVTHNRFPVLSRLATRYQGPISVAIHINDDATKQTILKDLHAMYETNPDM- 76
Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM 588
R + H+V + + N RNVA T Y+ +LD+DF L+ M
Sbjct: 77 --GRFVDIHLVVDKLD-RQFNMWRNVAKFFARTEYIMMLDVDFHLCTDFRQSLRKHTDLM 133
Query: 589 DMHGHGGKKVLVVPAFE------TQRYRTAFPAS-----------------------HAP 619
D+ G K +VVPAFE + +RT FP + H
Sbjct: 134 DILRQG-KGAVVVPAFEYLAEEDGEDWRT-FPTTKEALIDIVQQEKIDMFHITWTRGHGS 191
Query: 620 TNFSRWVNATTPYQIE-WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
TN+++W + PY++E + +EPY++ ++ P D RFVG+G NK
Sbjct: 192 TNYTKWYASNDPYKVEDYNYSYEPYVIYKKEGSPWCDERFVGYGANK 238
>gi|302844911|ref|XP_002953995.1| hypothetical protein VOLCADRAFT_94735 [Volvox carteri f.
nagariensis]
gi|300260807|gb|EFJ45024.1| hypothetical protein VOLCADRAFT_94735 [Volvox carteri f.
nagariensis]
Length = 360
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 71/169 (42%), Gaps = 47/169 (27%)
Query: 543 GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL------------KSSIRSMDM 590
PINTLRN AL STP V ++D+D P L + S+ +
Sbjct: 136 AGLVPINTLRNAALLAASTPLVAMIDVDLAPSESLGAEVLRPLRLGGGGGGGGSVSELLR 195
Query: 591 HGHGGKKVLVVPAFETQR----------------------------------YRTAFPAS 616
+ V V+PA+ET + + A+
Sbjct: 196 TCSQQRAVWVLPAWETHKRLGREEGTRVAQMALSGDKVTLRRFQADELIHYFAKLAYARG 255
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDL-PRYDTRFVGFGWNK 664
HA TNFSRW+++ PY + + ++EP+ + R L P YD RF G+GWNK
Sbjct: 256 HAATNFSRWLDSDVPYTVPYEVNYEPWFIGSRALMPPYDVRFRGYGWNK 304
>gi|405969672|gb|EKC34627.1| Glycosyltransferase 8 domain-containing protein 4 [Crassostrea
gigas]
Length = 367
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 89/188 (47%), Gaps = 13/188 (6%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P +L + + +DTD++F + + ++W FSR Q I L D G W
Sbjct: 142 RLFIPSILT-NVDSLLYVDTDILFLSPLEEIWFHFSRFNSTQLIALAPESEDRQTG--WY 198
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT+LR+ +W + +++ L W DQD+ N
Sbjct: 199 NRFARHPYFGELGVNSGVMLMNLTRLRNSTWLSSIKNYYKEYKLKLTW---GDQDLINVY 255
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCY-TELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
HP +Y PC+WN + D+ +C E ++H + + + N F+
Sbjct: 256 FHFHPDELYVYPCEWNYR-PDHCMYMSVCKGAERGGAYVLHGS---RRAMHNDKQPAFKA 311
Query: 375 LYLTFLEY 382
+YL F Y
Sbjct: 312 VYLAFNAY 319
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 72/148 (48%), Gaps = 14/148 (9%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQVEV 73
I +++V G S +TLIKS + + K+ + H+ + + LQ ++W S+ + +
Sbjct: 57 IHLSLVVCGDRGNES-ITLIKSAVLFTKSYIIIHIFAEEELHSALQYQLNSWPSLVRKKF 115
Query: 74 SFYLADSVVEDVSWIPNKHYSGVY---GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
++L + D N+ + ++ +L +P +L + + +DTD++F + + +
Sbjct: 116 EYHLYGIIYPD----SNQEWKMLFKPCASQRLFIPSILT-NVDSLLYVDTDILFLSPLEE 170
Query: 131 LWALFSRLRQRQTIGLV----ENQSDWY 154
+W FSR Q I L + Q+ WY
Sbjct: 171 IWFHFSRFNSTQLIALAPESEDRQTGWY 198
>gi|358339091|dbj|GAA47216.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 308
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 56/103 (54%), Gaps = 4/103 (3%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
KHY+G ++KL P ++ + K IVLD D++F DI +LW FS+ + IG V Q
Sbjct: 148 KHYAGYSAMMKLMAPYIVDSNVQKVIVLDVDLLFNDDIKELWKYFSQFEPGEAIGAVCEQ 207
Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW 290
S G+ + +G NTGV+LLD+++LR W W
Sbjct: 208 S----GECQASCSSPAVPPQGINTGVMLLDVSELRAKEWRKMW 246
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 77/174 (44%), Gaps = 11/174 (6%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHL---ITDTVALNILQTLFSTWSVP 69
E+I V G ++ VTL+KSIL Y + HL +T + + + + S
Sbjct: 61 EIIHTFSVLHGASAVYQAVTLLKSIL-YSQGQFKDHLPECLTGWIRHKDINCVHHSPSPR 119
Query: 70 QVEVSFYLAD----SVVEDVSWIPN---KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
Q ++ D ++ D+ I + KHY+G ++KL P ++ + K IVLD D+
Sbjct: 120 QYLNMHWVVDQGSRGLLVDLKLIGHARTKHYAGYSAMMKLMAPYIVDSNVQKVIVLDVDL 179
Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLAD 176
+F DI +LW FS+ + IG V QS P + G L D
Sbjct: 180 LFNDDIKELWKYFSQFEPGEAIGAVCEQSGECQASCSSPAVPPQGINTGVMLLD 233
>gi|332375717|gb|AEE62999.1| unknown [Dendroctonus ponderosae]
Length = 366
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 78/161 (48%), Gaps = 10/161 (6%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VLP+ A + +DTD +F T + +W F++ Q L D + W
Sbjct: 141 RLFLPDVLPDVDA-LLYMDTDTLFLTPVESIWRYFNKFNASQMAALAPEHEDPAIA--WY 197
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R W + I +K+ L W DQDI N I
Sbjct: 198 NRFAKHPYYGKLGVNSGVMLMNLTRMRLFKWTEYVVPIYKKYKLHITW---GDQDIINII 254
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELC-YTELTDLKIIH 355
HP ++ C++N + D+ +C E + +IH
Sbjct: 255 FHYHPGKLFIYSCRYNYR-PDHCMYSSICKAAEKNGVAVIH 294
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 62/148 (41%), Gaps = 17/148 (11%)
Query: 17 VAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFY 76
VA+V G + +TLIKS L + K L+F +I + L S W E +
Sbjct: 57 VAVVACG-DRLNETLTLIKSALMFSKGYLNFIVIAEDKLKENLNEKLSEWK----EFTHN 111
Query: 77 LADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
+ +++ P K S L K L LP VLP+ A + +DTD +F T +
Sbjct: 112 GFSYRILPLNF-PTKDGSEWMKLFKPCAAQRLFLPDVLPDVDA-LLYMDTDTLFLTPVES 169
Query: 131 LWALFSRLRQRQTIGLVENQSD----WY 154
+W F++ Q L D WY
Sbjct: 170 IWRYFNKFNASQMAALAPEHEDPAIAWY 197
>gi|196011271|ref|XP_002115499.1| hypothetical protein TRIADDRAFT_29667 [Trichoplax adhaerens]
gi|190581787|gb|EDV21862.1| hypothetical protein TRIADDRAFT_29667 [Trichoplax adhaerens]
Length = 365
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/266 (24%), Positives = 102/266 (38%), Gaps = 79/266 (29%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM--SDAEAQQFL---------------- 519
DVTLVA ++++L ++ L + W GP+S+ ++ +DA +F
Sbjct: 34 DVTLVAHCTVNKLHLLLPLAERWNGPMSIVVFAPGNDARMTRFAIDGLRQCSTYIRDYAT 93
Query: 520 -----------SYAGNSEALRSRRNIGY-----------------HVVYKEGNF------ 545
+ G + L + R + Y H Y +G
Sbjct: 94 FHLVYPTALLPPFDGGTGELFTEREMEYANFCPEVIPNRIKALSSHSSYDDGYVRATRYI 153
Query: 546 -YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
YP N LRN+ + + YVF++DID +P GLY L + + + K VVPAF
Sbjct: 154 PYPSNLLRNIGRDFAKSTYVFVIDIDMVPNKGLYHELINFLEEKKLFDSIEKNAYVVPAF 213
Query: 605 ETQRYR------------TAFPASHA-------PTNFSRWVNATT-------PYQIEWAP 638
E R A P H P N+ RW A + Y + W
Sbjct: 214 EINEKRDIPKNKAALREAAARPFYHEVCEKCQRPLNYDRWWKANSNKGVLGPAYTLNWKD 273
Query: 639 DFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ + P +D RF +G+N+
Sbjct: 274 PWEPFYIIDSKSPSFDERFKQYGFNR 299
>gi|313232041|emb|CBY09152.1| unnamed protein product [Oikopleura dioica]
Length = 510
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 101/255 (39%), Gaps = 72/255 (28%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS-EALRS----- 530
NDVTL Q S+D+L + + ++GP+S++++ +A +YA ++ LR
Sbjct: 81 NDVTLATQGSLDKLSYLTGIADRFDGPLSVSIFCPGYDA----AYADDAIHMLRKCYPKI 136
Query: 531 RRNIGYHVVY---------------------------------KEGNFYPINTLRNVALN 557
N+ +H+VY E +P N LRNVA +
Sbjct: 137 SENVMFHLVYPAEQPADLAYTTGWRDLECADFIYQLKAFTSTENEAIIFPHNVLRNVARS 196
Query: 558 QVSTPYVFLLDIDFLPMFGL-YPYLKSSIRSMDMHGHGGKKVLVVPAFETQ--------- 607
V + + LLDID P L ++ + + V V+PAFE +
Sbjct: 197 AVHSEFFLLLDIDLAPSLNLRQSFVDYANENNLWEEENDLVVYVLPAFEQKAGTKMPKDK 256
Query: 608 ---------------RYRTAFPASHAPTNFSRWVNATT---PYQIEWAPDFEPYIVAHRD 649
T FP H P + W N + ++ W +EP+ + R
Sbjct: 257 DELINAIEKGNIRPFHNETCFPC-HRPEKYEHWKNTSEKFIAFEAPWEEAWEPFYIGRRS 315
Query: 650 LPRYDTRFVGFGWNK 664
LP YD RF GFG+++
Sbjct: 316 LPYYDERFKGFGFDR 330
>gi|321473905|gb|EFX84871.1| hypothetical protein DAPPUDRAFT_46504 [Daphnia pulex]
Length = 405
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 102/273 (37%), Gaps = 77/273 (28%)
Query: 469 EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL 528
++ +++ ++V L Q S+DRL + HW G ISL +Y+ D E F +
Sbjct: 23 QFHSTAVSHNVCLATQSSLDRLHWLGESSDHWRGAISLAIYLDDEEVGLFYKVLTHMRRC 82
Query: 529 RSR--RNIGYHVVYK--------------------------------------------- 541
S NI +HVV K
Sbjct: 83 HSDLLANIAFHVVSKTETDDGDEGTSASAGHQSTPVNCTGPSEALTWLLKERPSLSAKWE 142
Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
G YP N +RNVA + YV LLD+D +P + + L + + H K VV
Sbjct: 143 SGRPYPQNLMRNVARKGCPSHYVLLLDVDVIPSYNMAESLAEFLDRPSVVAHCTKCAFVV 202
Query: 602 PAFE-----------TQR------------YRTAFPASHAPTNFSRWVNATTPYQIEWAP 638
P +E TQ + AF + +N +RW + Q+ +
Sbjct: 203 PTYELDYSADFPQNKTQLLTLENQGRAQPFHHKAFRYNQFASNLTRWRQTSETDQVVISH 262
Query: 639 D-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+ +EP+ +A + P +D RF+G+G+ +
Sbjct: 263 NVTNYEFFYEPFYIARDEAPPHDERFLGYGFTR 295
>gi|346474050|gb|AEO36869.1| hypothetical protein [Amblyomma maculatum]
Length = 388
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 67/138 (48%), Gaps = 12/138 (8%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L LP +LP+ A I +D DV+F + I +LW F+ + L D WY
Sbjct: 155 RLFLPSMLPDVDA-VIYVDADVLFLSPIEKLWEHFASMNSSHLAALAPESEDYATNWY-- 211
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
+ + H + LG N+GV+L++LT++RD W + + W DQD+ N
Sbjct: 212 RRFARHPFYQPLG--VNSGVMLMNLTRMRDFGWESRLGPLLHNYSRAISW---GDQDLLN 266
Query: 314 AIISEHPYLVYTLPCQWN 331
+ S HP + PCQWN
Sbjct: 267 ILFSAHPERLLLFPCQWN 284
>gi|348520260|ref|XP_003447646.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like
[Oreochromis niloticus]
Length = 431
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 67/154 (43%), Gaps = 35/154 (22%)
Query: 543 GNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
GN YP N LRN+A + Y+ ++D+D +P L+ + I H +V V+
Sbjct: 215 GNISYPNNLLRNIARGGTESSYILVIDMDMMPSADLHEQFMAMIAK---HEPASDEVFVL 271
Query: 602 PAFETQR-----------------------YRTAFPASHAPTNFSRWVNA--------TT 630
PAFE + Y P APTN+S+WVN+
Sbjct: 272 PAFEIRHARKMPATKAELVQLYQVGEVRPFYEELCPRCQAPTNYSQWVNSHVRETGTLEV 331
Query: 631 PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
Y + W +EP+ + HR +P YD F +G+N+
Sbjct: 332 AYTLTWVDPWEPFYIGHRSVPLYDENFKQYGFNR 365
>gi|384499295|gb|EIE89786.1| hypothetical protein RO3G_14497 [Rhizopus delemar RA 99-880]
Length = 351
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 100/227 (44%), Gaps = 45/227 (19%)
Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL------SYAGNSEALR 529
D+TL ++ DR ++ L H++GPIS T+Y++ + + + +Y N +
Sbjct: 96 SKDITLSTIVTNDRFPVLSRLASHYKGPISATVYVNGNDNKDTILQELSKTYRENPDM-- 153
Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
++ + H++ + N RNVA T Y+ +LDIDF P + + + D
Sbjct: 154 -KKYVDIHLILDAYD-RQFNMWRNVARLYARTDYIMMLDIDFFPCTD---FRQEILNDAD 208
Query: 590 MHGH--GGKKVLVVPAFETQR----------------------------YRTAFPASHAP 619
+ G+ LV+PAFE + A+ H
Sbjct: 209 LLNRLDSGRTALVIPAFEYTHLEDGMDYRHFPRTKADLIQQVQKGKLGMFHRAWQKGHGS 268
Query: 620 TNFSRWVNATTPYQ-IEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
T++ +W NAT Y+ +++ +EPY++ R P D RF+G+G NK
Sbjct: 269 TDYVKWYNATAHYRVVDYNYSYEPYVIFKRQGAPWCDERFIGYGANK 315
>gi|66820721|ref|XP_643936.1| hypothetical protein DDB_G0274741 [Dictyostelium discoideum AX4]
gi|74857595|sp|Q555X4.1|GNT15_DICDI RecName: Full=Glycosyltransferase-like protein gnt15
gi|60472310|gb|EAL70263.1| hypothetical protein DDB_G0274741 [Dictyostelium discoideum AX4]
Length = 516
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 5/123 (4%)
Query: 470 YEASSD--GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD-AEAQQFLSYAGNSE 526
Y A+ D D++++ Q ++DR + M+ W PIS +Y++ + + NS
Sbjct: 79 YRANGDLKKYDISIITQFTVDRFDRIAMMADKWRAPISAAVYITSFKDIDEVFKLVRNSF 138
Query: 527 ALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
A+ + H ++ YP+N LRN+AL T + LLD+DF+ G+Y YL S++
Sbjct: 139 AVTEF--VDLHFLFANKTRYPVNNLRNLALRNARTEWCLLLDVDFISPLGMYDYLHSTLE 196
Query: 587 SMD 589
+D
Sbjct: 197 KLD 199
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
P H PTN+SRW ++ PY +++ +EP+++ +R + YD R G+G++K
Sbjct: 379 PKCHGPTNYSRWYLSSEPYLVQYKWIYEPFLLYNRSQIHDYDERLKGYGFDK 430
>gi|384490378|gb|EIE81600.1| hypothetical protein RO3G_06305 [Rhizopus delemar RA 99-880]
Length = 260
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 96/217 (44%), Gaps = 43/217 (19%)
Query: 485 LSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS------YAGNSEALRSRRNIGYHV 538
++ +R +++ L H++GPIS ++++D ++ + YA N + R + H+
Sbjct: 5 VTYNRFKVLSRLASHYKGPISAAIHVNDDHTKEHIIKDLHQLYADNPDM---GRYVDLHL 61
Query: 539 VYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
V NF N RNVA T Y+ +LD+DF LK M++ GK
Sbjct: 62 VI--DNFDRQFNMWRNVAKFFARTNYIMMLDVDFYLCTDFRKSLKKQPHLMEL-LRSGKG 118
Query: 598 VLVVPAFE----------------------------TQRYRTAFPASHAPTNFSRWVNAT 629
+VVPAFE + ++ H TN+ +W T
Sbjct: 119 AIVVPAFEYIDESDGEDWNVFPKTKQELLVDVEAEKLDMFHRSWKRGHGSTNYEKWYKQT 178
Query: 630 TPYQI-EWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
PY++ E+ +EPY++ ++ P D RFVG+G NK
Sbjct: 179 KPYKVTEYNYSYEPYVIFKKEGTPWCDERFVGYGANK 215
>gi|149411403|ref|XP_001511493.1| PREDICTED: uncharacterized protein C3orf21 homolog [Ornithorhynchus
anatinus]
Length = 276
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 81/179 (45%), Gaps = 18/179 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +S+ + S +YS L + + +++P+ +++ I LD D+ + T+I +L+
Sbjct: 65 FPIVESMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEISQIIQLDLDLKYKTNILELFEE 124
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F + IG+ Y W+ + P P GFN+GV+LLDL +R
Sbjct: 125 FDNFSEGNVIGIAREMQPVYRHTFWQFRQENPKTRVGDPPPDGLPGFNSGVMLLDLEAMR 184
Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
+ + + E + +L L DQD F I EHP L + L C WN QL
Sbjct: 185 Q---SQLYNQLLEPARIQQLTEKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 240
>gi|324518766|gb|ADY47199.1| Glycosyltransferase-like protein [Ascaris suum]
Length = 189
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 59/118 (50%), Gaps = 31/118 (26%)
Query: 574 MFGLYPYLKSSIRS-MDMHGHGGKKVLVVPAFETQRYRTAF-PAS--------------- 616
M GLY ++ + S + + G K VVPAFE++ YR++ PAS
Sbjct: 1 MPGLYELIRKRLSSGLPLKG----KAFVVPAFESKGYRSSVVPASKAELLVMLDTGQVQI 56
Query: 617 ---------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
HA T++ RW T Y + W D+EPY+V R + P YD RFVGFGWNK
Sbjct: 57 FRQDVWIQGHASTDYDRWRTTQTEYSVAWRTDYEPYVVVARNETPPYDNRFVGFGWNK 114
>gi|91090075|ref|XP_969798.1| PREDICTED: similar to GLT8D3 protein [Tribolium castaneum]
Length = 370
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 70/138 (50%), Gaps = 8/138 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD +F T + +W+ F ++ Q L D +G W
Sbjct: 146 RLFLPNILKD-VDSLLYMDTDTLFLTPVENIWSYFHKMNSSQMAALAPEHEDPNVG--WY 202
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R W+ + I K+ L W DQDI N I
Sbjct: 203 NRFAKHPFYGKLGVNSGVMLMNLTRMRAFQWSEYVVPIYRKYKLKITW---GDQDIINII 259
Query: 316 ISEHPYLVYTLPCQWNVQ 333
HP +Y C++N +
Sbjct: 260 FHYHPGKLYLYSCRFNFR 277
>gi|389749357|gb|EIM90534.1| hypothetical protein STEHIDRAFT_50943 [Stereum hirsutum FP-91666
SS1]
Length = 468
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 76/156 (48%), Gaps = 13/156 (8%)
Query: 11 TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP- 69
T E + +++ +S LIKS + Y PL FH+I D A N L+ F P
Sbjct: 104 TLEPVVFSLIMWSESSAIEGALLIKSAIMYTTRPLEFHIICDEEAQNHLEKKFLLLEHPA 163
Query: 70 -QVEVSFY------LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
+ + FY + D + + I H +GV GL+KL + ++LP+++ K+I +DTD
Sbjct: 164 QNILLRFYRVSRQSMTDRIQRE-GGIHTDHSAGVPGLMKLFIHEILPDSVRKSIYVDTDA 222
Query: 123 IFATDIAQLWALFSRLRQRQTIGLV----ENQSDWY 154
F TD A LW F LR I + +N +W+
Sbjct: 223 FFITDPALLWQTFDTLRPSTAISMPTHYDQNSPEWH 258
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 73/173 (42%), Gaps = 38/173 (21%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 244
I H +GV GL+KL + ++LP+++ K+I +DTD F TD A LW F LR I +
Sbjct: 188 IHTDHSAGVPGLMKLFIHEILPDSVRKSIYVDTDAFFITDPALLWQTFDTLRPSTAISMP 247
Query: 245 ----ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISW--AGFWRIIAEKFL 298
+N +W+ + V+LLDL+KLR + + +R
Sbjct: 248 THYDQNSPEWHHAS-------------KICSCVMLLDLSKLRSLRLMDSSIYRADPTSRY 294
Query: 299 LTRL-------------------WTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
+ L +L DQ F AI+S+ P + L W V
Sbjct: 295 PSALSPPLFRSLFGPPGPEDHYEGVALGDQGYFWAIVSQRPDIFEHLSFDWEV 347
>gi|219115597|ref|XP_002178594.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410329|gb|EEC50259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 417
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 101/242 (41%), Gaps = 64/242 (26%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNSEALRSRRN 533
D TLV QLS DRL M+ K W G +++L + E+ +Q LS +G AL
Sbjct: 111 DFTLVTQLSDDRLAMMRPHCKRW-GKHTISLAIGTNESRDTVEQALSKSGCDTAL----- 164
Query: 534 IGYHVV--YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH 591
I +V + YP+N LRNVA++QV T + ++D DF+ LY L +++
Sbjct: 165 ITLSIVRDFDSDQKYPVNKLRNVAMSQVRTSHAVIIDADFVLSPNLYETLHLHNKTL--- 221
Query: 592 GHGGKKVLVVPAFETQR------------YRTAFPAS----------------------- 616
LV+P+FE ++ Y P +
Sbjct: 222 AADSTNALVIPSFELRKACRRRNRRCITMYSAMVPRNKDGLLELYDPMTEDSAGYGIAQF 281
Query: 617 --------HAPTNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFVGFGW 662
HA T ++ W + P + + +EPY+V RDLP + FVG+G
Sbjct: 282 DIRGNYHGHASTRYADWASQPAEQLLPIECVTSDRYEPYLVVRHCRDLPPFQEAFVGYGQ 341
Query: 663 NK 664
NK
Sbjct: 342 NK 343
>gi|323450957|gb|EGB06836.1| hypothetical protein AURANDRAFT_65363 [Aureococcus anophagefferens]
Length = 2492
Score = 72.0 bits (175), Expect = 8e-10, Method: Composition-based stats.
Identities = 52/153 (33%), Positives = 79/153 (51%), Gaps = 11/153 (7%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
D +++GN VT+V Q ++DRL V + + W+GP+SL + + + A A
Sbjct: 427 DAPSTTANGN-VTIVTQSTLDRLWNVGHICERWKGPLSLGMMVKHEDPSAVKRDAKRIMA 485
Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP--YLKSSI 585
+ VV K + YP+N LRN+A ++ +TP++FLLD DF P G Y +L+ +I
Sbjct: 486 ACPEVRVSV-VVSKPEDKYPVNRLRNLAWSKATTPWIFLLDADFWP--GAYARFFLEQAI 542
Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPASHA 618
G + LVVPAF Q + F HA
Sbjct: 543 ARSSF---GPRTALVVPAFAFQPHE--FLPGHA 570
>gi|432861231|ref|XP_004069565.1| PREDICTED: glucoside xylosyltransferase 1-like [Oryzias latipes]
Length = 443
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 95/223 (42%), Gaps = 30/223 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +WAL S Q + + +G W
Sbjct: 208 RLFLPLILKE-VDSLLYVDTDILFLQPVEDIWALLSLFDSSQLAAMAPEHEEPRIG--WY 264
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T+LR+ + W + +K+ L W
Sbjct: 265 NRFARHPYYGKTGVNSGVMLMNMTRLREKYFKNDMTAVALKWEEMLMPLLQKYKLNITW- 323
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N I +P +Y PCQWN + E + I+H N +
Sbjct: 324 --GDQDLLNIIFHHNPESLYVFPCQWNYRPDHCMYGSNCAQAEREGVFILHGN---RGVY 378
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
N F+ +Y T +Y FG NL + L+ E +
Sbjct: 379 HNDKQPAFQAVYKTIQKYQ--------FGENLETSLLKPLEAA 413
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 36/152 (23%), Positives = 65/152 (42%), Gaps = 15/152 (9%)
Query: 16 QVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSF 75
VA+V G +L T++KS + + K P+HF++ + + + TW +V+ F
Sbjct: 123 HVAVVACGPRLEETL-TMVKSAVLFSKKPVHFYIFAEDELHDSFRDALITWPR-EVKAKF 180
Query: 76 YLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFATDIA 129
+ P ++ + L K L LP +L E + + +DTD++F +
Sbjct: 181 TFTIYPIT----FPKENANEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLQPVE 235
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+WAL S Q + + +G W N
Sbjct: 236 DIWALLSLFDSSQLAAMAPEHEEPRIG--WYN 265
>gi|242001678|ref|XP_002435482.1| glycosyltransferase domain-containing protein, putative [Ixodes
scapularis]
gi|215498818|gb|EEC08312.1| glycosyltransferase domain-containing protein, putative [Ixodes
scapularis]
Length = 269
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 79/158 (50%), Gaps = 7/158 (4%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
K G +L LP +L E A + +D+D++F + +LW++F + Q GL +
Sbjct: 85 KELFGPCASQRLFLPSMLSEEDA-VLYVDSDIVFFRPVEELWSIFDNMDDMQLAGLAPDV 143
Query: 248 SDWYLGKLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSL 306
D Y G ++ ++ GR G N+GVIL++LT++R G I+ L
Sbjct: 144 ED-YNGSVYMHNWKTRYYGRYGLNSGVILMNLTRMRAY---GLESIVTNLMNKYHSVMKL 199
Query: 307 ADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
DQD+ N + + P +Y LPC+W+V L + R+ C
Sbjct: 200 PDQDLLNMVFHDDPGRLYELPCRWDV-LPPDCRAPASC 236
Score = 38.9 bits (89), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 14/147 (9%)
Query: 14 VIQVAIVCAGYNSTRSLVTL--IKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
V V + CA TR +T+ +KS + + + PL L D + LQ P
Sbjct: 7 VTMVVVTCA----TRLELTMNNLKSAVSFSRAPLRLLLYADVENIKRLQDRVRIREWPSS 62
Query: 72 ---EVSFYLADSVVEDVSWIPN--KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
V++ L +VE P+ K G +L LP +L E A + +D+D++F
Sbjct: 63 VLQRVTYSL--RLVEFPKKSPHNWKELFGPCASQRLFLPSMLSEEDA-VLYVDSDIVFFR 119
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDW 153
+ +LW++F + Q GL + D+
Sbjct: 120 PVEELWSIFDNMDDMQLAGLAPDVEDY 146
>gi|47210788|emb|CAF91098.1| unnamed protein product [Tetraodon nigroviridis]
Length = 457
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 67/150 (44%), Gaps = 34/150 (22%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
YP N LRNVA + + YV ++DID +P L+ + + I + GH +V V+PAFE
Sbjct: 219 YPNNLLRNVARSGTESTYVLVIDIDMVPSADLHQHFLAMIARHEPAGH---EVFVLPAFE 275
Query: 606 TQR-----------------------YRTAFPASHAPTNFSRWVNA--------TTPYQI 634
+ Y P APTN+S+WVN+ Y +
Sbjct: 276 IRHARKIPASKAELVQLYQVGEVRPFYEELCPRCQAPTNYSQWVNSHVRETGTLEVAYTV 335
Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
W +EP+ + +P YD F +G+N+
Sbjct: 336 PWTDPWEPFYIGPNSVPLYDESFKQYGFNR 365
>gi|327289393|ref|XP_003229409.1| PREDICTED: LOW QUALITY PROTEIN: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Anolis
carolinensis]
Length = 407
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 96/256 (37%), Gaps = 75/256 (29%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA------------------------ 513
D+ LV S+ L + L + W+GP S+ ++ A
Sbjct: 92 DMVLVTHTSLGNLHHAQRLAERWQGPFSVAVFAPGAAEVRLAVTMIYAIAALCSPLRQLL 151
Query: 514 ---------EAQQFLS-------YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALN 557
+A F S G+ A SR +H Y YP N LRNVA
Sbjct: 152 RLHLVCHADQAATFFSGDVARLETCGDVFAKLSRIGSRWHN-YGANASYPNNLLRNVAKK 210
Query: 558 QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE---TQR------ 608
V + ++D+D +P GL + M G V VVPAFE T+R
Sbjct: 211 AVGDHWALVVDMDMVPSEGLREAFLALRNDETMPG-----VFVVPAFEIRHTRRIPGTKA 265
Query: 609 --------------YRTAFPASHAPTNFSRWVN--ATTP----YQIEWAPDFEPYIVAHR 648
Y P APTN+S+WVN A P Y +EW +EP+ V
Sbjct: 266 ELLRLYQVGETRPFYEELCPRCQAPTNYSQWVNLPAGGPLRVAYDVEWRDPWEPFYVGPP 325
Query: 649 DLPRYDTRFVGFGWNK 664
P YD RF +G+N+
Sbjct: 326 SAPPYDERFKQYGFNR 341
>gi|402225614|gb|EJU05675.1| hypothetical protein DACRYDRAFT_113733 [Dacryopinax sp. DJM-731
SS1]
Length = 455
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 105/227 (46%), Gaps = 42/227 (18%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
S D + +T+ ++ R ++ L H+ GPIS+ + Q LS E S
Sbjct: 132 SPDPSAITITTLVTPSRYPVLSRLASHYNGPISVAVPF-----QSPLSLPSLHEFYTSNP 186
Query: 533 NIGYHV-VYKEGNFYP--INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
I HV ++ N P NT RNVA T + +LD+DF P ++ S +++
Sbjct: 187 AIVEHVDIHLILNSPPRAFNTWRNVARAFARTDTILMLDVDFYPCTPFAQRIQGSQIALN 246
Query: 590 MHGHGGKKVLVVPAFE-----TQRYRTAFPAS-----------------------HAPTN 621
+ + +LV+PAFE +Y + FP + HA TN
Sbjct: 247 LLENNA--MLVLPAFEYASIDAGKYTSLFPVAKRDLLPLLENGEIGQFHGAWRPGHAATN 304
Query: 622 FSRWVNATTP--YQI-EWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
+SR+ NAT+ Y++ ++ P +EPY++ R+ P D RF+G+G NK
Sbjct: 305 YSRFANATSGEIYKVKDYHPSYEPYVIFQRENAPWCDERFIGYGANK 351
>gi|301772162|ref|XP_002921506.1| PREDICTED: uncharacterized protein C3orf21 homolog [Ailuropoda
melanoleuca]
Length = 339
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 128 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 187
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 188 FDNFLPGAIIGIAREMQPVYRHTFWQFRHENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 247
Query: 283 DISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ G R +A+K+ R L DQD F I EHP L + L C WN QL
Sbjct: 248 QSALYSRLLEPGQVRQLADKY---RFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 304
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 305 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 332
>gi|403418633|emb|CCM05333.1| predicted protein [Fibroporia radiculosa]
Length = 491
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 7/129 (5%)
Query: 25 NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP--QVEVSFYLAD--- 79
+S L+KSI+ Y P+ FH+I D VA L+ + P + V FY
Sbjct: 124 DSAAEGAVLMKSIIMYSSVPIEFHVICDDVAQTFLEKRLRLVTRPARNILVRFYRLSWSS 183
Query: 80 --SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR 137
+ +E I H +G GL+KL + ++LP ++ + I +DTD F TD AQLW F+
Sbjct: 184 MVARIEREGAITTDHSAGTPGLMKLFIHEILPPSVPRAIFIDTDAFFITDPAQLWDAFTH 243
Query: 138 LRQRQTIGL 146
L+ I +
Sbjct: 244 LKPTTAIAM 252
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 37/65 (56%)
Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
+E I H +G GL+KL + ++LP ++ + I +DTD F TD AQLW F+ L+
Sbjct: 188 IEREGAITTDHSAGTPGLMKLFIHEILPPSVPRAIFIDTDAFFITDPAQLWDAFTHLKPT 247
Query: 239 QTIGL 243
I +
Sbjct: 248 TAIAM 252
>gi|443704974|gb|ELU01746.1| hypothetical protein CAPTEDRAFT_195175 [Capitella teleta]
Length = 346
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 21/180 (11%)
Query: 174 LADSVV---EDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
LAD+V E S + ++ + + + +V+P + ++I++D D+ F DI L+
Sbjct: 135 LADTVYSMQELFSSVELPYFQNILFHISPAIHRVMPTFVHRSIMIDVDLKFLGDIRLLYD 194
Query: 231 LFSRLRQRQTIGLVENQSDWY--LGKLWKNHKPWPALGR-------GFNTGVILLDLTKL 281
F IG+ Q Y L K +++ P +G GFN+GVILLDL K+
Sbjct: 195 QFRHFNDTHLIGIANEQQPTYRHLFKEYRHQNPRTIIGDPPPNGLPGFNSGVILLDLEKM 254
Query: 282 RDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
R+ + + + EK + L L DQD + + H + Y LPC++N QLS
Sbjct: 255 RN---SHLYNTLLEKEKIEELAKHFMLSGYLGDQDFYTLVNLVHREMFYQLPCEFNRQLS 311
>gi|195996077|ref|XP_002107907.1| hypothetical protein TRIADDRAFT_18847 [Trichoplax adhaerens]
gi|190588683|gb|EDV28705.1| hypothetical protein TRIADDRAFT_18847 [Trichoplax adhaerens]
Length = 329
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 68/136 (50%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L +T I LDTD++F LW F++ Q + + D + W
Sbjct: 105 RLFLPYLLKKT-DSLIYLDTDILFMRPPEDLWNFFNKFNATQLVAVSPEGED--IRTSWY 161
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+ VIL++LT++R +W + +K+ R L DQDI NAI
Sbjct: 162 NKFAQHPYYGKLGINSDVILMNLTRMRKANWNDKVTKVYQKY---RSSIVLGDQDILNAI 218
Query: 316 ISEHPYLVYTLPCQWN 331
HP +VY L C WN
Sbjct: 219 FHFHPEMVYVLTCAWN 234
>gi|391325616|ref|XP_003737327.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like
[Metaseiulus occidentalis]
Length = 381
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/261 (22%), Positives = 94/261 (36%), Gaps = 74/261 (28%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVTLV Q +++ L +E L W GP+S+ ++ +D + + + R N+ +H
Sbjct: 54 DVTLVTQTTLNHLGHIESLAGGWGGPLSVAIFATDLPDVVEAIIQLRTCSSQVRHNVSFH 113
Query: 538 VVY----------------------------------------KEGNFYPINTLRNVALN 557
+VY G+ +P+N LRNV
Sbjct: 114 LVYPLRSKSLSGHVHPSGASYLKDGGCATLVERLSDLSRKTNYDHGDEFPVNLLRNVGRK 173
Query: 558 QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM----HGHGGKKVLVVPAFETQR----- 608
T +VF++D+D GL R + + G +V VVPAFE
Sbjct: 174 NSRTQFVFVVDVDLKVNAGLRDAFVHFARVNSLFTQQYAEGPNEVFVVPAFEALETIQMP 233
Query: 609 ------------------YRTAFPASHAPTNFSRWVNATTP-------YQIEWAPDFEPY 643
Y H T++ W+ A Y + W +EP+
Sbjct: 234 KDKYELVDLLSNNQARPFYTELCHKCHKHTDYETWLKAPVDGNRLEVLYHVTWKDPWEPF 293
Query: 644 IVAHRDLPRYDTRFVGFGWNK 664
AH P YD RF +G+N+
Sbjct: 294 YFAHNSAPLYDERFRQYGFNR 314
>gi|410970721|ref|XP_003991826.1| PREDICTED: xyloside xylosyltransferase 1, partial [Felis catus]
Length = 259
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 91/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 48 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 107
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 108 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 167
Query: 283 DISWAGFW------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
G + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 168 QSPLYGRLLEPSQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 224
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 225 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 252
>gi|390597324|gb|EIN06724.1| hypothetical protein PUNSTDRAFT_71944 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 385
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 65/231 (28%), Positives = 103/231 (44%), Gaps = 42/231 (18%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS----DAEAQQFLSYAGNSEAL 528
S +D+T+ ++ +RLQ + L H++GP+S +++S A A+ + G +
Sbjct: 56 SPQKDDITITTIVTPNRLQRLADLVDHYQGPVSAAVHISTSNATACAEMLTALHGIYRSG 115
Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF--LPMFGLYPYLKSSIR 586
RR + H+V + NT RN+A T YV LLD+DF FG K + R
Sbjct: 116 SLRRYLDVHLVLDPFD-RQFNTWRNIARLFARTDYVMLLDVDFWLCTDFGS-ALKKRASR 173
Query: 587 SMDMHGHGGKKVLVVPAFETQRYRTA-----FP-----------------------ASHA 618
+ GG LV+PAFE + FP H
Sbjct: 174 EVVAKLDGGNAALVIPAFEYTKQEEGRDPSLFPRDKETLLSLVRQERLGMFHRSWAPGHN 233
Query: 619 PTNFSRWVNATTPYQI----EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
T+++R+ A P Q+ ++ +EPY++ +D P + D RFVG+G NK
Sbjct: 234 STDYNRYY-AAPPGQVYRVTQYQAAYEPYVIFKKDGPPWCDERFVGYGGNK 283
>gi|195997643|ref|XP_002108690.1| hypothetical protein TRIADDRAFT_18350 [Trichoplax adhaerens]
gi|190589466|gb|EDV29488.1| hypothetical protein TRIADDRAFT_18350, partial [Trichoplax
adhaerens]
Length = 323
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 80/164 (48%), Gaps = 11/164 (6%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L +T A I LDTD++F LW F +L Q + D G W
Sbjct: 95 RLFLPDILVKTDA-LIYLDTDILFMRPPEDLWQFFDKLNSTQLAAVSPEGEDAPTG--WY 151
Query: 258 N-HKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N + P G G N+G++L++LT++R +W I+ + + + DQDI N +
Sbjct: 152 NKYARHPYYGELGINSGIMLMNLTRMRKANWND---IVLKLYQEYKSGIVFGDQDILNVV 208
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTR--SDELCYTELTDLKIIHWN 357
HP ++YT C WN + D+ R ++ + + + ++H N
Sbjct: 209 FHFHPEMLYTFTCDWNYR-PDHCRYGRNDCKHADKNGISLLHGN 251
>gi|281339157|gb|EFB14741.1| hypothetical protein PANDA_010392 [Ailuropoda melanoleuca]
Length = 308
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 98 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 157
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 158 FDNFLPGAIIGIAREMQPVYRHTFWQFRHENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 217
Query: 283 DISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ G R +A+K+ R L DQD F I EHP L + L C WN QL
Sbjct: 218 QSALYSRLLEPGQVRQLADKY---RFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 274
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 275 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 302
>gi|410908409|ref|XP_003967683.1| PREDICTED: glucoside xylosyltransferase 1-like [Takifugu rubripes]
Length = 408
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 88/198 (44%), Gaps = 22/198 (11%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD+IF + ++WAL SR + + +G W
Sbjct: 169 RLFLPLILKD-VDSLLYVDTDIIFLQPVEEIWALLSRFNSSHLAAMAPEHEEPRIG--WY 225
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L++LT++R+ + W + +K+ L W
Sbjct: 226 NRFARHPYYGKTGINSGVMLMNLTRIREKLFKNDMTTMLLRWEEILMPLLQKYKLNITW- 284
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N I +P ++ PCQWN + E + I+H N +
Sbjct: 285 --GDQDLLNIIFHHNPESLHVFPCQWNYRPDHCIYGSNCREAEHDGVFILHGN---RGVF 339
Query: 365 KNKHMEFFRNLYLTFLEY 382
+ FR +Y FL++
Sbjct: 340 HDDKQPAFRAVYEAFLKF 357
Score = 43.5 bits (101), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 5/149 (3%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
E +A+V G +L T++KS L + PLHF++ + + Q ++W
Sbjct: 83 ERSHLAVVACGPRLEETL-TMLKSALLLSRKPLHFYIFAEEDLHDSFQNTLNSWPR-TTR 140
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
SF + + K +L LP +L + + + +DTD+IF + ++W
Sbjct: 141 FSFTIYPITFPSENGKEWKKLFKPCASQRLFLPLILKD-VDSLLYVDTDIIFLQPVEEIW 199
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
AL SR + + +G W N
Sbjct: 200 ALLSRFNSSHLAAMAPEHEEPRIG--WYN 226
>gi|348513043|ref|XP_003444052.1| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1-like
[Oreochromis niloticus]
Length = 442
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 93/223 (41%), Gaps = 30/223 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD+IF + +WAL S Q + + +G W
Sbjct: 201 RLFLPLILKE-VDSLLYVDTDIIFLQPVEDMWALLSHFNSSQLAAMAPEHEEPRIG--WY 257
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T+LR+ + W + +K+ L W
Sbjct: 258 NRFARHPYYGKTGINSGVMLMNMTRLREKFFKNDMTTVVLKWEEMLMPLLQKYKLNITW- 316
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N I +P +Y PCQWN + E + I+H N +
Sbjct: 317 --GDQDLLNIIFHHNPESLYVFPCQWNYRPDHCIYGSNCQQAEQEGVFILHGN---RGVY 371
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
+ FR +Y +Y FG NL + L E S
Sbjct: 372 HDDKQPAFRAVYEAIQKYQ--------FGDNLETSLLGPLETS 406
Score = 42.7 bits (99), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 67/155 (43%), Gaps = 15/155 (9%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
E +A+V G +T++KS + + K PLHF++ + + +W +V
Sbjct: 113 ERSHLAVVACG-PRLEETITMLKSAVLFSKKPLHFYIFAEDELHKSFRNTLESWPW-RVR 170
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFAT 126
F +S + +++ P ++ L K L LP +L E + + +DTD+IF
Sbjct: 171 ARF---NSTIYPITF-PTENAREWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDIIFLQ 225
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +WAL S Q + + +G W N
Sbjct: 226 PVEDMWALLSHFNSSQLAAMAPEHEEPRIG--WYN 258
>gi|323451610|gb|EGB07487.1| hypothetical protein AURANDRAFT_71805 [Aureococcus anophagefferens]
Length = 529
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 86/179 (48%), Gaps = 11/179 (6%)
Query: 442 DGNDVTLVAQLSMDRLQMVEINPNNV------DEYEASSDGNDVTLVAQLSMDRLQMVEM 495
+G LVA + + L+ + P V D +++GN VT+V Q + DRL +
Sbjct: 21 EGGRPALVASAAPENLRALSEWPGGVPPGCEPDAPSTTANGN-VTIVTQSTTDRLWNLGH 79
Query: 496 LFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVA 555
+ + W+GP+SL + ++ + A A ++ +V + YP+N LRN+A
Sbjct: 80 VCERWKGPLSLGMTVTPEDPGGVTQDAKRIMAACPEVHVSV-LVREPDQKYPVNRLRNLA 138
Query: 556 LNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFP 614
+ +TP++FLLD DF P L+ +I G + LVVPAF + A+P
Sbjct: 139 WAKATTPWIFLLDADFWPGADARFLLEQTIARSSF---GPRTALVVPAFALDLGKPAWP 194
>gi|195028392|ref|XP_001987060.1| GH21705 [Drosophila grimshawi]
gi|193903060|gb|EDW01927.1| GH21705 [Drosophila grimshawi]
Length = 463
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 85/199 (42%), Gaps = 40/199 (20%)
Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
+ D ++ S PN +YS L L L ++ ++L + I+ D D++F +DI L+A F
Sbjct: 232 ITDKLLPYFSSSPNSYYSDSLFFLSLGLHRIADKSLDRAILFDCDIVFRSDIRLLFAEFD 291
Query: 234 RLRQRQTIGLVENQSDWYLGKLWKNHKPWPA-----------LGR--------------- 267
R Q GL + Y L++ +P LG
Sbjct: 292 RFLPHQLFGLAPELTPVYRHILYRYRTRFPKSNFGNPYNPIQLGGKKNNENKLVSSRNRD 351
Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAII 316
G N+GV+L+ L ++RD + + + +F + RL L DQD F +
Sbjct: 352 RHGYPGLNSGVVLMLLNRIRD---SKTYIEMLTQFEVNRLVAKYSFKGHLGDQDFFTLLG 408
Query: 317 SEHPYLVYTLPCQWNVQLS 335
E+P L+Y L C WN QL
Sbjct: 409 YEYPNLIYRLDCVWNRQLC 427
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 68/150 (45%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQT---------LFSTWSVPQV-EVSFYLADSV 81
S+L + LH H+ITD + L ILQ +++ + V E+ + D +
Sbjct: 177 SLLKHTNAQLHLHIITDKKSEVSVLEILQRKIRKFKRILIYTMYDVKICSEIIQDITDKL 236
Query: 82 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
+ S PN +YS L L L ++ ++L + I+ D D++F +DI L+A F R
Sbjct: 237 LPYFSSSPNSYYSDSLFFLSLGLHRIADKSLDRAILFDCDIVFRSDIRLLFAEFDRFLPH 296
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 297 QLFGLAPELTPVYRHILYRYRTRFPKSNFG 326
>gi|326925929|ref|XP_003209159.1| PREDICTED: uncharacterized protein C3orf21 homolog [Meleagris
gallopavo]
Length = 266
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 91/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I +D D+ + +I L+
Sbjct: 55 FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHRIMPKEITQIIQVDLDLKYKANIRDLFEE 114
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F ++ IG+ Y W+ + P P GFN+GV+LL+L +R
Sbjct: 115 FDNFQEGAVIGIAREMQPVYRHTFWQYRRENPQTRVGEPPPDGLPGFNSGVLLLNLEAMR 174
Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ G + + EK+ L DQD F + EHP L + L C WN QL
Sbjct: 175 RSKLYNQLLEPGMVQKLTEKY---HFKGHLGDQDFFTMVGMEHPELFHVLDCTWNRQLCT 231
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD + + ++KI H N
Sbjct: 232 WWKDHGYSDVFDQYFRCDGEVKIYHGNC 259
>gi|449509892|ref|XP_002189200.2| PREDICTED: xyloside xylosyltransferase 1 [Taeniopygia guttata]
Length = 313
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 84/200 (42%), Gaps = 36/200 (18%)
Query: 169 GRGFYLADSVVEDVSWIP----------NKHYSGVYG--------LLKLTLPKVLPETLA 210
GRG +V DV+ + KH+S G L + + +++P+ +
Sbjct: 81 GRGLLRHPVIVHDVNELTEKLFPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHRIMPKEIT 140
Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW---------KNHKP 261
+ I +D D+ + T+I L+ F + IG+ Y W K +P
Sbjct: 141 RIIQVDLDLKYKTNIRDLFDEFDNFPEGAVIGIAREMQPVYRHTFWQYRRENPQTKVGEP 200
Query: 262 WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAI 315
P GFN+GV+LL+L +R + + + E ++ +L L DQD F +
Sbjct: 201 PPDGLPGFNSGVLLLNLEAMRQ---SKLYNQLLEPTMVQKLTEKYHFKGHLGDQDFFTMV 257
Query: 316 ISEHPYLVYTLPCQWNVQLS 335
EHP L + L C WN QL
Sbjct: 258 GMEHPELFHVLDCTWNRQLC 277
>gi|156379954|ref|XP_001631720.1| predicted protein [Nematostella vectensis]
gi|156218765|gb|EDO39657.1| predicted protein [Nematostella vectensis]
Length = 389
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 85/185 (45%), Gaps = 19/185 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
+L LP +L E + I +D D +F + + LW FS Q + + + WY
Sbjct: 170 RLFLPDILTE-VDSLIYMDIDTLFLSPVQWLWDQFSNFNSSQLASMTPEGEVSATGWY-- 226
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+ H P G+ G N+GV+L++LT++R W I K+ W DQDI
Sbjct: 227 NRFARH---PYYGKLGLNSGVMLMNLTRMRSFGWQEKILPIYNKYRFDITW---GDQDIL 280
Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD-LKIIHWNSPKKLKVKNKHMEF 371
N + HP LVY L C+WN + +D+ C T T+ + I+H N + N
Sbjct: 281 NILFHYHPELVYVLSCEWNYR-NDHCIYGNNCKTADTNGIYILHGN---RGVFTNNKQPV 336
Query: 372 FRNLY 376
FR +Y
Sbjct: 337 FRAVY 341
>gi|47224410|emb|CAG08660.1| unnamed protein product [Tetraodon nigroviridis]
Length = 319
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD+IF + ++WAL S Q + + +G W
Sbjct: 89 RLFLPMILKE-VDSLLYVDTDIIFLQPVEEIWALLSHFNSSQLAAMAPEHEEPRIG--WY 145
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L++LT+LR+ + W + +K+ L W
Sbjct: 146 NRFARHPYFGKTGINSGVMLMNLTRLREKLFKNDMTTVLLRWDEILMPLLQKYKLNITW- 204
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 205 --GDQDLLNIIFYHNPESLHVFPCQWNYRPDHCIYGSNCKEAEHEGVFILHGN 255
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 5/150 (3%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQV 71
E VA+V G +L T++KS + PLHF++ + + Q ++W Q
Sbjct: 1 ERSHVAVVACGPRLEETL-TMLKSAVLLSVKPLHFYIFAEDDLHDSFQNSLNSWPRTVQT 59
Query: 72 EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
SF + + K +L LP +L E + + +DTD+IF + ++
Sbjct: 60 RFSFTIYPITFPSENGKEWKKLFKPCASQRLFLPMILKE-VDSLLYVDTDIIFLQPVEEI 118
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
WAL S Q + + +G W N
Sbjct: 119 WALLSHFNSSQLAAMAPEHEEPRIG--WYN 146
>gi|405972582|gb|EKC37344.1| Uncharacterized protein C3orf21-like protein [Crassostrea gigas]
Length = 371
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 21/156 (13%)
Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDW-YLG 253
L + L +V+P L K I+LD D+ F DI L+ +F +R IG+ ENQ + +L
Sbjct: 180 FLSIVLHRVIP--LDKVIMLDVDLKFDEDIRLLYNIFDEFSERSIIGIARENQPVYRHLF 237
Query: 254 KLWKNHKPWPALG-------RGFNTGVILLDLTKLRD-------ISWAGFWRIIAEKFLL 299
++ P +G GFN+GV+LLDL K+R I + +K+L
Sbjct: 238 HQYRGKNPSTRVGAPPPDGLTGFNSGVLLLDLEKMRKSQVYNSLIDSPKTLETLTQKYLF 297
Query: 300 TRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
L DQD + + EH L + L C WN QL
Sbjct: 298 K---GHLGDQDFYTLVSMEHEDLFHILLCSWNRQLC 330
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 67/150 (44%), Gaps = 15/150 (10%)
Query: 33 LIKSILFYRKNPLHFHLITD----TVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWI 88
L++SI L FH+I D T+A +I++ T + + + V E +S +
Sbjct: 103 LVESISRKSSISLCFHVICDEDGKTIATSIIERAAPTETQIIIHDITIMTQKVKEQISQM 162
Query: 89 -------PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
+Y L + L +V+P L K I+LD D+ F DI L+ +F +R
Sbjct: 163 QTFFSHSSKSYYGDPLFFLSIVLHRVIP--LDKVIMLDVDLKFDEDIRLLYNIFDEFSER 220
Query: 142 QTIGLV-ENQSDW-YLGKLWKNHKPWPALG 169
IG+ ENQ + +L ++ P +G
Sbjct: 221 SIIGIARENQPVYRHLFHQYRGKNPSTRVG 250
>gi|195122692|ref|XP_002005845.1| GI20695 [Drosophila mojavensis]
gi|193910913|gb|EDW09780.1| GI20695 [Drosophila mojavensis]
Length = 463
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 86/204 (42%), Gaps = 45/204 (22%)
Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
+ D + S PN +YS + L L ++ +L++ I+ D D++F +DI L+A F
Sbjct: 228 ITDKLSPYFSSAPNSYYSDSLFFMSLGLHRIADASLSRAILFDCDIVFRSDIRLLFAEFD 287
Query: 234 RLRQRQTIGLVENQSDWYLGKLWKNHKPWP-----------ALGR--------------- 267
R Q GL + Y L++ +P +G+
Sbjct: 288 RFLPNQLFGLAPELTPVYRHILYRYRTQFPESNFGNPYQPIQIGKNKNNNNKNKNNKLLT 347
Query: 268 ----------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDI 311
G N+GV+L+ L ++RD S + I ++ + RL T L DQD
Sbjct: 348 SRSQERHGYPGLNSGVVLILLNRIRDSST---YIEILTQYEINRLVTKYSFKGHLGDQDF 404
Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
F + E+P L+Y L C WN QL
Sbjct: 405 FTLLGYEYPTLIYRLDCVWNRQLC 428
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 69/160 (43%), Gaps = 28/160 (17%)
Query: 33 LIKSILFYRKNPLHFHLITD----TVALNILQT---------LFSTWSVPQVEVSFYLAD 79
I S+ + LH H+ITD L ILQ ++S +++ +
Sbjct: 170 FIHSLFKHTNAQLHLHVITDKESEVAVLEILQRKVRRFKRVLIYSMYNIQ-------ICS 222
Query: 80 SVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
++++D+ S PN +YS + L L ++ +L++ I+ D D++F +DI L
Sbjct: 223 NIIQDITDKLSPYFSSAPNSYYSDSLFFMSLGLHRIADASLSRAILFDCDIVFRSDIRLL 282
Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
+A F R Q GL + Y L++ +P G
Sbjct: 283 FAEFDRFLPNQLFGLAPELTPVYRHILYRYRTQFPESNFG 322
>gi|363737325|ref|XP_422707.3| PREDICTED: uncharacterized protein C3orf21 homolog [Gallus gallus]
Length = 339
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 91/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I +D D+ + +I L+
Sbjct: 128 FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHRIMPKEITQIIQVDLDLKYKANIRDLFEE 187
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F ++ IG+ Y W+ + P P GFN+GV+LL+L +R
Sbjct: 188 FDNFQEGAVIGIAREMQPVYRHTFWQYRRENPQTRVGEPPPDGLPGFNSGVLLLNLEAMR 247
Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ G + + EK+ L DQD F + EHP L + L C WN QL
Sbjct: 248 RSKLYNQLLEPGMVQKLTEKYHFK---GHLGDQDFFTMVGMEHPELFHVLDCTWNRQLCT 304
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD + + ++KI H N
Sbjct: 305 WWKDHGYSDVFDQYFRCEGEVKIYHGNC 332
>gi|21594165|gb|AAH31419.1| Expressed sequence AI480653 [Mus musculus]
Length = 379
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 93/211 (44%), Gaps = 30/211 (14%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 168 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 227
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 228 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 287
Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
+ SW + +A+K+ L DQD F I EHP L + L C WN Q
Sbjct: 288 QSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 341
Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
L D+ SD + + +KI H N
Sbjct: 342 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 372
>gi|12805437|gb|AAH02191.1| B3gnt1 protein, partial [Mus musculus]
Length = 241
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 67/148 (45%), Gaps = 34/148 (22%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
YP N LRN+A + + Y ++D+D +P GL+ L+ +D H LVVPAFE
Sbjct: 33 YPNNLLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFE 87
Query: 606 TQRYRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEW 636
+R R HAPTN+SRWVN Y + W
Sbjct: 88 IRRSRRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPW 147
Query: 637 APDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 148 RDPWEPFYVAGGKVPTFDERFRQYGFNR 175
>gi|390349079|ref|XP_789869.3| PREDICTED: glucoside xylosyltransferase 1-like [Strongylocentrotus
purpuratus]
Length = 582
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 92/207 (44%), Gaps = 19/207 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +LP + + +DTD++F + ++++ F + Q L +G W
Sbjct: 351 RLFLPDLLP-NVDSLLYVDTDILFIRPVEEIYSFFKQFNSTQMAALAPEHEAKNIG--WY 407
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G G N+GV+L++LT++R W + ++ L W DQD+ N +
Sbjct: 408 NRFARHPYYGETGMNSGVMLMNLTRMRRFGWTQTIVPLYHEYKLKITW---GDQDLINIL 464
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNL 375
++P ++ C+WN + S+ T + +IH N + N FR +
Sbjct: 465 FHDYPQGLFPYACEWNYRPDHCMYSNNCHRTNEHGIAVIHGN---RGVYHNMKQPAFRAI 521
Query: 376 YLTFLEYDGNLLRRELFGCNLTQTTLQ 402
Y +F +Y FG +L + ++
Sbjct: 522 YESFRDY--------TFGADLEKGLIE 540
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 69/155 (44%), Gaps = 16/155 (10%)
Query: 14 VIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLIT-DTVALNILQTLFSTWSVPQVE 72
VI +++VC G + +LV ++KS + +HFH+ D + I Q L + SV + +
Sbjct: 263 VINLSVVCCGDRTNETLV-MLKSATVLTQQLMHFHIFAEDELQAGIAQRLNAWPSVYRRK 321
Query: 73 VSFYLADSVVEDVSWIPN------KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
+F L +S+ P K +L LP +LP + + +DTD++F
Sbjct: 322 FAFSL-----HPISFPPGEDPAKWKKVFKTCATQRLFLPDLLP-NVDSLLYVDTDILFIR 375
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ ++++ F + Q L +G W N
Sbjct: 376 PVEEIYSFFKQFNSTQMAALAPEHEAKNIG--WYN 408
>gi|256355176|ref|NP_941028.2| xyloside xylosyltransferase 1 [Mus musculus]
gi|341940318|sp|Q3U4G3.2|XXLT1_MOUSE RecName: Full=Xyloside xylosyltransferase 1; AltName:
Full=UDP-xylose:alpha-xyloside
alpha-1,3-xylosyltransferase
Length = 392
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 93/211 (44%), Gaps = 30/211 (14%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 240
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300
Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
+ SW + +A+K+ L DQD F I EHP L + L C WN Q
Sbjct: 301 QSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 354
Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
L D+ SD + + +KI H N
Sbjct: 355 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 385
>gi|125829752|ref|XP_694206.2| PREDICTED: uncharacterized protein C3orf21 homolog [Danio rerio]
gi|326680849|ref|XP_002663856.2| PREDICTED: uncharacterized protein C3orf21 homolog [Danio rerio]
Length = 387
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 77/178 (43%), Gaps = 18/178 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P L + + LD D+ F ++I L+
Sbjct: 176 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHRIMPAELKRIVQLDLDLKFRSNIRDLFQE 235
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
F+ IG+ Y W+ K P P GFN+GV+LL+L +R
Sbjct: 236 FNHFPSDAVIGIAREMQPVYRHTFWQYRKENPKSRVGDPPPDGLPGFNSGVMLLNLDSMR 295
Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ + + E + RL L DQD F I EH L Y L C WN QL
Sbjct: 296 R---STVYNQLLEPERVARLADKYHFRGHLGDQDFFTMIGMEHTELFYPLTCGWNRQL 350
>gi|74178412|dbj|BAE32468.1| unnamed protein product [Mus musculus]
Length = 392
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 93/211 (44%), Gaps = 30/211 (14%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 240
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300
Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
+ SW + +A+K+ L DQD F I EHP L + L C WN Q
Sbjct: 301 QSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 354
Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
L D+ SD + + +KI H N
Sbjct: 355 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 385
>gi|242214080|ref|XP_002472865.1| predicted protein [Postia placenta Mad-698-R]
gi|220728071|gb|EED81973.1| predicted protein [Postia placenta Mad-698-R]
Length = 492
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 76/150 (50%), Gaps = 10/150 (6%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
+EP P E + +++ +S L+KSIL Y P+ FH++ D A L+
Sbjct: 123 KEPPKPP-EAVVFSLIMFSEDSAVEGAMLMKSILMYTSMPVEFHIVCDEAAQAYLENRVR 181
Query: 65 TWSVP--QVEVSFYL------ADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTI 116
+ P + V FYL AD + + + I H +GV GL+KL + ++LP ++ + I
Sbjct: 182 LITRPAHDILVRFYLLSWQSMADRIDREGT-IMTDHSAGVPGLMKLFIHEILPPSVPRAI 240
Query: 117 VLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
+DTD +F +D A LW F++L I +
Sbjct: 241 FIDTDALFISDPALLWDRFAQLAPGAAIAI 270
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 36/59 (61%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 243
I H +GV GL+KL + ++LP ++ + I +DTD +F +D A LW F++L I +
Sbjct: 212 IMTDHSAGVPGLMKLFIHEILPPSVPRAIFIDTDALFISDPALLWDRFAQLAPGAAIAI 270
>gi|270013726|gb|EFA10174.1| hypothetical protein TcasGA2_TC012364 [Tribolium castaneum]
Length = 326
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 7/121 (5%)
Query: 215 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFNTG 272
+DTD +F T + +W+ F ++ Q L D +G W N P G+ G N+G
Sbjct: 118 MDTDTLFLTPVENIWSYFHKMNSSQMAALAPEHEDPNVG--WYNRFAKHPFYGKLGVNSG 175
Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
V+L++LT++R W+ + I K+ L W DQDI N I HP +Y C++N
Sbjct: 176 VMLMNLTRMRAFQWSEYVVPIYRKYKLKITW---GDQDIINIIFHYHPGKLYLYSCRFNF 232
Query: 333 Q 333
+
Sbjct: 233 R 233
>gi|322792843|gb|EFZ16676.1| hypothetical protein SINV_09024 [Solenopsis invicta]
Length = 359
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/136 (33%), Positives = 68/136 (50%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL +T A + +DTD +F ++W F R+ Q L D G W
Sbjct: 136 RLFLPTVLNDTDA-VLYVDTDTLFLAPPEKIWDEFKRMNSSQLAALSPEHEDPNTG--WY 192
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ W + I +++ L W DQDI N I
Sbjct: 193 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWTDYVIPIHKEYKLKITW---GDQDIINII 249
Query: 316 ISEHPYLVYTLPCQWN 331
HP +Y C++N
Sbjct: 250 FHYHPEKLYVYSCRYN 265
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 65/155 (41%), Gaps = 19/155 (12%)
Query: 11 TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ 70
T EV +VC + +T++KS L + PL F ++ + + S W +
Sbjct: 46 TSEVTICVVVCG--DRLHEALTMLKSALVFTSRPLQFIILAEQNLIPAFDEKLSEWRL-- 101
Query: 71 VEVSFYLADSVVEDVSWIPNKHYSGVYGLL-------KLTLPKVLPETLAKTIVLDTDVI 123
+S D +V +++ P+ ++ L +L LP VL +T A + +DTD +
Sbjct: 102 --ISNKTFDFLVRPITF-PDGSDVAMWKKLFKPCAAQRLFLPTVLNDTDA-VLYVDTDTL 157
Query: 124 FATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
F ++W F R+ Q L D WY
Sbjct: 158 FLAPPEKIWDEFKRMNSSQLAALSPEHEDPNTGWY 192
>gi|293352262|ref|XP_344028.3| PREDICTED: xyloside xylosyltransferase 1-like [Rattus norvegicus]
Length = 392
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 81/179 (45%), Gaps = 18/179 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 240
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300
Query: 283 DISWAGF-----W-RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
G W + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 301 QSPLYGHLLEPAWVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 356
>gi|66811776|ref|XP_640067.1| hypothetical protein DDB_G0282469 [Dictyostelium discoideum AX4]
gi|74855026|sp|Q54SH2.1|GNT13_DICDI RecName: Full=Glycosyltransferase-like protein gnt13
gi|60468084|gb|EAL66094.1| hypothetical protein DDB_G0282469 [Dictyostelium discoideum AX4]
Length = 635
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 63/111 (56%), Gaps = 8/111 (7%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD-----AEAQQFLSYAGNS-EALRSRR 532
+T+V Q ++DRL V M+ + W+ P+S+ +++ D AE ++ + N AL
Sbjct: 112 ITIVTQTTVDRLSKVSMMAEKWKAPLSVAVFIKDSENPKAEIEKLEKHIENDYPALSYYS 171
Query: 533 NIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
+I H++ YP+N LRN+A+ T VF++D DFLP GL+ Y+ S
Sbjct: 172 DI--HILISNKTRYPVNNLRNLAIEHSRTDLVFIMDADFLPPLGLHDYILS 220
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 37/50 (74%), Gaps = 3/50 (6%)
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
H+PTNF +W++A PY++E+ FEPY+V ++ ++P +D R G+G++K
Sbjct: 505 HSPTNFEKWMDAVEPYEVEYKWIFEPYLVFNKTQNIP-FDERLKGYGFDK 553
>gi|357614662|gb|EHJ69200.1| putative N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Danaus
plexippus]
Length = 459
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 100/267 (37%), Gaps = 73/267 (27%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA----- 522
D Y +S V L Q S++RL + + HW GPIS+ ++++ E + ++
Sbjct: 107 DMYADASSNRRVCLATQSSIERLHELLGIAAHWTGPISVAVFVAGDELRLLRAFEMWLFR 166
Query: 523 --------------------------------GNSEALRSRRNIGYHVVYKEGNFYPINT 550
N L + V ++ + YP N
Sbjct: 167 CQPDVYARLALHVAMPAERPGVQSNVPNWARNCNVAPLATEERKSETVAWRARHPYPQNH 226
Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
LRN+A TPYVFL+D+D +P G+ L S + S+ VVP +E +
Sbjct: 227 LRNLARKNCHTPYVFLVDVDIVPSRGMAEALDSFLSSVP---KCQMCAYVVPTYELDKRV 283
Query: 611 TAFPASHA------------------------PTNFSRWV----NATTPYQIEWAPD--- 639
FP + + +NFSRW N + I
Sbjct: 284 ANFPTNKSELLRLSRNKLAIPFHRKVFIYNQYASNFSRWESSGGNESLSTHISHTVTNFE 343
Query: 640 --FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF+G+G+ +
Sbjct: 344 LLYEPFYVAPDTVPAHDERFLGYGFTR 370
>gi|293340644|ref|XP_002724719.1| PREDICTED: xyloside xylosyltransferase 1-like [Rattus norvegicus]
Length = 391
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 74/162 (45%), Gaps = 18/162 (11%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
+YS L + + +++P+ + + I LD D+ + T+I +L+ F IG+
Sbjct: 197 YYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEEFDNFLPGAVIGIAREMQ 256
Query: 249 DWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLRDISWAGF-----W-RII 293
Y W+ H+ P P GFN+GV+LL+L +R G W + +
Sbjct: 257 PVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMRQSPLYGHLLEPAWVQQL 316
Query: 294 AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 317 ADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 355
>gi|307196398|gb|EFN77987.1| Glycosyltransferase 8 domain-containing protein 3 [Harpegnathos
saltator]
Length = 367
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 68/136 (50%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL +T A + +DTD +F ++W F ++ Q L D G W
Sbjct: 145 RLFLPTVLNDTDA-VLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 201
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ W + I +++ L W DQDI N I
Sbjct: 202 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWTDYVIPIHKEYRLKITW---GDQDIINII 258
Query: 316 ISEHPYLVYTLPCQWN 331
HP +Y C++N
Sbjct: 259 FHYHPEKLYVYSCRYN 274
>gi|354466014|ref|XP_003495471.1| PREDICTED: uncharacterized protein C3orf21 homolog, partial
[Cricetulus griseus]
Length = 254
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 80/179 (44%), Gaps = 18/179 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + ++LP+ + + I LD D+ + T+I +L+
Sbjct: 43 FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHQILPKEILRIIQLDLDLKYKTNIRELFEE 102
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 103 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 162
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 163 QSPLYSHLLEPARVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 218
>gi|307186594|gb|EFN72111.1| Glycosyltransferase 8 domain-containing protein 3 [Camponotus
floridanus]
Length = 368
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 88/196 (44%), Gaps = 23/196 (11%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL +T A + +DTD +F ++W F ++ Q L D G W
Sbjct: 145 RLFLPTVLNDTDA-VLYVDTDTLFLAPPDKVWDEFKKMNSSQLAALSPEHEDPNTG--WY 201
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ W I +++ L W DQDI N I
Sbjct: 202 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWTDHVIPIHKEYKLKITW---GDQDIINII 258
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKII------HWNSPKKLKVKNKHM 369
HP +Y C++N + D+ +C D ++ ++SPK+
Sbjct: 259 FHHHPEKLYVYSCRYNYR-PDHCMYMSVCAEAEKDGALVLHGSRGTFHSPKQPP------ 311
Query: 370 EFFRNLYLTFLEYDGN 385
FR +Y EY N
Sbjct: 312 --FRAVYRAMQEYPLN 325
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 62/154 (40%), Gaps = 17/154 (11%)
Query: 11 TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ 70
T EV +VC + +T++KS L + PL F ++ D + S W +
Sbjct: 55 TSEVTICVVVCG--DRLHEALTMLKSALVFANRPLQFVILADNNLIPAFSEKLSEWRL-- 110
Query: 71 VEVSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIF 124
+S D +V +++ + L K L LP VL +T A + +DTD +F
Sbjct: 111 --ISNKTFDFLVRPITFPDGSDVAMWKKLFKPCAAQRLFLPTVLNDTDA-VLYVDTDTLF 167
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
++W F ++ Q L D WY
Sbjct: 168 LAPPDKVWDEFKKMNSSQLAALSPEHEDPNTGWY 201
>gi|74147544|dbj|BAE38667.1| unnamed protein product [Mus musculus]
Length = 392
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 93/211 (44%), Gaps = 30/211 (14%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 240
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300
Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
+ +W + +A+K+ L DQD F I EHP L + L C WN Q
Sbjct: 301 QSPLYSHLLEPAWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 354
Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
L D+ SD + + +KI H N
Sbjct: 355 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 385
>gi|194222714|ref|XP_001499067.2| PREDICTED: uncharacterized protein C3orf21 homolog [Equus caballus]
Length = 281
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 70 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 129
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ H+ P P GFN+GV+LL+L +R
Sbjct: 130 FDNFLPGAIIGIAHEMQPVYRHTFWQFRHENPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 189
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 190 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 246
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + ++I H N
Sbjct: 247 WWRDHGYSDVFEAYFRCEGHVRIYHGNC 274
>gi|428179147|gb|EKX48019.1| hypothetical protein GUITHDRAFT_136980 [Guillardia theta CCMP2712]
Length = 946
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 96/247 (38%), Gaps = 63/247 (25%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
VTLV QLS ++ + +L W GP + + +++ +F + G E +S +I V
Sbjct: 32 VTLVTQLSWNKFSSLLVLRPRWAGPCVAAVLVPSSQSSEF--HEGVQELQQSSHDITQIV 89
Query: 539 VYKEGNF----YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS---------- 584
E YPIN +RN+AL T VFL D+D LP L+
Sbjct: 90 AVWEDELHFERYPINAMRNIALAHAGTDLVFLCDVDQLPSVTFSALLQQEDILMLLWRWC 149
Query: 585 --------IRSMDMHGHGG---------------KKVLVVPAFETQR---YRT-AFPASH 617
+ ++++ GG KK+L E R ++T FP H
Sbjct: 150 CVERNAIVVPALELKKGGGEAISLEDVLGDYEMAKKLLHEEDLEKTRVIGFQTEKFPTGH 209
Query: 618 APTNFSR-WVNATT--PYQIEWAPDFEPYIVAHRD-----------------LPRYDTRF 657
A W + Y IE+ FEPY++ R +P YD R+
Sbjct: 210 AAERVRELWTACESIEGYMIEYEEGFEPYVIVSRQSFLLFLAPSAEYCSRWMVPAYDERY 269
Query: 658 VGFGWNK 664
G+G NK
Sbjct: 270 EGYGRNK 276
>gi|344240257|gb|EGV96360.1| Arf-GAP with coiled-coil, ANK repeat and PH domain-containing
protein 2 [Cricetulus griseus]
Length = 852
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 80/178 (44%), Gaps = 18/178 (10%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + ++LP+ + + I LD D+ + T+I +L+
Sbjct: 641 FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHQILPKEILRIIQLDLDLKYKTNIRELFEE 700
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLW--KNHKPWPALGR-------GFNTGVILLDLTKLR 282
F IG+ Y W ++ P +G GFN+GV+LL+L +R
Sbjct: 701 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 760
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 761 QSPLYSHLLEPARVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQL 815
>gi|302836423|ref|XP_002949772.1| hypothetical protein VOLCADRAFT_80793 [Volvox carteri f.
nagariensis]
gi|300265131|gb|EFJ49324.1| hypothetical protein VOLCADRAFT_80793 [Volvox carteri f.
nagariensis]
Length = 415
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 98/250 (39%), Gaps = 56/250 (22%)
Query: 470 YEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-------------SDAEAQ 516
+ + S G VTLV QLS +RL ++ W GP++ LY+ S + +
Sbjct: 70 WSSKSLGIPVTLVTQLSAERLDQLKAQCATWGGPLAAALYVPLHNPSATELLETSRQKLE 129
Query: 517 QFLSYA----------GNSEALRSRRNIGYHVVY--KEGNFYPINTLRNVALNQVSTPYV 564
+++A N + R + Y + + YP+N+LRN A T +
Sbjct: 130 AMIAHADELFRNSENNNNGHGCQLRLALIYELFADPRAQILYPVNSLRNYARLMADTDLI 189
Query: 565 FLLDIDFLPMFGLYPYLK--SSIRSMDMHGHGGKKVLVVPAFET---------------- 606
+D+D +P + L ++ G V V+PAFET
Sbjct: 190 ANIDVDMVPSLSISTALAEPQTLAEYTRACRSG-AVYVLPAFETHCGGTSYADSVAVDGK 248
Query: 607 -----------QRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYD 654
+R R P SH T + RW Y I + P FEP+ ++ R YD
Sbjct: 249 KAVIQGIKRCVRRMRPKAPFSHNSTKYERWFATDEAYSITYGPLFEPWFISWRWGTLWYD 308
Query: 655 TRFVGFGWNK 664
R+ G+G NK
Sbjct: 309 YRYRGYGKNK 318
>gi|353232724|emb|CCD80079.1| putative tata-binding protein-associated phosphoprotein
[Schistosoma mansoni]
Length = 116
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 52/93 (55%), Gaps = 7/93 (7%)
Query: 212 TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNT 271
IVLDTD++ DI+ LW+LF + Q IG+ Q+ + + + WP LG G+N+
Sbjct: 4 VIVLDTDILLNDDISNLWSLFDEFNKDQCIGIAAEQNPTFYSNM--GQRFWPTLGYGYNS 61
Query: 272 GVILLDLTKLRDISWAGFW-----RIIAEKFLL 299
G++L+ LTKLR W W +I EK +L
Sbjct: 62 GILLIHLTKLRARGWDEIWMKTAFNLIEEKRIL 94
Score = 46.6 bits (109), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 2/58 (3%)
Query: 115 TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
IVLDTD++ DI+ LW+LF + Q IG+ Q+ + + + WP LG G+
Sbjct: 4 VIVLDTDILLNDDISNLWSLFDEFNKDQCIGIAAEQNPTFYSNM--GQRFWPTLGYGY 59
>gi|428163534|gb|EKX32600.1| hypothetical protein GUITHDRAFT_148516 [Guillardia theta CCMP2712]
Length = 690
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 119/268 (44%), Gaps = 64/268 (23%)
Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDR-LQMVEMLFKHWEGPISLTLYMSDAEAQ 516
Q E+ P+ + +++L + S+DR +QM+ ++ + W+GP+SL Y E +
Sbjct: 155 QSFEVPPSLQCCSSSHEQDLNISLASIGSIDRSIQMLHVMLQ-WDGPVSLVFYARSEEEE 213
Query: 517 QFLSYAGNS----EALRSRRNIGYHVV---YKEGNF-----YPINTLRNVALNQVSTPYV 564
+F+S A+RS R + ++ + +G+ +P+N LR+VA+++ T V
Sbjct: 214 RFISAFVEETMIPAAVRSCRELNLQLLSDCHDQGSPERVYPFPVNKLRSVAVSRAKTDLV 273
Query: 565 FLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF-------------------- 604
+D+DF+P L+ ++ S G +LV+PAF
Sbjct: 274 LYIDVDFIPSADAARQLQQAMSSPSFDQAAG--LLVLPAFLTVWSSPWPSPMRVEDEGMQ 331
Query: 605 -------ETQR-------YRTAFP---ASHAPTNFSRW----VNATTPYQIEWAPDFEPY 643
E +R Y P + H T++ +W N T P+Q+ ++ +EPY
Sbjct: 332 VEPLHLDELKRRCIAGDVYPPEMPFRMSPHGATDYLKWFHMPANRTEPFQVLYSMWYEPY 391
Query: 644 IVAHRDLPR-------YDTRFVGFGWNK 664
V +R L R +D RFV G +K
Sbjct: 392 FVVNRRLWRGINHPGLFDERFVDAGPDK 419
>gi|126338874|ref|XP_001379360.1| PREDICTED: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like
[Monodelphis domestica]
Length = 441
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 106/268 (39%), Gaps = 90/268 (33%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNS---EALRS 530
+ V L S+ L V L + W GP+++ ++ + E Q L+YA + EA+R
Sbjct: 118 DAVLLATHASLSNLGHVGTLLERWAGPLAVAVFAARPEEAQLATVLTYALGALCTEAVRD 177
Query: 531 RRNIGYHVV------------YKEGNF------------------------------YPI 548
R + H+V + G F YP
Sbjct: 178 R--LTMHLVCPARTEAAVPDPREPGEFARISSCTDVFTSMGARSRSGRNYALATNVSYPN 235
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG-GKKVLVVPAFETQ 607
N LRNVA + ++D+D +P GL+P +R + G G + LVVPAFE +
Sbjct: 236 NLLRNVAREGAG--HALVVDVDMVPSEGLWP----GLRDLLAQGEGKAGRALVVPAFEIR 289
Query: 608 R-----------------------YRTAFPASHAPTNFSRWVNATTP--------YQIEW 636
+ Y P APTN++RW+ T P Y++ W
Sbjct: 290 QGVRVPRTKIELMRFLKMGHVRPFYAELCPRCQAPTNYTRWL--TLPPGGPLRPAYRVTW 347
Query: 637 APDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P ++ RF +G+N+
Sbjct: 348 QDPWEPFYVAGGAVPSFNERFRQYGFNR 375
>gi|307104338|gb|EFN52592.1| hypothetical protein CHLNCDRAFT_138618 [Chlorella variabilis]
Length = 308
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 67/162 (41%), Gaps = 34/162 (20%)
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
H + YP N +RN AL T V LLD DF+P + + S G +
Sbjct: 68 HCDAHSASLYPANAVRNRALRLAETEAVILLDADFVPSASMAARMSDQGYSAATLGLLQQ 127
Query: 597 KV-LVVPAFET------------QRYRTA--------------------FPASHAPTNFS 623
++ LV+PAF+T Q R+ F H T+F
Sbjct: 128 RLALVLPAFQTVGGGEEDAELALQVARSGKAALLEQCEVGELESFHSGHFSKGHRATDFG 187
Query: 624 RWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
RW A Y++E+ FEPYI+ HR +P YD R G+ NK
Sbjct: 188 RWATAANVYEVEFQTGFEPYILVHRKHVPWYDERCRGYFVNK 229
>gi|66535277|ref|XP_624895.1| PREDICTED: glucoside xylosyltransferase 1-like [Apis mellifera]
Length = 362
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL T+ + +DTD +F ++W F ++ Q L D G W
Sbjct: 142 RLFLPSVL-NTIDAVLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 198
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ W + I +++ L W DQDI N I
Sbjct: 199 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKQYKLKITW---GDQDIINII 255
Query: 316 ISEHPYLVYTLPCQWN 331
HP +Y C++N
Sbjct: 256 FHYHPEKLYVYSCRYN 271
>gi|115496384|ref|NP_001069467.1| uncharacterized protein C3orf21 homolog [Bos taurus]
gi|112180740|gb|AAI14705.2| C1H3ORF21 protein [Bos taurus]
Length = 393
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ N K P P GFN+GV+LL+L +R
Sbjct: 242 FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPLYDRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 358
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD + + +KI H N
Sbjct: 359 WWRDHGYSDVFDAYFRCEGHVKIYHGNC 386
>gi|332022944|gb|EGI63210.1| Glycosyltransferase 8 domain-containing protein 3 [Acromyrmex
echinatior]
Length = 367
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL +T A + +DTD +F ++W F ++ Q L D G W
Sbjct: 144 RLFLPTVLNDTDA-VLYVDTDTLFLAPPEKVWDNFKKMNSSQLAALSPEHEDPNTG--WY 200
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R W + I +++ L W DQDI N I
Sbjct: 201 NRFAKHPYYGKLGVNSGVMLMNLTRMRKFRWTDYVIPIHKEYKLKITW---GDQDIINII 257
Query: 316 ISEHPYLVYTLPCQWN 331
HP +Y C++N
Sbjct: 258 FHYHPEKLYVYSCRYN 273
Score = 43.5 bits (101), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 70/163 (42%), Gaps = 18/163 (11%)
Query: 3 SDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTL 62
S + P P+ + + +V G + +T++KS L + PL F ++T+ +
Sbjct: 45 SKKSPSTPSTSEVTICVVVCG-DRLHEALTMLKSALVFTSRPLQFIILTEQNLIPAFSEK 103
Query: 63 FSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL-------KLTLPKVLPETLAKT 115
S W + +S D +V +++ P+ ++ L +L LP VL +T A
Sbjct: 104 LSEWRL----ISNKTFDFLVRPITF-PDGSDVAMWKKLFKPCAAQRLFLPTVLNDTDA-V 157
Query: 116 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
+ +DTD +F ++W F ++ Q L D WY
Sbjct: 158 LYVDTDTLFLAPPEKVWDNFKKMNSSQLAALSPEHEDPNTGWY 200
>gi|194756854|ref|XP_001960685.1| GF11373 [Drosophila ananassae]
gi|190621983|gb|EDV37507.1| GF11373 [Drosophila ananassae]
Length = 377
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/278 (23%), Positives = 108/278 (38%), Gaps = 87/278 (31%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
++Y +S+ N V L Q S++RL + + +W+G +S+ L+ + E Q F++Y
Sbjct: 29 EQYADTSEANYVCLATQTSVERLNSLPQVSANWQGKMSVALFAAGHEEFVVLQYFVTYMR 88
Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
+ R N +H++
Sbjct: 89 LC-FVNIRENATFHILVPRDYDKLPRVEAVPYNMKGKFDCQYPDRTMKALLKFRTLKTLQ 147
Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
+++ N YP N +RN+A T YVFL DID +P P L R+ +
Sbjct: 148 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCSKLCA---Y 204
Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVNATTPYQIE 635
V+P FE R FP S + TNFS+W++ T + E
Sbjct: 205 VIPTFEID-IRAMFPRSKSGLLRLIKKGLARPFHEKVFIYNQYATNFSKWLSTNTN-ETE 262
Query: 636 WAPD---------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+ +EP+ VA ++P +D RF+G+G+ +
Sbjct: 263 VSVSHIVTNFEFLYEPFYVAVDNVPAHDERFLGYGFTR 300
>gi|300120705|emb|CBK20259.2| unnamed protein product [Blastocystis hominis]
Length = 322
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 94/233 (40%), Gaps = 46/233 (19%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
+S +DVT +Q + R +L + W GP+S+TL + E ++ SR
Sbjct: 10 TSQVHDVTFTSQGTTVRYYYFPILLQRWTGPLSITLCLKPEEVSDINDQLA-ADNFDSRL 68
Query: 533 NIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG 592
+I +V G YPIN LRN+A+ V T + +L D+D P GLY ++ +R +
Sbjct: 69 SIVQYVATNPG-IYPINKLRNLAIRNVKTDHFWLTDLDMWPSQGLY---EAILRLPNSAL 124
Query: 593 HGGKKVLVVPAFETQRYRTA-------------FPASHAPTN------------------ 621
++VPAFE + FP + N
Sbjct: 125 QDEVLAIIVPAFEIHASSCSTFQACTDLPILPQFPNTKRELNRCISTRRCSTFRPWDQLH 184
Query: 622 ---FSRWVNAT-----TPYQIEWAPDFEPYIVAHRD--LPRYDTRFVGFGWNK 664
F W + TP + EPY+V + LP +D RFV + +NK
Sbjct: 185 DYHFPEWYRESYTAELTPVRCFKGNTQEPYVVVKKSPFLPSFDERFVNYAYNK 237
>gi|256072700|ref|XP_002572672.1| glycosyltransferase-related [Schistosoma mansoni]
Length = 165
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 68/137 (49%), Gaps = 18/137 (13%)
Query: 266 GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
G G+N+GV+L+DL KLR + W W + + T + S +QD+ N I+ + + Y
Sbjct: 20 GYGYNSGVVLMDLAKLRSLPWNHLWMSAIKFVMKTAGYLSAGEQDVINLIMLRNNEIFYE 79
Query: 326 LPCQWNVQLSDNTRSDELCYTELT--------DL------KIIHWN---SPKKLKVKNKH 368
+PC+WN+QLS + + LT DL KI+H N P+ L +KN
Sbjct: 80 IPCEWNIQLSSGVDTHRCPVSWLTENSLLNSGDLIYGKQPKIVHLNHQVKPEDLDLKNI- 138
Query: 369 MEFFRNLYLTFLEYDGN 385
+ N+ LT +GN
Sbjct: 139 LNVDTNVSLTTYNGNGN 155
>gi|395330127|gb|EJF62511.1| hypothetical protein DICSQDRAFT_126280 [Dichomitus squalens
LYAD-421 SS1]
Length = 363
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 64/129 (49%), Gaps = 7/129 (5%)
Query: 25 NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ--VEVSFYL--ADS 80
+S LIKSI+ Y +PL FH+I D A L+ + PQ + V FY +S
Sbjct: 5 DSAAEGSILIKSIIMYSSSPLAFHIICDHDARQYLERRLRLLTHPQHDISVQFYRIPHES 64
Query: 81 VVEDV---SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR 137
+V + + H +GV GL+KL + ++LP ++ + I +DTD F D QLW F
Sbjct: 65 MVARIQREGALHTDHSAGVPGLMKLFIHEILPPSVERAIFVDTDAFFIADPTQLWDRFDA 124
Query: 138 LRQRQTIGL 146
+ + +
Sbjct: 125 FKPGAAVSM 133
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 9/96 (9%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H +GV GL+KL + ++LP ++ + I +DTD F D QLW F + + +
Sbjct: 79 HSAGVPGLMKLFIHEILPPSVERAIFVDTDAFFIADPTQLWDRFDAFKPGAAVSM-PYHP 137
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
D Y + W R + V+LLDL +LR++
Sbjct: 138 DQYAPE-------WHHANR-ICSCVMLLDLKRLREL 165
>gi|198431243|ref|XP_002124020.1| PREDICTED: similar to glycosyltransferase-like 1B [Ciona
intestinalis]
Length = 406
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 104/259 (40%), Gaps = 75/259 (28%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS---- 530
D DVTLV ++DRL + + + W GPIS+ +Y A Q + + LR
Sbjct: 78 DPADVTLVMHGTVDRLYRLIEIAEVWSGPISMAVY---APMQDASFFDDAIDGLRLCWAT 134
Query: 531 -RRNIGYHVVY---------KEGNF-----------------------YPINTLRNVALN 557
R+++ +H+V+ + G+F YP N LRNV +
Sbjct: 135 LRKSVSFHLVHHVNSPANVTQLGSFVYLSCKDIIRRLRLVRPNAGRSQYPHNLLRNVGRS 194
Query: 558 QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH----GHGGKKVLVVPAFETQRY---- 609
T +V +D D P L K ++ ++ K + V+P FE +++
Sbjct: 195 GALTNFVLHIDADLAPSRSLRGKFKELLKLHNLESGIAARSSKLLFVLPVFELRQFQNPV 254
Query: 610 ------------RTAFP-------ASHAPTNFSRWV--------NATTPYQIEWAPDFEP 642
R A P H PT++ RW+ ++ Y++++ ++ P
Sbjct: 255 RTKQELAEAIKERNAIPFLQETCVPCHRPTDYDRWLQSLKSDSNDSEMSYKVDYVKEWTP 314
Query: 643 YIVAHRDLPRYDTRFVGFG 661
+ + + P YD RFV FG
Sbjct: 315 FFITSKHSPLYDDRFVAFG 333
>gi|302689521|ref|XP_003034440.1| glycosyltransferase family 8 protein [Schizophyllum commune H4-8]
gi|300108135|gb|EFI99537.1| glycosyltransferase family 8 protein [Schizophyllum commune H4-8]
Length = 427
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 65/146 (44%), Gaps = 8/146 (5%)
Query: 8 VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
+ P E + A + S LIKSIL Y NP H++ L++ S +
Sbjct: 58 LAPALEPVVFAFMIWSRGSAEEGAQLIKSILLYNSNPTEIHIVCGEDERVYLESRLSLVT 117
Query: 68 VPQ--VEVSFYLAD-----SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
PQ V V FY +E I H +G GL+KL L +++P T+ K I +DT
Sbjct: 118 SPQHDVRVVFYPVTLQEMLDRIEREGAIKTDHEAGTPGLMKLFLHEIMP-TVKKAIYVDT 176
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGL 146
D +F +D LW +F R I +
Sbjct: 177 DALFISDPTILWRIFDTFRPSTAIAM 202
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 49/106 (46%), Gaps = 10/106 (9%)
Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
+E I H +G GL+KL L +++P T+ K I +DTD +F +D LW +F R
Sbjct: 139 IEREGAIKTDHEAGTPGLMKLFLHEIMP-TVKKAIYVDTDALFISDPTILWRIFDTFRPS 197
Query: 239 QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
I + + + W + ++LLD KLRD+
Sbjct: 198 TAIAMGYHPEQ--MAAAWH-------YASRICSCIMLLDFEKLRDM 234
>gi|443696721|gb|ELT97358.1| hypothetical protein CAPTEDRAFT_147975 [Capitella teleta]
Length = 388
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/270 (24%), Positives = 109/270 (40%), Gaps = 76/270 (28%)
Query: 468 DEYEASSDGN----DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA-------- 515
D ++AS + DVTL++ S+ ++ + L W GP+S+++++ +E
Sbjct: 55 DVFQASGETKTRILDVTLISHCSVHLIKHIPALVNAWHGPMSISIFLVSSELHTAVDMLL 114
Query: 516 ---------QQF--------LSYAG-----------NSEALRS---RRNIGYHVVYKEGN 544
Q+F LS+ N + + S +R +G +G
Sbjct: 115 IIHQCVPLFQKFVTVHFVLPLSHQKLTISKSSLLPINCDEMNSDLQKRVLGSRNYALDGV 174
Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
YP N LRNVAL T Y ++D+D LP L + ++S + V+PAF
Sbjct: 175 LYPNNLLRNVALRNALTEYHLVIDVDMLPSPNLRQLFTTFVKS--QFAPNNRTAFVIPAF 232
Query: 605 ETQRYRTAFP------------ASHAP------------TNFSRW-----VNATTP-YQI 634
E T FP S P T++ W + +P Y +
Sbjct: 233 EMVTGGT-FPQTKQHLLDLWDQGSIQPFYLKACWKCQRFTDYDSWRKLPDSDGLSPGYDV 291
Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+W +EP+ +A PRYD RF +G+N+
Sbjct: 292 DWKDPWEPFFIASWQAPRYDERFKQYGFNR 321
>gi|241608406|ref|XP_002405973.1| hypothetical protein IscW_ISCW019642 [Ixodes scapularis]
gi|215500729|gb|EEC10223.1| hypothetical protein IscW_ISCW019642 [Ixodes scapularis]
Length = 345
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 64/134 (47%), Gaps = 12/134 (8%)
Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNH 259
VLPE A + +D DVIF LW+LF + Q LV D WY + + H
Sbjct: 113 VLPEEDA-VLYVDADVIFVHPPEDLWSLFGAMNDSQLAALVPESEDYATNWY--RRFARH 169
Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
+ LG N+GV+L++LT++R W + K+ W DQD+ N + +H
Sbjct: 170 PFFEPLG--VNSGVMLMNLTRMRAFGWESLMGPLLTKYERNISW---GDQDLINIVFHDH 224
Query: 320 PYLVYTLPCQWNVQ 333
P + L C+WN +
Sbjct: 225 PDKLLVLSCKWNFR 238
>gi|241672118|ref|XP_002411447.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Ixodes scapularis]
gi|215504098|gb|EEC13592.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Ixodes scapularis]
Length = 398
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 108/270 (40%), Gaps = 87/270 (32%)
Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ----FLSYAGN-SEALRS 530
G DV L Q S+DRL + L + W GP+SL ++++++ Q +LS + SEA+++
Sbjct: 67 GGDVCLATQGSLDRLSWLVDLVQLWNGPVSLAVFVANSAQFQAVKVYLSLIRSCSEAIKA 126
Query: 531 R---------------------------------------RNIGYHVVYK---EGNFYPI 548
+N+G V K + YP
Sbjct: 127 NMRVDLVYPADENLTRKTVWSDELGSDFSCGTHAVLLKVLQNVGKTKVVKGSAKQMLYPQ 186
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH---GGKKVLVVPAFE 605
N LRNVA + + Y FL DID +P GL+ L D G K V VVP +E
Sbjct: 187 NHLRNVARDGCAQKYFFLTDIDIMPKPGLWEELS------DFFGRTLPCTKCVFVVPTYE 240
Query: 606 ---------TQR--------------YRTAFPASHAPTNFSRWVNAT-------TPYQIE 635
T+R ++ +F + TN W Y ++
Sbjct: 241 MSETAAVPRTKRELLHLVRRKEARPFHQKSFIHNQYATNHGMWEGLKQRGDGLRAAYSVK 300
Query: 636 WAPDF-EPYIVAHRDLPRYDTRFVGFGWNK 664
F EP+ VA + +PRYD RFVG+G+ +
Sbjct: 301 RYEFFYEPFYVAAKVVPRYDERFVGYGFTR 330
>gi|383864121|ref|XP_003707528.1| PREDICTED: glucoside xylosyltransferase 1-like [Megachile
rotundata]
Length = 367
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 69/139 (49%), Gaps = 14/139 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L LP VL ET A + +DTD +F +W F ++ Q L D WY
Sbjct: 144 RLFLPTVLNETDA-VLYMDTDTLFLAPPETIWDEFKKMNSSQLAALSPEHEDPNTAWY-N 201
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+ K+ P G+ G N+GV+L++LT++R+ W + I +++ L W DQDI
Sbjct: 202 RFAKH----PYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKEYKLKITW---GDQDII 254
Query: 313 NAIISEHPYLVYTLPCQWN 331
N I HP +Y C++N
Sbjct: 255 NIIFHYHPEKLYVYSCRYN 273
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 68/156 (43%), Gaps = 18/156 (11%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
P E + + +V G SL T++KS L + + PL F ++ +T ++ W +
Sbjct: 52 PIVEEVTICVVVCGDRLDESL-TMLKSALVFTRRPLQFIILAETDLISAFNEKLMEWKL- 109
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLL-------KLTLPKVLPETLAKTIVLDTDV 122
+S D VV+ +++ P + ++ L +L LP VL ET A + +DTD
Sbjct: 110 ---MSNKTFDFVVKPITF-PEGSDAAMWKKLFKPCAAQRLFLPTVLNETDA-VLYMDTDT 164
Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
+F +W F ++ Q L D WY
Sbjct: 165 LFLAPPETIWDEFKKMNSSQLAALSPEHEDPNTAWY 200
>gi|426217674|ref|XP_004003078.1| PREDICTED: xyloside xylosyltransferase 1 [Ovis aries]
Length = 393
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ N K P P GFN+GV+LL+L +R
Sbjct: 242 FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPLYDRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 358
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD + + +KI H N
Sbjct: 359 WWRDHGYSDVFDAYFRCEGHVKIYHGNC 386
>gi|440895294|gb|ELR47525.1| hypothetical protein M91_05743, partial [Bos grunniens mutus]
Length = 226
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 15 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 74
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ N K P P GFN+GV+LL+L +R
Sbjct: 75 FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 134
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 135 QSPLYDRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 191
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD + + +KI H N
Sbjct: 192 WWRDHGYSDVFDAYFRCEGHVKIYHGNC 219
>gi|242006902|ref|XP_002424281.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507681|gb|EEB11543.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 334
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 73/156 (46%), Gaps = 13/156 (8%)
Query: 205 LPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNHK 260
LP + + LDTD + + Q+WA F + Q L D WY + H
Sbjct: 115 LPVNVDSVLYLDTDTLILGPVEQIWAHFKLMNSSQFAALAPEHEDPLTGWY--NRFARHP 172
Query: 261 PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHP 320
+ LG NTGV+L++LT++R W + I +++ L W DQDI N I HP
Sbjct: 173 FYKPLG--VNTGVMLMNLTRMRKFEWERYLLPIYKQYKLKIPW---GDQDIINIIFHFHP 227
Query: 321 YLVYTLPCQWNVQLSDNTRSDELCYT-ELTDLKIIH 355
+Y C++N + +D+ +C T E + ++H
Sbjct: 228 DKLYIYSCKYNYR-ADHCMYTSVCKTAEKEGIVVLH 262
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 62/148 (41%), Gaps = 11/148 (7%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
E + + +V G +LV L+KS L + K L F++ TD +N+ Q + W +
Sbjct: 23 ETMSIVVVVCGDRWKETLV-LLKSALLFTKGHLQFYIFTDLELVNVFQRELNIWK-KDFK 80
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPK--VLPETLAKTIVLDTDVIFATDIAQ 130
+F D + +V + + S L K + LP + + LDTD + + Q
Sbjct: 81 KNF---DYKILEVKFPSGEFESSWRKLFKPCSSQRLFLPVNVDSVLYLDTDTLILGPVEQ 137
Query: 131 LWALFSRLRQRQTIGLVENQSD----WY 154
+WA F + Q L D WY
Sbjct: 138 IWAHFKLMNSSQFAALAPEHEDPLTGWY 165
>gi|20130341|ref|NP_611886.1| CG3253, isoform A [Drosophila melanogaster]
gi|386768550|ref|NP_001246488.1| CG3253, isoform B [Drosophila melanogaster]
gi|7291745|gb|AAF47166.1| CG3253, isoform A [Drosophila melanogaster]
gi|15010436|gb|AAK77266.1| GH04269p [Drosophila melanogaster]
gi|220956256|gb|ACL90671.1| CG3253-PA [synthetic construct]
gi|383302675|gb|AFH08241.1| CG3253, isoform B [Drosophila melanogaster]
Length = 389
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 107/278 (38%), Gaps = 87/278 (31%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
++Y SS+G+ V L Q S++RL + + +W+G +S+ L+ + E Q F++Y
Sbjct: 41 EQYVDSSEGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100
Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
A R N +H++
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRVAALPLNMRGKFDCQYPDRTLKALLKFRSLKTLQ 159
Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
+++ N YP N +RN+A T YVFL DID +P P L R+ +
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCTKSCA---Y 216
Query: 600 VVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQIE 635
V+P FE R FP S TNFS+W++ T + E
Sbjct: 217 VIPTFEID-VRATFPRSKNALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSPNTN-ETE 274
Query: 636 WAPD---------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+ +EP+ +A + P +D RF+G+G+ +
Sbjct: 275 VSVSHVVTNFEFLYEPFYIAVDNAPAHDERFLGYGFTR 312
>gi|442754659|gb|JAA69489.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
unknown function [Ixodes ricinus]
Length = 318
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 84/170 (49%), Gaps = 11/170 (6%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +LP+ A + LD D++ + LW +F + I D ++
Sbjct: 87 RLFLPDMLPKEEA-VLYLDADILLLRPVEDLWRVFHSMGPSHLIAQSYEIEDMDTNP-YR 144
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
++ P + G N GV+ ++LT++R+ +W + E++ W ADQDI N I
Sbjct: 145 HYGSVPYVKPYGVNAGVMPMNLTRMRNFAWGSRMEPLLEQYGSRLKW---ADQDIVNVIF 201
Query: 317 SEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
+E+P ++ +PC WN + DN E C E+ L +H N ++L +K+
Sbjct: 202 NENPDKLFLMPCTWNFR-RDNCEFHESCKGEVPGL--LHAN--QRLFIKD 246
>gi|395528648|ref|XP_003766439.1| PREDICTED: xyloside xylosyltransferase 1 [Sarcophilus harrisii]
Length = 368
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 74/172 (43%), Gaps = 26/172 (15%)
Query: 187 NKHYSGVYG--------LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
KH+S G L + + +++P+ +++ I LD D+ T+I +L+ F
Sbjct: 164 QKHFSAGSGTYYGDSIFFLSVAMHQIMPKEISQIIQLDLDLKHKTNIRELFEEFDNFPPG 223
Query: 239 QTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLRDISWAGF 289
IG+ Y W+ + P P GFN+GV+LL+L +R +
Sbjct: 224 AVIGIAREMQPVYRHTFWQFRRENPQTRVGDPPPDGLPGFNSGVMLLNLEAMRQ---SAL 280
Query: 290 WRIIAEKFLLT------RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
+R + E L R L DQD F + E+P L + L C WN QL
Sbjct: 281 YRRLLEPATLQQLTEKYRFQGHLGDQDFFTVVGMEYPELFHVLDCTWNRQLC 332
>gi|330846633|ref|XP_003295120.1| hypothetical protein DICPUDRAFT_51901 [Dictyostelium purpureum]
gi|325074251|gb|EGC28356.1| hypothetical protein DICPUDRAFT_51901 [Dictyostelium purpureum]
Length = 387
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 63/107 (58%), Gaps = 4/107 (3%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS--DAEAQQFLSYAGNSEALRSRRNIGY 536
V++V Q ++DRL V M+ + W+ P+S+++++ + + N +L ++
Sbjct: 12 VSIVTQTTVDRLHKVSMMAEKWKAPMSVSVFIKKPSKDIPELKKKLQNYPSLVYYADL-- 69
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
H+++ YP+N LRN+A+ T +VF++D DFLP GL+ Y+++
Sbjct: 70 HLLFANNTRYPVNNLRNLAIENSRTDFVFIMDADFLPPLGLHDYIQN 116
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 35/50 (70%), Gaps = 3/50 (6%)
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
H+PTNF +W+ A PY+IE+ FEPY+V +R LP +D R G+G++K
Sbjct: 257 HSPTNFPKWIEAKEPYEIEYKWIFEPYLVFNRSESLP-FDERLKGYGFDK 305
>gi|68362842|ref|XP_708461.1| PREDICTED: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like isoform 2
[Danio rerio]
gi|444246614|gb|AGD94656.1| beta-1,3-N-acetylglycosaminyltransferase 1 [Danio rerio]
Length = 431
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 65/151 (43%), Gaps = 36/151 (23%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
YP N LRNVA Y+ ++DID +P L+ + + + +VLV+PAFE
Sbjct: 219 YPNNLLRNVARGGTDAAYILVIDIDMIPSANLHHQFVTMLMKREP---AADEVLVLPAFE 275
Query: 606 TQRYRTAFPAS------------------------HAPTNFSRWVNATTP--------YQ 633
+ R PAS APTN+S WVN + Y
Sbjct: 276 IRHIRK-MPASKPELVQLYQVGEVRPFYDELCSRCQAPTNYSLWVNLASKSSGPLEVSYT 334
Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
I W +EP+ + R +P YD F +G+N+
Sbjct: 335 INWVDPWEPFYIGARSVPLYDESFRQYGFNR 365
>gi|328855389|gb|EGG04516.1| family 49 glycosyltransferase [Melampsora larici-populina 98AG31]
Length = 544
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 101/243 (41%), Gaps = 52/243 (21%)
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
I P++ E+ D D+T+ ++ +R ++ L +H++GPIS T+++S+
Sbjct: 221 IGPHSDGEF----DQKDITITTLVTSNRFKVFAKLVEHYQGPISATIHVSNVPETLEPLL 276
Query: 522 AGNSEALRSRRN----IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGL 577
A RS + H+V + N RNVA T YV +LD+DF
Sbjct: 277 ASLETMYRSSPAMSTWVDIHLVVDSFD-RQFNMWRNVAKWFSRTSYVMMLDVDFWICTDF 335
Query: 578 Y------PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR-----TAFP------------ 614
PY S + S G+ LV+PAFE + T FP
Sbjct: 336 RTRVLSNPYYLSLLES-------GQAALVIPAFEFTKQADGVNPTQFPNQKSTLKGLFQD 388
Query: 615 -----------ASHAPTNFSRWVNATTPYQ-IEWAPDFEPYIVAHRD-LPRYDTRFVGFG 661
H PT++ ++ YQ I + +EPY++ +D P D RF+G+G
Sbjct: 389 GKIDMFHKSWIPGHGPTDYKQYFQTDQSYQVISYTHSYEPYVIYKKDGTPFCDERFIGYG 448
Query: 662 WNK 664
NK
Sbjct: 449 GNK 451
>gi|302564007|ref|NP_001181005.1| xyloside xylosyltransferase 1 [Macaca mulatta]
Length = 393
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IGL Y W+ P GFN+GV+LL+L +R
Sbjct: 242 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQL 356
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386
>gi|428180835|gb|EKX49701.1| hypothetical protein GUITHDRAFT_104664 [Guillardia theta CCMP2712]
Length = 576
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 34/46 (73%)
Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
P +HAP ++ RW+ AT PY + + DF P+++ RD+PR+DTRFVG
Sbjct: 423 PRAHAPVDYERWIGATRPYAVTYQHDFAPFLLCRRDVPRFDTRFVG 468
>gi|219123437|ref|XP_002182031.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406632|gb|EEC46571.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 527
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 103/265 (38%), Gaps = 52/265 (19%)
Query: 449 VAQLSMDRLQMVEINPNNVDE---YEASSDGNDVTLVAQLSMDRLQMVEMLFKHW--EGP 503
V+ D L + N D + S D TLV QLS DRL M+E + W P
Sbjct: 122 VSSTETDALPSIPFQTNFPDAPVCHSLSPDDVSYTLVTQLSQDRLWMMEHHCQRWGPSHP 181
Query: 504 ISLTLYMSDAEAQ---QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS 560
+S+ ++ + A+ Q ++ E L S + + YP+N LRN+A V+
Sbjct: 182 MSIAVFTNQTVAEVRSQLVALGCAPEQLASVQTLPSTAAAVSD--YPVNVLRNLAFRAVT 239
Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY----------- 609
T ++ +D+DF P L+ L S R + LV+PAF+ QR
Sbjct: 240 TTHIVYVDVDFWPSADLHATL-SGARIRHALAQNERTALVIPAFQLQRQCRAWKECPDQN 298
Query: 610 -----------------RTAFP------ASHAPTNFSRWVNATTPYQIEWAP-----DFE 641
R AFP H T + W+ + + P +E
Sbjct: 299 VPVMPTHKAALERLSRNRQAFPFDPTNVGGHGSTKYRAWIKSQPDGVLLEIPCVLSNRYE 358
Query: 642 PYIVAHRD--LPRYDTRFVGFGWNK 664
PY+V LP + F G+G NK
Sbjct: 359 PYLVVRYCDVLPPFQEAFSGYGKNK 383
>gi|193577758|ref|XP_001943061.1| PREDICTED: uncharacterized protein C3orf21 homolog [Acyrthosiphon
pisum]
Length = 344
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 73/159 (45%), Gaps = 15/159 (9%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
+YS L L V P +L K IV+D D F + I L +F Q GL S
Sbjct: 151 YYSDDVFYLSPALHLVAPNSLDKAIVVDVDTAFQSTICDLHNVFDNFTNTQLFGLAPELS 210
Query: 249 DWYLGKLWK-------NHKPWPALG-RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
Y LWK +K P + G N+GV+L L ++R A + R+I+ +++ +
Sbjct: 211 PVYHHILWKWRNYRGDKYKTQPHINLNGLNSGVLLSYLERIRKS--AEYDRLISPEWISS 268
Query: 301 RLWT-----SLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ L DQD + + EHP L+Y L C WN QL
Sbjct: 269 VVQKYLFKGHLGDQDWYTLVSYEHPELIYRLSCGWNRQL 307
>gi|299469898|emb|CBN76752.1| Glycosyltransferase, family GT49 [Ectocarpus siliculosus]
Length = 1372
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 103/300 (34%), Gaps = 107/300 (35%)
Query: 461 EINPNNVD-----EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA 515
E +P +D E +D+T+ Q S+ RL +E + W G +S +
Sbjct: 440 EYHPKRIDVVARLESRHRRAESDITIFTQCSLSRLNRLERMCDLWTGVVSAAI------- 492
Query: 516 QQFLSYAGNSEALRSRRNIG--YHVVYKEG-------------------NFYPINTLRNV 554
++ G R RR + + V G YP+N+LRN
Sbjct: 493 ---INGGGGHVDQRMRRKVSALHRRVESTGKCRLDICLCQPAPGLAARDGLYPVNSLRNA 549
Query: 555 ALNQVSTPYVFLLDIDFLPMFGLYPYLKSS---IRSMDMHGHGGK--------------- 596
AL T V +D+D +PM GL+ + R ++M GG+
Sbjct: 550 ALRASRTDLVLSVDVDAMPMRGLHEVVSRPPVYARLLEM-CRGGRPPTHAAPAFPAASCS 608
Query: 597 --------------KVLVVPAFE--------TQRYRT----------------------- 611
LVVPA E R RT
Sbjct: 609 GADTEAATLDTHPPTFLVVPALEFVGASSPDDPRIRTLLQEADNKRKAADLVVAGDLQAF 668
Query: 612 ---AFPASHAPTNFSRWVN---ATTPYQIEWAPDFEPY-IVAHRDLPRYDTRFVGFGWNK 664
FP H TN RW TPY++ +A +EPY + R P +D RF GFG +K
Sbjct: 669 HAECFPEGHRQTNLDRWTTLPVDATPYEVMYAEGYEPYGLFFRRHAPPFDERFRGFGLDK 728
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 71/179 (39%), Gaps = 56/179 (31%)
Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS--IRSMDMHGHGGKKVL 599
+ ++YPIN LRN AL Q T V + D+DF P L L++S I+S+ +
Sbjct: 112 KASWYPINQLRNRALAQADTDLVLICDVDFRPCRRLAHLLRASGEIKSIFKRVSCRLNCV 171
Query: 600 VVPAFE-----------------TQRYRTAFPA--------------------------- 615
V+PAFE QR P
Sbjct: 172 VLPAFEVSAARDDDDDGVSNEEAAQRAAIEMPEKSWWLKTLACKQELLKQWEQGRVVPFA 231
Query: 616 ------SHAPTNFSRWVNATTP---YQIEWAPDFEPYIVAHRDL-PRYDTRFVGFGWNK 664
H TNF RW AT Y++ + FEP+++ +R L P +D RF G+G NK
Sbjct: 232 SRVWAQGHRATNFDRWKAATDLSGWYEVAYEEGFEPFVIMNRLLVPAFDERFEGYGRNK 290
>gi|195489451|ref|XP_002092743.1| GE14358 [Drosophila yakuba]
gi|194178844|gb|EDW92455.1| GE14358 [Drosophila yakuba]
Length = 458
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 85/202 (42%), Gaps = 46/202 (22%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ S+++D+ S PN +YS L L L ++ ++L + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSAPNSYYSDSLFFLSLGLHRIADQSLNRAILLDCDIVFRSDV 287
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGSNQHSRV 347
Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDIFN 313
G N+GV+LL L ++R+ + EK T + T L DQD F
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRN------SKSYLEKLTHTEVHTLVAKYSFKGHLGDQDFFT 401
Query: 314 AIISEHPYLVYTLPCQWNVQLS 335
+ E+P L+Y L C WN QL
Sbjct: 402 LLGYEYPNLIYRLDCIWNRQLC 423
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 67/150 (44%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD+ + L ILQ + +V + S+++D+
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ ++L + I+LD D++F +D+ L+ F
Sbjct: 241 SPYFSSAPNSYYSDSLFFLSLGLHRIADQSLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330
>gi|402861997|ref|XP_003895359.1| PREDICTED: xyloside xylosyltransferase 1 [Papio anubis]
Length = 393
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T++ +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNVRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IGL Y W+ P GFN+GV+LL+L +R
Sbjct: 242 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQL 356
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHIKIYHGNC 386
>gi|291236100|ref|XP_002737979.1| PREDICTED: UDP-GlcNAc:betaGal
beta-1,3-N-acetylglucosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 888
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 107/279 (38%), Gaps = 92/279 (32%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-----------------EAQQFL 519
+DVT+V Q S + L ++ + W GPIS+ ++ + + Q+F
Sbjct: 545 DDVTIVTQCSPNHLHLLFDQAEIWAGPISVAIFAPNPNITALYVVIAHLRHCSLKIQKFA 604
Query: 520 SY-------------AGNSEALRSRR-NIGYHVVYKEG---NF---------------YP 547
S+ N E SR I H + NF YP
Sbjct: 605 SFHLVYPLRQMSQSRGFNKENYSSRPIEIAMHCTEVKDWIKNFDNLDHENYVLAGDVPYP 664
Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS--------SIRSMDMHGHGGKKVL 599
N LRNVA T Y+ +LDID +P GL KS S+ ++M + V
Sbjct: 665 NNLLRNVAWQGSVTEYILVLDIDIIPSSGLLASFKSFDRRRKSLSLSHLEME-EPPETVY 723
Query: 600 VVPAFETQR--------------------------YRTAFPASHAPTNFSRW--VNATTP 631
V+PAFE + Y ++++RW + +P
Sbjct: 724 VLPAFEIREGIDRIFTPMKKNELLTLWNEGDVRPFYEEVCSKCQIHSSYTRWRTFESESP 783
Query: 632 ------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
Y++EW +EP+ +A R +P+YD RF +G+N+
Sbjct: 784 DMMNIAYELEWKDPWEPFYIAKRHIPKYDERFKQYGFNR 822
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 54/250 (21%), Positives = 100/250 (40%), Gaps = 73/250 (29%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE---------------AQQFLSY 521
+DVT+V S+ L + + + WEGPIS+ ++ +D +Q +++
Sbjct: 89 DDVTIVTHSSVSDLHHLVGMAERWEGPISVAVFSTDISLFASSVSNLRRCFPLVKQHVTF 148
Query: 522 -------AGNSE--------------ALRSRRNIGYHVVYKEGN-FYPINTLRNVALNQV 559
G +E L RN+ + N YPIN L+N AL
Sbjct: 149 HLVTTIAPGKAEISNDETILKVPCGMVLHVTRNLEDESYSSDDNPAYPINILQNEALKGS 208
Query: 560 STPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF--------ETQRYRT 611
T Y+ L++D +P L K +++ + V ++P F ET +Y++
Sbjct: 209 ITKYILTLNVDLMPNANLLKNFKLFTDAIEDDA-AIQNVYILPDFEICDDDVHETPKYKS 267
Query: 612 AF-------------------PASHAPTNFSRWV----NAT----TPYQIEWAPDFEPYI 644
+ H N+++W +AT + Y++E+ ++P+
Sbjct: 268 EVLDLWREGKDLIRPYRYAMCRSCHNHGNYTKWRQILPSATNTLDSSYEVEFRYPWQPFY 327
Query: 645 VAHRDLPRYD 654
++ RD+P YD
Sbjct: 328 ISSRDVPLYD 337
>gi|195552258|ref|XP_002076411.1| GD15463 [Drosophila simulans]
gi|194202060|gb|EDX15636.1| GD15463 [Drosophila simulans]
Length = 389
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 64/279 (22%), Positives = 108/279 (38%), Gaps = 89/279 (31%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
++Y SS+G+ V L Q S++RL + + +W+G +S+ L+ + E Q F++Y
Sbjct: 41 EQYVDSSEGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100
Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
A R N +H++
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRVAALPLNMRGKFDCQYPDRTLKALLKFRSLKTLQ 159
Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
+++ N YP N +RN+A T YVFL DID +P P L +R+ +
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFLRTANCTKSCA---Y 216
Query: 600 VVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTP---- 631
V+P FE R FP S TNFS+W+++ T
Sbjct: 217 VIPTFEID-IRATFPRSKNALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSSNTNDTEV 275
Query: 632 ------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
E+ +EP+ ++ + P +D RF+G+G+ +
Sbjct: 276 SVSHIVTNFEFL--YEPFYISVDNAPAHDERFLGYGFTR 312
>gi|427785201|gb|JAA58052.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
unknown function [Rhipicephalus pulchellus]
Length = 387
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 12/138 (8%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L LP +LP+ A I +D DV+F + I +LW F+ + L D WY
Sbjct: 156 RLFLPSMLPDVDA-VIYVDADVLFLSPIEELWKHFTAMNSSHLAALAPESEDYATNWY-- 212
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
+ + H + LG N+GV+L++LT++R+ W + K+ W DQD+ N
Sbjct: 213 RRFARHPFYQPLG--VNSGVMLMNLTRMREFGWEKRLGPLLRKYGSNISW---GDQDLLN 267
Query: 314 AIISEHPYLVYTLPCQWN 331
+ +HP + C+WN
Sbjct: 268 ILFHDHPERLLLFSCRWN 285
>gi|358336522|dbj|GAA55009.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
Length = 230
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 43/86 (50%), Gaps = 11/86 (12%)
Query: 263 PALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQ-----------DI 311
P RG NTGV+L+ L KLR +SW W + L ADQ DI
Sbjct: 61 PMEKRGINTGVLLIHLQKLRKVSWHRMWMDTIQDLLKRTKNLKAADQVSAVRIYIHIQDI 120
Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDN 337
+NAI HP L+Y LPC WNVQ+ N
Sbjct: 121 YNAIFGIHPELLYQLPCTWNVQVIRN 146
>gi|355747173|gb|EHH51787.1| hypothetical protein EGM_11230, partial [Macaca fascicularis]
Length = 318
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 107 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 166
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IGL Y W+ P GFN+GV+LL+L +R
Sbjct: 167 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 226
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 227 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQLCT 283
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 284 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 311
>gi|355560144|gb|EHH16872.1| hypothetical protein EGK_12239, partial [Macaca mulatta]
Length = 308
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 97 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 156
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IGL Y W+ P GFN+GV+LL+L +R
Sbjct: 157 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 216
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 217 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQLCT 273
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 274 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 301
>gi|16768190|gb|AAL28314.1| GH22974p [Drosophila melanogaster]
Length = 458
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 83/196 (42%), Gaps = 34/196 (17%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ S+++D+ S PN +YS L L L ++ TL + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRTLNRAILLDCDIVFRSDV 287
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 347
Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---SLADQDIFNAIISEH 319
G N+GV+LL L ++R+ +E L ++ L DQD F + E+
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLVAKYSFKGHLGDQDFFTLLGYEY 407
Query: 320 PYLVYTLPCQWNVQLS 335
P L+Y L C WN QL
Sbjct: 408 PNLIYRLDCIWNRQLC 423
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD+ + L ILQ + +V + S+++D+
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ TL + I+LD D++F +D+ L+ F
Sbjct: 241 SPYFSSTPNSYYSDSLFFLSLGLHRIADRTLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330
>gi|340726907|ref|XP_003401793.1| PREDICTED: glucoside xylosyltransferase 1-like [Bombus terrestris]
Length = 364
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL + A + +DTD +F ++W F ++ Q L D G W
Sbjct: 143 RLFLPSVLNDIDA-VLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 199
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ W + I +++ L W DQDI N I
Sbjct: 200 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKEYKLKITW---GDQDIINII 256
Query: 316 ISEHPYLVYTLPCQWN 331
HP +Y C++N
Sbjct: 257 FHYHPEKLYVYSCRYN 272
>gi|427778361|gb|JAA54632.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
unknown function [Rhipicephalus pulchellus]
Length = 375
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 12/138 (8%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L LP +LP+ A I +D DV+F + I +LW F+ + L D WY
Sbjct: 156 RLFLPSMLPDVDA-VIYVDADVLFLSPIEELWKHFTAMNSSHLAALAPESEDYATNWY-- 212
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
+ + H + LG N+GV+L++LT++R+ W + K+ W DQD+ N
Sbjct: 213 RRFARHPFYQPLG--VNSGVMLMNLTRMREFGWEKRLGPLLRKYGSNISW---GDQDLLN 267
Query: 314 AIISEHPYLVYTLPCQWN 331
+ +HP + C+WN
Sbjct: 268 ILFHDHPERLLLFSCRWN 285
>gi|152125897|sp|Q3UHH8.2|GXLT1_MOUSE RecName: Full=Glucoside xylosyltransferase 1; AltName:
Full=Glycosyltransferase 8 domain-containing protein 3
gi|148672339|gb|EDL04286.1| mCG19737 [Mus musculus]
gi|219521061|gb|AAI71972.1| Glt8d3 protein [Mus musculus]
gi|223461493|gb|AAI41303.1| Glt8d3 protein [Mus musculus]
Length = 404
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 77/173 (44%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 164 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 220
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P GR G N+GV+L+++T++R + W + +K+ L W
Sbjct: 221 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 279
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + S +P ++ PCQWN + E + I+H N
Sbjct: 280 --GDQDLLNIVFSHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 330
Score = 42.4 bits (98), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 15/154 (9%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
E + +A+V G VT++KS L + PLH H+ + + + ++WS + +
Sbjct: 78 EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLASWSFLRRF 136
Query: 72 EVSFY----LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
+ S Y DS + W K +L LP +L E + + +DTD++F
Sbjct: 137 DYSLYPITFPGDSAAD---W---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRP 189
Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 190 VDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 221
>gi|380020565|ref|XP_003694153.1| PREDICTED: glucoside xylosyltransferase 1-like [Apis florea]
Length = 364
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 66/136 (48%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL + + +DTD +F ++W F ++ Q L D G W
Sbjct: 144 RLFLPSVL-NNIDAVLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 200
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ W + I +++ L W DQDI N I
Sbjct: 201 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKEYKLKITW---GDQDIINII 257
Query: 316 ISEHPYLVYTLPCQWN 331
HP +Y C++N
Sbjct: 258 FHYHPEKLYVYSCRYN 273
>gi|348582762|ref|XP_003477145.1| PREDICTED: uncharacterized protein C3orf21 homolog [Cavia
porcellus]
Length = 393
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 87/208 (41%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ + P GFN+GV+LL L +R
Sbjct: 242 FDNFLPGAVIGIAREMQPVYRHTFWQFRQENPKTRVGGPPPEGLPGFNSGVMLLSLEAMR 301
Query: 283 DISWAGFW------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
G + +A K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPLYGRLLEPEQVQQLANKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 358
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 359 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 386
>gi|332262844|ref|XP_003280468.1| PREDICTED: LOW QUALITY PROTEIN: xyloside xylosyltransferase 1
[Nomascus leucogenys]
Length = 394
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 183 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKFKTNIRELFEE 242
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ + Y W+ P GFN+GV+LL L +R
Sbjct: 243 FDSFLPGTIIGIAQEMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLSLEAMR 302
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 303 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 357
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 358 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 387
>gi|380798379|gb|AFE71065.1| xyloside xylosyltransferase 1, partial [Macaca mulatta]
Length = 272
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 61 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 120
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IGL Y W+ P GFN+GV+LL+L +R
Sbjct: 121 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 180
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 181 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQLCT 237
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 238 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 265
>gi|195430896|ref|XP_002063484.1| GK21934 [Drosophila willistoni]
gi|194159569|gb|EDW74470.1| GK21934 [Drosophila willistoni]
Length = 429
Score = 65.5 bits (158), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 79/190 (41%), Gaps = 42/190 (22%)
Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
S PN +YS L L L ++ +L+K I+LD D++F +D+ L+ F Q G
Sbjct: 208 SSAPNSYYSDSLFFLSLGLHRIADTSLSKAILLDCDIVFRSDVRLLFGEFDNFLPHQLFG 267
Query: 243 LVENQSDWYLGKLWKNHKPWP--ALGR-----------------------------GFNT 271
L + Y L++ +P + G G N+
Sbjct: 268 LAPELTPVYRHILYRYRVRFPKTSFGNPYYPMRFAKNNEIGTGDQANHIHEHHGYPGLNS 327
Query: 272 GVILLDLTKLRD-------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVY 324
GV+LL L ++R+ ++ R++A+ L DQD F + E+P L+Y
Sbjct: 328 GVVLLLLNRIRESKTYVEALTSTEVHRLVAK----YSFKGHLGDQDFFTLLGYEYPNLIY 383
Query: 325 TLPCQWNVQL 334
L C WN QL
Sbjct: 384 RLDCVWNRQL 393
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITD----TVALNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD T L+ILQ + +V + +++D+
Sbjct: 144 SLLKHTNAQLHLHVITDKDSQTSVLDILQKQIRRFRRTVIYTMYDVKVCSVIIQDIAAKL 203
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ +L+K I+LD D++F +D+ L+ F
Sbjct: 204 SPYFSSAPNSYYSDSLFFLSLGLHRIADTSLSKAILLDCDIVFRSDVRLLFGEFDNFLPH 263
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 264 QLFGLAPELTPVYRHILYRYRVRFPKTSFG 293
>gi|402591667|gb|EJW85596.1| hypothetical protein WUBG_03494 [Wuchereria bancrofti]
Length = 181
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 44/96 (45%), Gaps = 26/96 (27%)
Query: 595 GKKVLVVPAFE---------------------TQRY----RTAFPASHAPTNFSRWVNAT 629
K+ VVPAFE QR R + H T++ RW NA
Sbjct: 13 NKRAFVVPAFEYMSNQVPAIPLTKNELLIELDAQRMQIFRRNIWIQGHVATDYDRWRNAD 72
Query: 630 TPYQIEWAPDFEPYIVA-HRDLPRYDTRFVGFGWNK 664
Y + W D+EPYIV R LP YD RFVGFGWNK
Sbjct: 73 QEYSVSWRTDYEPYIVVKRRGLPPYDQRFVGFGWNK 108
>gi|395839733|ref|XP_003792735.1| PREDICTED: xyloside xylosyltransferase 1 [Otolemur garnettii]
Length = 393
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 86/205 (41%), Gaps = 18/205 (8%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I +D D+ + T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSIAMHQIMPKEILRVIQVDLDLKYKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ + P GFN+GV+LL+L +R
Sbjct: 242 FDNFLPGAVIGIAREMQPVYRHTFWQFRRENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 DISWAGFWRIIAEKFLLT---RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS---- 335
A+ LT L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPLYSHLLEPAQVQQLTDKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCTWWR 361
Query: 336 DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 362 DHGYSDVFEAYFQCEGHVKIYHGNC 386
>gi|428175844|gb|EKX44732.1| hypothetical protein GUITHDRAFT_109510 [Guillardia theta CCMP2712]
Length = 550
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 98/261 (37%), Gaps = 72/261 (27%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALR--SR 531
++ +VT V ++DRL + L HW P+S+ +Y E ++ +S L S+
Sbjct: 89 NECENVTFVTSATIDRLPTLLNLLAHWSRPVSVAIYARTHEEERQISLFVRKYVLPWCSK 148
Query: 532 RNIGYHVVYKEGNF----------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
++ + +P+N LR +A + T V DID LP L +
Sbjct: 149 KHQSISICMLTACIDESRPPFILPFPVNRLRRLATSIAKTDLVIYADIDLLPSADLGSII 208
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------- 616
S + G K VLV+P+F + PAS
Sbjct: 209 MEKFLSRQVQG--WKDVLVIPSFRSSDESWPSPASILGSDNDVWIEAEPIKTDSLLERFL 266
Query: 617 --------------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD- 649
HAPT+++RW NA + Y++++ +EPY+V +R
Sbjct: 267 NETVTIPGLKELELPRGDLWFHSSVFHAPTDYARWFNAKSLYEVDYVLGYEPYVVLNRSD 326
Query: 650 ------LPRYDTRFVGFGWNK 664
+D F +GW+K
Sbjct: 327 WHGQHGWGLWDEAFDTWGWDK 347
>gi|195347329|ref|XP_002040206.1| GM16083 [Drosophila sechellia]
gi|194135555|gb|EDW57071.1| GM16083 [Drosophila sechellia]
Length = 417
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 83/196 (42%), Gaps = 34/196 (17%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ S+++D+ S PN +YS L L L ++ +L + I+LD D++F +D+
Sbjct: 187 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDV 246
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 247 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 306
Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---SLADQDIFNAIISEH 319
G N+GV+LL L ++R+ +E L ++ L DQD F + E+
Sbjct: 307 HHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLVAKYSFKGHLGDQDFFTLLGYEY 366
Query: 320 PYLVYTLPCQWNVQLS 335
P L+Y L C WN QL
Sbjct: 367 PNLIYRLDCIWNRQLC 382
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 68/154 (44%), Gaps = 22/154 (14%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFS------TWSVPQVEVSFYLADSVVEDV 85
S+L + LH H+ITD+ + L ILQ +++ V+V S+++D+
Sbjct: 140 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTLIYTIYDVKV----CSSIIQDI 195
Query: 86 --------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR 137
S PN +YS L L L ++ +L + I+LD D++F +D+ L+ F
Sbjct: 196 AAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDVRLLFNEFDN 255
Query: 138 LRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 256 FLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFG 289
>gi|397472300|ref|XP_003807689.1| PREDICTED: xyloside xylosyltransferase 1 [Pan paniscus]
Length = 393
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386
>gi|256082479|ref|XP_002577483.1| glycosyltransferase-related [Schistosoma mansoni]
gi|350644935|emb|CCD60351.1| glycosyltransferase-related [Schistosoma mansoni]
Length = 118
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 2/96 (2%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHK 260
+ ++LP T++K IVLD D++ TDI +LW F Q+IG+ Q+ + + K +
Sbjct: 2 ITEILPLTISKVIVLDIDILLNTDIIELWNYFDYFTGTQSIGIGLEQNPHFQEVMTKLNS 61
Query: 261 PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEK 296
W G G+N GV+L L+KLR W W I +K
Sbjct: 62 NWK--GYGYNNGVLLFHLSKLRSTKWDYLWLNITKK 95
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%)
Query: 104 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHK 163
+ ++LP T++K IVLD D++ TDI +LW F Q+IG+ Q+ + + K +
Sbjct: 2 ITEILPLTISKVIVLDIDILLNTDIIELWNYFDYFTGTQSIGIGLEQNPHFQEVMTKLNS 61
Query: 164 PWPALG 169
W G
Sbjct: 62 NWKGYG 67
>gi|410221668|gb|JAA08053.1| chromosome 3 open reading frame 21 [Pan troglodytes]
Length = 393
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386
>gi|114591198|ref|XP_516961.2| PREDICTED: xyloside xylosyltransferase 1 isoform 2 [Pan
troglodytes]
gi|410247994|gb|JAA11964.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410247996|gb|JAA11965.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410247998|gb|JAA11966.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410294112|gb|JAA25656.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410294114|gb|JAA25657.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410294116|gb|JAA25658.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410294118|gb|JAA25659.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410294120|gb|JAA25660.1| chromosome 3 open reading frame 21 [Pan troglodytes]
gi|410335001|gb|JAA36447.1| chromosome 3 open reading frame 21 [Pan troglodytes]
Length = 393
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386
>gi|321472410|gb|EFX83380.1| hypothetical protein DAPPUDRAFT_210281 [Daphnia pulex]
Length = 272
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 73/165 (44%), Gaps = 26/165 (15%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
+YS + + LP ++P + + I +D D F + +L F +Q +GL S
Sbjct: 76 YYSQALFYVSVQLPDLIPSDVTRLIFVDVDTEFRDSVEKLNQQFEYFDDQQLLGLAPELS 135
Query: 249 DWYLGKLW--KNHKPWPALGR-------GFNTGVILLDLTKLRDISWAGFWRIIAEKFL- 298
Y L+ +N LG G+N+GV+L+ L K+R +I + L
Sbjct: 136 PVYRHILYVYRNQHKTSLLGEPKGKGFPGYNSGVVLMKLDKMRQ-------SVIYQSLLT 188
Query: 299 ------LTRLWT---SLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
LT ++ L DQD + E+P L YTLPC WN QL
Sbjct: 189 NSSLGNLTDKYSFKGHLGDQDFYTLFALEYPQLFYTLPCNWNRQL 233
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 67/166 (40%), Gaps = 25/166 (15%)
Query: 27 TRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP---QVEVSFYLADSVVE 83
TRSL+ L + LHFHL+TD + I + + S ++VS Y D+ V+
Sbjct: 2 TRSLIKLCS-----KNIALHFHLVTDPSSWEIAKEIIHNESKKTQINIQVSAYFIDTFVK 56
Query: 84 DV-----------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
S +YS + + LP ++P + + I +D D F + +L
Sbjct: 57 QFVTTVQVLQQYFSVKQGTYYSQALFYVSVQLPDLIPSDVTRLIFVDVDTEFRDSVEKLN 116
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPAL----GRGF 172
F +Q +GL S Y L ++N L G+GF
Sbjct: 117 QQFEYFDDQQLLGLAPELSPVYRHILYVYRNQHKTSLLGEPKGKGF 162
>gi|326432410|gb|EGD77980.1| hypothetical protein PTSG_09613 [Salpingoeca sp. ATCC 50818]
Length = 522
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 79/168 (47%), Gaps = 32/168 (19%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVTLV Q ++DR+ +VEML + W GP + + +A + LR+ ++ ++
Sbjct: 175 DVTLVTQGTLDRVHVVEMLCRAWRGPKVVVFAVREAGTEH--------PQLRTLQSTCHN 226
Query: 538 V------VYKEGNFY---------------PINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
+ + +FY PINTLRN+A++ T ++F D+DFLP
Sbjct: 227 AKIVAVPITESSDFYTSRFRASDPAAPPVIPINTLRNLAVDLAPTNFIFTCDMDFLPSAS 286
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSR 624
LY L + R + + + LVVP +E R + P PT F +
Sbjct: 287 LYDSLVT--RYLPLLAAIDRPALVVPHWELLRCGGSAPPP-VPTTFQQ 331
>gi|219115563|ref|XP_002178577.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410312|gb|EEC50242.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 349
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 96/246 (39%), Gaps = 71/246 (28%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHW-EGPISLTLYMSD--AEAQQFLSYAGNSEALRSRRNI 534
D TLV QLS +RL M+ + W + PISL + + Q+ LS G + + I
Sbjct: 42 DFTLVTQLSFNRLSMMRQHCERWGDHPISLAIGTTKNLETVQKALSQLGCKNEMITVSLI 101
Query: 535 GYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG 594
+E YP+N LRN+A++ + T + ++D DF+ GL+ L H H
Sbjct: 102 SDFNSEEE---YPVNRLRNLAMSHIKTSHAVIIDADFVLSVGLFEIL---------HMHR 149
Query: 595 GK------KVLVVPAFETQR---------------------------YRTAFPA------ 615
LV+PAFE ++ YR A
Sbjct: 150 ATLAADYLNALVIPAFELRKVCEEQTLNCTALHIAMLPHNKDELLKLYRGATEVYSNSSS 209
Query: 616 -----------SHAPTNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFV 658
HA T ++ W+ P + + +EPY+V R LP + FV
Sbjct: 210 VTQFNGRGNFHGHASTRYTDWITQPAEQLLPIECVTSDRYEPYLVVRHCRALPPFQKVFV 269
Query: 659 GFGWNK 664
G+G NK
Sbjct: 270 GYGQNK 275
>gi|194754365|ref|XP_001959466.1| GF12890 [Drosophila ananassae]
gi|190620764|gb|EDV36288.1| GF12890 [Drosophila ananassae]
Length = 457
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 84/202 (41%), Gaps = 46/202 (22%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ S+++D+ S PN +YS L L L ++ +L + I+LD D++F +D+
Sbjct: 227 VCSSIIQDIAARLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDV 286
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 287 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPMNNEAGNQHSHI 346
Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDIFN 313
G N+GV+LL L ++R+ + EK T + + L DQD F
Sbjct: 347 HHGYPGLNSGVVLLLLNRIRN------SKAYLEKLTHTEVHSLVAKYSFKGHLGDQDFFT 400
Query: 314 AIISEHPYLVYTLPCQWNVQLS 335
+ E+P L+Y L C WN QL
Sbjct: 401 LLGYEYPNLIYRLDCIWNRQLC 422
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD + + ILQ + +V + S+++D+
Sbjct: 180 SLLKHTSAQLHLHIITDNESQVSVMEILQRQVRRFRRTVIYTIYDVKVCSSIIQDIAARL 239
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ +L + I+LD D++F +D+ L+ F
Sbjct: 240 SPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 299
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 300 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 329
>gi|74002978|ref|XP_852253.1| PREDICTED: uncharacterized protein C3orf21 [Canis lupus familiaris]
Length = 500
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 89/208 (42%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 289 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 348
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LLDL +R
Sbjct: 349 FDSFLPGAVIGIAREMQPVYRHTFWQFRHENPGTRVGGPPPEGLPGFNSGVMLLDLEAMR 408
Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
G R++ E + RL L DQD F I EHP L + L C WN QL
Sbjct: 409 RSLLYG--RLL-EPAQVQRLADKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 465
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 466 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 493
>gi|85701786|ref|NP_001028447.1| glucoside xylosyltransferase 1 [Mus musculus]
gi|74184510|dbj|BAE27879.1| unnamed protein product [Mus musculus]
Length = 372
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 77/173 (44%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 164 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 220
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P GR G N+GV+L+++T++R + W + +K+ L W
Sbjct: 221 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 279
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + S +P ++ PCQWN + E + I+H N
Sbjct: 280 --GDQDLLNIVFSHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 330
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 15/154 (9%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
E + +A+V G VT++KS L + PLH H+ + + + ++WS + +
Sbjct: 78 EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLASWSFLRRF 136
Query: 72 EVSFY----LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
+ S Y DS + W K +L LP +L E + + +DTD++F
Sbjct: 137 DYSLYPITFPGDSAAD---W---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRP 189
Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 190 VDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 221
>gi|449668189|ref|XP_002163320.2| PREDICTED: xyloside xylosyltransferase 1-like [Hydra
magnipapillata]
Length = 342
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 79/177 (44%), Gaps = 21/177 (11%)
Query: 178 VVEDVSWIP-NKHYSGVYGLLKLTLPKVLP-ETLAKTIVLDTDVIFATDIAQLWALFSRL 235
V++D+ P +YS + L L ++ + + I++D D+ TDI L+ +F ++
Sbjct: 134 VMQDLFSSPAGSYYSKTLFFIPLALHRINSLKNVTHLIMIDVDIELRTDIRYLYNIFLQM 193
Query: 236 RQRQTIGLVENQSDWY---LGKLWKNHK------PWPALGRGFNTGVILLDLTKLRDISW 286
Q GL Q Y K HK P P GFN+GV L++LT LR
Sbjct: 194 GPEQLYGLAREQQPTYRHITSAYRKRHKKTAVGNPPPFGLTGFNSGVKLINLTSLRQSKL 253
Query: 287 AGFW-------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSD 336
+ + +A+KF L DQD F + H + Y LPCQWN QL +
Sbjct: 254 YNSYLDDPVKLKHLADKF---SFRGHLGDQDFFTLLSFLHKDMFYELPCQWNRQLCE 307
>gi|241618451|ref|XP_002406997.1| glycosyltransferase domain-containing protein, putative [Ixodes
scapularis]
gi|215500924|gb|EEC10418.1| glycosyltransferase domain-containing protein, putative [Ixodes
scapularis]
Length = 230
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 12/143 (8%)
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNHKPWPALGRG 268
+ +D D +F + + W +F ++ + + L D WY + K+ P P G
Sbjct: 9 LYVDADTLFLNPVEEHWNVFEKMNESHLMALGSEGEDFAHNWYHRRA-KHPYPPPF---G 64
Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPC 328
N GV+ ++LT++R W + E++ LW DQD+ N + S HP ++ L C
Sbjct: 65 ANAGVMAMNLTRMRSFDWVSRLAPLREEYAGRLLW---GDQDLINILFSFHPDKLFMLTC 121
Query: 329 QWNVQLSDNTRSDELCYTELTDL 351
+WN + DN R + C E L
Sbjct: 122 RWNYR-QDNCRFHDSCSGETPAL 143
>gi|194885984|ref|XP_001976525.1| GG22921 [Drosophila erecta]
gi|190659712|gb|EDV56925.1| GG22921 [Drosophila erecta]
Length = 458
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 46/202 (22%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ S+++D+ S PN +YS L L L ++ L + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRNLNRAILLDCDIVFRSDV 287
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 347
Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDIFN 313
G N+GV+LL L ++R+ + EK T + T L DQD F
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRN------SKSYLEKLTHTEVHTLVAKYSFKGHLGDQDFFT 401
Query: 314 AIISEHPYLVYTLPCQWNVQLS 335
+ E+P L+Y L C WN QL
Sbjct: 402 LLGYEYPNLIYRLDCIWNRQLC 423
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 65/150 (43%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD+ + L ILQ + +V + S+++D+
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ L + I+LD D++F +D+ L+ F
Sbjct: 241 SPYFSSTPNSYYSDSLFFLSLGLHRIADRNLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330
>gi|24762499|ref|NP_611871.2| CG11388 [Drosophila melanogaster]
gi|7291716|gb|AAF47138.1| CG11388 [Drosophila melanogaster]
gi|372266220|gb|AEX91658.1| FI19481p1 [Drosophila melanogaster]
Length = 458
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 83/196 (42%), Gaps = 34/196 (17%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ S+++D+ S PN +YS L L L ++ +L + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDV 287
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 347
Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---SLADQDIFNAIISEH 319
G N+GV+LL L ++R+ +E L ++ L DQD F + E+
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLVAKYSFKGHLGDQDFFTLLGYEY 407
Query: 320 PYLVYTLPCQWNVQLS 335
P L+Y L C WN QL
Sbjct: 408 PNLIYRLDCIWNRQLC 423
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 66/150 (44%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD+ + L ILQ + +V + S+++D+
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ +L + I+LD D++F +D+ L+ F
Sbjct: 241 SPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330
>gi|17512112|gb|AAH19036.1| Chromosome 3 open reading frame 21 [Homo sapiens]
Length = 380
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 169 FPIVEAMQKHFSAGLGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 228
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 229 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 288
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 289 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 343
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 344 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 373
>gi|194886138|ref|XP_001976557.1| GG22943 [Drosophila erecta]
gi|190659744|gb|EDV56957.1| GG22943 [Drosophila erecta]
Length = 389
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 104/277 (37%), Gaps = 85/277 (30%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
++Y +DG+ V L Q S++RL + + +W+G +S+ L+ + E Q F++Y
Sbjct: 41 EQYADLADGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100
Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
A R N +H++
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRMAALPPNMKGKFDCQYPDRTLKALLKFRSLKTLQ 159
Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
+++ N YP N +RN+A T YVFL DID +P P L R+ +
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCSKSCA---Y 216
Query: 600 VVPAFETQRYRTAFPAS------------------------HAPTNFSRWV----NATTP 631
V+P FE R FP S TNFS+W+ N T
Sbjct: 217 VIPTFEID-VRATFPRSKNALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSPNSNETEV 275
Query: 632 YQIEWAPDF----EPYIVAHRDLPRYDTRFVGFGWNK 664
+F EP+ +A + P +D RF+G+G+ +
Sbjct: 276 SVSHIVTNFEFLYEPFYIAVDNAPAHDERFLGYGFTR 312
>gi|353238557|emb|CCA70499.1| hypothetical protein PIIN_04436 [Piriformospora indica DSM 11827]
Length = 443
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 101/240 (42%), Gaps = 57/240 (23%)
Query: 473 SSDGN----DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ----------- 517
S DG D+TL+ ++ +R ++ L K ++GPIS+ L+++++
Sbjct: 111 SMDGTLAKEDITLLTIVTSNRFSVLGRLAKRYQGPISVALHVTESPRAHRDRILDELHEL 170
Query: 518 FLSYAGNSEALRSRRNIGYHVVYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
+LS G S + + H++ NF N RN A T YV +LD+DF G
Sbjct: 171 YLSTPGMS------KWVDVHLILD--NFDRQFNMWRNAARLFARTNYVMMLDVDFAVCTG 222
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAFE----------------------------TQR 608
L +S M M G VVPAFE Q
Sbjct: 223 FRQRLLASKEMMGML-RAGDAAFVVPAFEYVVAEDGGNEASFPRTKERLRRLHDAGKIQV 281
Query: 609 YRTAFPASHAPTNFSRWVNATTP--YQIE-WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
+ A+ H +++SR+ A Y++ + +EPY++ RD P D RF+G+G NK
Sbjct: 282 FHQAWEGGHNSSDYSRYFQAAPGDVYKVHAYQHAYEPYVIFKRDSAPWCDERFIGYGGNK 341
>gi|112181295|ref|NP_689744.3| xyloside xylosyltransferase 1 [Homo sapiens]
gi|74751171|sp|Q8NBI6.1|XXLT1_HUMAN RecName: Full=Xyloside xylosyltransferase 1; AltName:
Full=UDP-xylose:alpha-xyloside
alpha-1,3-xylosyltransferase
gi|22761774|dbj|BAC11694.1| unnamed protein product [Homo sapiens]
gi|112180452|gb|AAH39067.2| C3orf21 protein [Homo sapiens]
gi|119598434|gb|EAW78028.1| chromosome 3 open reading frame 21, isoform CRA_a [Homo sapiens]
gi|119598435|gb|EAW78029.1| chromosome 3 open reading frame 21, isoform CRA_a [Homo sapiens]
Length = 393
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 182 FPIVEAMQKHFSAGLGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ R++ A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356
Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386
>gi|302834571|ref|XP_002948848.1| hypothetical protein VOLCADRAFT_89141 [Volvox carteri f.
nagariensis]
gi|300266039|gb|EFJ50228.1| hypothetical protein VOLCADRAFT_89141 [Volvox carteri f.
nagariensis]
Length = 547
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 73/175 (41%), Gaps = 47/175 (26%)
Query: 536 YHVVYKE--GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL---KSSIRSMDM 590
Y +V E PIN LRN A+ TP V ++D+D P L +L ++ +
Sbjct: 239 YELVADEVMAALMPINALRNAAILAADTPLVAMVDVDLSPSLSLSQHLLSNRTRAAELQR 298
Query: 591 HGHGGKKVLVVPAFETQRYRTAFP------------------------------------ 614
+ V V+PAF+TQR R + P
Sbjct: 299 RAEAERTVWVIPAFDTQR-RLSLPGREAVADLAVAVPHSAKGTRLVALWKKAEKIYPFAY 357
Query: 615 ----ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
A H+ T++++W + YQ+ + P +EP+ + R LP YD RF G+ WNK
Sbjct: 358 DVYRAGHSATDYNKWFRSKLDYQVSYRPGYEPWFIGARMALPAYDARFRGYSWNK 412
>gi|328778044|ref|XP_393493.3| PREDICTED: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Apis
mellifera]
Length = 495
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/272 (23%), Positives = 98/272 (36%), Gaps = 91/272 (33%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSRRN 533
V L Q S+++L + HW GP+S+ LY++ E Q++L Y E +R R
Sbjct: 158 VCLATQSSVEKLHSIVQAAHHWTGPMSVALYVAGDEEFEVLQKYLIYLRKCYEPIRERVA 217
Query: 534 IGYHV---------------------VYKEG-------------------NFYPINTLRN 553
V EG N YP N +RN
Sbjct: 218 FSLAVPRVRPVKRQPREFDPPEMVDCAKPEGTLNELMNGVSNEQANWRIRNAYPQNHMRN 277
Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
+A + YVFL D+D +P F L L +R+ K V+P +E R F
Sbjct: 278 LARKNCQSDYVFLTDVDIVPSFNLTGALDEFLRA----DTCDKCAYVIPTYELDS-RVRF 332
Query: 614 PASHA------------------------PTNFSRWVNATTPYQIEWAPD---------- 639
P + TNF+RW+ T+P ++
Sbjct: 333 PQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTSPNHKKYGNMKSGKVYVSHD 392
Query: 640 -------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF+G+G+ +
Sbjct: 393 VTNFEFLYEPFYVAKDIVPAHDERFMGYGYTR 424
>gi|242012129|ref|XP_002426793.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Pediculus humanus corporis]
gi|212510975|gb|EEB14055.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Pediculus humanus corporis]
Length = 451
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/278 (21%), Positives = 107/278 (38%), Gaps = 83/278 (29%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY------ 521
+EY + S V + Q S+++L + + +HW GPIS+ +Y + E L +
Sbjct: 108 EEYGSLSKKYMVCMATQSSIEKLHSIAQVSEHWTGPISVAIYAAGDEEYYLLHWYVHFLK 167
Query: 522 ---------------------------------AGNSEALRSRRNIGYHVV--------- 539
N+ R +N+ ++
Sbjct: 168 NCYPWISKRVSFHLGVPKLRIPTGKYPENLKNLKINNYKCRDYKNVLKEIIKLRPVEIVK 227
Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
++ N YP N +RN+A + YVFL D+D +P F L L +++ K
Sbjct: 228 WRMKNPYPQNHMRNLARKNCQSDYVFLTDVDIVPSFKLADNLNKFLKTAKKTCK-NKCAY 286
Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVN--ATTPYQ 633
V+P +E RT FP + + TNFS+W + ++ +
Sbjct: 287 VIPTYELDS-RTDFPKNKSELLRLSQKGLARAFHQKVFIYNQFATNFSKWESFVDSSEDE 345
Query: 634 IEWAPD-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
I + D +EP+ VA +P +D RF+G+G+ +
Sbjct: 346 IIISHDVTNFEFLYEPFYVAPDTVPEHDERFLGYGYTR 383
>gi|384248607|gb|EIE22091.1| hypothetical protein COCSUDRAFT_83482 [Coccomyxa subellipsoidea
C-169]
Length = 330
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 101/244 (41%), Gaps = 24/244 (9%)
Query: 146 LVENQSDWYLGKLW--KNHKPWPALGRGFYLAD--SVVEDVSWI--PNKHYSGVYGLLKL 199
LV +D + + W + PW + F D +V+E ++ I P K Y G GL ++
Sbjct: 63 LVAADADAEIVRSWLQRLRSPWQPVTYSFVTIDDGAVLERLAAIHGPLKGYMGYCGLARM 122
Query: 200 TLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH 259
+ LP + + LDTD + D+ QLW F Q IG + + L H
Sbjct: 123 FFTEFLPHE-ERVLWLDTDTVVNGDVCQLWPHFDCFNASQIIGFPGDPTIPNL------H 175
Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSL---ADQDIFNAII 316
P+ G + V L+DLT++R W G + A + L + + DQD+ +
Sbjct: 176 HPF-----GLCSCVALMDLTRMRAAGWGGSFMAAALEAHLKVNPSEVFPYGDQDVLRIVQ 230
Query: 317 SEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD--LKIIHWNSPKKLKVKNKHMEFFRN 374
+P Y LP WNV + D + D + IIH+ K + + F R
Sbjct: 231 RHYPDSAYRLPSGWNV-MPFCPAYDAAMHETAVDAFVGIIHFTCTVKQHIYSSMASFGRW 289
Query: 375 LYLT 378
+L
Sbjct: 290 AWLV 293
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 12/122 (9%)
Query: 34 IKSILFYRKNPLHFHLITDT-----VALNILQTLFSTWSVPQVEVSFYLAD--SVVEDVS 86
+KS+L + PLH +++ + + LQ L S W V SF D +V+E ++
Sbjct: 47 LKSLLRNARCPLHLYILVAADADAEIVRSWLQRLRSPWQ--PVTYSFVTIDDGAVLERLA 104
Query: 87 WI--PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
I P K Y G GL ++ + LP + + LDTD + D+ QLW F Q I
Sbjct: 105 AIHGPLKGYMGYCGLARMFFTEFLPHE-ERVLWLDTDTVVNGDVCQLWPHFDCFNASQII 163
Query: 145 GL 146
G
Sbjct: 164 GF 165
>gi|156545014|ref|XP_001600015.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 1 [Nasonia
vitripennis]
gi|345479641|ref|XP_003423999.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 2 [Nasonia
vitripennis]
Length = 385
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP VL +T A + +DTD +F +W F ++ Q L D G W
Sbjct: 157 RLFLPTVLNDTDA-ILYVDTDTLFLAPPEVIWDEFKKMNNVQLAALSPEHEDPNTG--WY 213
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R+ W + I +++ L W DQDI N I
Sbjct: 214 NRFAKHPYYGKLGVNSGVMLMNLTRMREFKWTQYVIPIHKEYRLKITW---GDQDIINII 270
Query: 316 ISEHPYLVYTLPCQWN 331
HP ++ C++N
Sbjct: 271 FHYHPEKLHVYDCRYN 286
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 61/148 (41%), Gaps = 7/148 (4%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
+V+ + +V G + +T++KS L + K PL F +I+D + + W +
Sbjct: 67 QVVVICVVACGADRLDESLTMLKSALVFAKRPLRFIVISDYGLIPAFHEKLTEWKQIVDK 126
Query: 72 EVSFYLADSVVEDVSWIPN-KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
F + D+S + + +L LP VL +T A + +DTD +F
Sbjct: 127 TFDFIVRPVTFPDISDVTMWRKLFKPCAAQRLFLPTVLNDTDA-ILYVDTDTLFLAPPEV 185
Query: 131 LWALFSRLRQRQTIGLVENQSD----WY 154
+W F ++ Q L D WY
Sbjct: 186 IWDEFKKMNNVQLAALSPEHEDPNTGWY 213
>gi|313233727|emb|CBY09897.1| unnamed protein product [Oikopleura dioica]
Length = 505
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 113/290 (38%), Gaps = 95/290 (32%)
Query: 457 LQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ 516
++M I+P EY A+ DVTLV S + L + L + W GPIS+ ++ EA
Sbjct: 129 IRMFFISP----EYFANWKFEDVTLVTMCSANHLHHIISLAERWSGPISVGVFGPGLEA- 183
Query: 517 QFLSYAGNS-EALRS-----RRNIGYHVVY------------------------------ 540
S+A ++ E++++ R + +H+VY
Sbjct: 184 ---SFATDAIESIKNCFPAVREQVLFHLVYPISLPADLSKSSGKLRFENCDDLRKQIKSY 240
Query: 541 -----KEGNF-----YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS-SIRSMD 589
N+ +P N LRN A T Y+FL+D+D +P L +IR
Sbjct: 241 GGDEDDRANYGGDIPFPHNVLRNAARRGAPTAYIFLIDVDVMPSLNLREDFNDFAIR--- 297
Query: 590 MHG-----HGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
HG V VVP FE ++ T P + H P
Sbjct: 298 -HGLYDKNENAMTVFVVPVFEKKKGSTC-PDNKRELAESCDKNEVRSFHEETCKWCHEPE 355
Query: 621 NFSRWVNATTPYQIE------WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+W+ ++E W +EP+ +A R +P +D RF+ +G+++
Sbjct: 356 QPKKWLQIQGQSRLEISFNSSWTKSWEPFYIASRQVPYFDERFMVYGFDR 405
>gi|149616439|ref|XP_001516223.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like
[Ornithorhynchus anatinus]
Length = 412
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/267 (23%), Positives = 101/267 (37%), Gaps = 79/267 (29%)
Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEAL 528
A D+ L S+ L +E L + W GP+++ ++ + + Q L+YA +
Sbjct: 85 AEGVAGDLVLATHASLGNLPHLEGLVERWGGPVAVAIFAAGGKEAQLATVLAYALGTLCG 144
Query: 529 RSRRNIGYHVVYKEGN------------------------------------------FY 546
R + H+V G+ Y
Sbjct: 145 AVRARVTLHLVCPAGHEASVPEPREPGEFARLRSCPEVFAKLAAWAGAGPNYALGANVSY 204
Query: 547 PINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFET 606
P N LRN+A + + ++D+D +P GL+ L + LVVPAFE
Sbjct: 205 PNNLLRNLAREEAG--HALVVDVDMMPSEGLWAGLLELLALGAGRA---GTALVVPAFEI 259
Query: 607 QRYRT------------------AF-----PASHAPTNFSRWVNATT------PYQIEWA 637
+R R AF P APTN+SRW+N + Y++ W
Sbjct: 260 RRARRVPGTKAELLRLYQVGEVRAFYAELCPRCQAPTNYSRWLNLPSGGSLRPAYRVGWR 319
Query: 638 PDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF +G+N+
Sbjct: 320 DPWEPFYVAGGAVPTFDERFRQYGFNR 346
>gi|296487730|tpg|DAA29843.1| TPA: glucoside xylosyltransferase 1 isoform 1 [Bos taurus]
Length = 409
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +WAL + Q + + +G W
Sbjct: 169 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWALLKKFNSTQIAAMAPEHEEPRIG--WY 225
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 284
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 285 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 335
Score = 45.4 bits (106), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 64/151 (42%), Gaps = 9/151 (5%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
E + +A+V G VT++KS L + PL FH+ + + + + +WS Q
Sbjct: 83 ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 141
Query: 72 EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
S Y E+ + W K +L LP +L E + + +DTD++F +
Sbjct: 142 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDD 197
Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+WAL + Q + + +G W N
Sbjct: 198 IWALLKKFNSTQIAAMAPEHEEPRIG--WYN 226
>gi|296224892|ref|XP_002758256.1| PREDICTED: xyloside xylosyltransferase 1 [Callithrix jacchus]
Length = 393
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 88/208 (42%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 302 QSLLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQLCT 358
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 359 WWRDHGYSDVFEAYFQCEGHVKIYHGNC 386
>gi|313238546|emb|CBY13597.1| unnamed protein product [Oikopleura dioica]
Length = 427
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 100/265 (37%), Gaps = 75/265 (28%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---RS 530
S DVTL +Q S + L V L + W GPIS+ ++ D E+ F++ A S +
Sbjct: 94 SQYEDVTLASQCSANHLHHVINLLERWSGPISIAVFAPDKESS-FVTDAILSMRKCWPQI 152
Query: 531 RRNIGYHVVYKEG-------------------------NF-------------YPINTLR 552
R + +H+VY G NF YP N LR
Sbjct: 153 RDKVTFHLVYPIGHTSDVSASTGGLEFNDCNELVDIISNFGGDDAGNYKGDIPYPHNVLR 212
Query: 553 NVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK--VLVVPAFETQRYR 610
NVA+ +T Y FL+DID LP + + + V VVP FE ++
Sbjct: 213 NVAIKTSATKYTFLIDIDVLPNENIRERFNQMAIKKGYYDEDFDENVVFVVPCFEMKKGY 272
Query: 611 TA----------FPAS-------------HAPTNFSRWV--------NATTPYQIEWAPD 639
T F H+P W+ + + W +
Sbjct: 273 TRSLNKKHLLEDFQKGLIRPFHNESCWWCHSPEEHPHWMAIPLSRNDQSKISFNATWNKN 332
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ + +++P +D RF +G+++
Sbjct: 333 WEPFYIGSKNVPLFDERFKQYGFDR 357
>gi|300123041|emb|CBK24048.2| unnamed protein product [Blastocystis hominis]
Length = 290
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 11/152 (7%)
Query: 464 PNNVDEYEASSDGNDV----TLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL 519
P + D S+D DV T +Q ++ R + + + W GP+S+ Y FL
Sbjct: 136 PTDFDPVFHSTDAPDVEYDVTFCSQGTITRWGFMLEVSRRWRGPMSIVYYFPKENESYFL 195
Query: 520 SYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
S S LR + Y V+ + ++PINTLRN A++ V T + +L D+D P FG Y
Sbjct: 196 SLKAESN-LRPNIIVDY-VIQTDPFYFPINTLRNRAIDNVKTTHFWLTDMDVWPSFGSY- 252
Query: 580 YLKSSIRSMDMH-GHGGKKVLVVPAFETQRYR 610
+I + + K +VP F+ +R+R
Sbjct: 253 ---DAIMHLPLEFFQDPKNAGIVPVFQMRRHR 281
>gi|170067015|ref|XP_001868312.1| N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase
[Culex quinquefasciatus]
gi|167863173|gb|EDS26556.1| N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase
[Culex quinquefasciatus]
Length = 500
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 104/274 (37%), Gaps = 86/274 (31%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
S+ V L Q S++++ + + HW GPIS ++ + + Q +LSY N + +
Sbjct: 125 SERYSVCLATQSSLEKIYSLVQVSHHWSGPISAAIFAAGNDELYLLQIYLSYLRNCFKTI 184
Query: 529 RSRRNIGYHVVYKE------------GNF------------------------------- 545
R R + +H+ + G+F
Sbjct: 185 RER--VSFHLALPKERAPTHLKSIHVGDFTKFDCAKPEATLNDLIKLRKADTNKWRIKNP 242
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM--HGHGGKK-VLVVP 602
YP N LRNVA + YVFL D+D +P +L +R+ GHG ++ VVP
Sbjct: 243 YPQNHLRNVARKGCQSTYVFLTDVDIIPSVNFAEHLDKFLRTQRAPHGGHGAQQTAFVVP 302
Query: 603 AFETQRYRTAFPASHA------------------------PTNFSRW--------VNATT 630
+E R FP + TNFSRW + +
Sbjct: 303 TYELDE-RVRFPRNKTDLIRLANKGLARPFHHKVFIYNQFATNFSRWQADIFDGEMTRVS 361
Query: 631 PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ +EP+ VA +P +D RF+G+G+ +
Sbjct: 362 HNVTNFEFLYEPFYVAPDTVPPHDERFLGYGYTR 395
>gi|440905211|gb|ELR55623.1| Glucoside xylosyltransferase 1, partial [Bos grunniens mutus]
Length = 369
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F I +WAL + Q + + +G W
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPIDDIWALLKKFNSTQIAAMAPEHEEPRIG--WY 185
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 244
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 245 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 295
Score = 46.6 bits (109), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 64/151 (42%), Gaps = 9/151 (5%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
E + +A+V G VT++KS L + PL FH+ + + + + +WS Q
Sbjct: 43 ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 101
Query: 72 EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
S Y E+ + W K +L LP +L E + + +DTD++F I
Sbjct: 102 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPIDD 157
Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+WAL + Q + + +G W N
Sbjct: 158 IWALLKKFNSTQIAAMAPEHEEPRIG--WYN 186
>gi|403270494|ref|XP_003927212.1| PREDICTED: xyloside xylosyltransferase 1, partial [Saimiri
boliviensis boliviensis]
Length = 328
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 88/208 (42%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 117 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKDVLRIIQLDLDLKYMTNIRELFEE 176
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 177 FDSFLPGAVIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 236
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 237 QSLLYSRLLEPARVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQLCT 293
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 294 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 321
>gi|224002262|ref|XP_002290803.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974225|gb|EED92555.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 575
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 65/133 (48%), Gaps = 6/133 (4%)
Query: 480 TLVAQLSMDRLQMVEMLFKHW-EGPISLTLY---MSDAEAQQFLSYAGNSEALRSRRNIG 535
TLV QLS DRL M+ K W E PIS+ ++ +D +Q + N+E L + + +
Sbjct: 204 TLVTQLSNDRLWMIPYHCKRWGEHPISIAVFSNRRADDVKEQLVKEGCNAEYL-NVQTVS 262
Query: 536 YHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGG 595
G YP+N LRN+AL+ V T +V D+DF P L+ L M
Sbjct: 263 KTKYDPTGTEYPVNILRNLALSAVKTTHVVYADVDFWPASNLHSILTEQNVKERM-ASDN 321
Query: 596 KKVLVVPAFETQR 608
K V+PAF+ R
Sbjct: 322 KLATVIPAFQMNR 334
>gi|342321325|gb|EGU13259.1| Glycosyltransferase family 49 protein [Rhodotorula glutinis ATCC
204091]
Length = 569
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 110/254 (43%), Gaps = 76/254 (29%)
Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
+P+ +D + D+T+ ++ +R + E L + + GP+S T+++++ +A
Sbjct: 231 DPDRID-----FNKEDITITTLVTSNRFAVFERLVERYRGPMSATVHLTEGKA------- 278
Query: 523 GNSEALRS-----------RRNIGYHVVYK--EGNFYPINTLRNVALNQVSTPYVFLLDI 569
+SE LR+ ++ + H+VY E F N RNVA T YV +LD+
Sbjct: 279 -HSEMLRALETMYTSSPLMKQYVDIHLVYDPFERQF---NMWRNVAKFFARTEYVMMLDV 334
Query: 570 DFLPMFGLYPYLKSSIRSMDMHG-------HGGKKVLVVPAFETQRYRTA-----FPAS- 616
DF +L + RS + GG VVPAFE + FP++
Sbjct: 335 DF--------WLCTDFRSRMLESPEVMQRLKGGMAAFVVPAFEFHKQADGVDPNTFPSTK 386
Query: 617 ----------------------HAPTNFSRWVNATTP--YQIE-WAPDFEPYIVAHRD-L 650
H TN++R+ A Y+++ + +EPY++ ++
Sbjct: 387 EGLLQLVNDDKIGMFHKSWAPGHGSTNYTRYYAAKPGEVYRVQGYTHSYEPYVIFKKEGT 446
Query: 651 PRYDTRFVGFGWNK 664
P D RF+G+G NK
Sbjct: 447 PWCDERFIGYGGNK 460
>gi|326679949|ref|XP_002667034.2| PREDICTED: glucoside xylosyltransferase 1-like, partial [Danio
rerio]
Length = 405
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 75/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD++F + +W+ S + L + +G W
Sbjct: 218 RLFLPLILKQ-VDSVLYVDTDILFLRPVEDIWSFLSSFNRSHVAALAPEHEEPRMG--WY 274
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLR-----------DISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R D+SW+ + K+ L W
Sbjct: 275 NRFARHPYYGKTGVNSGVMLINMTRIRHTHFKNDLSAMDLSWSDLLMPLLNKYKLNITW- 333
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P + PC WN + E + I+H N
Sbjct: 334 --GDQDLLNIIFHYNPESLLVFPCHWNYRPDHCIYGSNCAAAEQHGIYILHGN 384
>gi|393222913|gb|EJD08397.1| hypothetical protein FOMMEDRAFT_74587 [Fomitiporia mediterranea
MF3/22]
Length = 381
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 101/245 (41%), Gaps = 53/245 (21%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG--------- 523
S D+T+ ++ +R + + L + + GPIS+T+++S AE + S
Sbjct: 38 SHKKEDITIATLVTRNRFAVFKQLVERYRGPISVTVHVSRAELAESSSGGKGVNFLDALH 97
Query: 524 ---NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
+S L S + ++ N N RN A T YV +LD+DF+P
Sbjct: 98 ALYSSSPLMSALVDVHLILTPSRNDRQFNAWRNAARLFARTDYVMMLDVDFVPCTDFRTR 157
Query: 581 LKSSIRSMDMHG--HGGKKVLVVPAFETQRYRTA-----FPAS----------------- 616
L + ++ +H + LV+PAFE ++ FP S
Sbjct: 158 LLHTPKTSPIHELLRRDRAALVIPAFEYVEHKDGTDARTFPTSKVELIKLYNESRIGMFH 217
Query: 617 ------HAPTNFSRWVNATTPYQI------EWAPDFEPYIVAHRD-----LPRYDTRFVG 659
H T++ R++ + P ++ + +EPY++ RD +P D RFVG
Sbjct: 218 ATWSPGHNSTDYERFLYHSQPGEVYKVSPAHYQHAYEPYVIFRRDGGERTIPWCDERFVG 277
Query: 660 FGWNK 664
+G NK
Sbjct: 278 YGGNK 282
>gi|383851201|ref|XP_003701127.1| PREDICTED: xyloside xylosyltransferase 1-like [Megachile rotundata]
Length = 356
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 71/165 (43%), Gaps = 19/165 (11%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L L ++ P + D D F DI +L+ F+ GL
Sbjct: 158 PGTYYSDALFFLSLGLHRIAPPEQNVAAMFDADTKFRKDIKELFEEFNSFGNEALFGLAP 217
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDIS-WAGFWR--- 291
+ Y + L++N P G G+N+GV+L +L +LR+ S + R
Sbjct: 218 ELTPVYRHVLYLYRNKNPNTLFGEPASSGGYPGYNSGVVLFNLNRLRNSSIYHEIVRNES 277
Query: 292 --IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
I EK+ L DQD + + E P L++T+ C WN QL
Sbjct: 278 VNSITEKY---HFKGHLGDQDFYTLLGMERPELIHTIDCGWNKQL 319
>gi|219115037|ref|XP_002178314.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410049|gb|EEC49979.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 402
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 100/241 (41%), Gaps = 61/241 (25%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEG-PISLTLYMSD--AEAQQFLSYAGNSEALRSRRNI 534
D TLV QLS RL ++E + W PISL + ++ ++ LS G + L + +
Sbjct: 95 DFTLVTQLSPSRLAIMEQHCERWGNHPISLAIGTTEDIENIEKTLSEFGCQKNLVT---V 151
Query: 535 GYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG 594
+ + YP+N LRN+A+++V T + F++D DF+ LY L+ R+M H
Sbjct: 152 SFVSDFNSEGEYPVNRLRNLAMSRVQTSHAFVIDADFVLSTNLYQMLRLH-RAMLATDH- 209
Query: 595 GKKVLVVPAFETQ---------------------------------------------RY 609
LVVPAFE Q R
Sbjct: 210 -LHTLVVPAFELQAVCNEQNENCTAKHLAMLPDTKNELVKQYWTAKDHVGPNPSVTQFRE 268
Query: 610 RTAFPASHAPTNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFVGFGWN 663
+ AF A HA T + W+ P + + +EPY+V + L + FVG+G N
Sbjct: 269 QAAFHA-HASTRYEDWMTQPAEKLLPIECVTSDSYEPYLVVRPCQFLAPFQEAFVGYGQN 327
Query: 664 K 664
K
Sbjct: 328 K 328
>gi|300798509|ref|NP_001180096.1| glucoside xylosyltransferase 1 [Bos taurus]
gi|296487731|tpg|DAA29844.1| TPA: glucoside xylosyltransferase 1 isoform 2 [Bos taurus]
Length = 440
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +WAL + Q + + +G W
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWALLKKFNSTQIAAMAPEHEEPRIG--WY 256
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 315
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 316 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366
Score = 45.8 bits (107), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 64/151 (42%), Gaps = 9/151 (5%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
E + +A+V G VT++KS L + PL FH+ + + + + +WS Q
Sbjct: 114 ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 172
Query: 72 EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
S Y E+ + W K +L LP +L E + + +DTD++F +
Sbjct: 173 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDD 228
Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+WAL + Q + + +G W N
Sbjct: 229 IWALLKKFNSTQIAAMAPEHEEPRIG--WYN 257
>gi|307189145|gb|EFN73593.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Camponotus floridanus]
Length = 735
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/272 (23%), Positives = 102/272 (37%), Gaps = 91/272 (33%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSR-- 531
V L Q S+++L + + HW GP+S+ LY + E Q++L Y E++R R
Sbjct: 154 VCLATQTSVEKLHSLVQVALHWTGPMSVALYAAGDEEFEVLQRYLVYLRKCYESIRERVI 213
Query: 532 ---------------------------------RNIGYHVVYKEGNF-----YPINTLRN 553
H+ ++ N+ YP N +RN
Sbjct: 214 FSLAVPRMRPPKKQPRVFELPDIIDCAKPEATLNEFASHISSEQTNWRIRNVYPQNHMRN 273
Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
+A T YVFL D+D +P F L L +R+ + K V+P +E R F
Sbjct: 274 LARKNCQTDYVFLTDVDIIPSFNLSVALDEFLRNDNC----DKCAYVIPTYELD-TRMRF 328
Query: 614 PASHA------------------------PTNFSRWVNATTPY----------QIEWAPD 639
P + TNF+RW+ +P ++ + D
Sbjct: 329 PQNKTELIRLARKGLARPFHQKVFIHNQFATNFTRWLQDVSPTHKYHENVRTGKVYVSHD 388
Query: 640 -------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA P +D RF+G+G+ +
Sbjct: 389 VTNFEFLYEPFYVAKDIAPSHDERFMGYGYTR 420
>gi|426343379|ref|XP_004038286.1| PREDICTED: xyloside xylosyltransferase 1 [Gorilla gorilla gorilla]
Length = 247
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 87/208 (41%), Gaps = 24/208 (11%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ F T+I +L+
Sbjct: 36 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 95
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL L +R
Sbjct: 96 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLSLEAMR 155
Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + +A+K+ L DQD F I EHP L + L C WN QL
Sbjct: 156 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQLCT 212
Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 213 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 240
>gi|198458069|ref|XP_001360902.2| GA16964 [Drosophila pseudoobscura pseudoobscura]
gi|198136211|gb|EAL25477.2| GA16964 [Drosophila pseudoobscura pseudoobscura]
Length = 506
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 107/278 (38%), Gaps = 87/278 (31%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSY-- 521
++Y SS+ + V L Q S++RL + + +W+GP+S+ ++ + E Q F++Y
Sbjct: 46 EQYTESSESSLVCLATQTSVERLNSLPQVATNWQGPMSVAVFSASHEEFVVLQYFVTYMR 105
Query: 522 --------------------------AGNSEALRSRRNIGY---------------HVVY 540
S LR + + Y + +
Sbjct: 106 LCFPNIRENATFHLLTPPDYDKLPRLVSISFNLRGKFDCQYPDRTLKALLKLRSLKSLQW 165
Query: 541 KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLV 600
++ N YP N +RN+A T Y FL DID +P P L +++ + + V
Sbjct: 166 RQRNPYPQNHMRNLARKGCQTKYTFLTDIDIVPSTNSVPQLNQFLKTANCSKNCA---YV 222
Query: 601 VPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTP----- 631
+P FE R +FP S TNFS+W+++ T
Sbjct: 223 IPTFEID-VRASFPRSKYGLQRLMKKGLARPFHEKVFIYNQYATNFSKWLSSNTNDTEVS 281
Query: 632 -----YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
E+ +EP+ +A P +D RF G+G+ +
Sbjct: 282 VSHVVTNFEFL--YEPFYIAIDSAPAHDERFTGYGFTR 317
>gi|302830866|ref|XP_002946999.1| hypothetical protein VOLCADRAFT_103232 [Volvox carteri f.
nagariensis]
gi|300268043|gb|EFJ52225.1| hypothetical protein VOLCADRAFT_103232 [Volvox carteri f.
nagariensis]
Length = 276
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 29/149 (19%)
Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGL-YPYLKSSIRSMDMHGHGGKKVLVVPA 603
YP+NTLRN A T + +D+D LP + Y I + G + V PA
Sbjct: 48 LYPLNTLRNYARLMADTDLITNVDVDLLPSMSISYALADPGILANYTEGCRSGRAFVWPA 107
Query: 604 FET-----------------------QRYRTAF----PASHAPTNFSRWVNATTPYQIEW 636
F+T Q+ T F P H TN+ RW+ + Y IE+
Sbjct: 108 FDTHCNGTEYADFIAMDGRPALNEGLQKCVTMFSYKNPVFHNATNYQRWLTSRETYDIEY 167
Query: 637 APDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
++EP+ ++ R D P +D R+ G+G+NK
Sbjct: 168 NLEYEPWYLSWRWDTPWFDFRYRGYGYNK 196
>gi|403269346|ref|XP_003926708.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 406
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ + Q + + +G W
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 222
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 281
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + S +P ++ PCQWN + E + I+H N
Sbjct: 282 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 332
>gi|328778506|ref|XP_001120832.2| PREDICTED: uncharacterized protein C3orf21 homolog [Apis mellifera]
Length = 357
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 75/164 (45%), Gaps = 17/164 (10%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L L ++ P + D D F TDI +L+ F+ + GL
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPIEQNVAAMFDADTKFRTDIKELFEEFNTFGNQALFGLSP 218
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
+ Y + L++N P G G+N+G++L +L KLR+ + + +I+ +
Sbjct: 219 ELTPVYRHVLYLYRNKHPNTLFGEPASSGGYPGYNSGMVLFNLDKLRNSTV--YDKIVKK 276
Query: 296 KFLLT-----RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ + T L DQD + + E P L++T+ C WN QL
Sbjct: 277 ESVNTMTSKYHFKGHLGDQDFYTLLGMEKPELIHTIDCGWNRQL 320
>gi|357612376|gb|EHJ67946.1| putative GLT8D3 protein [Danaus plexippus]
Length = 263
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 87/191 (45%), Gaps = 18/191 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L +P ++ + + +DTD +F +LW +F ++ + Q + + WY
Sbjct: 11 RLFIPNLITH-IDSMMYVDTDTLFLGPADRLWEVFYKMNRTQISAMALEDDNPNISWY-- 67
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+ H P G+ G N+GV+L++LT++RD W + I K+ +L+ DQDI
Sbjct: 68 PRFAKH---PFYGKYGLNSGVMLMNLTRMRDFGWVDYVTPIMLKW---KLYIPWGDQDII 121
Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFF 372
N I H VY + C +N + SD + C + ++H + + N F
Sbjct: 122 NIIFHYHENAVYVMTCNYNYR-SDQCVYGDACEPASHGVLVVHGS---RGVFHNNKQPAF 177
Query: 373 RNLYLTFLEYD 383
+ +Y EY+
Sbjct: 178 QAVYRAINEYE 188
>gi|351713218|gb|EHB16137.1| Glycosyltransferase 8 domain-containing protein 3, partial
[Heterocephalus glaber]
Length = 338
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 98 RLFLPSILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 154
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 155 NRFARHPYYGKTGVNSGVMLMNMTRMRKKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 213
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 214 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGIFILHGN 264
Score = 42.4 bits (98), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS L + PL FH+ + + + +WS
Sbjct: 8 IQPIEKMHLAVVACG-ERLEETMTMLKSALIFSIKPLQFHIFAEDQLHDSFKARLDSWSF 66
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 67 LQTFNYTIYPITFPSENAAEW---KKLFKPCASQRLFLPSILKE-VDSLLYVDTDILFLR 122
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 123 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 155
>gi|195430906|ref|XP_002063489.1| GK21937 [Drosophila willistoni]
gi|194159574|gb|EDW74475.1| GK21937 [Drosophila willistoni]
Length = 378
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 63/282 (22%), Positives = 106/282 (37%), Gaps = 94/282 (33%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
D+Y SS+ + + L Q S++RL + + +W GPIS+ L+ + E Q F++Y
Sbjct: 34 DQYTESSENSLICLATQSSVERLNSLAQVSANWYGPISVALFSASNEEFLILQYFITYMR 93
Query: 524 ----NSEALRSRRNIGYHVV---------------------------------------- 539
N +A N +H++
Sbjct: 94 LCFPNIKA-----NATFHILIPRDYEKLPKVVTLPFILKGKFDCRYPEKTLKALLKLRTV 148
Query: 540 ----YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGG 595
+++ N YP N +RN+A + YVFL DID +P + L + +
Sbjct: 149 KTLQWRQRNTYPQNHMRNLARKGCQSKYVFLTDIDIIPSTNIVAQLNDFFKGERCTKNCA 208
Query: 596 KKVLVVPAFETQRYRTAFPASHA------------------------PTNFSRW----VN 627
V+P FE R FP S TNF++W +N
Sbjct: 209 ---YVIPTFEID-VRAPFPRSKKSLLRLIRKGLARPFHEKVFIYNQYATNFTKWLSTNLN 264
Query: 628 ATTPYQIEWAPD-----FEPYIVAHRDLPRYDTRFVGFGWNK 664
T +I + +EP+ +A ++P +D RF G+G+ +
Sbjct: 265 ETDVTKISHSVTNFEFLYEPFYIAMDNVPAHDERFTGYGFTR 306
>gi|296211440|ref|XP_002752411.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Callithrix
jacchus]
Length = 406
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ + Q + + +G W
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 222
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 281
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + S +P ++ PCQWN + E + I+H N
Sbjct: 282 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 332
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E I +A+V G +T++KS + + PL FH+ + + + +WS
Sbjct: 76 IQPVEKIHLAVVACG-ERLEETMTMLKSAIIFSIRPLQFHIFAEDQLHHSFKGRLDSWSF 134
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 135 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 190
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+ + Q + + +G W N
Sbjct: 191 PVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WYN 223
>gi|354594893|ref|ZP_09012930.1| hypothetical protein CIN_16260 [Commensalibacter intestini A911]
gi|353671732|gb|EHD13434.1| hypothetical protein CIN_16260 [Commensalibacter intestini A911]
Length = 608
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 98/207 (47%), Gaps = 31/207 (14%)
Query: 183 SWIP-NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
+++P N+ Y + +L + +LP+T+ K I +D+DVI +IA+LW Q +
Sbjct: 358 NYLPLNREYISLNTYYRLVIQDILPKTVKKIIYIDSDVIVYGNIAELW---QEPLQDMCV 414
Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
G V ++ G ++ + FN G+++ D+ K++ + ++ E F R
Sbjct: 415 GAVLDE-----GGTLQSRRLSLEDNNYFNAGIMIFDIEKIKG-EFKDIFKTYFENFYKNR 468
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS---DELCY--------TELTD 350
+L DQDI N +E +V P +WNV NTR +EL Y L +
Sbjct: 469 DIITLQDQDILNITFAEKTKIV---PLRWNV----NTRMLGYNELDYKYTLKDAEAALQN 521
Query: 351 LKIIHWNSPKK-LKVKNKHMEFFRNLY 376
+ IIH+ +K K+ H FR+LY
Sbjct: 522 IGIIHYTDKRKPWKITCNHP--FRSLY 546
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/120 (23%), Positives = 60/120 (50%), Gaps = 11/120 (9%)
Query: 20 VCAGYNSTRS------LVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEV 73
+C ++ S + +LIK+ + K ++ +++ D Q++ + V +
Sbjct: 290 ICMSFDDNYSAHGDAVITSLIKNA--HPKQQINIYILHDEKLSRTNQSILTRSENQNVRI 347
Query: 74 SFYLADSVVEDVSWIP-NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
+ L D + +++P N+ Y + +L + +LP+T+ K I +D+DVI +IA+LW
Sbjct: 348 HYILIDKKL--FNYLPLNREYISLNTYYRLVIQDILPKTVKKIIYIDSDVIVYGNIAELW 405
>gi|410918973|ref|XP_003972959.1| PREDICTED: glucoside xylosyltransferase 1-like [Takifugu rubripes]
Length = 413
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 77/176 (43%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L LP +L + + + +D+D++F + LWA S+ Q + + WY
Sbjct: 173 RLFLPLILKD-IDSIVYVDSDILFLQPMDHLWAFLSKFTPSQLAAMAPEHEEPRIAWY-- 229
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDIS-----------WAGFWRIIAEKFLLTR 301
+ H P GR G N+GV+L+++T++R +S W + +K+ L
Sbjct: 230 SRFARH---PFYGRTGVNSGVMLMNMTRMRSVSFKNDMTSVGLQWEELLMPLLQKYKLNI 286
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N + +P + PC+WN + E + + ++H N
Sbjct: 287 TW---GDQDLLNIVFHHNPERLLEFPCKWNYRPDHCIYGSNCASAEESGVYMLHGN 339
>gi|332022196|gb|EGI62513.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Acromyrmex echinatior]
Length = 486
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 62/272 (22%), Positives = 99/272 (36%), Gaps = 91/272 (33%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSR-- 531
V L Q S+++L + + HW GP+S+ LY + E Q++L Y E++R R
Sbjct: 126 VCLATQSSVEKLHSLVQVALHWTGPMSVALYAAGDEEFEVLQRYLVYLRRCYESIRERVI 185
Query: 532 --------------------------------------RNIGYHVVYKEGNFYPINTLRN 553
R ++ N YP N +RN
Sbjct: 186 FSLAVPKMRTPKKQPRVFELPDRVDCAKPEATLNEFMSRVPNEQTNWRIRNVYPQNHMRN 245
Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
+A T YVFL D+D +P F L L +R+ + K V+P +E R F
Sbjct: 246 LARKNCQTDYVFLTDVDIVPSFNLTVVLDEFLRN----DNCDKCAYVIPTYELD-TRVRF 300
Query: 614 PASHA------------------------PTNFSRWVNATTPYQIE-------------- 635
P + TNF+RW+ +P +
Sbjct: 301 PPNKTELIRLATKGLARPFHQKVFIHNQFATNFTRWLQDVSPNHVHHENVKTGKVYISHD 360
Query: 636 ---WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ +EP+ VA +P +D RF+G+G+ +
Sbjct: 361 VTNFEFLYEPFYVAKDIVPPHDERFMGYGYTR 392
>gi|426226725|ref|XP_004007489.1| PREDICTED: glucoside xylosyltransferase 1 [Ovis aries]
Length = 486
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 246 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 302
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 303 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 361
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 362 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 412
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 64/151 (42%), Gaps = 9/151 (5%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
E + +A+V G VT++KS L + PL FH+ + + + + +WS Q
Sbjct: 160 ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 218
Query: 72 EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
S Y E+ + W K +L LP +L E + + +DTD++F +
Sbjct: 219 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDD 274
Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+W+L + Q + + +G W N
Sbjct: 275 IWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 303
>gi|48686592|gb|AAT46050.1| hypothetical protein [Rattus norvegicus]
Length = 319
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W+L R Q + + +G W
Sbjct: 79 RLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WY 135
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P GR G N+GV+L+++T++R + W + +K+ L W
Sbjct: 136 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 194
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 195 --GDQDLLNIMFYHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 245
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 31/131 (23%), Positives = 55/131 (41%), Gaps = 4/131 (3%)
Query: 31 VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPN 90
VT++KS L + PLH H+ + + + +WS Q ++ L S +
Sbjct: 10 VTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLDSWSFLQ-RFNYSLYPITFPSDSAMEW 68
Query: 91 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 150
K +L LP +L + + + +DTDV+F + +W+L R Q +
Sbjct: 69 KKLFKPCASQRLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEH 127
Query: 151 SDWYLGKLWKN 161
+ +G W N
Sbjct: 128 EEPRVG--WYN 136
>gi|307200359|gb|EFN80611.1| Uncharacterized protein C3orf21-like protein [Harpegnathos
saltator]
Length = 368
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 75/164 (45%), Gaps = 17/164 (10%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L L ++ P + D D F D+ L+ F+ + GL
Sbjct: 170 PGTYYSDALFFLSLGLHRIAPSDQKLAAMFDADTKFHRDVKDLFEEFNNFGEEALFGLAP 229
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
+ Y + L++N P G G+N+G++L +L +LR + + +I+++
Sbjct: 230 ELTPVYRHVLYLYRNKHPNTLFGEPRHKGGYPGYNSGMVLFNLERLRKSNI--YKKIVSK 287
Query: 296 KFL--LTRLW---TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
K + +T + L DQD + + E P L+YT+ C WN QL
Sbjct: 288 KNVDAMTEKYHFKGHLGDQDFYTLLGMEKPELIYTVDCGWNRQL 331
>gi|195151255|ref|XP_002016563.1| GL11650 [Drosophila persimilis]
gi|194110410|gb|EDW32453.1| GL11650 [Drosophila persimilis]
Length = 459
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 83/204 (40%), Gaps = 48/204 (23%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ +++ED+ S PN +YS L L L ++ +L + I+LD D++F +D+
Sbjct: 227 VCSNIIEDIAAKLSPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDV 286
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 287 RLLFNEFDHFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPMNHDMGNQHNHN 346
Query: 268 -------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDI 311
G N+GV+LL ++R+ + EK + + T L DQD
Sbjct: 347 HERHGYPGLNSGVVLLLFNRIRN------SKSYLEKLTNSEVQTLVAKYSFKGHLGDQDF 400
Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
F + E+P L+Y L C WN QL
Sbjct: 401 FTLLGYEYPNLIYRLDCVWNRQLC 424
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD + + ILQ + +V + +++ED+
Sbjct: 180 SLLKHTSAQLHLHVITDKESSPSVMEILQRQIRRFRRTVIYTMYDVKVCSNIIEDIAAKL 239
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ +L + I+LD D++F +D+ L+ F
Sbjct: 240 SPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDVRLLFNEFDHFLPH 299
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 300 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 329
>gi|213688380|ref|NP_001094357.1| glucoside xylosyltransferase 1 [Rattus norvegicus]
gi|152125898|sp|Q6GX83.2|GXLT1_RAT RecName: Full=Glucoside xylosyltransferase 1; AltName:
Full=Glycosyltransferase 8 domain-containing protein 3;
AltName: Full=S33-D
Length = 435
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W+L R Q + + +G W
Sbjct: 195 RLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WY 251
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P GR G N+GV+L+++T++R + W + +K+ L W
Sbjct: 252 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 310
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 311 --GDQDLLNIMFYHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 361
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 64/149 (42%), Gaps = 8/149 (5%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
PT E + +A+V G + VT++KS L + PLH H+ + + + +WS
Sbjct: 107 PT-EKMHLAVVACG-DRLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLDSWSFL 164
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
Q ++ L S + K +L LP +L + + + +DTDV+F +
Sbjct: 165 Q-RFNYSLYPITFPSDSAMEWKKLFKPCASQRLFLPLIL-KGVDSLLYVDTDVLFLRPVD 222
Query: 130 QLWALFSRLRQRQTIGLV----ENQSDWY 154
+W+L R Q + E + WY
Sbjct: 223 DIWSLLERFNSTQIAAMAPEHEEPRVGWY 251
>gi|296211438|ref|XP_002752410.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Callithrix
jacchus]
Length = 437
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ + Q + + +G W
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 253
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 312
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + S +P ++ PCQWN + E + I+H N
Sbjct: 313 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 363
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E I +A+V G +T++KS + + PL FH+ + + + +WS
Sbjct: 107 IQPVEKIHLAVVACG-ERLEETMTMLKSAIIFSIRPLQFHIFAEDQLHHSFKGRLDSWSF 165
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 166 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 221
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+ + Q + + +G W N
Sbjct: 222 PVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WYN 254
>gi|410964153|ref|XP_003988620.1| PREDICTED: glucoside xylosyltransferase 1 [Felis catus]
Length = 476
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 236 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 292
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 293 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTAVRLRWGDILMPLLKKYKLNITW- 351
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 352 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 402
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
+ +A+V G +T++KS + + PL FH+ + + + + +WS Q
Sbjct: 152 MHLAVVACG-ERLEETITMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILESWSFLQTFNY 210
Query: 74 SFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
S Y E+ + W K +L LP +L E + + +DTD++F + +W
Sbjct: 211 SLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 266
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+L + Q + + +G W N
Sbjct: 267 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 293
>gi|125808799|ref|XP_001360878.1| GA10968 [Drosophila pseudoobscura pseudoobscura]
gi|54636050|gb|EAL25453.1| GA10968 [Drosophila pseudoobscura pseudoobscura]
Length = 459
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 83/204 (40%), Gaps = 48/204 (23%)
Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
+ +++ED+ S PN +YS L L L ++ +L + I+LD D++F +D+
Sbjct: 227 VCSNIIEDIAAKLSPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDV 286
Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
L+ F Q GL + Y L++ +P + G
Sbjct: 287 RLLFNEFDHFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPMNHEMGNQHNHN 346
Query: 268 -------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDI 311
G N+GV+LL ++R+ + EK + + T L DQD
Sbjct: 347 HERHGYPGLNSGVVLLLFNRIRNS------KSYLEKLTNSEVQTLVAKYSFKGHLGDQDF 400
Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
F + E+P L+Y L C WN QL
Sbjct: 401 FTLLGYEYPNLIYRLDCVWNRQLC 424
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 14/150 (9%)
Query: 36 SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
S+L + LH H+ITD + + ILQ + +V + +++ED+
Sbjct: 180 SLLKHTSAQLHLHVITDKESSPSVMEILQRQIRRFRRTVIYTMYDVKVCSNIIEDIAAKL 239
Query: 86 ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S PN +YS L L L ++ +L + I+LD D++F +D+ L+ F
Sbjct: 240 SPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDVRLLFNEFDHFLPH 299
Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
Q GL + Y L++ +P G
Sbjct: 300 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 329
>gi|403269344|ref|XP_003926707.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Saimiri
boliviensis boliviensis]
Length = 437
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ + Q + + +G W
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 253
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 312
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + S +P ++ PCQWN + E + I+H N
Sbjct: 313 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 363
>gi|55637891|ref|XP_522357.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Pan
troglodytes]
gi|410223168|gb|JAA08803.1| glucoside xylosyltransferase 1 [Pan troglodytes]
gi|410250938|gb|JAA13436.1| glucoside xylosyltransferase 1 [Pan troglodytes]
gi|410298480|gb|JAA27840.1| glucoside xylosyltransferase 1 [Pan troglodytes]
Length = 409
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 169 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 225
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 284
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 285 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 335
>gi|300175417|emb|CBK20728.2| unnamed protein product [Blastocystis hominis]
Length = 311
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 69/130 (53%), Gaps = 5/130 (3%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVT +Q S++R + ++ + + W+GP+S+ Y+ E + + + LR ++ Y
Sbjct: 63 DVTFASQASINRWEFMKQILRFWKGPVSIVYYLKPDEVETYKRLKEKT-PLRDNVHLAYM 121
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
+ + YPIN LRN A++ ++T + + D+D P LY L RS + K
Sbjct: 122 IQHNNSQ-YPINKLRNKAISYITTSHYYFSDMDNWPSPNLYDELLRLPRSA---LNDDKL 177
Query: 598 VLVVPAFETQ 607
+++VPA++ +
Sbjct: 178 IIIVPAYQIK 187
>gi|403169096|ref|XP_003328632.2| hypothetical protein PGTG_10591 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167800|gb|EFP84213.2| hypothetical protein PGTG_10591 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 571
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 101/244 (41%), Gaps = 51/244 (20%)
Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
Q EINP D+++ ++ +R ++ L ++++GPIS T+++SD+ A
Sbjct: 240 QSGEINPK------------DISITTLVTSNRFKVFAKLVENYQGPISATIHVSDSPAIL 287
Query: 518 FLSYAGNSEALRSRRN----IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLP 573
E S ++ + H+V + N RNVA T YV +LD+DF
Sbjct: 288 KPLLESLDELYHSSKSMSTWVDIHLVVDSFD-RQFNMWRNVAKFFARTDYVMMLDVDFWI 346
Query: 574 MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR-----TAFPA------------- 615
+ S +D G V+PAFE + FP+
Sbjct: 347 CTNFRQRILGSPELLDKLA-AGTSAFVIPAFEYTKQSDGVDPDGFPSDKQALMKLVSSGQ 405
Query: 616 ----------SHAPTNFSRWVNATTPYQI----EWAPDFEPYIVAHRD-LPRYDTRFVGF 660
H TN++R+ +A QI ++ +EPY+V +D P D RF+G+
Sbjct: 406 IGMFHKSWKPGHGSTNYTRFYHADQLDQIYPASGYSHSYEPYVVYKKDKTPFCDERFIGY 465
Query: 661 GWNK 664
G NK
Sbjct: 466 GANK 469
>gi|198416660|ref|XP_002125009.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 363
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 102/278 (36%), Gaps = 98/278 (35%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA-----------QQF-------- 518
DV+L Q S++ L + L W+GP+S ++++ + +A Q F
Sbjct: 29 DVSLATQCSVNHLHHLVELLDRWDGPVSCSVFVPNQDASFADDAIKRLRQCFPKINRHVS 88
Query: 519 ------------LSYAGN------SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS 560
LS G SE L N GY G +P N LRN A + V
Sbjct: 89 FHLVYPSTHIADLSLVGTWLELSCSELLVRLENFGYQNYDVAGISFPHNALRNAARSGVL 148
Query: 561 TPYVFLLDIDFLPMFGLY-PYLKSSIRS-------MDMHGHGGKKVLVVPAFETQRYRTA 612
T YV L+DID +P GL +++ + R+ MD+ + VVP FE
Sbjct: 149 TQYVLLVDIDVMPNVGLRNQFIEFANRNSLFNTSVMDLTAY------VVPVFEVLA-GLE 201
Query: 613 FPASH------------------------------APTNFSRWV---------------- 626
FP ++F RW
Sbjct: 202 FPKDKKQLLNGLKMVPFGLFITKLAGGVIKLKTLTGGSHFHRWKTLFLNVQLLQLQGNGE 261
Query: 627 NATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
N + EW +EP+ +A RD+P +D RF +G+++
Sbjct: 262 NLNVAFIAEWDKSWEPFYIARRDVPMFDERFKQYGFDR 299
>gi|196002501|ref|XP_002111118.1| hypothetical protein TRIADDRAFT_50070 [Trichoplax adhaerens]
gi|190587069|gb|EDV27122.1| hypothetical protein TRIADDRAFT_50070 [Trichoplax adhaerens]
Length = 171
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 62/136 (45%), Gaps = 18/136 (13%)
Query: 215 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPAL 265
+D+D+ F TDI L+ LF + IG+ Y W+ H+ P P
Sbjct: 1 MDSDLKFTTDIRNLFNLFHEFQPNNIIGIGRESQPVYRHVFWQYRHENPKTRVGDPPPHG 60
Query: 266 GRGFNTGVILLDLTKLRDISWAG------FWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
GFN+GV+LLDL ++R G +A+K+ L DQD++ + E+
Sbjct: 61 LTGFNSGVLLLDLERMRKSKLFGQVLDENLPENLAKKY---HFRGHLGDQDLYTLLHMEY 117
Query: 320 PYLVYTLPCQWNVQLS 335
+ Y LPC WN QL
Sbjct: 118 NDMFYVLPCTWNRQLC 133
>gi|153791276|ref|NP_001093120.1| glucoside xylosyltransferase 1 isoform 2 [Homo sapiens]
gi|24659591|gb|AAH39145.1| GLT8D3 protein [Homo sapiens]
gi|119578236|gb|EAW57832.1| hCG39367, isoform CRA_b [Homo sapiens]
gi|193785001|dbj|BAG54154.1| unnamed protein product [Homo sapiens]
Length = 409
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 169 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 225
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 284
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 285 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 335
Score = 38.9 bits (89), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G ++ T++KS + + PL FH+ + + + WS
Sbjct: 79 IQPVEKMHLAVVACGERLEETM-TMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 137
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 138 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 193
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 194 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 226
>gi|157113908|ref|XP_001657920.1| n-acetyllactosaminide beta-1,3-n-acetylglucosaminyltransferase
[Aedes aegypti]
gi|108877534|gb|EAT41759.1| AAEL006638-PA [Aedes aegypti]
Length = 482
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 62/277 (22%), Positives = 103/277 (37%), Gaps = 85/277 (30%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
++Y S+ V L Q S++++ + + HW GPIS ++ + + Q +LSY
Sbjct: 118 EKYPELSERFSVCLATQSSLEKIYSLVQVSHHWSGPISTAIFAAGNDELYLLQIYLSYLR 177
Query: 524 NS-EALRSRRNIGYHVVYKE------------GNF------------------------- 545
N + +R R + +H+ + G+F
Sbjct: 178 NCFKTIRER--VSFHLALPKDRAPTHLKSIHVGDFTKFDCAKPEATLNDLIKLRKVETNK 235
Query: 546 ------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
YP N LRNVA + +VFL D+D +P +L +R+ H
Sbjct: 236 WRIKNPYPQNHLRNVARKGCQSTHVFLTDVDIIPSVNFAEHLDKFLRNQRPQHH--PTAY 293
Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVNATTPYQIE 635
VVP +E R FP + TNFSRW + ++
Sbjct: 294 VVPTYELDE-RVRFPRNKTDLIRLANKGLARPFHHKVFIYNQFATNFSRWQADVSDSEVT 352
Query: 636 WAPD--------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF+G+G+ +
Sbjct: 353 HVSHNVTNFEFLYEPFYVAPDTVPPHDERFLGYGYTR 389
>gi|260822427|ref|XP_002606603.1| hypothetical protein BRAFLDRAFT_209578 [Branchiostoma floridae]
gi|229291947|gb|EEN62613.1| hypothetical protein BRAFLDRAFT_209578 [Branchiostoma floridae]
Length = 381
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/270 (24%), Positives = 97/270 (35%), Gaps = 88/270 (32%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS-----RR 532
D TLV Q S+ L + L W+GP+S+ ++ A G LR R+
Sbjct: 30 DATLVTQCSVQNLYHIVELSVRWDGPVSVAIFAPGENAAFADDIVG---GLRRCYPLIRK 86
Query: 533 NIGYHVVY--------------------------------------KEGNFYPINTLRNV 554
+ +H+VY E YP N LRNV
Sbjct: 87 YVSFHLVYPTSREAIIYEESPTLLNTPCQDILHLAKYHKSATLNYGAEKEPYPNNLLRNV 146
Query: 555 ALNQVSTPYVFLLDIDFLP---------MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
A ST ++F++DID LP F L L S S+ + H V VVPAFE
Sbjct: 147 ARQGSSTEFIFVVDIDMLPSEKLRTDFVHFALENRLFDSRTSL-VSDH-LDTVYVVPAFE 204
Query: 606 TQR-----------------------YRTAFPASHAPTNFSRWVNATT--------PYQI 634
Q YR + T++ +W A + Y +
Sbjct: 205 LQLNASIPKDKTELLEALQQGQARPFYRDICWKCQSSTDYDKWTAAKSSSKRRLDVAYNV 264
Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
W +EP+ + +P YD RF +G+N+
Sbjct: 265 TWKDPWEPFYIGRWSVPLYDERFKQYGFNR 294
>gi|395841533|ref|XP_003793589.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Otolemur
garnettii]
Length = 406
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 222
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRIRRKYFKNDMTSVRLRWGDILMPLLKKYKLNITW- 281
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 282 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCHEAEEEGIFILHGN 332
>gi|426372204|ref|XP_004053018.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Gorilla
gorilla gorilla]
Length = 409
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 169 RLFLPLILKE-IDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 225
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 284
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 285 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 335
Score = 39.3 bits (90), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + WS
Sbjct: 79 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 137
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 138 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-IDSLLYVDTDILFLR 193
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 194 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 226
>gi|350411357|ref|XP_003489320.1| PREDICTED: uncharacterized protein C3orf21 homolog [Bombus
impatiens]
Length = 357
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 17/164 (10%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L L ++ P + + D D F DI +L+ F+ + GL
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPAEQSVAAMFDVDTKFRKDIKELFEEFNSFGSQALFGLAP 218
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
+ Y + L+++ P G G+N+GV+L +L KLR+ S + I+
Sbjct: 219 ELTPVYRHVLYLYRSKHPNTLFGEPASSGGYCGYNSGVVLFNLDKLRNSSV--YDAIVKR 276
Query: 296 KFL--LTRLW---TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ + +T+ + L DQD + + E P L++T+ C WN QL
Sbjct: 277 ESVDAMTKKYHFKGHLGDQDFYTILGMEKPELIHTIDCGWNRQL 320
>gi|301772784|ref|XP_002921812.1| PREDICTED: glucoside xylosyltransferase 1-like [Ailuropoda
melanoleuca]
Length = 458
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 78/173 (45%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 218 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 274
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLR------DIS-----WAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R D++ W + +K+ L W
Sbjct: 275 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLLWGDILMPLLKKYKLNITW- 333
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 334 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEAEGIFILHGN 384
Score = 42.0 bits (97), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
+ +A+V G VT++KS + + PL FH+ + + + + +WS Q
Sbjct: 134 MHLAVVACG-ERLEETVTMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILDSWSFLQTFNY 192
Query: 74 SFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
S Y E+ + W K +L LP +L E + + +DTD++F + +W
Sbjct: 193 SLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 248
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+L + Q + + +G W N
Sbjct: 249 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 275
>gi|42415433|gb|AAS15686.1| GH27387p [Drosophila melanogaster]
Length = 186
Score = 62.4 bits (150), Expect = 9e-07, Method: Composition-based stats.
Identities = 33/96 (34%), Positives = 55/96 (57%), Gaps = 5/96 (5%)
Query: 263 PALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPY 321
P GR G N+GV+L++LT++R++ W I +++ L +W DQDI N + HP
Sbjct: 23 PFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINILFYYHPD 79
Query: 322 LVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+Y +PC++N + D+ +C T +K+IH N
Sbjct: 80 KLYIMPCEYNYR-PDHCMYMSICNMSQTGVKVIHGN 114
>gi|156367099|ref|XP_001627257.1| predicted protein [Nematostella vectensis]
gi|156214161|gb|EDO35157.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 58/265 (21%), Positives = 98/265 (36%), Gaps = 83/265 (31%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLT-----LYMSDAEAQQFLSYAGNSEALRSR 531
+DV LV +++ L + + +HW+GPIS+ L++ A + N + +
Sbjct: 3 DDVVLVTHCTVNHLHYLLDIVEHWQGPISVAVVVPGLHIVSAYKSMIGLHLCNKDV---Q 59
Query: 532 RNIGYHVVY--------------------------------------KEGNF-----YPI 548
+ + +HVVY N+ YP
Sbjct: 60 QRVTFHVVYPLSHPALDNRLSNYVRLARAELPLTCEEFMVTLQTGLANSANYGGEVPYPS 119
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
N LRNV +VF++DID + LY + R ++ V VVPAFE+Q
Sbjct: 120 NVLRNVGREHSKAKFVFVVDIDMMCNGNLYKDFLNFSRENNLF---SMTVYVVPAFESQE 176
Query: 609 -----------------------YRTAFPASHAPTNFSRWVNATT------PYQIEWAPD 639
Y A +PTN+ +W + + + W
Sbjct: 177 NLPPNTNKMELIKMWENGKVLPFYYQACRKCQSPTNYDKWKRSRSSSGLHVSFTRSWTDP 236
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ ++ +P YD RF +G+N+
Sbjct: 237 WEPFYISLNSVPLYDGRFKQYGFNR 261
>gi|397511420|ref|XP_003826072.1| PREDICTED: glucoside xylosyltransferase 1 [Pan paniscus]
Length = 394
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 154 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 210
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 211 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 269
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 270 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 320
Score = 39.3 bits (90), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + WS
Sbjct: 64 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 122
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 123 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 178
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 179 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 211
>gi|380796193|gb|AFE69972.1| glucoside xylosyltransferase 1 isoform 1, partial [Macaca mulatta]
Length = 375
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 135 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 191
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 192 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 250
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 251 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 301
Score = 39.7 bits (91), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 33/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + +W+
Sbjct: 45 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSMKPLQFHIFAEDQLHHSFKDRLDSWAF 103
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 104 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 159
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 160 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 192
>gi|340729673|ref|XP_003403121.1| PREDICTED: uncharacterized protein C3orf21 homolog [Bombus
terrestris]
Length = 357
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 17/164 (10%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L L ++ P + + D D F DI +L+ F+ + GL
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPAEQSVAAMFDVDTKFRKDIKELFEEFNTFGNQALFGLAP 218
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
+ Y + L+++ P G G+N+GV+L +L KLR+ S + I+
Sbjct: 219 ELTPVYRHVLYLYRSKHPNTLFGEPASSGGYHGYNSGVVLFNLDKLRNSSV--YDAIVKR 276
Query: 296 KFL--LTRLW---TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ + +T+ + L DQD + + E P L++T+ C WN QL
Sbjct: 277 ESVDAMTKKYHFKGHLGDQDFYTILGMEKPELIHTIDCGWNRQL 320
>gi|187469806|gb|AAI67068.1| Glt8d3 protein [Rattus norvegicus]
Length = 359
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W+L R Q + + +G W
Sbjct: 119 RLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WY 175
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P GR G N+GV+L+++T++R + W + +K+ L W
Sbjct: 176 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 234
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 235 --GDQDLLNIMFYHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 285
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 35/146 (23%), Positives = 62/146 (42%), Gaps = 7/146 (4%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
E + +A+V G + VT++KS L + PLH H+ + + + +WS Q
Sbjct: 33 EKMHLAVVACG-DRLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLDSWSFLQ-R 90
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
++ L S + K +L LP +L + + + +DTDV+F + +W
Sbjct: 91 FNYSLYPITFPSDSAMEWKKLFKPCASQRLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIW 149
Query: 133 ALFSRLRQRQTIGLV----ENQSDWY 154
+L R Q + E + WY
Sbjct: 150 SLLERFNSTQIAAMAPEHEEPRVGWY 175
>gi|302564381|ref|NP_001180786.1| glucoside xylosyltransferase 1 [Macaca mulatta]
Length = 440
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + +WS
Sbjct: 110 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSMKPLQFHIFAEDQLHHSFKDRLDSWSF 168
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 169 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 224
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 225 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 257
>gi|395541130|ref|XP_003772500.1| PREDICTED: glucoside xylosyltransferase 1-like [Sarcophilus
harrisii]
Length = 410
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 170 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 226
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 227 NRFARHPYYGKTGINSGVMLMNMTRMRRKYFKNDMTTARLQWGEILMPLLKKYKLNITW- 285
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 286 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGIFILHGN 336
Score = 39.3 bits (90), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 30/153 (19%), Positives = 61/153 (39%), Gaps = 9/153 (5%)
Query: 11 TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ 70
+ + + +A+V G +T+IKS + + PL FH+ + + + + W Q
Sbjct: 82 SSDTMHLAVVACG-ERLEETITMIKSAIIFSIQPLQFHIFAEDQLHHSFKNILDGWKFLQ 140
Query: 71 V--EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
+ + + W K +L LP +L E + + +DTD++F +
Sbjct: 141 TFNYIIYPITFPTENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPV 196
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+W+L + Q + + +G W N
Sbjct: 197 DDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 227
>gi|384489769|gb|EIE80991.1| hypothetical protein RO3G_05696 [Rhizopus delemar RA 99-880]
Length = 392
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 99/226 (43%), Gaps = 47/226 (20%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLY-MSDAEAQQFLS-----YAGNSEALRSR 531
D+TL ++ DR ++ L +++GPIS ++ M + E + + Y N + +
Sbjct: 44 DITLATLVTRDRFHVLSRLATNYQGPISAAIHVMDNKEKTKTIGELNRIYKSNPDM---Q 100
Query: 532 RNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI---RSM 588
+ H++ + + N RN A T Y+ +LD+D F L + SI +
Sbjct: 101 AFVDVHLIVDKYD-RQFNMWRNTAKLFTRTDYLMMLDVD----FHLCTDFRKSIWENAEL 155
Query: 589 DMHGHGGKKVLVVPAFE--------------TQR--------------YRTAFPASHAPT 620
GK VVPAFE T++ + +A+ + H T
Sbjct: 156 AAKLKSGKTAFVVPAFEYLDQKDGLDWRTFPTEKKKVLKEVEEAKLDSFHSAWVSGHGAT 215
Query: 621 NFSRWVNATTPYQI-EWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
N++RW + Y + E+ +EPY++ ++ P D RF+G+G NK
Sbjct: 216 NYTRWYSTNQAYPVTEYEYSYEPYVIYKKEGTPWCDERFIGYGANK 261
>gi|167525928|ref|XP_001747298.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774133|gb|EDQ87765.1| predicted protein [Monosiga brevicollis MX1]
Length = 434
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 64/150 (42%), Gaps = 33/150 (22%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
YP N LRN+A +Q +T + +LD+D +P + ++ ++ + HG + VVPAFE
Sbjct: 222 YPNNLLRNLARDQATTLFTMVLDVDIVPSHNIAKAVEQAVAA--HHGSYTRLAFVVPAFE 279
Query: 606 -TQRYRTAFPA------------------------SHAPTNFSRWVNATT-----PYQIE 635
T PA TN+ +W Y++
Sbjct: 280 LDTSTATTMPAHRDELLQMLEAGTARPFYQEVCWKCQRHTNYEKWQQGPMQITELEYEVS 339
Query: 636 WAPDFEP-YIVAHRDLPRYDTRFVGFGWNK 664
W +EP YI R PRYD RF +G+N+
Sbjct: 340 WHDPWEPFYIARERSCPRYDVRFRQYGFNR 369
>gi|195489522|ref|XP_002092774.1| GE14380 [Drosophila yakuba]
gi|194178875|gb|EDW92486.1| GE14380 [Drosophila yakuba]
Length = 389
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/278 (21%), Positives = 106/278 (38%), Gaps = 87/278 (31%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
++Y +++ + V L Q S++RL + + +W+G +S+ L+ + E Q F++Y
Sbjct: 41 EQYADTTESSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100
Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
A R N +H++
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRMSALPPNMKGKFDCQYPDRTLKALLKFRSLKTLQ 159
Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
+++ N YP N +RN+A T YVFL DID +P P L R+ +
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCTKSCA---Y 216
Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVNATTPYQIE 635
V+P FE R FP S TNFS+W++ T + E
Sbjct: 217 VIPTFEID-VRATFPRSKTALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSPNTN-ETE 274
Query: 636 WAPD---------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+ +EP+ ++ + P +D RF+G+G+ +
Sbjct: 275 VSVSHIVTNFEFLYEPFYISVDNAPAHDERFLGYGFTR 312
>gi|348580283|ref|XP_003475908.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 2 [Cavia
porcellus]
Length = 406
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 222
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 281
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 282 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 332
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS L + PL FH+ + + + +WS
Sbjct: 76 IQPVEKMHLAVVACG-ERLEETMTMLKSALIFSIKPLQFHIFAEDQLHDSFKARLDSWSF 134
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 135 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 190
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 191 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 223
>gi|332206948|ref|XP_003252557.1| PREDICTED: glucoside xylosyltransferase 1, partial [Nomascus
leucogenys]
Length = 417
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F I +W+L + Q + + +G W
Sbjct: 177 RLFLPLILKE-VDSLLYVDTDILFLRPIDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 233
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 234 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 292
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 293 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 343
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 63/155 (40%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + WS
Sbjct: 87 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDDWSF 145
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 146 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 201
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
I +W+L + Q + + +G W N
Sbjct: 202 PIDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 234
>gi|432114562|gb|ELK36410.1| Glucoside xylosyltransferase 1 [Myotis davidii]
Length = 392
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 152 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 208
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 209 NRFARHPYYGKTGVNSGVMLMNMTRIRGKYFKNDMTTVRMRWGDILMPLLKKYKLNITW- 267
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 268 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGIFILHGN 318
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 37/157 (23%), Positives = 66/157 (42%), Gaps = 9/157 (5%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P + E + +A+V G +T++KS + + PL FH+ + + + + TW
Sbjct: 60 PKVQPVERMHLAVVACG-ERLEETITMLKSAIIFSIKPLQFHIFAEDQLHDSFKGILDTW 118
Query: 67 SVPQV-EVSFYLADSVVED-VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
S Q S Y E+ V W K +L LP +L E + + +DTD++F
Sbjct: 119 SFLQTFNYSLYPISFPSENAVEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILF 174
Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 175 LRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 209
>gi|281341850|gb|EFB17434.1| hypothetical protein PANDA_010726 [Ailuropoda melanoleuca]
Length = 369
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 78/173 (45%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 185
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLR------DIS-----WAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R D++ W + +K+ L W
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLLWGDILMPLLKKYKLNITW- 244
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 245 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEAEGIFILHGN 295
Score = 42.0 bits (97), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
+ +A+V G VT++KS + + PL FH+ + + + + +WS Q
Sbjct: 45 MHLAVVACG-ERLEETVTMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILDSWSFLQTFNY 103
Query: 74 SFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
S Y E+ + W K +L LP +L E + + +DTD++F + +W
Sbjct: 104 SLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 159
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+L + Q + + +G W N
Sbjct: 160 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 186
>gi|71051513|gb|AAH30023.1| Glycosyltransferase 8 domain containing 3 [Homo sapiens]
Length = 440
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 200 RLFLPLILKE-VNSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 366
Score = 38.9 bits (89), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + WS
Sbjct: 110 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 168
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 169 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VNSLLYVDTDILFLR 224
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 225 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 257
>gi|114645289|ref|XP_001167787.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Pan
troglodytes]
gi|410223170|gb|JAA08804.1| glucoside xylosyltransferase 1 [Pan troglodytes]
gi|410298482|gb|JAA27841.1| glucoside xylosyltransferase 1 [Pan troglodytes]
Length = 440
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366
>gi|355564140|gb|EHH20640.1| Glucoside xylosyltransferase 1, partial [Macaca mulatta]
gi|355786013|gb|EHH66196.1| Glucoside xylosyltransferase 1, partial [Macaca fascicularis]
Length = 369
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 185
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 244
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 245 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 295
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + +WS
Sbjct: 39 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSMKPLQFHIFAEDQLHHSFKDRLDSWSF 97
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 98 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 153
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 154 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 186
>gi|73997151|ref|XP_543730.2| PREDICTED: glucoside xylosyltransferase 1 [Canis lupus familiaris]
Length = 410
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 170 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 226
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 227 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDMLMPLLKKYKLNITW- 285
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 286 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 336
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 33/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
+ +A+V G +T++KS + + PL FH+ + + + + +WS Q
Sbjct: 86 MHLAVVACG-ERLEETITMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILDSWSFLQTFNY 144
Query: 74 SFYLADSVVED-VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
S Y E+ + W K +L LP +L E + + +DTD++F + +W
Sbjct: 145 SLYPITFPSENALEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 200
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+L + Q + + +G W N
Sbjct: 201 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 227
>gi|431899796|gb|ELK07743.1| Glycosyltransferase 8 domain-containing protein 4 [Pteropus alecto]
Length = 317
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 70 RLFLPMILKD-VDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W ++ +K+ W
Sbjct: 128 RFAQHPFYGSSGINSGVMLMNLTRIRSAQFKNSVIPTGLAWEDMLYLLYQKYKSAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + EL + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAELEGVSVLHGN 236
>gi|432920667|ref|XP_004079976.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Oryzias
latipes]
Length = 431
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 63/150 (42%), Gaps = 34/150 (22%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
YP N LRN+A + + Y+ ++DID +P L+ + + +V V+PAFE
Sbjct: 219 YPNNLLRNIARSGTESSYILVIDIDMMPSADLHQQFLNLVMKRQP---TSDEVFVLPAFE 275
Query: 606 TQR-----------------------YRTAFPASHAPTNFSRWVNA--------TTPYQI 634
+ Y P APTN+S+WVN Y +
Sbjct: 276 IRHARKIPANKAELVQLYQVGEVRPFYEELCPRCQAPTNYSQWVNNHVRGIGTLEVAYTL 335
Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
W +EP+ + R +P Y+ F +G+N+
Sbjct: 336 TWVDPWEPFYIGPRTVPLYNENFKQYGFNR 365
>gi|153791362|ref|NP_775872.1| glucoside xylosyltransferase 1 isoform 1 [Homo sapiens]
gi|269849602|sp|Q4G148.2|GXLT1_HUMAN RecName: Full=Glucoside xylosyltransferase 1; AltName:
Full=Glycosyltransferase 8 domain-containing protein 3
gi|119578237|gb|EAW57833.1| hCG39367, isoform CRA_c [Homo sapiens]
Length = 440
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 366
>gi|313213019|emb|CBY36900.1| unnamed protein product [Oikopleura dioica]
Length = 427
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/265 (23%), Positives = 100/265 (37%), Gaps = 75/265 (28%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---RS 530
S DVTL +Q S + L V L + W GP+S+ ++ D ++ F++ A S +
Sbjct: 94 SQYEDVTLASQCSANHLHHVINLLERWSGPMSIAVFAPDKDSS-FVTDAILSMRKCWPQI 152
Query: 531 RRNIGYHVVYKEG-------------------------NF-------------YPINTLR 552
R + +H+VY G NF YP N LR
Sbjct: 153 RDKVTFHLVYPIGHTSDVSASTGGLEFNDCNELVDIISNFGGDDAGNYKGDIPYPHNVLR 212
Query: 553 NVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK--VLVVPAFETQRYR 610
NVA+ +T Y FL+DID LP + + + V VVP FE ++
Sbjct: 213 NVAIKTSATKYTFLIDIDVLPNENIRERFNQMAIKKGYYDEDFDENVVFVVPCFEMKKGY 272
Query: 611 TA----------FPAS-------------HAPTNFSRWV--------NATTPYQIEWAPD 639
T F H+P W+ + + W +
Sbjct: 273 TRSLNKKHLLEDFQKGLIRPFHNESCWWCHSPEEHPHWMAIPLSRNDQSKISFNATWNKN 332
Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ + +++P +D RF +G+++
Sbjct: 333 WEPFYIGSKNVPLFDERFKQYGFDR 357
>gi|149017615|gb|EDL76619.1| similar to CG9996-PA, isoform CRA_b [Rattus norvegicus]
Length = 362
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 68/158 (43%), Gaps = 18/158 (11%)
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFN 270
+ +DTDV+F + +W+L R Q + + +G W N P GR G N
Sbjct: 136 LYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WYNRFARHPYYGRTGVN 193
Query: 271 TGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
+GV+L+++T++R + W + +K+ L W DQD+ N + +
Sbjct: 194 SGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW---GDQDLLNIMFYHN 250
Query: 320 PYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
P ++ PCQWN + E + I+H N
Sbjct: 251 PESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 288
>gi|426372202|ref|XP_004053017.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Gorilla
gorilla gorilla]
Length = 440
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 200 RLFLPLILKE-IDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 316 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366
Score = 38.9 bits (89), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + WS
Sbjct: 110 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 168
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 169 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-IDSLLYVDTDILFLR 224
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 225 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 257
>gi|431901432|gb|ELK08458.1| Glycosyltransferase 8 domain-containing protein 3, partial
[Pteropus alecto]
Length = 369
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 185
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRIRRKYFKNDMTTARLRWGDILMPLLKKYKLNITW- 244
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 245 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 295
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/153 (20%), Positives = 63/153 (41%), Gaps = 5/153 (3%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I + I +A+V G +T++KS + + PL FH+ + + + + WS
Sbjct: 39 IQPVQRIHLAVVACG-ERLEETITMLKSAIIFSIKPLQFHIFAEDQLHDSFRGILDNWSF 97
Query: 69 PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
Q ++ L + + K +L LP +L E + + +DTD++F +
Sbjct: 98 LQT-FNYSLYPIIFPSENAAEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPV 155
Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+W+L + Q + + +G W N
Sbjct: 156 DDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 186
>gi|355736744|gb|AES12096.1| hypothetical protein [Mustela putorius furo]
Length = 226
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 88/210 (41%), Gaps = 26/210 (12%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 14 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 73
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
F IG+ Y W+ P GFN+GV+LL+L +R
Sbjct: 74 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 133
Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
+ + + E + +L L DQD F I EHP L + L C WN QL
Sbjct: 134 Q---SPLYSRLLEPTQVQQLADKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 190
Query: 336 -----DNTRSD--ELCYTELTDLKIIHWNS 358
D+ SD E + +KI H N
Sbjct: 191 WTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 220
>gi|395841531|ref|XP_003793588.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Otolemur
garnettii]
Length = 437
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 253
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRIRRKYFKNDMTSVRLRWGDILMPLLKKYKLNITW- 312
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 313 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCHEAEEEGIFILHGN 363
>gi|393242337|gb|EJD49855.1| hypothetical protein AURDEDRAFT_112338 [Auricularia delicata
TFB-10046 SS5]
Length = 442
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 71/136 (52%), Gaps = 9/136 (6%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS--TWS 67
P E I +++V G +S ++KSIL P+ H++ A+ +LQ T
Sbjct: 60 PEEEKIVISMVMFGRSSAEEGQHMLKSILMRASAPVELHIVCSDDAVPVLQRRLDLVTRP 119
Query: 68 VPQVEVSFY--LADSVVE--DVSWIPNK-HYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
V V+V FY D+VV + + I NK H++GV GL K+ + +++P LA + +DTD
Sbjct: 120 VHDVDVVFYPLTEDAVVTRGERAGIGNKRHHAGVGGLAKMFIHELVPAPLA--LYVDTDA 177
Query: 123 IFATDIAQLWALFSRL 138
+F TD LW RL
Sbjct: 178 MFLTDPLHLWHTARRL 193
>gi|241848448|ref|XP_002415636.1| conserved hypothetical protein [Ixodes scapularis]
gi|215509850|gb|EEC19303.1| conserved hypothetical protein [Ixodes scapularis]
Length = 387
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 75/161 (46%), Gaps = 22/161 (13%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
YS L L L +VL +L + I+LD D+ +D+ L F+ + +G+ Q
Sbjct: 192 YSNALFFLSLGLHRVL--SLRRLILLDIDLRLESDVWLLHRHFALFPETALVGMAHEQQP 249
Query: 250 WYLGKLWKNHK----------PWPALGRGFNTGVILLDLTKL------RDISWAGFWRII 293
YL L+ H+ P P GFN+GV+LLDL ++ R++S +
Sbjct: 250 TYL-HLFHEHRKRHPETRCSLPLPQGNPGFNSGVLLLDLHRMALSSAFRNLSSREGVEYL 308
Query: 294 AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
K+ R L DQD + + E P LVY LPC WN QL
Sbjct: 309 TSKY---RFQGHLGDQDFYTVLSCERPELVYALPCTWNRQL 346
>gi|344255262|gb|EGW11366.1| Glycosyltransferase 8 domain-containing protein 3 [Cricetulus
griseus]
Length = 345
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 105 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 161
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 162 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 220
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 221 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 271
Score = 46.6 bits (109), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 37/152 (24%), Positives = 64/152 (42%), Gaps = 6/152 (3%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
PT E + +A+V G VT++KS L + PL FH+ + N + +WS
Sbjct: 17 PT-EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHNSFKERLDSWSFL 74
Query: 70 QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
Q ++ L S + K +L LP +L E + + +DTD++F +
Sbjct: 75 Q-RFNYTLYSITFPSDSAVEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVD 132
Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+W+L + Q + + +G W N
Sbjct: 133 DIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 162
>gi|297691568|ref|XP_002823152.1| PREDICTED: glucoside xylosyltransferase 1 [Pongo abelii]
Length = 437
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 253
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 312
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 313 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 363
Score = 39.3 bits (90), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS + + PL FH+ + + + +WS
Sbjct: 107 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDSWSF 165
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 166 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 221
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 222 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 254
>gi|348580281|ref|XP_003475907.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 1 [Cavia
porcellus]
Length = 437
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 253
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 312
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 313 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 363
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 64/155 (41%), Gaps = 9/155 (5%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I E + +A+V G +T++KS L + PL FH+ + + + +WS
Sbjct: 107 IQPVEKMHLAVVACG-ERLEETMTMLKSALIFSIKPLQFHIFAEDQLHDSFKARLDSWSF 165
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 166 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 221
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+ +W+L + Q + + +G W N
Sbjct: 222 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 254
>gi|147906982|ref|NP_001089592.1| glucoside xylosyltransferase 1 [Xenopus laevis]
gi|68533770|gb|AAH99025.1| MGC115517 protein [Xenopus laevis]
Length = 324
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 31/223 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ Q + + +G W
Sbjct: 85 RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNATQIAAMAPEHEEPRIG--WY 141
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 142 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTTVRLHWGDLLLPLLKKYKLNITW- 200
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N + S +P ++ PCQWN + D+ C+ + I+H N +
Sbjct: 201 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEDGIFILHGN---RGVY 254
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
++ FR +Y F Y FG +L Q+ L EL
Sbjct: 255 HDRKQPAFRAVYEAFRNYS--------FGHDLRQSLLYPLELE 289
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/180 (20%), Positives = 78/180 (43%), Gaps = 10/180 (5%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ-VEV 73
+ +A+V G + +T++KS L + K L FH+ + + ++W+ P+ +
Sbjct: 1 MHLAVVACG-DRLEETITMLKSALLFSKKLLQFHIFAEDQLHYDFKAALNSWAFPEKFKY 59
Query: 74 SFYLADSVVED-VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
+ Y +E+ V W K + +L LP +L E + + +DTD++F + +W
Sbjct: 60 TLYPITFPLENAVEW---KKLFKLCASQRLFLPLILTE-VDSLLYVDTDILFLRPLDDIW 115
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGRGFYLADSVVEDVSWIPNKHYS 191
+ Q + + +G W N P G+ + ++ +++ I KH+
Sbjct: 116 SFLGGFNATQIAAMAPEHEEPRIG--WYNRFARHPYYGKTGINSGVMLMNMTRIRKKHFK 173
>gi|291392421|ref|XP_002712730.1| PREDICTED: glycosyltransferase 8 domain containing 3 [Oryctolagus
cuniculus]
Length = 680
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/173 (23%), Positives = 75/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 292 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 348
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G G N+GV+L+++T++R + W + +K+ L W
Sbjct: 349 NRFARHPYYGETGVNSGVMLMNMTRMRRKYFKNDMTTARLRWGDILVPLLKKYKLNITW- 407
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + ++H N
Sbjct: 408 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGISVLHGN 458
>gi|156537600|ref|XP_001607722.1| PREDICTED: uncharacterized protein C3orf21 homolog [Nasonia
vitripennis]
Length = 372
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 74/163 (45%), Gaps = 13/163 (7%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L L ++ P+ +++D D F D+ +L+ F ++ GL
Sbjct: 174 PGTYYSDALFFLSLGLHRIAPQDQDTAVMIDADTKFRVDVKKLFKEFDHFGKQALFGLGP 233
Query: 246 NQSDWYLGKL--WKNHKPWPALGR--------GFNTGVILLDLTKLRD-ISWAGFWRIIA 294
+ Y L +++ P +G G+N+GV+LL+L LR+ + + + +
Sbjct: 234 ELTPVYRHVLYVYRSKNPKTIVGEPGYLGGYPGYNSGVVLLNLKGLRESLEYDQIVSMDS 293
Query: 295 EKFLLTR--LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
+ L+ + L DQD + + E P ++ + C WN QL
Sbjct: 294 VEHLVEKYEFKGHLGDQDFYTLLGMERPEFIHAIDCGWNRQLC 336
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/165 (22%), Positives = 70/165 (42%), Gaps = 18/165 (10%)
Query: 28 RSLVTLIKSILFYRKNPLHFHLITD----TVALNILQTLFSTWSVPQVEVSFYLADSV-- 81
R + S+L Y + FH+ITD +A +++ F + +++V +Y +
Sbjct: 99 RKFRMFVDSLLKYSTGGVVFHVITDEESKIIAEKVIEYNFLV-NKKKMKVQYYNVHKLAI 157
Query: 82 -VEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
+ED+ S P +YS L L L ++ P+ +++D D F D+ +L+
Sbjct: 158 EIEDIVSVMSPHFSSKPGTYYSDALFFLSLGLHRIAPQDQDTAVMIDADTKFRVDVKKLF 217
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGFYLA 175
F ++ GL + Y L +++ P +G YL
Sbjct: 218 KEFDHFGKQALFGLGPELTPVYRHVLYVYRSKNPKTIVGEPGYLG 262
>gi|380017376|ref|XP_003692633.1| PREDICTED: xyloside xylosyltransferase 1-like [Apis florea]
Length = 357
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 74/164 (45%), Gaps = 17/164 (10%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L L ++ P + D D F DI +L+ F+ + GL
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPIEQKVAAMFDADTKFRKDIKELFEEFNTFGNQALFGLSP 218
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
+ Y + L++N P G G+N+G++L +L KLR+ + + +I+ +
Sbjct: 219 ELTPVYRHVLYLYRNKHPNTLFGEPASSGGYPGYNSGMVLFNLDKLRNSTV--YDKIVKK 276
Query: 296 KFLLT-----RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ + T L DQD + + E P L++T+ C WN QL
Sbjct: 277 ESVDTMTSKYHFKGHLGDQDFYTLLGMEKPELIHTIDCGWNRQL 320
>gi|149017614|gb|EDL76618.1| similar to CG9996-PA, isoform CRA_a [Rattus norvegicus]
Length = 393
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 68/158 (43%), Gaps = 18/158 (11%)
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFN 270
+ +DTDV+F + +W+L R Q + + +G W N P GR G N
Sbjct: 167 LYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WYNRFARHPYYGRTGVN 224
Query: 271 TGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
+GV+L+++T++R + W + +K+ L W DQD+ N + +
Sbjct: 225 SGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW---GDQDLLNIMFYHN 281
Query: 320 PYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
P ++ PCQWN + E + I+H N
Sbjct: 282 PESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 319
>gi|344266729|ref|XP_003405432.1| PREDICTED: glucoside xylosyltransferase 1-like [Loxodonta africana]
Length = 580
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 340 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 396
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 397 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 455
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 456 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEKEGIFILHGN 506
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/150 (21%), Positives = 64/150 (42%), Gaps = 7/150 (4%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
I T E + +A+V G +T++KS + + PL FH+ + + + + +W +
Sbjct: 250 IHTVERMHLAVVACG-ERLEETITMLKSAIIFSIKPLQFHIFAEDQLHDNFKGILDSWLL 308
Query: 69 PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
Q + Y E+ + W K +L LP +L E + + +DTD++F
Sbjct: 309 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 364
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLG 156
+ +W+L + Q + + +G
Sbjct: 365 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG 394
>gi|159155696|gb|AAI54725.1| LOC100127708 protein [Xenopus (Silurana) tropicalis]
Length = 366
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 95/223 (42%), Gaps = 31/223 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ Q + + +G W
Sbjct: 127 RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WY 183
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 184 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTAVRLHWGDLLLPLLKKYKLNITW- 242
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N + S +P ++ PCQWN + D+ C+ + I+H N +
Sbjct: 243 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEGGIFILHGN---RGVY 296
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
++ FR +Y F Y FG NL + L EL
Sbjct: 297 HDRKQPAFRAVYDAFRNYT--------FGDNLFHSLLYPLELE 331
Score = 39.7 bits (91), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 36/186 (19%), Positives = 74/186 (39%), Gaps = 6/186 (3%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
PV E + +A+V G + +T++KS L + L FH+ + + + W
Sbjct: 35 PVKHGTETMHLAVVACG-DRLEETITMLKSALLFSTKFLQFHIFAEDQLHYEFKATLNRW 93
Query: 67 SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
+ P+ + + L + K +L LP +L E + + +DTD++F
Sbjct: 94 AFPE-KFKYILYPITFPSENAAEWKKLFKPCASQRLFLPLILTE-VDSLLYVDTDILFLR 151
Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGRGFYLADSVVEDVSWI 185
+ +W+ Q + + +G W N P G+ + ++ +++ I
Sbjct: 152 PLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WYNRFARHPYYGKTGINSGVMLMNMTRI 209
Query: 186 PNKHYS 191
KH+
Sbjct: 210 RKKHFK 215
>gi|256082475|ref|XP_002577481.1| glycosyltransferase-related [Schistosoma mansoni]
gi|350644937|emb|CCD60353.1| glycosyltransferase-related [Schistosoma mansoni]
Length = 237
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
IVLDTD++ TDI +LW F Q Q+IG+ Q+ + + K W G G+N G
Sbjct: 142 IVLDTDILLNTDIIELWNHFENFSQTQSIGIGLEQNPHFQDVMPKLESDWE--GYGYNNG 199
Query: 273 VILLDLTKLRDISWAGFWRIIAEKFL 298
V+L DL+K+R W W ++ L
Sbjct: 200 VLLFDLSKIRLTLWNELWIVMTRNVL 225
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 116 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALG 169
IVLDTD++ TDI +LW F Q Q+IG+ Q+ + + K W G
Sbjct: 142 IVLDTDILLNTDIIELWNHFENFSQTQSIGIGLEQNPHFQDVMPKLESDWEGYG 195
>gi|345319859|ref|XP_001507088.2| PREDICTED: glucoside xylosyltransferase 1-like, partial
[Ornithorhynchus anatinus]
Length = 253
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 72/169 (42%), Gaps = 18/169 (10%)
Query: 202 PKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-K 260
PK++ + + + +DTD++F + +W+ Q + + +G W N
Sbjct: 16 PKLILKEVDSLLYVDTDILFLRPVDDIWSFLKEFNSTQIAAMAPEHEEPRIG--WYNRFA 73
Query: 261 PWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLAD 308
P G+ G N+GV+L+++T++R + W + +K+ L W D
Sbjct: 74 RHPYYGKTGINSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW---GD 130
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
QD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 131 QDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 179
>gi|354504228|ref|XP_003514179.1| PREDICTED: glucoside xylosyltransferase 1-like [Cricetulus griseus]
Length = 486
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 246 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 302
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 303 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 361
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 362 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 412
Score = 45.4 bits (106), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 5/149 (3%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
E + +A+V G VT++KS L + PL FH+ + N + +WS Q
Sbjct: 160 EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHNSFKERLDSWSFLQ-R 217
Query: 73 VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
++ L S + K +L LP +L E + + +DTD++F + +W
Sbjct: 218 FNYTLYSITFPSDSAVEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 276
Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
+L + Q + + +G W N
Sbjct: 277 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 303
>gi|313231957|emb|CBY09069.1| unnamed protein product [Oikopleura dioica]
Length = 384
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 105/277 (37%), Gaps = 96/277 (34%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS------ 530
NDVT+ Q S + L V L++ W GPIS+ ++ + EA S+A ++A+ S
Sbjct: 18 NDVTVATQCSANHLHHVVKLWERWSGPISVAVFAPNLEA----SFA--TDAILSMQKCWP 71
Query: 531 --RRNIGYHVVYK-------------------------------------EGNF-YPINT 550
++ + +H+VY +G+ YP N
Sbjct: 72 DMKKRVTFHLVYPTQHKADVSESTGTLDYNSCEELRQIIHDYGGDDSQNYKGDIPYPHNV 131
Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG----HGGKKVLVVPAFE- 605
LRN A +T +VFL+D+D +P + HG G K V VVP FE
Sbjct: 132 LRNAARKGAATEFVFLIDVDVMPSLHMREGFNEFANKFK-HGLYEEKGDKAVFVVPVFEI 190
Query: 606 ------------------TQRYRTAFPAS-----HAPTNFSRWVNATTPYQIEWAPD--- 639
T++ F + H P + W+ +E A +
Sbjct: 191 LKGHQCPDDKADLKDAQMTRKEVRPFHNATCRWCHEPEDHDNWLMKPKNEHMEVAFNATW 250
Query: 640 ------------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ +A R +P +D RF +G+++
Sbjct: 251 NKIGFDCLYLLSWEPFYIARRSVPLFDERFKQYGFDR 287
>gi|242012431|ref|XP_002426936.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Pediculus humanus corporis]
gi|212511165|gb|EEB14198.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Pediculus humanus corporis]
Length = 305
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 67/149 (44%), Gaps = 33/149 (22%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY-PYLKSSIRSMDMHGHGGKKVLVVPAF 604
YP+N LRN+A +Q T Y+ +DID LP L +LKS + V VVP F
Sbjct: 94 YPVNFLRNLAKSQSVTDYILTIDIDMLPSENLRNDFLKSKLVE---KYQWNDVVFVVPIF 150
Query: 605 ETQR-----------------------YRTAFPASHAPTNFSRWV--NATTP----YQIE 635
E + Y + TN++ W+ N T+ Y++E
Sbjct: 151 EIEHDVQMPQSKNDLLECVKMGKARPFYLEPCEKCQSLTNYTEWLKNNKTSGIEILYEVE 210
Query: 636 WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
W +EP+ ++++ +P +D RF +G N+
Sbjct: 211 WKYPWEPFFISNKKVPNFDERFKQYGMNR 239
>gi|195122672|ref|XP_002005835.1| GI18863 [Drosophila mojavensis]
gi|193910903|gb|EDW09770.1| GI18863 [Drosophila mojavensis]
Length = 385
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/276 (22%), Positives = 103/276 (37%), Gaps = 83/276 (30%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
++Y SS + + L Q S++RL + + +W GP+S+ L+ + E L Y
Sbjct: 46 EQYTESSKISLICLATQSSVERLYSLPQVAANWIGPMSVALFSAGNEEYIVLKYFVTYMR 105
Query: 528 L---RSRRNIGYHVVY-------------------------------------------- 540
L + N +H+++
Sbjct: 106 LCFPNVKANATFHILFPQEYESLPGHLTEPLNIQKKFNCRNPEKTLKFLLKLRTAKTLEW 165
Query: 541 KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLV 600
++ N YP N +RN+A + YVFL DID +P L + +++D + V
Sbjct: 166 RKRNVYPQNHMRNLARKGCQSKYVFLTDIDIIPSVNSVVQLNNFFKTVDCEKNCA---YV 222
Query: 601 VPAFETQRYRTAFPASHA------------------------PTNFSRWV----NATTPY 632
+P FE R +FP S TNFS+W+ N T+
Sbjct: 223 IPTFEID-VRASFPQSKESLLALTKKGLARPFHEKVFIYNQYATNFSKWLTLKGNETSVS 281
Query: 633 QIEWAPDF----EPYIVAHRDLPRYDTRFVGFGWNK 664
+F EP+ +A +P +D RF G+G+ +
Sbjct: 282 ISHIVTNFEFLYEPFYIALDTVPIHDERFTGYGFTR 317
>gi|432942747|ref|XP_004083054.1| PREDICTED: glucoside xylosyltransferase 1-like [Oryzias latipes]
Length = 420
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/198 (22%), Positives = 84/198 (42%), Gaps = 22/198 (11%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +D+D++F + +LW +R + Q + + + W
Sbjct: 178 RLFLPLILKD-VDSVVYVDSDILFLRPVDELWGFLTRFDRSQLAAMAPEHEEPRIA--WY 234
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P GR G N+GV+L+++T++R + W + +K+ L W
Sbjct: 235 NRFAMHPFYGRTGVNSGVMLMNMTRMRSAYFKNDMTPVGLLWEELLMPLLQKYKLNITW- 293
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N I +P + PC+WN + E + ++H N +
Sbjct: 294 --GDQDLLNIIFHHNPERLLEFPCKWNYRPDHCIYGSNCASAEEDGVCVMHGN---RGVY 348
Query: 365 KNKHMEFFRNLYLTFLEY 382
++ FR +Y +Y
Sbjct: 349 HDQKQPAFRAVYQAIKKY 366
>gi|426200366|gb|EKV50290.1| hypothetical protein AGABI2DRAFT_115347 [Agaricus bisporus var.
bisporus H97]
Length = 487
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 73/152 (48%), Gaps = 16/152 (10%)
Query: 1 MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
+S ++EP E + +V S L+KSIL Y +P FH++ D A L+
Sbjct: 116 LSENQEP-----EPVVFVLVVWSRPSAIEGALLVKSILMYNTSPSEFHIVCDDDAEEFLR 170
Query: 61 TLFSTWSVPQ--VEVSFY------LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETL 112
+ F+ PQ V V FY + D + + + I H +G+ GL+KL L +++P +
Sbjct: 171 SRFALVHHPQRDVLVRFYKLTWQSMLDRIQREGT-ISTDHSAGLPGLMKLFLHEIVP--V 227
Query: 113 AKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
+ I +DTD F +D LW +F + I
Sbjct: 228 RRGIYIDTDAFFISDPTLLWGVFKDFKASTAI 259
>gi|412992029|emb|CCO20755.1| glycosyltransferase family 8 [Bathycoccus prasinos]
Length = 369
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 68/166 (40%), Gaps = 34/166 (20%)
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLWKNHKPWPALGR- 267
I +DTDV+F LW +FS Q G+ +Y +L KNH P
Sbjct: 159 IYVDTDVLFLKSPLLLWNIFSCFSPLQKFGMANETNGGTGGYYRKRLSKNHVPCSQGSCK 218
Query: 268 --------------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
G N+GV L++LT++R + RI F DQD++N
Sbjct: 219 CKNGLRCKRFYGLYGLNSGVALMNLTRIRRSDFQSRIRIAMTSFKF------YGDQDVYN 272
Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD--LKIIHWN 357
+ +E V+ LPC WN+ RSD C D L I+H N
Sbjct: 273 HLFAERND-VHVLPCIWNI------RSDSRCDFAKNDESLGILHGN 311
>gi|391343843|ref|XP_003746215.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like
[Metaseiulus occidentalis]
Length = 453
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 66/286 (23%), Positives = 108/286 (37%), Gaps = 94/286 (32%)
Query: 467 VDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYA 522
V E + S+ + V L SMDRL + + +HW+GP+S+ +++ E +QF+
Sbjct: 108 VAEDQESTSRSQVCLATLSSMDRLLGLMDIAQHWDGPVSIAVFIRTTEELQALRQFIRIL 167
Query: 523 G-NSEALRSRRNIGYHVVYKEGN------------------------------------- 544
G S+A+R N +H+V N
Sbjct: 168 GICSDAVRL--NFAFHLVLPISNRSRIDLHAKDIVSRSSESTSCADQLKSLLPAPPPVPM 225
Query: 545 -----------FYPINTLRNVALN---QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
+P N LRNVA + ++ + + FL DID +P P L S +
Sbjct: 226 KSLKNVDLLKLLFPQNHLRNVARDGCERMGSRFFFLTDIDIIPK----PALASQLNEFLQ 281
Query: 591 HGHGGKKVLVVPAFETQRY-----------------------RTAFPASHAPTNFSRW-- 625
K++ VVP +E + + F + TN W
Sbjct: 282 PRQASKEIFVVPTYEMHDHAEMPQSKTDLLKMVSVGYARPFHKIVFIHNQYATNHDLWER 341
Query: 626 VNATTPYQIEWAPD-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+ +P +I A +EP+ VA +PR+D RFVG+G+ +
Sbjct: 342 IPRKSPDKIHVAYKINNYEFFYEPFYVAPFSVPRHDERFVGYGFTR 387
>gi|449481334|ref|XP_002193848.2| PREDICTED: glucoside xylosyltransferase 1 [Taeniopygia guttata]
Length = 629
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/221 (23%), Positives = 89/221 (40%), Gaps = 30/221 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD++F + +W+ + Q + + +G W
Sbjct: 389 RLFLPLIL-KNVDSLLYVDTDILFLRPVDDIWSFLRKFNSTQIAAMAPEHEEPRIG--WY 445
Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G G N+GV+L+++T++R + W + +K+ L W
Sbjct: 446 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 504
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N + +P +Y PCQWN + E + I+H N +
Sbjct: 505 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKAAEEEGIFILHGN---RGVY 559
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
+ FR +Y Y FG +L + LQ E
Sbjct: 560 HDDKQPTFRAVYEAIKNYS--------FGDDLVHSLLQPLE 592
>gi|170027891|ref|XP_001841830.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167868300|gb|EDS31683.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 412
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 74/323 (22%), Positives = 119/323 (36%), Gaps = 86/323 (26%)
Query: 33 LIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFY--------------LA 78
+KS+L Y P+H H+ITD + + L +V FY +
Sbjct: 121 FLKSLLKYTTIPMHLHVITDEQSEGSAEELIREQIAHYRKVVFYTLYDVKDCAEKISDIV 180
Query: 79 DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
S++ ++ +YS LL L L +++ + + I++D DV+F + + +L+ F
Sbjct: 181 KSMLPLFNYKTGSYYSDALFLLSLGLHRIVDVNMKRAILIDCDVVFRSSVKELFDQFDHF 240
Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK 198
GL + YL L+K Y A N H ++G
Sbjct: 241 GPEHLFGLAPELTPVYLHILYK------------YRAQ----------NSH--TLFG--- 273
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 258
P L +T TI + FSR ++ G
Sbjct: 274 --TPYYLNKTTTTTIG-----------NSIGGFFSRTAKKTKHG---------------- 304
Query: 259 HKPWPALGRGFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+P G N+GV++L L ++R +I + I EK+ L DQD +
Sbjct: 305 ---YP----GLNSGVVMLQLDRIRQSRLYEEIIKPDSVKNIVEKY---SFKGHLGDQDFY 354
Query: 313 NAIISEHPYLVYTLPCQWNVQLS 335
+ E P L+Y L C WN QL
Sbjct: 355 TLMGFEFPGLIYRLDCVWNRQLC 377
>gi|449272005|gb|EMC82135.1| Glycosyltransferase 8 domain-containing protein 3, partial [Columba
livia]
Length = 368
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 74/173 (42%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W+ + Q + + +G W
Sbjct: 128 RLFLPLIL-KNVDSLLYVDTDVLFLRPVDDIWSFLGKFNSTQIAAMAPEHEEPRIG--WY 184
Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G G N+GV+L+++T++R + W + +K+ L W
Sbjct: 185 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 243
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P +Y PCQWN + E + I+H N
Sbjct: 244 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKEAEEEGIFILHGN 294
>gi|432858866|ref|XP_004068977.1| PREDICTED: glucoside xylosyltransferase 2-like [Oryzias latipes]
Length = 429
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W + Q + +G +
Sbjct: 184 RLFLPVILKD-VDSLLYVDTDVLFLRPMDDIWGFLKSFNETQLAAMAPEHEVPKIG-WYS 241
Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LTK+R +SW + +K+ W
Sbjct: 242 RFARHPFYGVTGVNSGVMLMNLTKIRSTLFKNSMISSGLSWGDLLHPLYQKYKSHITW-- 299
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I ++P ++ PCQWN + E + I+H N
Sbjct: 300 -GDQDLLNIIFHDNPECLFIFPCQWNYRPDHCMYGSNCKAAEEEGVSILHGN 350
>gi|340729238|ref|XP_003402913.1| PREDICTED: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
terrestris]
Length = 492
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 96/277 (34%), Gaps = 91/277 (32%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
S V L Q S+++L V HW GP+S+ LY + E Q++L Y E +
Sbjct: 150 SQAYRVCLATQSSVEKLHSVVQAAHHWTGPMSVALYAAGDEEFEVLQKYLIYLRKCYEPI 209
Query: 529 RSRRNIGYHV---------------------VYKEG-------------------NFYPI 548
R R V EG N YP
Sbjct: 210 RERVAFSLAVPRVRPPKKQPREFELPEVVDCAKPEGTLNELMNGISNEQTNWRIRNVYPQ 269
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
N +RN+A + YVFL D+D +P F L L +R+ K V+P +E
Sbjct: 270 NHMRNLARKNCQSDYVFLTDVDIVPSFNLTGALDEFLRT----DTCDKCAYVIPTYELDS 325
Query: 609 YRTAFPASHA------------------------PTNFSRWVNATTPYQIEWAPD----- 639
R FP + TNF+RW+ T+ +
Sbjct: 326 -RVRFPQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTSVNHKNFGNVKTGKV 384
Query: 640 ------------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF+G+G+ +
Sbjct: 385 YISHDVTNFEFLYEPFYVAKDIVPAHDERFMGYGYTR 421
>gi|307174885|gb|EFN65151.1| Uncharacterized protein C3orf21-like protein [Camponotus
floridanus]
Length = 371
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 69/166 (41%), Gaps = 19/166 (11%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS L L + K+ P + D D F D+ L+ F+ + GL
Sbjct: 173 PGTYYSDALFFLSLGMHKIAPSHQKLAAMFDADTKFRRDVKDLFEEFNNFGKEALFGLAP 232
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLR------DISWAGF 289
+ Y + L++ P G G+N+G++L +L +LR +I
Sbjct: 233 ELTPVYRHVLYLYRTKHPTTMFGEPRHKGGYPGYNSGMVLFNLERLRKSLEYNEILTKNS 292
Query: 290 WRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
I+ EK+ L DQD + + E P L++T+ C WN QL
Sbjct: 293 VDIMTEKY---HFKGHLGDQDFYTLLGMERPELIHTVDCGWNRQLC 335
>gi|157120271|ref|XP_001653581.1| hypothetical protein AaeL_AAEL001590 [Aedes aegypti]
gi|108883094|gb|EAT47319.1| AAEL001590-PA [Aedes aegypti]
Length = 418
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 120/322 (37%), Gaps = 89/322 (27%)
Query: 33 LIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYL----------ADSVV 82
+KS+L Y P+H H+ITD + + L +V FY +V
Sbjct: 130 FLKSLLKYTTIPIHLHVITDEQSETSAEELIREQIAHYRKVVFYTLYDVKDCAEKTSDIV 189
Query: 83 EDV----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
+++ ++ +YS L+ L L +++ + + I++D DV+F + + +L+ F+R
Sbjct: 190 KNMLPLFNYKTGSYYSDALFLISLGLHRIVDVNMKRAIIIDCDVVFRSSVKELFDEFNRF 249
Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK 198
GL + YL L+K P ++G
Sbjct: 250 SSDHLFGLAPELTPVYLHILFKYRAQNPHT------------------------IFG--- 282
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 258
P L KT + ++ + FSR ++ G
Sbjct: 283 ------TPYFLNKTAIGNS----------VGGFFSRTAKKTKHG---------------- 310
Query: 259 HKPWPALGRGFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+P L N+GV++L L ++R +I + + EK+ L DQD +
Sbjct: 311 ---YPGL----NSGVVMLRLDRIRQSRLYEEIIKQESVKNMVEKY---SFKGHLGDQDFY 360
Query: 313 NAIISEHPYLVYTLPCQWNVQL 334
+ E P L+Y L C WN QL
Sbjct: 361 TLMGFEFPGLIYRLDCVWNRQL 382
>gi|443710199|gb|ELU04497.1| hypothetical protein CAPTEDRAFT_192236 [Capitella teleta]
Length = 225
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 33/151 (21%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
YP + +RN+AL +V T Y FL D+DF+P +Y I+S + VV AFE
Sbjct: 38 YPSSIIRNLALEEVFTEYAFLSDVDFMPAPSVYKTSVDLIKSKN-RLLSKTTAFVVAAFE 96
Query: 606 TQRYRT------AFPAS-------------------------HAPTNFSRWVNATTPYQI 634
+R T FP + H TN+ W A T Y+
Sbjct: 97 LKRNTTPDGDAVKFPRNKSDLLALQNSTLIQPFHYHRERHYGHFSTNYEAWNIAKTTYKA 156
Query: 635 EWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
+W +EPY++ + +PRY+ FVG ++K
Sbjct: 157 KWVGLYEPYVIVNTCAVPRYNKAFVGRYFDK 187
>gi|349604448|gb|AEP99996.1| Glycosyltransferase 8 domain-containing protein 3-like protein,
partial [Equus caballus]
Length = 257
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 75/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 17 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 73
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 74 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITWG 133
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
QD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 134 C---QDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 183
>gi|348526548|ref|XP_003450781.1| PREDICTED: glucoside xylosyltransferase 1-like [Oreochromis
niloticus]
Length = 409
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 74/176 (42%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L LP +L + + + +D+D++F + LW L S Q + + WY
Sbjct: 170 RLFLPLILKD-VDSIVYVDSDILFLQPVDLLWRLLSDFNSSQLAAMAPEHEEPRIAWY-- 226
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P GR G N+GV+L++LT++R + W + +K+ L
Sbjct: 227 NRFARH---PFYGRTGINSGVMLMNLTRMRSMFFKNDMTSVGLRWEELLMPLLQKYKLNI 283
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P + PCQWN + E + I+H N
Sbjct: 284 TW---GDQDLLNIIFHYNPESLLEFPCQWNYRPDHCIYGSNCASAEEDGIYILHGN 336
>gi|423295529|ref|ZP_17273656.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
CL03T12C18]
gi|392672453|gb|EIY65921.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
CL03T12C18]
Length = 324
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 67/135 (49%), Gaps = 11/135 (8%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP+VLP TL K + LD D+I I LW + + TIG VE+ + +
Sbjct: 95 RLMLPEVLPVTLDKILYLDCDIIVNGRIESLWNIDLKY---YTIGAVED--NIVISSEAP 149
Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
+P FN GV+L++L+ +RDI + + E+ L ++ DQDI N ++
Sbjct: 150 RRLGYPVQSSYFNAGVMLMNLSLMRDIQFTKNAFVYIEQHLKEIVY---HDQDILNVLLY 206
Query: 318 EHPYLVYTLPCQWNV 332
+ LP +WNV
Sbjct: 207 DQKLF---LPIKWNV 218
>gi|71896989|ref|NP_001025907.1| glucoside xylosyltransferase 1 [Gallus gallus]
gi|82082155|sp|Q5ZKI6.1|GXLT1_CHICK RecName: Full=Glucoside xylosyltransferase 1; AltName:
Full=Glycosyltransferase 8 domain-containing protein 3
gi|53130856|emb|CAG31757.1| hypothetical protein RCJMB04_10i5 [Gallus gallus]
Length = 433
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 75/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD++F + +W+ + Q + + +G W
Sbjct: 193 RLFLPLILKD-VDSLLYVDTDILFLRPVDDIWSFLRKFDSTQIAAMAPEHEEPRIG--WY 249
Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G G N+GV+L+++T++R + WA + +K+ L W
Sbjct: 250 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTSVRLRWAEILMPLLKKYKLNITW- 308
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P +Y PCQWN + E + I+H N
Sbjct: 309 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKEAEEEGIFILHGN 359
>gi|198424870|ref|XP_002119916.1| PREDICTED: similar to glycosyltransferase 8 domain containing 4
[Ciona intestinalis]
Length = 401
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 68/154 (44%), Gaps = 25/154 (16%)
Query: 13 EVIQVAIVCAG---YNSTRSL-------VTLIKSILFYRKNPLHFHLITDTVALNILQTL 62
EV+ +A V G Y R + +T+IKS + NPLH H+ T+ +++ QT
Sbjct: 49 EVMHLAAVSCGRPNYQGIRDIKKPMDQTLTMIKSASLFAHNPLHIHIFTENDMISMFQTE 108
Query: 63 FSTW----------SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETL 112
W S+ ++ YL +++ W K + G +L LP VL +
Sbjct: 109 IDAWPARVRSRVTYSIRPIDYERYLPPTLI--TQW---KKWYKPCGSFRLLLPMVLKDVT 163
Query: 113 AKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
I D+DVIF I +LWA + +RQ +
Sbjct: 164 DAAIYSDSDVIFIKPIDELWAQLKEMDERQVAAI 197
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 81/207 (39%), Gaps = 21/207 (10%)
Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG- 253
G +L LP VL + I D+DVIF I +LWA + +RQ + G
Sbjct: 149 GSFRLLLPMVLKDVTDAAIYSDSDVIFIKPIDELWAQLKEMDERQVAAISPTAGHPLGGS 208
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW-----------RIIAEKFLLTRL 302
KL +N NTGV++L+ T++ W F R+ LL
Sbjct: 209 KLNENFISHSTGLFQINTGVMVLNFTRMLSSDWLLFHDEHKTDQSELRRVKYGIDLLLPY 268
Query: 303 WTSLA-----DQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC-YTELTDLKIIHW 356
+ DQ + N + +P L+ LPC WN + + D C E +H
Sbjct: 269 YDEYKEKAEHDQKLLNIMFHFNPSLLRQLPCSWNFKNNFCQDEDNTCSVAERGGAGAVHG 328
Query: 357 NSPKKLKVKNKHMEFFRNLYLTFLEYD 383
+ N FR LY +FL+YD
Sbjct: 329 ITSAFFNDVNPT---FRALYESFLQYD 352
>gi|159466980|ref|XP_001691676.1| hypothetical protein CHLREDRAFT_206250 [Chlamydomonas reinhardtii]
gi|158279022|gb|EDP04784.1| hypothetical protein CHLREDRAFT_206250 [Chlamydomonas reinhardtii]
Length = 496
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Query: 613 FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
FPA H+ TNFSRW A PY + + +FEP+ VA RD +P +D R G+G NK
Sbjct: 84 FPAGHSATNFSRWFRAAEPYTVRYVRNFEPWFVAGRDSVPWHDVRLRGYGQNK 136
>gi|47227367|emb|CAF96916.1| unnamed protein product [Tetraodon nigroviridis]
Length = 322
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 74/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + A + +DTDV+F + +W+LF Q + +G +
Sbjct: 92 RLFLPVILKDVDA-LLYVDTDVLFLRPMDDIWSLFRSFNSTQLAAMAPEHEVPKIG-WYS 149
Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R +SW + +K+ W
Sbjct: 150 RFARHPFYGVTGVNSGVMLMNLTRIRRATFKNSMIPARLSWEDLLHPLYQKYKNHITW-- 207
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I ++P ++ PCQWN + E + I+H N
Sbjct: 208 -GDQDLLNIIFHDNPECLFIFPCQWNYRPDHCMYGSNCRGAEKDGVSILHGN 258
>gi|55727735|emb|CAH90618.1| hypothetical protein [Pongo abelii]
Length = 164
Score = 59.3 bits (142), Expect = 6e-06, Method: Composition-based stats.
Identities = 23/37 (62%), Positives = 32/37 (86%)
Query: 5 EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR 41
++PV+ CE I VAIVCAGYN++R +VTL+KS+LF+R
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHR 164
>gi|383859939|ref|XP_003705449.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Megachile
rotundata]
Length = 489
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 99/277 (35%), Gaps = 91/277 (32%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
S V L Q S+++L + HW GP+S+ LY + E Q++L Y E +
Sbjct: 147 SQAYRVCLATQSSVEKLHSLVQAAHHWTGPMSVALYAAGDEEFDVLQKYLVYLRRCYEPI 206
Query: 529 RSRRNIGYHV---------------------VYKEG-------------------NFYPI 548
R R + V EG N YP
Sbjct: 207 RERVSFSLAVPRNRPPKKQPVDHELPDTVDCAKPEGTLNQLMNGISNEQTNWRIHNVYPQ 266
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
N +RN+A T YVFL D+D +P + L L +R+ K V+P +E
Sbjct: 267 NHMRNLARKNCQTDYVFLTDVDIVPSYNLTGALDEFLRT----DSCDKCAYVIPTYELDS 322
Query: 609 YRTAFPASHA------------------------PTNFSRWVNATT----------PYQI 634
R FP + TNF+RW+ T ++
Sbjct: 323 -RVRFPQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTALNHRNFENVRSGKV 381
Query: 635 EWAPD-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+ D +EP+ VA +P +D RF+G+G+ +
Sbjct: 382 YISHDVTNFEFLYEPFYVARDIVPPHDERFMGYGYTR 418
>gi|327273109|ref|XP_003221325.1| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1-like
[Anolis carolinensis]
Length = 442
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD++F + +W+ ++ Q + + +G W
Sbjct: 202 RLFLPLILSD-VDSLLYVDTDILFLRPVDDIWSFLKKMNSSQIAAMAPEHEEPRIG--WY 258
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 259 NRFARHPYYGKTGINSGVMLMNMTRIRRKYFKNDMTTVRLRWEQILMPLLKKYKLNITW- 317
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P ++ PCQWN + E + I+H N
Sbjct: 318 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQQAEKEGIFILHGN 368
>gi|427783557|gb|JAA57230.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
unknown function [Rhipicephalus pulchellus]
Length = 384
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 90/202 (44%), Gaps = 30/202 (14%)
Query: 157 KLWKNHKPWPALGRGFYLADSVVEDVS------WIPNKHYSGVYGLLK------LTLPKV 204
++ + + WP DSV + VS W P ++Y + + + L L ++
Sbjct: 109 RMRREYASWP---------DSVKKRVSCDVMPVWFPKENYYQWWAMFRPCTTQRLFLTRM 159
Query: 205 LPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA 264
LP A + +DTDVIF I W F + + Q G+ ++ + + K
Sbjct: 160 LPNEDA-VLYVDTDVIFVHPIEDFWRKFYAMNEWQMAGMAPETENYTMN--YYTTKALHP 216
Query: 265 LGRGF--NTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYL 322
+ F N G+++++LT++R + ++F W ADQD+ N + S +P
Sbjct: 217 FVQPFALNAGLLMMNLTRMRAFDLENRVMRLKQEFEGRIPW---ADQDLLNIVFSRYPQG 273
Query: 323 VYTLPCQWNVQLSDNTRSDELC 344
++T C+WN + ++ + LC
Sbjct: 274 IFTFTCRWNFR-GEHCHGEALC 294
>gi|402221737|gb|EJU01805.1| hypothetical protein DACRYDRAFT_116245 [Dacryopinax sp. DJM-731
SS1]
Length = 876
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 66/139 (47%), Gaps = 7/139 (5%)
Query: 14 VIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP--QV 71
I+ A+V G ++ VT+IK IL + P+H H++ D A+ + + P +
Sbjct: 167 TIEFALVIMGESAAVEAVTMIKGILMHISTPVHLHIVCDEAAMYYIDDRLTLVHRPAYDI 226
Query: 72 EVSFYLADS----VVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
VS+ L S + I + +++G GL K+ L ++L ET + I +D D +F D
Sbjct: 227 HVSYNLLSSSQIWARGARAGIGSTYHAGAGGLAKVFLHEILTET-KRAIFIDIDTLFTVD 285
Query: 128 IAQLWALFSRLRQRQTIGL 146
LW F + + Q L
Sbjct: 286 PLLLWKEFDKFKPGQVFAL 304
>gi|410920623|ref|XP_003973783.1| PREDICTED: glucoside xylosyltransferase 2-like [Takifugu rubripes]
Length = 442
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 73/172 (42%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTDV+F + +W+LF Q + +G +
Sbjct: 197 RLFLPVIL-RDVDSLLYVDTDVLFLRPMDDIWSLFRSFNATQLAAMAPEHEVPKIG-WYS 254
Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R +SW + +K+ W
Sbjct: 255 RFARHPFYGVTGVNSGVMLMNLTRIRSAAFKNSMIPARLSWEDLLHPLYQKYKNHITW-- 312
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I ++P ++ PCQWN + E + I+H N
Sbjct: 313 -GDQDLLNIIFHDNPECLFIFPCQWNYRPDHCMYGSNCRGAEKDGVSILHGN 363
>gi|301618797|ref|XP_002938793.1| PREDICTED: glucoside xylosyltransferase 2-like [Xenopus (Silurana)
tropicalis]
Length = 422
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +WA R Q + +G +
Sbjct: 177 RLFLPMLL-QDVDSLLYVDTDVLFLRPLDHIWAFLRRFNDTQLAAMAPEHEISKIG-WYS 234
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ ++W + +K+ + + +
Sbjct: 235 RFARHPFYGTTGVNSGVMLMNLTRIRNQQFKNNMIPAGLTWEEMLHPLYQKY---KNYIT 291
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++Y PC WN + E + I+H N
Sbjct: 292 WGDQDLLNIIFYFNPEMLYVFPCHWNYRPDHCMYGSNCKAAEEEGVSILHGN 343
>gi|443717909|gb|ELU08746.1| hypothetical protein CAPTEDRAFT_217021 [Capitella teleta]
Length = 365
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 84/206 (40%), Gaps = 19/206 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P +L + + +DTD++F I +W F Q L G W
Sbjct: 145 RLFIPDIL-RDVDSVLYVDTDILFMRPIEDVWDHFKHFNSSQIAALSPEHESPNAG--WY 201
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
N P G+ G N+GV+L++LT++R + W +++ L W DQD+ N
Sbjct: 202 NRFARHPYYGKLGVNSGVMLMNLTRMRQVGWLDHIIPYYKEYKLKITW---GDQDLINIF 258
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNL 375
P +Y C WN + E + ++H + N+ F+++
Sbjct: 259 FHYFPDKLYIYNCDWNYRPDHCMYMSNCKQAESNGVSVLHGC---RGVYHNEKQPVFKSI 315
Query: 376 YLTFLEYDGNLLRRELFGCNLTQTTL 401
Y F +Y FG +L++T L
Sbjct: 316 YEAFRDYK--------FGDDLSKTLL 333
>gi|409080739|gb|EKM81099.1| hypothetical protein AGABI1DRAFT_71889 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 430
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 115/260 (44%), Gaps = 64/260 (24%)
Query: 462 INPNNVDEYEASSDG----NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
+ P+ + Y + G +D+T+ + ++ +R +++ L + ++GPIS+TL++++A
Sbjct: 72 MKPSQIIPYYLRASGVFHKHDITITSIITSNRFEILARLAQRYQGPISVTLHINNATE-- 129
Query: 518 FLSYAGNSEALRSRRN--------IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDI 569
+ E+L + N + H+V + NT RN+A T YV +LD+
Sbjct: 130 --NVPHLLESLHNLYNTTPNMAVYVDVHLVLDTFD-RQFNTWRNIARLFARTEYVMMLDV 186
Query: 570 DFLPMFGLYPYLKSSIRSMDMHG------------HGGKKVLVVPAFETQRYRTA----- 612
DF L++ +R +D +G + G VVPAFE Y+
Sbjct: 187 DFYICTDFRETLRNLLR-VDGNGEIGTGENIREMFNNGVAAFVVPAFEYISYKEGREYKT 245
Query: 613 FPAS-----------------------HAPTNFSRWVNATTPYQI----EWAPDFEPYIV 645
FP + H T++ R+ +A P +I ++ +EPY +
Sbjct: 246 FPKTKEHLMQLVQQRRIGMFHRVWAPGHNSTDYDRFYSA-KPGEIYKVTQFHSAYEPYTI 304
Query: 646 AHRDLPRY-DTRFVGFGWNK 664
+D P + D RF+G+G NK
Sbjct: 305 FRKDGPPWCDERFIGYGGNK 324
>gi|395544504|ref|XP_003774150.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase [Sarcophilus
harrisii]
Length = 358
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 48/220 (21%)
Query: 480 TLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-EAQQFLSYAGNSE---ALRSRRNIG 535
L Q DRL M H P + D E +F + S+ +L +R G
Sbjct: 86 ALCTQAVRDRLTM------HLVCPARTEAAVPDPREPGEFARISSCSDVFTSLGARSRSG 139
Query: 536 YHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGG 595
+ YP N LRNVA + ++D D +P GL+ L + +
Sbjct: 140 RNYALAANVSYPNNLLRNVAREGAG--HALVVDADMVPSEGLWTGLLELLALGEGKA--- 194
Query: 596 KKVLVVPAFETQR-----------------------YRTAFPASHAPTNFSRWVNATTP- 631
+ LVVPAFE ++ Y P APTN++RW+ T P
Sbjct: 195 GRALVVPAFEIRQGVRVPRTKMELMRFLKMGHVRPFYAELCPRCQAPTNYTRWL--TLPP 252
Query: 632 -------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
Y++ W +EP+ VA +P YD RF +G+N+
Sbjct: 253 GGPLRPAYRVTWQDPWEPFYVAGGAVPAYDERFRQYGFNR 292
>gi|148222987|ref|NP_001086684.1| glucoside xylosyltransferase 2 precursor [Xenopus laevis]
gi|82182587|sp|Q6DE37.1|GXLT2_XENLA RecName: Full=Glucoside xylosyltransferase 2; AltName:
Full=Glycosyltransferase 8 domain-containing protein 4
gi|50416401|gb|AAH77307.1| MGC80221 protein [Xenopus laevis]
Length = 423
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +WA R Q + +G +
Sbjct: 178 RLFLPMLL-QDVDSLLYVDTDVLFLRPLDHVWAFLRRFNDTQLAAMAPEHEISKIG-WYS 235
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ ++W + +K+ + + +
Sbjct: 236 RFARHPFYGTTGVNSGVMLMNLTRIRNQHFKNNMIPAGLTWEEMLHPLYQKY---KNYIT 292
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++Y PC WN + E + I+H N
Sbjct: 293 WGDQDLLNIIFYFNPEMLYVFPCHWNYRPDHCMYGSNCKAAEEEGVSILHGN 344
>gi|393229277|gb|EJD36903.1| hypothetical protein AURDEDRAFT_174070 [Auricularia delicata
TFB-10046 SS5]
Length = 412
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 63/127 (49%), Gaps = 8/127 (6%)
Query: 18 AIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYL 77
A+V G S ++KSIL P+ H+I + ++Q + + P+ +V
Sbjct: 67 AMVMFGAGSAEEGQHMLKSILMQSSRPIEVHIICSNDSAPVIQQRLALVTRPRHDVRAVF 126
Query: 78 ADSVVEDV------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
E + + I +KH++GV GL K+ + +++P + + +DTD +FATD L
Sbjct: 127 YPVTPEAILARAARAGIGSKHHAGVGGLAKMLIHELIPAE--RALYVDTDAMFATDPVLL 184
Query: 132 WALFSRL 138
W F+RL
Sbjct: 185 WRAFTRL 191
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
I +KH++GV GL K+ + +++P + + +DTD +FATD LW F+RL
Sbjct: 143 IGSKHHAGVGGLAKMLIHELIPAE--RALYVDTDAMFATDPVLLWRAFTRL 191
>gi|350417157|ref|XP_003491284.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
impatiens]
Length = 492
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 95/277 (34%), Gaps = 91/277 (32%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
S V L Q S+++L V HW GP+S+ LY + E Q++L Y E +
Sbjct: 150 SQAYRVCLATQSSVEKLHSVVQAAHHWTGPMSVALYAAGDEEFEVLQKYLIYLRKCYEPI 209
Query: 529 RSRRNIG---------------------YHVVYKEG-------------------NFYPI 548
R R EG N YP
Sbjct: 210 RERVAFSLAXXXVRPPKRQPREFELPEVVDCAKPEGTLNELMNGISNEQTNWRIRNVYPQ 269
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
N +RN+A + YVFL D+D +P F L L +R+ K V+P +E
Sbjct: 270 NHMRNLARKNCQSDYVFLTDVDIVPSFNLTGALDEFLRT----DTCDKCAYVIPTYELDS 325
Query: 609 YRTAFPASHA------------------------PTNFSRWVNATTPYQIEWAPD----- 639
R FP + TNF+RW+ T+ +
Sbjct: 326 -RVRFPQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTSVNHKNFGNVKTGKV 384
Query: 640 ------------FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ VA +P +D RF+G+G+ +
Sbjct: 385 YISHDVTNFEFLYEPFYVAKDIVPAHDERFMGYGYTR 421
>gi|47204580|emb|CAG04707.1| unnamed protein product [Tetraodon nigroviridis]
Length = 184
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 63/150 (42%), Gaps = 25/150 (16%)
Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK---------LWKNH 259
L + + LD D+ + +I +L+ F R +G+ Y + W+
Sbjct: 1 LTRIVQLDLDLKYRENIRELFQEFHRFPPGAVMGIAREMQPVYSTEESATPDTHTFWQYR 60
Query: 260 K---------PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-----RLWTS 305
K P P GFN+GV+LLDL +R S + +++ + L R
Sbjct: 61 KENPQTKVGEPHPDGLPGFNSGVMLLDLAAMRASSL--YNQLLEPEHLAQLAAQYRFRGH 118
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
L DQD F I EHP L Y L C WN QL
Sbjct: 119 LGDQDFFTMIGMEHPELFYPLACGWNRQLC 148
>gi|301606488|ref|XP_002932855.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 456
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 95/222 (42%), Gaps = 31/222 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ Q + + +G W
Sbjct: 217 RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WY 273
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 274 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTAVRLHWGDLLLPLLKKYKLNITW- 332
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N + S +P ++ PCQWN + D+ C+ + I+H N +
Sbjct: 333 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEGGIFILHGN---RGVY 386
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAEL 406
++ FR +Y F Y FG +L + L EL
Sbjct: 387 HDRKQPAFRAVYDAFRNYR--------FGDDLFHSLLYPLEL 420
>gi|38348470|ref|NP_941014.1| glucoside xylosyltransferase 2 precursor [Mus musculus]
gi|81873556|sp|Q810K9.1|GXLT2_MOUSE RecName: Full=Glucoside xylosyltransferase 2; AltName:
Full=Glycosyltransferase 8 domain-containing protein 4
gi|29437118|gb|AAH49816.1| Glycosyltransferase 8 domain containing 4 [Mus musculus]
Length = 444
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 74/176 (42%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L + Q + WY
Sbjct: 197 RLFLPAIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 253
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R+ ++W + +K+
Sbjct: 254 SRFARH---PFYGSAGVNSGVMLMNLTRIRNTQFKNSLIPAGLAWEEMLLPLYQKYKSAI 310
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 311 TW---GDQDLLNIIFYYNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 363
>gi|157120962|ref|XP_001653726.1| hypothetical protein AaeL_AAEL009190 [Aedes aegypti]
gi|108874756|gb|EAT38981.1| AAEL009190-PA [Aedes aegypti]
Length = 403
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 78/189 (41%), Gaps = 40/189 (21%)
Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
++ +YS L+ L L +++ + + I++D DV+F + + +L+ F+R G
Sbjct: 182 NYKTGSYYSDALFLISLGLHRIVDVNMKRAIIIDCDVVFRSSVKELFDEFNRFSSDHLFG 241
Query: 243 LVENQSDWYLGKLWKNHKPWP-------------ALGR------------------GFNT 271
L + YL L+K P A+G G N+
Sbjct: 242 LAPELTPVYLHILFKYRAQNPHTLFGTPYFLNKTAIGNSVGGFFSRTAKKTKHGYPGLNS 301
Query: 272 GVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
GV++L L ++R +I + + EK+ L DQD + + E P L+Y
Sbjct: 302 GVVMLRLDRIRQSRLYEEIIKQESVKNMVEKY---SFKGHLGDQDFYTLMGFEFPGLIYR 358
Query: 326 LPCQWNVQL 334
L C WN QL
Sbjct: 359 LDCVWNRQL 367
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/165 (22%), Positives = 72/165 (43%), Gaps = 16/165 (9%)
Query: 33 LIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYL----------ADSVV 82
+KS+L Y P+H H+ITD + + L +V FY +V
Sbjct: 115 FLKSLLKYTTIPIHLHVITDEQSETSAEELIREQIAHYRKVVFYTLYDVKDCAEKTSDIV 174
Query: 83 EDV----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
+++ ++ +YS L+ L L +++ + + I++D DV+F + + +L+ F+R
Sbjct: 175 KNMLPLFNYKTGSYYSDALFLISLGLHRIVDVNMKRAIIIDCDVVFRSSVKELFDEFNRF 234
Query: 139 RQRQTIGLVENQSDWYLGKLWK--NHKPWPALGRGFYLADSVVED 181
GL + YL L+K P G ++L + + +
Sbjct: 235 SSDHLFGLAPELTPVYLHILFKYRAQNPHTLFGTPYFLNKTAIGN 279
>gi|66828487|ref|XP_647598.1| hypothetical protein DDB_G0268160 [Dictyostelium discoideum AX4]
gi|74859220|sp|Q55FD5.1|GNT12_DICDI RecName: Full=Glycosyltransferase-like protein gnt12
gi|60475598|gb|EAL73533.1| hypothetical protein DDB_G0268160 [Dictyostelium discoideum AX4]
Length = 550
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 84/187 (44%), Gaps = 44/187 (23%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
+ DG ++++V + ++ ++ + M+ W PIS ++ + + + Y +L
Sbjct: 125 NEDGFEISIVTHVPIENMEKIAMIADIWRAPISASVLIKNKNDIDSV-YKLIRNSLSVSE 183
Query: 533 NIGYHVVY-------------------------KEGN----FYPINTLRNVALNQVSTPY 563
+ +H +Y +E N +YPIN+LRN++L T +
Sbjct: 184 FVDFHFLYWNEDSDSIINNNNNNIINKNKINTNEEDNLNYYYYPINSLRNLSLKNSKTDW 243
Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL-------VVPAFETQRYRTAFPAS 616
+ +DID+LP +G+Y YL+ R++ KK++ VVP+F+ T+
Sbjct: 244 ILTIDIDYLPNYGIYQYLE---RTLYTSLQPSKKLINSDLVSFVVPSFQL----TSISTK 296
Query: 617 HAPTNFS 623
APT S
Sbjct: 297 TAPTTIS 303
>gi|397567044|gb|EJK45361.1| hypothetical protein THAOC_36024 [Thalassiosira oceanica]
Length = 735
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 95/240 (39%), Gaps = 57/240 (23%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS------DAEAQQFLSYAGNSEA 527
+D T+V QLS +R+ ++E W+GP+SL ++ E ++
Sbjct: 431 ADSISYTMVTQLSTNRMWLMERQCSVWKGPMSLGIFTDRNMTSLQDELEKLGCDVDRIAI 490
Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
R RR+ YK + YP+N LRN+AL+ V T + F DIDF+ + L SS +
Sbjct: 491 ERVRRD-----AYKSED-YPVNILRNLALSNVKTSHFFYTDIDFMLSRDVESSL-SSPEA 543
Query: 588 MDMHGHGGKKVLVVPAFE-----TQRYRTAF----------------------------- 613
+ + K+ +VVPAF + Y+
Sbjct: 544 LAILSKDPKQTIVVPAFMANAHCSDDYKDCIRPKASEIPSTKPELLRGMLKGDVKVFDVE 603
Query: 614 --PASHAPTNFSRWVNATTPYQIEWAP-----DFEPYIVAH--RDLPRYDTRFVGFGWNK 664
P H T + W+ P I P +EPY+ A ++P + F G+ +K
Sbjct: 604 TNPGGHGSTRYDVWMEQ-GPNTIATIPCLSSDRYEPYVAAQYCHEMPPFQESFQGYSSDK 662
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 100/234 (42%), Gaps = 53/234 (22%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
D+TL QL +RL +++ W+ ISL + +SD + S + + +++ R NI
Sbjct: 111 DITLTVQLDEERLWVMKHHCLAWKRQISLAV-LSD---RMKSSISNDLKSMGCRGNIILS 166
Query: 538 VV----YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
+ YK YP+N LRN+A+ +T ++ +D DF G L + +
Sbjct: 167 TLEASKYKRDE-YPVNLLRNIAIQASTTSHIVFVDADFFMPKGTDEVLMARTIKEKL-AR 224
Query: 594 GGKKVLVVPAFE---------------TQR----------------------YRTAFPAS 616
K+ LV+PAF+ +Q+ + P
Sbjct: 225 DAKQALVLPAFQLNYKCSDTTSADQCRSQKLEAMPQNIIDLHRGLDDGSVSMFDPKNPHG 284
Query: 617 HAPTNFSRW--VNATTPYQIEW--APDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
H T +++W +A++ I + +EPY+V R D+P + F G+GWNK
Sbjct: 285 HDSTGYNKWRKQSASSVEDISCINSERYEPYLVVRRCEDMPPFPEAFAGYGWNK 338
>gi|326911414|ref|XP_003202054.1| PREDICTED: glucoside xylosyltransferase 1-like [Meleagris
gallopavo]
Length = 382
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/173 (23%), Positives = 74/173 (42%), Gaps = 19/173 (10%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTD++F + +W+ + Q + + +G W
Sbjct: 142 RLFLPLILKD-VDSLLYVDTDILFLRPVDDIWSFLRKFNSTQIAAMAPEHEEPRIG--WY 198
Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G G N+GV+L+++T++R + W + +K+ L W
Sbjct: 199 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTTVRLRWGEILMPLLKKYKLNITW- 257
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N + +P +Y PCQWN + E + I+H N
Sbjct: 258 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKEAEEEGIFILHGN 308
>gi|301606486|ref|XP_002932854.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 487
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 95/222 (42%), Gaps = 31/222 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+ Q + + +G W
Sbjct: 248 RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WY 304
Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
N P G+ G N+GV+L+++T++R + W + +K+ L W
Sbjct: 305 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTAVRLHWGDLLLPLLKKYKLNITW- 363
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
DQD+ N + S +P ++ PCQWN + D+ C+ + I+H N +
Sbjct: 364 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEGGIFILHGN---RGVY 417
Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAEL 406
++ FR +Y F Y FG +L + L EL
Sbjct: 418 HDRKQPAFRAVYDAFRNYR--------FGDDLFHSLLYPLEL 451
>gi|300175814|emb|CBK21357.2| unnamed protein product [Blastocystis hominis]
Length = 615
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 119/291 (40%), Gaps = 51/291 (17%)
Query: 418 LRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN 477
+RRS + R +E +E G ++ + + +R + N + + A
Sbjct: 251 VRRSADSPARCDGCKVETPFELRVSGGSALILRKFAGNREEFTIYN-STFYRFPAEDAPF 309
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVT V + DRL ++ L WEGP+ + +E + F+ Y N+ L +R +
Sbjct: 310 DVTGVCVTTRDRLTVLTQLLYRWEGPLVIVFNGVVSEEKAFIDYL-NTHFLPARLTTLLY 368
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
++ + G F+P+N LRN+A+ + T + +LD+D P Y L S+ G+
Sbjct: 369 LMPQRG-FFPVNLLRNLAIRNIRTTHFQVLDMDLWPSPNTYRELLRLPPSL----RDGRV 423
Query: 598 VLVVPAFETQR----------YRTAFPA-SHAPTN------------------------- 621
++P F R Y A A H P+N
Sbjct: 424 AAILPVFFFDRKQVLNRCKSFYDCALLALEHLPSNKTALEACLYAKACLSSKPGIRTHLY 483
Query: 622 -FSRW--VNATTPY-QIE-WAPDF-EPYIVAH--RDLPRYDTRFVGFGWNK 664
W + A P+ Q+ + DF EPY++ P +D RFV +G+NK
Sbjct: 484 VLPDWFTIPADIPFVQVRCFLTDFQEPYVLLRYAPTTPLFDERFVNYGYNK 534
>gi|358060807|dbj|GAA93578.1| hypothetical protein E5Q_00222 [Mixia osmundae IAM 14324]
Length = 565
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 104/230 (45%), Gaps = 40/230 (17%)
Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD--AEAQQFLSYAGN-SEAL 528
A + D+T+ ++ +R +++ L + ++GP+S+ +++ + AE ++ L+ + E+
Sbjct: 243 AEDNKRDITITTLVTANRFKVLTDLVEKYQGPVSVCIHVVNIPAEREELLTSLRDIYESH 302
Query: 529 RSRRN-IGYHVVYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
+ +N + H+V NF N RNVA T YV +LD+DF + +S
Sbjct: 303 EAMKNWVDVHLVID--NFDRQFNMWRNVAKYFARTDYVMMLDVDFWICTDFRHRILASPD 360
Query: 587 SMDMHGHGGKKVLVVPAFETQR----------------------------YRTAFPASHA 618
+D G VVPAFE + + ++ H
Sbjct: 361 IIDKLS-AGTAAFVVPAFEFTKQVDGLDSKTFPTEKKDLLELVEGGKIGMFHKSWAPGHG 419
Query: 619 PTNFSRWVNATTP--YQIE-WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
TN++R+ + Y++ + +EPYI+ R+ P D RF+G+G NK
Sbjct: 420 STNYTRFYESKEGEIYRVHGYTHSYEPYIIYKREGTPYCDERFIGYGGNK 469
>gi|291394020|ref|XP_002713234.1| PREDICTED: glucoside xylosyltransferase 2 [Oryctolagus cuniculus]
Length = 501
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L + Q + WY
Sbjct: 253 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRKFNSTQLAAMAPEHEIPKIGWY-- 309
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 310 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSVIPTGLAWGDMLYPLYQKYKNAI 366
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 367 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 419
>gi|395824579|ref|XP_003785540.1| PREDICTED: glucoside xylosyltransferase 2 [Otolemur garnettii]
Length = 319
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 72 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHETPKIG-WYS 129
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 130 RFARHPFYGSAGLNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 187
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 188 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 238
>gi|189237113|ref|XP_971629.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270007231|gb|EFA03679.1| hypothetical protein TcasGA2_TC013781 [Tribolium castaneum]
Length = 351
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 82/184 (44%), Gaps = 22/184 (11%)
Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
+ D + S P +YS + L L ++ P + ++LD D+ F DI ++ F
Sbjct: 133 IVDVMTPHFSSKPGTYYSDALFYISLGLYRIAPPSQRTALLLDCDLYFKKDIRLVFEEFD 192
Query: 234 RLRQRQTIGL----------------VENQS---DWYLGKLWKNHKPWPALGRGFNTGVI 274
+ + GL V+ S ++Y K ++ P +G+N+GV+
Sbjct: 193 KFKPTALFGLAPELTPVYRHVLQTYKVKKNSTFGEYYHSKAISSNMQHPRGFQGYNSGVV 252
Query: 275 LLDLTKLRDIS-WAGFWRIIAEKFLLT--RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
L++L+ +R ++ R ++L + + L DQD + I E+P L+ T+ C +N
Sbjct: 253 LINLSAIRKSKEYSQIIRNETVQYLTSKYKFRGHLGDQDFYTLIGFEYPDLIQTINCGFN 312
Query: 332 VQLS 335
QL
Sbjct: 313 RQLC 316
>gi|452823675|gb|EME30683.1| glycosyltransferase-like protein LARGE [Galdieria sulphuraria]
Length = 380
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 68/147 (46%), Gaps = 6/147 (4%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL-VENQ 247
HY G L KL LP L +VLDTD+I D+ +LW + RQ +G+ ++
Sbjct: 168 HYHGKEALWKLLLPSFLQSD--NLLVLDTDMIILRDVLELWHRWRLTRQSVHLGMPCMSR 225
Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-RLWTSL 306
+ +L LW+ P +G +L+ K+R +W K L WT++
Sbjct: 226 DNSWLYPLWREIFPENGTLHCM-SGTLLMRPRKMRQENWINVVASSTNKILEDFPGWTAI 284
Query: 307 -ADQDIFNAIISEHPYLVYTLPCQWNV 332
ADQDIFN + P V + C+WN
Sbjct: 285 SADQDIFNLVAKSIPEKVTNIGCEWNC 311
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 76/165 (46%), Gaps = 18/165 (10%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNP---LHFHLITDTVALNILQTLFSTWSVPQV 71
I VAIV G N ++ +I S++ +P HFHLI+D LN + ++ W+
Sbjct: 78 IDVAIVSIGENDILNMKRMISSLVM-SSSPNVKFHFHLISDKKGLNAAKNWYTLWNSACA 136
Query: 72 EVSFY------LADSVVEDVSWIPNK-----HYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
+ Y L+ ++ + S K HY G L KL LP L +VLDT
Sbjct: 137 VWNIYDIESSKLSKALDQFHSLTGGKMRTAVHYHGKEALWKLLLPSFLQSD--NLLVLDT 194
Query: 121 DVIFATDIAQLWALFSRLRQRQTIGL-VENQSDWYLGKLWKNHKP 164
D+I D+ +LW + RQ +G+ ++ + +L LW+ P
Sbjct: 195 DMIILRDVLELWHRWRLTRQSVHLGMPCMSRDNSWLYPLWREIFP 239
>gi|312373082|gb|EFR20905.1| hypothetical protein AND_18321 [Anopheles darlingi]
Length = 410
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 73/184 (39%), Gaps = 41/184 (22%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
+YS LL L L +++ + + I++D DV+F + +L+ F Q GL +
Sbjct: 194 YYSDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQFDNFSPDQLFGLAPELT 253
Query: 249 DWYLGKL--WKNHKPWPALGR------------------------------GFNTGVILL 276
Y L ++ + P G G N+GV++L
Sbjct: 254 PVYRHVLSRYRMNNPHTIFGSPYYLDSAGLSEAASVVSPPSGEKARRHGYPGLNSGVVML 313
Query: 277 DLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQW 330
L ++R +I R +AEK+ L DQD + + E P L+Y L C W
Sbjct: 314 HLDRIRRSRLYEEIIKESTVRNMAEKY---SFQGHLGDQDFYTLMGFEFPGLIYRLDCVW 370
Query: 331 NVQL 334
N QL
Sbjct: 371 NRQL 374
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 48/216 (22%), Positives = 91/216 (42%), Gaps = 26/216 (12%)
Query: 30 LVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFY-------LADSVV 82
L ++S+L Y LH H+ITD + + L +FY A+ +
Sbjct: 118 LELFLRSLLKYSTIELHLHIITDEQSERSAEELIRQQIERFHRTAFYTLYDVRDCAEKIS 177
Query: 83 EDV-SWIP------NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF 135
+ V S +P +YS LL L L +++ + + I++D DV+F + +L+ F
Sbjct: 178 DIVHSMMPLFNYRTGSYYSDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQF 237
Query: 136 SRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGFYLADSVVEDVSWI-------- 185
Q GL + Y L ++ + P G +YL + + + + +
Sbjct: 238 DNFSPDQLFGLAPELTPVYRHVLSRYRMNNPHTIFGSPYYLDSAGLSEAASVVSPPSGEK 297
Query: 186 PNKH-YSGV-YGLLKLTLPKVLPETLAKTIVLDTDV 219
+H Y G+ G++ L L ++ L + I+ ++ V
Sbjct: 298 ARRHGYPGLNSGVVMLHLDRIRRSRLYEEIIKESTV 333
>gi|242219718|ref|XP_002475635.1| predicted protein [Postia placenta Mad-698-R]
gi|220725156|gb|EED79156.1| predicted protein [Postia placenta Mad-698-R]
Length = 337
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 9/108 (8%)
Query: 39 FYRKNPLHFHLITDTVALNILQTLFSTWSVP--QVEVSFYL------ADSVVEDVSWIPN 90
Y P+ FH++ D A L+ + P + V FYL AD + + + I
Sbjct: 1 MYTSMPVEFHIVCDEAAQAYLENRVRLITRPAHDILVRFYLLSWQSMADRIDREGT-IMT 59
Query: 91 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
H +GV GL+KL + ++LP ++ + I +DTD +F +D A LW F++L
Sbjct: 60 DHSAGVPGLMKLFIHEILPPSVPRAIFIDTDALFISDPALLWDRFAQL 107
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
+AD + + + I H +GV GL+KL + ++LP ++ + I +DTD +F +D A LW F+
Sbjct: 47 MADRIDREGT-IMTDHSAGVPGLMKLFIHEILPPSVPRAIFIDTDALFISDPALLWDRFA 105
Query: 234 RL 235
+L
Sbjct: 106 QL 107
>gi|158854046|ref|NP_001103670.1| glucoside xylosyltransferase 2 precursor [Danio rerio]
gi|141795891|gb|AAI34948.1| Glt8d4 protein [Danio rerio]
Length = 441
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W Q + +G +
Sbjct: 195 RLFLPVIL-KDVDSLLYVDTDVLFLRPMDDIWKFLKAFNSTQLAAMAPEHEIPKIG-WYS 252
Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ +SW + +K+ W
Sbjct: 253 RFARHPYYGVTGVNSGVMLMNLTRIRNTLFRNSMIASGLSWENLLHPLYQKYKNHITW-- 310
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 311 -GDQDLLNIIFHYNPECLYIFPCQWNYRPDHCMYGSNCKAAEEEGVSILHGN 361
>gi|428181726|gb|EKX50589.1| hypothetical protein GUITHDRAFT_103814 [Guillardia theta CCMP2712]
Length = 294
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 56/122 (45%), Gaps = 19/122 (15%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS-RRNIGY 536
DVTLV Q S+DRL + W+G +S +Y E +Q + N EALR R I
Sbjct: 77 DVTLVTQGSLDRLSRLRDQALSWKGSLSAAIYFRHHE-RQGENLEKNLEALRELHREIEM 135
Query: 537 ----------------HVVYKE-GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
KE YPIN LRNVA+ T + ++DIDF+P G +
Sbjct: 136 LGSCRLRISLLFALDPETADKEYDTLYPINALRNVAVRMSYTDLIMVIDIDFVPSKGFWE 195
Query: 580 YL 581
YL
Sbjct: 196 YL 197
>gi|336414250|ref|ZP_08594596.1| hypothetical protein HMPREF1017_01704 [Bacteroides ovatus
3_8_47FAA]
gi|335933362|gb|EGM95364.1| hypothetical protein HMPREF1017_01704 [Bacteroides ovatus
3_8_47FAA]
Length = 324
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 66/135 (48%), Gaps = 11/135 (8%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP+VLP TL K + LD D+I + I LW + + IG VE+ + +
Sbjct: 95 RLMLPEVLPVTLDKILYLDCDIIVNSKIESLWNIDLKY---YAIGAVED--NIVISSEAP 149
Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
+P FN GV+L++L+ +RD + + E+ L ++ DQDI N ++
Sbjct: 150 RRLGYPVQSSYFNAGVMLMNLSLMRDTQFTKNAFVYIEQHLKEIVY---HDQDILNVLLY 206
Query: 318 EHPYLVYTLPCQWNV 332
+ LP +WNV
Sbjct: 207 DQKLF---LPIKWNV 218
>gi|395516491|ref|XP_003762421.1| PREDICTED: glucoside xylosyltransferase 2 [Sarcophilus harrisii]
Length = 452
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F I +W L + Q + WY
Sbjct: 204 RLFLPVIL-KDVDSLLYVDTDVLFLRPIDDIWRLLKQFNSTQLAAMAPEHEIPKIGWY-- 260
Query: 254 KLWKNHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R+ ++W + +K+ +
Sbjct: 261 SRFARH---PYYGMTGVNSGVMLMNLTRIRNTQFKNSMIPTGLAWEDMLYPLYQKY---K 314
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+ + DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 315 NYITWGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEQEGVSVLHGN 370
>gi|302680991|ref|XP_003030177.1| glycosyltransferase family 49 protein [Schizophyllum commune H4-8]
gi|300103868|gb|EFI95274.1| glycosyltransferase family 49 protein [Schizophyllum commune H4-8]
Length = 446
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/229 (25%), Positives = 98/229 (42%), Gaps = 47/229 (20%)
Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ------QFLSYAGNSEALR 529
ND+T+ ++ +R + L + ++GPIS+ +++ + + + +S +
Sbjct: 88 ANDITMTTLITSNRFHVFSRLVERYQGPISVAIHVKNVTSHVDDLLASLHTLYASSPLMA 147
Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR-SM 588
+ ++ V + F NT RNVA T YV +LDID F L + IR S
Sbjct: 148 AHVDVHLIVDVFDRQF---NTWRNVARFFARTDYVMMLDID----FSLCTDFRRVIRESA 200
Query: 589 DMHGHGGKKV-LVVPAFETQRYRTA-----FPAS-----------------------HAP 619
D+ G + V V+ AFE FP + H
Sbjct: 201 DLRGRLDEGVAFVITAFEYTNLEEGTNQEVFPRTKEDLLDLVNVGRIGMFHASWQPGHNS 260
Query: 620 TNFSRWVNATTP--YQI-EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
T ++++ A Y++ + P +EPYIV +D P + D RF+G+G NK
Sbjct: 261 TEYAKYYGAAPGEVYEVTRYQPAYEPYIVMKKDGPPWCDERFIGYGGNK 309
>gi|428179611|gb|EKX48481.1| hypothetical protein GUITHDRAFT_105628 [Guillardia theta CCMP2712]
Length = 953
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 70/178 (39%), Gaps = 64/178 (35%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAF 604
+PIN LR ++ T V D+D L GL S+IR G G ++LV+P+F
Sbjct: 571 FPINMLRKISCAIARTDLVLYADVDMLASDGL----ASNIRQAYQDGKVGPTELLVIPSF 626
Query: 605 ET--------------QRYRTAFPAS---------------------------------- 616
+T QR R PAS
Sbjct: 627 KTSGSWPREAGISLEEQRIRVE-PASISMEALRLNFEGRQVWIPGLDMGMWSTTGRWHEA 685
Query: 617 ---HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPR-------YDTRFVGFGWNK 664
HA T + RW+ A YQ+++ +EPY+V +R R YD RFV +GW+K
Sbjct: 686 GVFHAGTEYDRWMEAEETYQVKYILGYEPYVVVNRSAWRGSHGVGLYDDRFVHWGWDK 743
>gi|397629521|gb|EJK69396.1| hypothetical protein THAOC_09354 [Thalassiosira oceanica]
Length = 489
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/235 (24%), Positives = 94/235 (40%), Gaps = 46/235 (19%)
Query: 475 DGNDV--TLVAQLSMDRLQMVEMLFKHW-EGPISLTLY--MSDAEAQQFLSYAGNSEALR 529
D DV TLV+QLS DRL MV + W + PIS+ ++ S + + L G E
Sbjct: 102 DAADVSYTLVSQLSNDRLWMVRYHCERWGDNPISIVVFSDRSSDDIKTQLIREGCYEPGL 161
Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
+ + + G YP+N LRN+A++ V T ++ DIDF P L+ ++ S +
Sbjct: 162 TVQVVSKSRYDPTGKEYPVNILRNMAISAVKTSHIVYADIDFWPSADLH-HILSKDEMKE 220
Query: 590 MHGHGGKKVLVVPAFE------------------------------TQRYRTAFP----A 615
K ++P F+ +R + F
Sbjct: 221 RFAKDAKLATIIPVFQMFRRCKEYKDCQEKNIPVMPKDKDSLINLIKKREASTFDPTNIG 280
Query: 616 SHAPTNFSRWVNATTPYQIEW----APDFEPYIVAH--RDLPRYDTRFVGFGWNK 664
H T + W + + ++ + +EPY+ DLP + F G+G NK
Sbjct: 281 GHGSTKYITWRDQSPATFVDLPCIKSNRYEPYLTFRYCSDLPPFQEGFTGYGKNK 335
>gi|449474226|ref|XP_002189241.2| PREDICTED: glucoside xylosyltransferase 2 [Taeniopygia guttata]
Length = 334
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F I +W L Q + +G +
Sbjct: 87 RLFLPVILKD-VDSLLYVDTDVLFLRPIDDIWHLLREFNSTQLAAMAPEHEIPKIG-WYS 144
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ + + +
Sbjct: 145 RFARHPYYGTTGLNSGVMLMNLTRIRSARFKNSLIPGGLTWEEMLYPLYQKY---KSYIT 201
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 202 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCRAAEEEGVSILHGN 253
>gi|149022671|gb|EDL79565.1| glycosyltransferase-like 1B, isoform CRA_c [Rattus norvegicus]
Length = 123
Score = 57.0 bits (136), Expect = 3e-05, Method: Composition-based stats.
Identities = 22/32 (68%), Positives = 29/32 (90%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR 41
P CE++ VAIVCAGYNS+R ++TL+KS+LFYR
Sbjct: 62 PKCEMLHVAIVCAGYNSSREIITLMKSVLFYR 93
>gi|338714503|ref|XP_001493752.2| PREDICTED: glucoside xylosyltransferase 2 [Equus caballus]
Length = 317
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 70 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 128 RFARHPFYGSTGLNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 236
>gi|426197653|gb|EKV47580.1| hypothetical protein AGABI2DRAFT_204862 [Agaricus bisporus var.
bisporus H97]
Length = 430
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 115/260 (44%), Gaps = 64/260 (24%)
Query: 462 INPNNVDEYEASSDG----NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
+ P+ + Y + G +D+T+ + ++ +R +++ L + ++GPIS+TL++++A
Sbjct: 72 MKPSQIIPYYLRASGVFHKHDITITSIITSNRFEILARLAQRYQGPISVTLHINNATE-- 129
Query: 518 FLSYAGNSEALRSRRN--------IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDI 569
+ E+L + N + H+V + NT RN+A T YV +LD+
Sbjct: 130 --NVPHLLESLHNLYNTTPNMAVYVDVHLVLDTFD-RQFNTWRNIARLFARTEYVMMLDV 186
Query: 570 DFLPMFGLYPYLKSSIRSMDMHG------------HGGKKVLVVPAFETQRYRTA----- 612
DF L++ ++ +D +G + G VVPAFE Y+
Sbjct: 187 DFYICTDFRETLQNLLQ-VDGNGEIGEGENIREMFNNGVAAFVVPAFEYINYKEGREYKT 245
Query: 613 FPAS-----------------------HAPTNFSRWVNATTPYQI----EWAPDFEPYIV 645
FP + H T++ R+ +A P +I ++ +EPY +
Sbjct: 246 FPKTKEHLMQLVQQRRIGMFHRVWAPGHNSTDYDRFYSA-KPGEIYKVTQFHSAYEPYTI 304
Query: 646 AHRDLPRY-DTRFVGFGWNK 664
+D P + D RF+G+G NK
Sbjct: 305 FRKDGPPWCDERFIGYGGNK 324
>gi|301757713|ref|XP_002914707.1| PREDICTED: glucoside xylosyltransferase 2-like [Ailuropoda
melanoleuca]
Length = 431
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 184 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 241
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 242 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 299
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 300 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKAAEREGVSVLHGN 350
>gi|354465598|ref|XP_003495266.1| PREDICTED: glucoside xylosyltransferase 2 [Cricetulus griseus]
Length = 376
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 129 RLFLPVILKD-VDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 186
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ ++W + +K+ W
Sbjct: 187 RFARHPFYGSTGVNSGVMLMNLTRIRNTQFKNSLIPTGLAWEDMLLPLYQKYKNAITW-- 244
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 245 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 295
>gi|317418931|emb|CBN80969.1| Glycosyltransferase 8 domain-containing protein 4 [Dicentrarchus
labrax]
Length = 448
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W+L Q + +G +
Sbjct: 203 RLFLPVIL-KDVDSLLYVDTDVLFLRPMDDIWSLLKSFNSTQLAAMAPEHEVPKIG-WYS 260
Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R +SW + +K+ W
Sbjct: 261 RFARHPFYGVTGVNSGVMLMNLTRIRSTMFKNSMIPSGLSWEDLLHPLYQKYKNHITW-- 318
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 319 -GDQDLLNIIFHYNPECLFIFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 369
>gi|348575275|ref|XP_003473415.1| PREDICTED: glucoside xylosyltransferase 2-like [Cavia porcellus]
Length = 478
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L + Q + WY
Sbjct: 231 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 287
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 288 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMGPTGLAWEDMLYPLYQKYKNAI 344
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 345 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 397
>gi|109472410|ref|XP_001066466.1| PREDICTED: glucoside xylosyltransferase 2-like [Rattus norvegicus]
gi|392340057|ref|XP_003753976.1| PREDICTED: glucoside xylosyltransferase 2-like [Rattus norvegicus]
Length = 444
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L + Q + WY
Sbjct: 197 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 253
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 254 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSLIPTGLAWEEMLLPLYQKYKNAI 310
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 311 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 363
>gi|397610368|gb|EJK60794.1| hypothetical protein THAOC_18795 [Thalassiosira oceanica]
Length = 565
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 73/162 (45%), Gaps = 18/162 (11%)
Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM------SDAEAQQFLSYAGNSEALR 529
G VTLV Q S+DR+ ++ W PI L +Y+ A+ +S +
Sbjct: 217 GVSVTLVVQSSIDRIWLLSETCSRWTEPIVLVVYLQWTMLNEKAQKSTAISQIVDVNMAC 276
Query: 530 SRRNIGYHVVYKEGN-----FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
+ + HV N YP+N +RN L+ V T +V +LD+D +P L K +
Sbjct: 277 PQMTVVPHVHDDRDNGEGTTTYPVNIMRNRGLDVVDTSHVLILDVDLIPSGDLSHVAKDN 336
Query: 585 I-RSMDMHGHGGK-----KVLVVPAFETQRYRTAFPASHAPT 620
+ + M G K LVVPAFE + +R P ++A +
Sbjct: 337 LMDEVSMTETNGDFQVPLKALVVPAFERKVHRDP-PCTNAAS 377
>gi|221194679|ref|ZP_03567736.1| glycosyl transferase family 8 [Atopobium rimae ATCC 49626]
gi|221185583|gb|EEE17973.1| glycosyl transferase family 8 [Atopobium rimae ATCC 49626]
Length = 358
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/245 (26%), Positives = 107/245 (43%), Gaps = 41/245 (16%)
Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK 198
R I L + S + L + + P +G GF D+ + D+ +P+ + +
Sbjct: 45 RNYDVIVLTRDLSPTSMITLTRQCQVTPNVGIGFLDVDAALGDIE-LPHHGHFRPETYFR 103
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGLVENQSDWYL 252
L P +LP +AK + LD+D++ D+A+L+ L + R TIG ++ D +
Sbjct: 104 LLAPSLLP-NVAKAVYLDSDLVVNHDVAELFDIDVTGYLVAATRDADTIGQIDGY-DSTV 161
Query: 253 GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFL---LTRLWTSLADQ 309
G K+ F GVIL++L ++R I E+FL R W L DQ
Sbjct: 162 GPYLKDELGMNDPHDYFQAGVILMNLAEIRT-------SIAPEEFLKVATMRNWRWL-DQ 213
Query: 310 DIFNAIISEHPYLVYTLPCQWN--------------VQLSDNTRSDELCYTEL-TDLKII 354
D+ N +I+ H YL +P +WN Q + R + Y E +DL I+
Sbjct: 214 DVLNRLINGH-YL--RIPMKWNYLADWQFLRRTHIVAQAPKDVREE---YDEARSDLCIV 267
Query: 355 HWNSP 359
H+ P
Sbjct: 268 HYAGP 272
>gi|335299327|ref|XP_003132373.2| PREDICTED: glucoside xylosyltransferase 2 [Sus scrofa]
Length = 317
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 70 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEQEGVSVLHGN 236
>gi|384248337|gb|EIE21821.1| hypothetical protein COCSUDRAFT_56271 [Coccomyxa subellipsoidea
C-169]
Length = 359
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 60/133 (45%), Gaps = 15/133 (11%)
Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRL-RQRQTIGLVEN----QSDWYLGKLWKNHKPWP 263
L K I +DTD ++ D W F + R GL EN WY+ + P P
Sbjct: 182 LDKVIYVDTDTLWLDDAVFWWTHFKHVGTMRAAFGLAENAGSGNGSWYI------NAPIP 235
Query: 264 ALG-RGFNTGVILLDLTKLRDISW-AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPY 321
G RG N G ++ L +R ++ A IIA L +L L DQD+ N +HP
Sbjct: 236 HHGERGVNAGTMMASLAAIRASNFTAERDAIIANYLPLNQL--PLGDQDVLNIYGHDHPD 293
Query: 322 LVYTLPCQWNVQL 334
+Y +PC +N +
Sbjct: 294 QIYVMPCIFNFRF 306
>gi|426249850|ref|XP_004018661.1| PREDICTED: glucoside xylosyltransferase 2 [Ovis aries]
Length = 454
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L + Q + WY
Sbjct: 207 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 263
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 264 SRFARH---PFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAI 320
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 321 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 373
>gi|334335508|ref|XP_001373334.2| PREDICTED: glucoside xylosyltransferase 2-like [Monodelphis
domestica]
Length = 704
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F I +W L + Q + WY
Sbjct: 456 RLFLPVIL-KDVDSLLYVDTDVLFLRPIDDIWRLLKQFNSTQLAAMAPEHEIPKIGWY-- 512
Query: 254 KLWKNHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R+ ++W + +K+ +
Sbjct: 513 SRFARH---PYYGMTGVNSGVMLMNLTRIRNTQFKNSLIPTGLAWEDMLYPLYQKY---K 566
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+ + DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 567 NYITWGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEQEGVSVLHGN 622
>gi|281353070|gb|EFB28654.1| hypothetical protein PANDA_002618 [Ailuropoda melanoleuca]
Length = 354
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 107 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 164
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 222
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKAAEREGVSVLHGN 273
>gi|393235330|gb|EJD42886.1| hypothetical protein AURDEDRAFT_152720 [Auricularia delicata
TFB-10046 SS5]
Length = 412
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 63/130 (48%), Gaps = 6/130 (4%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP--QVE 72
+ + +V G S R ++KS L PL H++ A+ L + P V
Sbjct: 58 VVLVLVMFGLASAREGQHMLKSALMRSSRPLDVHIVCSPDAVPFLHARLDLVTRPARDVF 117
Query: 73 VSFY--LADSVVEDV--SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
V FY D + + + +KH++G+ GL+K+ L ++LP ++ + + +DTD FA+D
Sbjct: 118 VQFYPITPDKIATRAQRAGVSSKHHAGLGGLVKIFLHELLPPSVPRALYVDTDAFFASDP 177
Query: 129 AQLWALFSRL 138
LW RL
Sbjct: 178 YLLWQTLLRL 187
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 34/51 (66%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
+ +KH++G+ GL+K+ L ++LP ++ + + +DTD FA+D LW RL
Sbjct: 137 VSSKHHAGLGGLVKIFLHELLPPSVPRALYVDTDAFFASDPYLLWQTLLRL 187
>gi|409041947|gb|EKM51432.1| glycosyltransferase family 8 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 459
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 69/159 (43%), Gaps = 12/159 (7%)
Query: 7 PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
P P+ + + A++ +S L+KSI+ Y P+H ++I D A ++ +
Sbjct: 103 PTAPS-QPVTFALIMWSEDSASEGAILLKSIMMYTSVPVHVYIICDEQAQAHIEKRLALV 161
Query: 67 SVPQ--VEVSFYLAD-----SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
P+ + V FY VE I H +G GL+KL + ++LP + + I +D
Sbjct: 162 RRPRHDLHVFFYRLSWQAMLGRVEREGSISTVHAAGTPGLMKLFIHEILPTHVRRAIFVD 221
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVEN----QSDWY 154
TD F TD LW F I L + +WY
Sbjct: 222 TDAFFTTDPLLLWRQFGTFNATTAIALPSHPEMFAPEWY 260
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 68/172 (39%), Gaps = 27/172 (15%)
Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
VE I H +G GL+KL + ++LP + + I +DTD F TD LW F
Sbjct: 184 VEREGSISTVHAAGTPGLMKLFIHEILPTHVRRAIFVDTDAFFTTDPLLLWRQFGTFNAT 243
Query: 239 QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD--------------- 283
I L + + + + + +K + V+LLDL + R
Sbjct: 244 TAIAL-PSHPEMFAPEWYDANK--------ICSCVMLLDLERFRTLRLMDSTHYRTAGLP 294
Query: 284 -ISWAGFWRIIAEKFLLTRLW--TSLADQDIFNAIISEHPYLVYTLPCQWNV 332
++ A F + E T + +L DQ + AII P L L W V
Sbjct: 295 ALAPATFRALFGEPDAATGHFADAALGDQTFWWAIIQGRPELFKHLHYDWEV 346
>gi|270001550|gb|EEZ97997.1| hypothetical protein TcasGA2_TC000395 [Tribolium castaneum]
Length = 432
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 65/279 (23%), Positives = 100/279 (35%), Gaps = 92/279 (32%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISL-TLYMSDAEAQQFLSYAG--N 524
D + A SD + L Q S++++ + + +W GPISL T SD E L Y
Sbjct: 92 DHFAALSDVHKTCLATQSSLEKIASIVEVTANWNGPISLATFAASDDELNSLLLYILFLR 151
Query: 525 SEALRSRRNIGYHVVY----------------KEGNF----------------------- 545
+ R + +H Y KE N
Sbjct: 152 DCFAKIREQVSFHFAYPKDHRPRNIDIDFDKLKEMNCANPSGVLQQLVKQYLKGNKWRTK 211
Query: 546 --YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH----GHGGKKVL 599
YP N LRN+A + +VFL D+D +P K + +D+ G
Sbjct: 212 LPYPQNHLRNLARKNCQSKFVFLTDVDIIPS-------KDMVEKLDLFLGRVKCSGLCAY 264
Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRW---------V 626
V+P +E R FP + TNF++W V
Sbjct: 265 VIPTYEIDE-RVRFPPNKTELIRLANKGLARPFHHKVFIYNQFATNFTKWQSTSNESPEV 323
Query: 627 NATTPY-QIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
+ + P E+ +EP+ VA +P +D RFVG+G+ +
Sbjct: 324 HISHPVTNFEFL--YEPFYVAPDTVPPHDERFVGYGYTR 360
>gi|391229413|ref|ZP_10265619.1| LPS:glycosyltransferase [Opitutaceae bacterium TAV1]
gi|391219074|gb|EIP97494.1| LPS:glycosyltransferase [Opitutaceae bacterium TAV1]
Length = 323
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 67/137 (48%), Gaps = 15/137 (10%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
+ +P++ PE K I LD D++ D+ +LW + + G+VE D K +
Sbjct: 88 FRFLVPQIRPE-FRKAIYLDCDLVVQDDLKKLWDVEP---GQNYAGVVE---DLLPRKDY 140
Query: 257 KNHKPWPALGRGFNTGVILLDLTKLR-DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
++HK + R FN G++LL+L ++R D S F I + R W ADQD+ N
Sbjct: 141 RSHKARIGIRRYFNAGMLLLNLEQIRQDFSVETFLSIERK----NRAWFLFADQDVLNFA 196
Query: 316 ISEHPYLVYTLPCQWNV 332
+ V LP +WNV
Sbjct: 197 FANR---VIYLPLRWNV 210
>gi|359078230|ref|XP_002697017.2| PREDICTED: glucoside xylosyltransferase 2 [Bos taurus]
Length = 317
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTDV+F + +W L + Q + +G +
Sbjct: 70 RLFLPVIL-RDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 236
>gi|363738785|ref|XP_414430.2| PREDICTED: glucoside xylosyltransferase 2 [Gallus gallus]
Length = 451
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F I +W + Q + +G +
Sbjct: 204 RLFLPVILKD-VDSLLYVDTDVLFLRPIDDIWHILKEFNSTQLAAMAPEHEIPKIG-WYS 261
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ ++W + +K+ + + +
Sbjct: 262 RFARHPYYGTTGVNSGVMLMNLTRIRNTQFRNSMIPSGLTWEEMLYPLYQKY---KNYIT 318
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 319 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 370
>gi|449283522|gb|EMC90138.1| Glycosyltransferase 8 domain-containing protein 4, partial [Columba
livia]
Length = 350
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F I +W + Q + +G +
Sbjct: 103 RLFLPVILKD-VDSLLYVDTDVLFLRPIDDIWHILKEFNSTQLAAMAPEHEIPKIG-WYS 160
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ ++W + +K+ + + +
Sbjct: 161 RFARHPYYGTTGVNSGVMLMNLTRIRNTQFKNSIIPSGLTWEEMLYPLYQKY---KNYIT 217
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 218 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 269
>gi|384248338|gb|EIE21822.1| hypothetical protein COCSUDRAFT_56272 [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)
Query: 207 ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT-IGLVENQSDWYLGKLWKNHKPWPAL 265
+ L K I +DTD ++ D W F+ ++ + G+ E + G W + P
Sbjct: 151 QDLDKVIYVDTDTLWLDDPVWWWTHFAHMKTKHAAFGMAEESNS---GGSWYTNAQIPHH 207
Query: 266 G-RGFNTGVILLDLTKLRDISW-AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLV 323
G RG N GV++ L +R ++ A IIA L L L DQDI N HP
Sbjct: 208 GARGLNAGVLMASLATVRASNFTAEREAIIAHYLPLGHL--PLGDQDILNIYGHNHPEQF 265
Query: 324 YTLPCQWNVQ 333
Y +PC +N +
Sbjct: 266 YAMPCMFNFR 275
>gi|296475003|tpg|DAA17118.1| TPA: hypothetical protein BOS_20881 [Bos taurus]
Length = 354
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTDV+F + +W L + Q + +G +
Sbjct: 107 RLFLPVIL-RDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 164
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW-- 222
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 273
>gi|345786246|ref|XP_853462.2| PREDICTED: glucoside xylosyltransferase 2 [Canis lupus familiaris]
Length = 317
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 70 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPEYLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 236
>gi|410951581|ref|XP_003982473.1| PREDICTED: glucoside xylosyltransferase 2 [Felis catus]
Length = 433
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 186 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 243
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 244 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 301
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 302 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 352
>gi|167518187|ref|XP_001743434.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778533|gb|EDQ92148.1| predicted protein [Monosiga brevicollis MX1]
Length = 600
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 21/127 (16%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL-RSRRN-- 533
+D+TLV Q + DRL + ML W GP + D + EA+ R+ R+
Sbjct: 223 DDITLVTQATFDRLDVFGMLANAWPGPKVAVFAVFDFDEASHAKAQVQIEAIIRTARDWT 282
Query: 534 ---IGYHVVYKEGNFY---------------PINTLRNVALNQVSTPYVFLLDIDFLPMF 575
I + V + ++Y PIN RN+A++ T +VF D+DF+P
Sbjct: 283 NVKILVYPVTTKDDYYTDRYRRHDPASEPQLPINVFRNLAVDHARTNFVFTCDMDFVPSS 342
Query: 576 GLYPYLK 582
LYP L+
Sbjct: 343 TLYPKLR 349
>gi|432119874|gb|ELK38654.1| Glucoside xylosyltransferase 2 [Myotis davidii]
Length = 392
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/157 (23%), Positives = 64/157 (40%), Gaps = 16/157 (10%)
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGR-GFNT 271
+ +DTDV+F + +W L + Q + +G + P G G N+
Sbjct: 159 LYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYSRFAQHPFYGSAGVNS 217
Query: 272 GVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHP 320
GV+L++LT++R ++W + +K+ W DQD+ N I +P
Sbjct: 218 GVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW---GDQDLLNIIFYFNP 274
Query: 321 YLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+Y PC WN + EL + ++H N
Sbjct: 275 ERLYVFPCHWNYRPDHCMYGSNCKRAELEGVSVLHGN 311
>gi|344248898|gb|EGW05002.1| Glycosyltransferase 8 domain-containing protein 4 [Cricetulus
griseus]
Length = 539
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/174 (24%), Positives = 74/174 (42%), Gaps = 21/174 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KL 255
+L LP +L + + + +DTDV+F + +W L + Q + +G
Sbjct: 70 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWYSR 128
Query: 256 WKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLW 303
+ H P G G N+GV+L++LT++R+ ++W + +K+ W
Sbjct: 129 FARH---PFYGSTGVNSGVMLMNLTRIRNTQFKNSLIPTGLAWEDMLLPLYQKYKNAITW 185
Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 186 ---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 236
>gi|440891388|gb|ELR45103.1| Glucoside xylosyltransferase 2, partial [Bos grunniens mutus]
Length = 354
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTDV+F + +W L + Q + +G +
Sbjct: 107 RLFLPVIL-RDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 164
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW-- 222
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 273
>gi|326928149|ref|XP_003210244.1| PREDICTED: glucoside xylosyltransferase 2-like [Meleagris
gallopavo]
Length = 463
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F I +W + Q + +G +
Sbjct: 216 RLFLPVIL-KDVDSLLYVDTDVLFLRPIDDIWHILKEFNSTQLAAMAPEHEIPKIG-WYS 273
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ ++W + +K+ + + +
Sbjct: 274 RFARHPYYGTTGVNSGVMLMNLTRIRNTQFRNSVIPSGLTWEEMLYPLYQKY---KNYIT 330
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 331 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 382
>gi|355694022|gb|AER99529.1| glucoside xylosyltransferase 2 [Mustela putorius furo]
Length = 334
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 88 RLFLPVILKD-VDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 145
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 146 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 203
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 204 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 254
>gi|308494925|ref|XP_003109651.1| hypothetical protein CRE_07260 [Caenorhabditis remanei]
gi|308245841|gb|EFO89793.1| hypothetical protein CRE_07260 [Caenorhabditis remanei]
Length = 164
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 51/108 (47%), Gaps = 1/108 (0%)
Query: 9 IPTCEVIQVAIVCAGYNSTRSL-VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
P + + +A V G TR + + KS+L Y + + I D + L ++W+
Sbjct: 55 FPEEDHVSLAYVLGGNFMTRLMFMQHFKSVLNYSDHYFRLYFIVDDENREDVNDLMTSWN 114
Query: 68 VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKT 115
+ E FY V WIPN HYS YGL KL +P+++ + + K
Sbjct: 115 ITNCEWFFYNLTQYDSRVKWIPNSHYSKYYGLSKLLIPEIISDNVGKV 162
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKT 212
V WIPN HYS YGL KL +P+++ + + K
Sbjct: 132 VKWIPNSHYSKYYGLSKLLIPEIISDNVGKV 162
>gi|327266150|ref|XP_003217869.1| PREDICTED: glucoside xylosyltransferase 2-like [Anolis
carolinensis]
Length = 334
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F I +W + Q + +G +
Sbjct: 87 RLFLPMILKD-VDSLLYVDTDVLFLRPIDDIWRFLTAFNSTQLAAMAPEHEIPKIG-WYS 144
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R+ ++W + +K+ + + +
Sbjct: 145 RFARHPYYGTTGVNSGVMLMNLTRIRNAQFKNSMIPTGLTWEEMLYPLYQKY---KNYIT 201
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 202 WGDQDLLNIIFYFNPECLYLFPCQWNYRPDHCMYGSNCKGAEEEGISILHGN 253
>gi|158285361|ref|XP_308268.3| AGAP007604-PA [Anopheles gambiae str. PEST]
gi|157019955|gb|EAA03928.3| AGAP007604-PA [Anopheles gambiae str. PEST]
Length = 416
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 78/198 (39%), Gaps = 50/198 (25%)
Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
++ +YS LL L L +++ + + I++D DV+F + +L+ F + Q G
Sbjct: 187 NYRSGSYYSDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQFDQFAPDQLFG 246
Query: 243 LVEN---------------------QSDWYLGKLWKNHKPWPALGR-------------- 267
L S +YL K+ P LG+
Sbjct: 247 LAPELTPVYRHVLSRYRMNNPQTSFGSPYYLDKVPAAKDMQP-LGKDVPRGKKAPASEPR 305
Query: 268 -----GFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
G N+GV++L L ++R +I + +AEK+ L DQD + +
Sbjct: 306 HHGYPGLNSGVVMLHLDRIRRSRIYEEIIKESTVKNMAEKY---SFQGHLGDQDFYTLMG 362
Query: 317 SEHPYLVYTLPCQWNVQL 334
E P L+Y L C WN QL
Sbjct: 363 FEFPGLIYRLDCVWNRQL 380
Score = 45.8 bits (107), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 44/203 (21%), Positives = 79/203 (38%), Gaps = 34/203 (16%)
Query: 6 EPVIPTCEVIQVAIVCAGYNSTRSLVTLI------------------KSILFYRKNPLHF 47
+P ++ +V V G N T + LI +S+L Y LH
Sbjct: 75 DPAAAASKLKRVKYVSKGTNDTEYNIFLIYTKESQNMILHSQLELFLRSLLKYSTIELHL 134
Query: 48 HLITDTVALNILQTLFSTWSVPQVEVSFY-------LADSVVEDV-------SWIPNKHY 93
H+ITD + + L +FY A+ + + V ++ +Y
Sbjct: 135 HVITDEQSERSAEELIKQQIERFHRTAFYTLYDVRDCAEKISDIVHGMMPLFNYRSGSYY 194
Query: 94 SGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDW 153
S LL L L +++ + + I++D DV+F + +L+ F + Q GL +
Sbjct: 195 SDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQFDQFAPDQLFGLAPELTPV 254
Query: 154 YLGKL--WKNHKPWPALGRGFYL 174
Y L ++ + P + G +YL
Sbjct: 255 YRHVLSRYRMNNPQTSFGSPYYL 277
>gi|443711810|gb|ELU05398.1| hypothetical protein CAPTEDRAFT_177574 [Capitella teleta]
Length = 221
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 34/53 (64%), Gaps = 1/53 (1%)
Query: 613 FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAH-RDLPRYDTRFVGFGWNK 664
FPA+H PTNF++W A+ PY+I + P FEPY+V +P Y F+G NK
Sbjct: 78 FPAAHMPTNFTKWRTASIPYRIHYGPYFEPYVVVSVAAIPPYVEEFLGRQMNK 130
>gi|296225683|ref|XP_002758628.1| PREDICTED: glucoside xylosyltransferase 2 [Callithrix jacchus]
Length = 443
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 72/176 (40%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L Q + WY
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 252
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 253 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 309
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 310 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSILHGN 362
>gi|402859592|ref|XP_003894235.1| PREDICTED: glucoside xylosyltransferase 2, partial [Papio anubis]
Length = 405
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L Q + WY
Sbjct: 158 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSSQLAAMAPEHEIPKIGWY-- 214
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 215 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 271
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 272 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 324
>gi|390605133|gb|EIN14524.1| hypothetical protein PUNSTDRAFT_140795 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 519
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 75/162 (46%), Gaps = 11/162 (6%)
Query: 3 SDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTL 62
S E+ P + I +++ G + R ++KSIL + P+ FH++ A+ +L++
Sbjct: 42 STEQAGGPDPDTIVFSLLMLGATTAREGSQMMKSILMHATRPVDFHIVCTPDAVPVLESP 101
Query: 63 FSTWSVPQ--VEVSFYLADS--VVEDV--SWIPNKHYSGVYGLLKLTLPKVLPETLAKTI 116
S P + V Y+ D E V S + N + +Y L+KL L +++P K I
Sbjct: 102 LSLIHNPAHGIRVRLYVVDEQYAFERVRRSGVGNPN---MYQLVKLFLHEIIPAE--KAI 156
Query: 117 VLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 158
+DTD +F+ D A LW F R + + + G +
Sbjct: 157 YVDTDALFSYDPAALWREFERFTPTTIAAITAKEEAGHAGSI 198
>gi|26343315|dbj|BAC35314.1| unnamed protein product [Mus musculus]
gi|148695647|gb|EDL27594.1| glycosyltransferase-like 1B, isoform CRA_b [Mus musculus]
Length = 118
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 32/40 (80%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR 41
SS + + P CE++ VAIVCAGYNS+R ++TL KS+LFYR
Sbjct: 55 SSPQLLLPPKCEMLHVAIVCAGYNSSREIITLTKSLLFYR 94
>gi|348510584|ref|XP_003442825.1| PREDICTED: glucoside xylosyltransferase 2-like [Oreochromis
niloticus]
Length = 450
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 72/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L + Q + +G +
Sbjct: 205 RLFLPVIL-KDVDSLLYVDTDVLFLRPMDDIWNLLKSFNRTQLAAMAPEHEVPKIG-WYS 262
Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R +SW + +K+ W
Sbjct: 263 RFARHPFYGVTGVNSGVMLMNLTRIRSTLFKNSMIPTGLSWEDLLHPLYQKYKNHITW-- 320
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P ++ PCQWN + E + I+H N
Sbjct: 321 -GDQDLLNIIFHYNPERLFIFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 371
>gi|332231527|ref|XP_003264947.1| PREDICTED: glucoside xylosyltransferase 2 [Nomascus leucogenys]
Length = 443
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L Q + WY
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 252
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 253 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 309
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 310 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 362
>gi|122937187|ref|NP_001073862.1| glucoside xylosyltransferase 2 precursor [Homo sapiens]
gi|152125899|sp|A0PJZ3.2|GXLT2_HUMAN RecName: Full=Glucoside xylosyltransferase 2; AltName:
Full=Glycosyltransferase 8 domain-containing protein 4
Length = 443
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L Q + WY
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 252
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 253 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 309
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 310 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 362
>gi|194677265|ref|XP_583887.4| PREDICTED: glucoside xylosyltransferase 2 [Bos taurus]
Length = 251
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 68/167 (40%), Gaps = 16/167 (9%)
Query: 203 KVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW 262
+V+ + + +DTDV+F + +W L + Q + +G +
Sbjct: 8 RVILRDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYSRFARH 66
Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
P G G N+GV+L++LT++R ++W + +K+ W DQD
Sbjct: 67 PFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW---GDQD 123
Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+ N I +P +Y PCQWN + E + ++H N
Sbjct: 124 LLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 170
>gi|389741822|gb|EIM83010.1| hypothetical protein STEHIDRAFT_84404, partial [Stereum hirsutum
FP-91666 SS1]
Length = 340
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 98/228 (42%), Gaps = 60/228 (26%)
Query: 488 DRLQMVEMLFKHWEGPISLTLYM----------SDAEAQQFLSYAGNSEALRSRRNIGYH 537
+RLQ++E L ++GP+S+TL++ ++ S +S AL +R +I H
Sbjct: 31 NRLQVLEKLAARYQGPLSVTLHLYLHPNGSLSSLNSSILSLHSLIRSSPALAARADI--H 88
Query: 538 VVYKEGNFYP--INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG--- 592
+V +P NT RN+A T Y+ LLD+D F + + +R+M G
Sbjct: 89 LVVSP---FPRAFNTWRNIARFLARTSYILLLDVD----FAMCTAWRDPVRAMLQSGGEM 141
Query: 593 ----HGGKKVLVVPAFETQRYRTA-----FPA-----------------------SHAPT 620
G+ LV+PAFE R R FP+ H T
Sbjct: 142 AKRMREGRAALVLPAFEYARARDGKDQMLFPSDKQALLEQVHSHKILPFHASWAQGHNST 201
Query: 621 NFSRWVNATTP--YQI-EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
++ R+ Y++ + +EPY++ +D + D RF G+G NK
Sbjct: 202 DYDRFYRTQLGEIYRVARYQSAYEPYVIVRKDAGGWCDERFNGYGANK 249
>gi|426341237|ref|XP_004035955.1| PREDICTED: glucoside xylosyltransferase 2, partial [Gorilla gorilla
gorilla]
Length = 403
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L Q + WY
Sbjct: 156 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 212
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 213 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 269
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 270 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 322
>gi|403297281|ref|XP_003939503.1| PREDICTED: glucoside xylosyltransferase 2 [Saimiri boliviensis
boliviensis]
Length = 317
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L Q + +G +
Sbjct: 70 RLFLPVILKD-VDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + I+H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYIFPCQWNYRPDHCMYGSNCREAEHEGVSILHGN 236
>gi|358057892|dbj|GAA96137.1| hypothetical protein E5Q_02798 [Mixia osmundae IAM 14324]
Length = 443
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 102/260 (39%), Gaps = 63/260 (24%)
Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEM-LFKHWEGPISLTLYMSDAEAQQFLS 520
+ P V YEA VTLV+ ++MDRL+ + + W G IS+++Y++ + L
Sbjct: 65 VMPALVPTYEAPR----VTLVSHMTMDRLKAYGVSVVSRWAGSISISVYLTQQTDELDLV 120
Query: 521 YAGNSEALRSRRNIGYHVVY-----KEGNF-----YPINTLRNVALNQVSTPYVFLLDID 570
+E + N+ +V E + YPIN LRN+A++ YV + D D
Sbjct: 121 EFFRNEQKTAWENVTITIVRPSYAPDEASLLHRLRYPINRLRNLAISVAPAAYVLMTDAD 180
Query: 571 FLPMFGLY--------PYLKSSIRSMDMHGHGG----KKVLVVPAFETQRYRTAFPAS-- 616
F+P ++ P L S I + H K +V+P F T F +S
Sbjct: 181 FVPSPNMHEIITRHGLPLLDSKITAWPS-AHDPRMIIKHAIVIPCFVASPGATQFASSQD 239
Query: 617 ---------------------HAPTNFSRWVNATT---------PYQIEWAPDFEPY--I 644
H PT R + Y + + P++EPY +
Sbjct: 240 ELKAHLGARPALLSLTDPTIGHGPTAPHRLIKTQAYEDSPISAWSYSVGFEPEWEPYYLL 299
Query: 645 VAHRDLPRYDTRFVGFGWNK 664
H L YD RF G +K
Sbjct: 300 EKHSHL-LYDERFTDQGGDK 318
>gi|344276496|ref|XP_003410044.1| PREDICTED: glucoside xylosyltransferase 2-like [Loxodonta africana]
Length = 442
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 71/175 (40%), Gaps = 23/175 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L Q + WY
Sbjct: 195 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRHFNSTQLAAMAPEHEIPKIGWY-- 251
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRL 302
+ H + A G N+GV+L++LT++R ++W + +K+
Sbjct: 252 SRFARHPFYGAAG--VNSGVMLMNLTRIRSTQFKNSLISTGLAWEDMLYPLYQKYKNAIT 309
Query: 303 WTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PC WN + E + ++H N
Sbjct: 310 W---GDQDLLNIIFYFNPECLYVFPCHWNYRPDHCMYGSNCKEAEREGVSVLHGN 361
>gi|242000568|ref|XP_002434927.1| conserved hypothetical protein [Ixodes scapularis]
gi|215498257|gb|EEC07751.1| conserved hypothetical protein [Ixodes scapularis]
Length = 173
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/122 (22%), Positives = 57/122 (46%), Gaps = 5/122 (4%)
Query: 212 TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNT 271
+ +D+D++F + +LW+ + +Q + + + D + + K L G N
Sbjct: 8 VLYVDSDIVFLRPVEELWSYLGAMDSQQMVAMAPHAEDANVSAYTRYAKHPFVLPYGVNA 67
Query: 272 GVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
G++L++LT++R+ +W + EK+ DQ + N + P + L C+WN
Sbjct: 68 GLMLMNLTRMRNFAWESRLEPMYEKY-----NRPFGDQALVNIVFHFFPEKLMMLSCKWN 122
Query: 332 VQ 333
+
Sbjct: 123 FR 124
>gi|355746568|gb|EHH51182.1| hypothetical protein EGM_10516, partial [Macaca fascicularis]
Length = 354
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L Q + +G +
Sbjct: 107 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 164
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 222
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 273
>gi|119585929|gb|EAW65525.1| hCG1998746, isoform CRA_b [Homo sapiens]
Length = 364
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L Q + +G +
Sbjct: 117 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 174
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 175 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 232
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 233 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 283
>gi|302565234|ref|NP_001180620.1| glucoside xylosyltransferase 2 precursor [Macaca mulatta]
Length = 443
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L Q + +G +
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 253
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 254 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 311
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 312 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 362
>gi|355559502|gb|EHH16230.1| hypothetical protein EGK_11484, partial [Macaca mulatta]
Length = 366
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L Q + +G +
Sbjct: 119 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 176
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 177 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 234
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 235 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 285
>gi|332817310|ref|XP_516584.3| PREDICTED: glucoside xylosyltransferase 2 [Pan troglodytes]
gi|397489677|ref|XP_003815849.1| PREDICTED: glucoside xylosyltransferase 2 [Pan paniscus]
gi|118341485|gb|AAI27734.1| GLT8D4 protein [Homo sapiens]
Length = 317
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L Q + +G +
Sbjct: 70 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 236
>gi|395733536|ref|XP_002813566.2| PREDICTED: glucoside xylosyltransferase 2 [Pongo abelii]
Length = 317
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + + +DTDV+F + +W L Q + +G +
Sbjct: 70 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 127
Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L++LT++R ++W + +K+ W
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 236
>gi|345482774|ref|XP_001599812.2| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Nasonia
vitripennis]
Length = 518
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 97/277 (35%), Gaps = 91/277 (32%)
Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
S+ + V L Q S+++L + + W G IS+ LY + E Q++L Y + E +
Sbjct: 176 SESSRVCLATQTSLEKLSSLVQVVHQWTGTISVALYAAGDEEFEVLQRYLEYMRHCYEPI 235
Query: 529 RSRRNIGYHVVYK---------------------EG-------------------NFYPI 548
R R V EG N YP
Sbjct: 236 RERVTFSLAVPRNRPPRAQPRYYPPDGLLDCQKPEGTLNQMTRAISNEQTNWRIRNVYPQ 295
Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
N +RN+A T YVFL D+D +P L L + K V+P +E
Sbjct: 296 NHMRNLARKNCQTDYVFLTDVDIVPSTNLSRVLDEFL----GQEKCDKCAYVIPTYELD- 350
Query: 609 YRTAFPASHA------------------------PTNFSRWVNATTP------------- 631
R FP + + TNF+RW+ TP
Sbjct: 351 ARVRFPQNKSELIRLAKKGLARPFHQKVFIHNQFATNFTRWMLDATPGYKAHENLKMGKA 410
Query: 632 YQIEWAPDF----EPYIVAHRDLPRYDTRFVGFGWNK 664
Y +F EP+ VA +P +D RF+G+G+ +
Sbjct: 411 YVSHDVTNFEFLYEPFYVARDLVPPHDERFMGYGYTR 447
>gi|386265848|ref|YP_005829340.1| glycosyltransferase [Haemophilus influenzae R2846]
gi|309973084|gb|ADO96285.1| Probable glycosyltransferase [Haemophilus influenzae R2846]
Length = 301
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 72/146 (49%), Gaps = 13/146 (8%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H L +L L ++LP+ + + I LD D+I I +LW + L+ S
Sbjct: 77 HIQSTASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
D+++ LW++ P+ + N+GV+L++L K R+ + ++ A K+ S D
Sbjct: 131 DFFIEYLWEH--PFCETQQYINSGVMLINLNKWRENNIEQYFIEFAAKYGKN---LSFGD 185
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQL 334
QD+ N I + L+ L ++N+Q+
Sbjct: 186 QDVINFSIPTN--LIKLLSVKFNIQV 209
Score = 39.7 bits (91), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 62/140 (44%), Gaps = 12/140 (8%)
Query: 25 NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVED 84
N L T++KS L Y N ++ ++I D + L+ L + + + Y +
Sbjct: 10 NFASHLDTVLKS-LCYHHNNINIYVIHDGIPAESLEKL--KMHCAKFDNTLYYIQFNINQ 66
Query: 85 VSW---IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
S+ + H L +L L ++LP+ + + I LD D+I I +LW +
Sbjct: 67 FSFPTVMSPAHIQSTASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------N 120
Query: 142 QTIGLVENQSDWYLGKLWKN 161
L+ SD+++ LW++
Sbjct: 121 LEDSLIAGVSDFFIEYLWEH 140
>gi|443682654|gb|ELT87167.1| hypothetical protein CAPTEDRAFT_200999 [Capitella teleta]
Length = 177
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 18/97 (18%)
Query: 517 QFLSYA--------GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLD 568
QFL+Y N RNIG+ V F+P+NTLRN+AL+ T +VFL D
Sbjct: 12 QFLTYKYHRRRSPLTNRLCFDHLRNIGWGHV-----FFPVNTLRNIALDMAVTSHVFLTD 66
Query: 569 IDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
+DF+P LY + S+ + + LV+PAFE
Sbjct: 67 VDFIPDQNLYENALQQLHSLQV-----MQSLVIPAFE 98
>gi|313232027|emb|CBY09138.1| unnamed protein product [Oikopleura dioica]
Length = 380
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 68/153 (44%), Gaps = 15/153 (9%)
Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 251
G +L LP+VL + + + +D+DV+F LW+ F + + Q GL + W
Sbjct: 125 GPCASFRLFLPEVLSD-VDSVVYVDSDVLFLAPPQDLWSHFEKFTEHQVGGLAP-RVGWD 182
Query: 252 LGKLWKNHKPWPALGRGF---NTGVILLDLTKLRDISWAGFWRIIAEKF-----LLTRLW 303
N G G N+GV L++LT++R+ + I F LL L+
Sbjct: 183 FKVPGSNPNFILRAGAGMTQVNSGVFLMNLTRMREPVFNTAESISLNSFKWDKDLLLPLY 242
Query: 304 -----TSLADQDIFNAIISEHPYLVYTLPCQWN 331
DQ++ N +P L+Y LPC+WN
Sbjct: 243 YRNKKDMYGDQNLINTAFHYNPELIYFLPCRWN 275
>gi|428163092|gb|EKX32183.1| hypothetical protein GUITHDRAFT_121643 [Guillardia theta CCMP2712]
Length = 1349
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 100/260 (38%), Gaps = 82/260 (31%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEAL-RSRRNIGY 536
VTLV ++DR ++ + HW+ +++ Y S ++ QQ + +S S R
Sbjct: 887 VTLVTVATVDRAVVLVNVAAHWKEDMAVAFYARSQSDEQQIRRFVRDSLGPWFSSRGRQL 946
Query: 537 HVVY--------KEGNF--YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP-----YL 581
+V E F +PIN LR ++ + V D+D +P L Y
Sbjct: 947 EIVMLTLCPDADAEARFAVFPINMLRKISCAIAKSDLVLYSDVDMIPSDSLAHQIRLLYQ 1006
Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQR----------------YRT-------------- 611
S++ S +++LVVP+F++ RT
Sbjct: 1007 TSTVTS--------QQLLVVPSFKSNGAWPPYAHVELHNKSLLVRTHAVTFLDLVTSFDA 1058
Query: 612 ------AFPAS--------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-- 649
A P HAPT +SRW N + Y++++ +EPY+V +R
Sbjct: 1059 CQVNIPAMPCGLWSPSHRWTEAGVYHAPTEYSRWFNTSVAYEVDYITGYEPYVVVNRSSW 1118
Query: 650 -----LPRYDTRFVGFGWNK 664
YD ++ +GW+K
Sbjct: 1119 MGRHGAGMYDDGYISWGWDK 1138
>gi|241784637|ref|XP_002414405.1| glycosyltransferase domain-containing protein, putative [Ixodes
scapularis]
gi|215508616|gb|EEC18070.1| glycosyltransferase domain-containing protein, putative [Ixodes
scapularis]
Length = 231
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 60/130 (46%), Gaps = 6/130 (4%)
Query: 203 KVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW 262
++LP+ + +D+D +F + LW F ++ +Q + + D L N
Sbjct: 1 RLLPQE-DGVLYVDSDAVFFLSVEDLWNYFGQMNDQQLAVVFSDVEDEALTIY--NEWEM 57
Query: 263 PALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPY 321
P R G N GV+L++LT++R+ +K+ L DQDI N + HP
Sbjct: 58 PHYKRYGINAGVMLMNLTRMRNSGLDALLLSWMDKYHHKVL--DFHDQDIINTVFHHHPE 115
Query: 322 LVYTLPCQWN 331
V+ PC+WN
Sbjct: 116 KVFHGPCKWN 125
>gi|119585931|gb|EAW65527.1| hCG2042744 [Homo sapiens]
Length = 324
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
+L LP +L + + + +DTDV+F + +W L Q + WY
Sbjct: 117 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 173
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
+ H P G G N+GV+L++LT++R ++W + +K+
Sbjct: 174 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 230
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
W DQD+ N I +P +Y PCQWN + E + ++H N
Sbjct: 231 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 283
>gi|145632851|ref|ZP_01788584.1| fumarate hydratase [Haemophilus influenzae 3655]
gi|144986507|gb|EDJ93073.1| fumarate hydratase [Haemophilus influenzae 3655]
Length = 277
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H L +L L ++LP+ + + I LD D+I I +LW + L+ S
Sbjct: 77 HIQSTASLFRLYLHQILPQHIERVIYLDMDLIIHQAIDELWDI------NLEDSLIAGVS 130
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
D++ LW++ P+ + NTGV+L++L K R+ + ++ A K+ ++ D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
QD+ N I + L+ LP ++N+Q+
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210
>gi|299751516|ref|XP_001830315.2| hypothetical protein CC1G_01951 [Coprinopsis cinerea okayama7#130]
gi|298409410|gb|EAU91462.2| hypothetical protein CC1G_01951 [Coprinopsis cinerea okayama7#130]
Length = 499
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 119/275 (43%), Gaps = 64/275 (23%)
Query: 444 NDVTLVAQLSMD-----RLQMVEINPNNVDEYEASSDG----NDVTLVAQLSMDRLQMVE 494
+D L+ S+D + + P+ V Y + G ++VT+ + ++ DRL +
Sbjct: 141 HDTGLLRSTSVDDVLFSKAFATSMRPSQVVPYFYRATGPFSEDEVTITSIITSDRLDVFA 200
Query: 495 MLFKHWEGPISLTLYMSDAEAQ---QFLSY----AGNSEALRSRRNIGYHVVYKEGNFYP 547
L + ++GPIS+T ++ ++ Q Q L Y SE + +I H+V +
Sbjct: 201 RLVERYQGPISVTFHVKNSTQQQVSQILDYLQKIVTASETVVRFADI--HLVLDSFD-RQ 257
Query: 548 INTLRNVALNQVSTPYVFLLDIDF--LPMFGLY----PYLKSSIRSMDMHGHGGKKVLVV 601
NT RN+A T +V +LDIDF F L P++++ + G V+
Sbjct: 258 FNTWRNIARLFARTNFVMMLDIDFYLCTDFRLTLRNNPWVRAKLEE-------GLSAFVI 310
Query: 602 PAFETQRYR-----TAFPAS-----------------------HAPTNFSRWVNATTP-- 631
PAFE R + FP + H T++ ++ NA
Sbjct: 311 PAFEYTDPRQEKGFSTFPTTKQDLLSKIKAGTISMFHASWKPGHNSTDYPKFYNAPAGDV 370
Query: 632 YQI-EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
Y++ + +EPY++ + P + + RFVG+G NK
Sbjct: 371 YKVTTYQSAYEPYVIFKKAGPPWCEERFVGYGGNK 405
>gi|159485548|ref|XP_001700806.1| hypothetical protein CHLREDRAFT_187466 [Chlamydomonas reinhardtii]
gi|158281305|gb|EDP07060.1| predicted protein [Chlamydomonas reinhardtii]
Length = 363
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 63/153 (41%), Gaps = 39/153 (25%)
Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
PIN+LRNVAL TP V ++D+D P GL L + + + V+PA+
Sbjct: 117 LVPINSLRNVALLAARTPLVAMIDVDLAPCEGLAGRLLRDVSRVQGLQAERDAMWVLPAW 176
Query: 605 E------------------------------TQRYRT----AFPASHAPTNFSRWVNATT 630
E + R RT + H T+FSRW+ +
Sbjct: 177 ESSPQLAQAEAEAVVQAARTGDKPRLAALWSSHRIRTFSEDVYARGHNATDFSRWLRSPK 236
Query: 631 PYQIEWAPDFEPYIVAHRDL----PRYDTRFVG 659
PY +++ +EP+ + R+ P +D RF G
Sbjct: 237 PYVVQYETGYEPWGIVAREHYLQWP-FDERFRG 268
>gi|422293685|gb|EKU20985.1| hypothetical protein NGA_2018100, partial [Nannochloropsis gaditana
CCMP526]
Length = 262
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 11/104 (10%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD--AEAQQFLSYAGNSE--------A 527
D TLV+Q S DR+ M+ + + WEGPI + ++M AE F++ + +
Sbjct: 160 DFTLVSQSSPDRIWMLPHICERWEGPIIVAIFMHGFTAEHDAFMANLKCPQLQCLPVHPS 219
Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
+ + +I V+ N YP+N LRN+A+++V T + L D+D
Sbjct: 220 VPASSSIPAS-VWAAPNLYPVNALRNLAISEVRTTHFLLTDMDL 262
>gi|242020666|ref|XP_002430773.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515970|gb|EEB18035.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 356
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 80/203 (39%), Gaps = 34/203 (16%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P+ +YS + L L K+ + I++D D D+ L+ F+ + IG+
Sbjct: 161 PDSYYSDALFFISLGLHKIADDQ-NYGIMIDVDTKIQIDVKLLFDEFNNFSSNELIGIAP 219
Query: 246 NQSDWYLGKLWKNH---------KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAE 295
QS Y L+ KP G G N+GV+LL +LR ++
Sbjct: 220 EQSPVYRHVLYMYRAKYRDTNFGKPLSQKGNPGVNSGVLLLHFERLRK-------SLLYN 272
Query: 296 KFLLTR----------LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL----SDNTRSD 341
K LL L DQD + + EH L++ L C WN QL S + D
Sbjct: 273 KLLLPEQVKRLVDKYMFQGHLGDQDWYTLVSQEHKKLIHLLDCTWNRQLCQWWSHHGYED 332
Query: 342 --ELCYTELTDLKIIHWNSPKKL 362
+ Y D+KI H N K+
Sbjct: 333 IFNMFYHCDGDVKIYHGNCNTKI 355
>gi|358332613|dbj|GAA51247.1| glycosyltransferase-like protein LARGE, partial [Clonorchis
sinensis]
Length = 159
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 55/128 (42%), Gaps = 31/128 (24%)
Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR---------------------------- 41
PT ++ V A + +R TL+KS+ +Y+
Sbjct: 32 PTVHLVHVLKGEAAFKRSR---TLLKSVFYYQGRLRSNRSECSLPRTPSPKPCAQQRKPE 88
Query: 42 KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
+ P+H HLI + + + S W V Q + + Y D + VSWI N HY+GV L K
Sbjct: 89 RTPIHLHLIAERKLWSTINDSVSQWLVQQFQWTLYDLDDYMSRVSWISNGHYAGVVTLAK 148
Query: 102 LTLPKVLP 109
+ +P +LP
Sbjct: 149 MIVPDILP 156
>gi|219115613|ref|XP_002178602.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410337|gb|EEC50267.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 293
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 69/171 (40%), Gaps = 57/171 (33%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP--YLKSSIRSMDMHGHGGKKVLVVPA 603
YP+N LRN+A++QV T + +D DF+ GL+ YL ++ + D + +V+PA
Sbjct: 54 YPVNKLRNLAMSQVKTSHAVFIDADFVLSAGLFETLYLHRALLAAD-----ERNAIVIPA 108
Query: 604 FETQRY------------------------------------RTAFPA--------SHAP 619
FE ++ R+A HA
Sbjct: 109 FELRKVCETTSYECLTTHLAMLPRDKEELLHLHKKSIESNGGRSAILQFNGLSNIHGHAS 168
Query: 620 TNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFVGFGWNK 664
T ++ WV P + + +EPY+V R+LP + F G+G NK
Sbjct: 169 TRYNDWVTQPAEQLLPIECVTSDRYEPYLVVRQCRNLPPFQEAFAGYGQNK 219
>gi|302847664|ref|XP_002955366.1| hypothetical protein VOLCADRAFT_96251 [Volvox carteri f.
nagariensis]
gi|300259438|gb|EFJ43666.1| hypothetical protein VOLCADRAFT_96251 [Volvox carteri f.
nagariensis]
Length = 464
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 58/155 (37%), Gaps = 35/155 (22%)
Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG-HGGKKVLVVPA 603
YP+N LRN A QV T + ++D+D L + R ++ V+PA
Sbjct: 200 LYPVNYLRNYARMQVRTRLLAMIDVDMYVSRTLSEEMAREGRIAHYESLCAERRATVLPA 259
Query: 604 FETQR---------------------------------YRTAFPASHAPTNFSRWVNATT 630
FE R FP H PT++ RW +
Sbjct: 260 FEPTRPGDKGRGMALKITQVTKSELASMHGRNKEAIQFKLRVFPRGHTPTDYVRWFTESQ 319
Query: 631 PYQIEWAPDFEP-YIVAHRDLPRYDTRFVGFGWNK 664
PY + + +EP +I +P YD F G+G NK
Sbjct: 320 PYPVSYKRFYEPWFITCDEVMPWYDVDFRGYGMNK 354
>gi|145634288|ref|ZP_01789998.1| fumarate hydratase [Haemophilus influenzae PittAA]
gi|145268268|gb|EDK08262.1| fumarate hydratase [Haemophilus influenzae PittAA]
Length = 310
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H L +L L ++LP+ + + I LD D+I I +LW + L+ S
Sbjct: 77 HIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
D++ LW++ P+ + NTGV+L++L K R+ + ++ A K+ ++ D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
QD+ N I + L+ LP ++N+Q+
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210
>gi|387199040|gb|AFJ68883.1| hypothetical protein NGATSA_2018100, partial [Nannochloropsis
gaditana CCMP526]
Length = 216
Score = 53.5 bits (127), Expect = 4e-04, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 11/103 (10%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS--DAEAQQFLSYAGNSE--------A 527
D TLV+Q S DR+ M+ + + WEGPI + ++M AE F++ + +
Sbjct: 114 DFTLVSQSSPDRIWMLPHICERWEGPIIVAIFMHGFTAEHDAFMANLKCPQLQCLPVHPS 173
Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
+ + +I V+ N YP+N LRN+A+++V T + L D+D
Sbjct: 174 VPASSSIPAS-VWAAPNLYPVNALRNLAISEVRTTHFLLTDMD 215
>gi|428174702|gb|EKX43596.1| hypothetical protein GUITHDRAFT_140359 [Guillardia theta CCMP2712]
Length = 882
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 68/178 (38%), Gaps = 64/178 (35%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAF 604
+PIN LR ++ T V D+D L +L S+IR G G ++LV+P+F
Sbjct: 498 FPINMLRKISCAIARTDLVLYADVDMLA----SDWLASNIRQAYQDGKVGPTELLVIPSF 553
Query: 605 --------------ETQRYRTAFPAS---------------------------------- 616
E QR R P S
Sbjct: 554 KSSEAWPMEAGISLEQQRMRVE-PVSISMDALRLNFQRRQVWIPGLVGGMWSTAGRWHES 612
Query: 617 ---HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPR-------YDTRFVGFGWNK 664
HA T + RW+ A YQ+++ +EPY+V +R R YD RFV +GW+K
Sbjct: 613 GVFHAGTEYDRWMEAEETYQVKYILGYEPYVVVNRTAWRGSHGVGLYDDRFVHWGWDK 670
>gi|167517295|ref|XP_001742988.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778087|gb|EDQ91702.1| predicted protein [Monosiga brevicollis MX1]
Length = 408
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 72/159 (45%), Gaps = 41/159 (25%)
Query: 546 YPINTLRNVALN-QVSTP---YVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL-V 600
YPIN LRN A+ S+P ++F LD DFLP L L +++ + + G ++ V
Sbjct: 52 YPINALRNAAMQLGPSSPAADFMFYLDADFLPSPDLPEALAAAVAAAETDGEDVARIAWV 111
Query: 601 VPAFETQ------------------------------RYRTAFPASHAPTNFSRWVNATT 630
VPAFE + R R + P SH TN+++++N
Sbjct: 112 VPAFELKGQSGPTVDRLLPASMDALRSQYAADQVVPFRSRES-PLSHRATNYAQFLNPEN 170
Query: 631 P---YQIEWAPDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
Y IE+ FEPY+ +LP + +F G+G NK
Sbjct: 171 RQPYYLIEYEDKFEPYVALRNNPELPLFAEQFEGYGLNK 209
>gi|229844138|ref|ZP_04464279.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
6P18H1]
gi|229813132|gb|EEP48820.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
6P18H1]
Length = 293
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H L +L L ++LP+ + + I LD D+I I +LW + L+ S
Sbjct: 77 HIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
D++ LW++ P+ + NTGV+L++L K R+ + ++ A K+ ++ D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
QD+ N I + L+ LP ++N+Q+
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210
>gi|148825938|ref|YP_001290691.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
PittEE]
gi|148716098|gb|ABQ98308.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
PittEE]
Length = 300
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H L +L L ++LP+ + + I LD D+I I +LW + L+ S
Sbjct: 77 HIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
D++ LW++ P+ + NTGV+L++L K R+ + ++ A K+ ++ D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
QD+ N I + L+ LP ++N+Q+
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210
>gi|223993881|ref|XP_002286624.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977939|gb|EED96265.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 378
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 75/170 (44%), Gaps = 22/170 (12%)
Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS----- 530
G VTLV Q S+DR+ ++ W PI L +Y+ + A E++
Sbjct: 22 GASVTLVIQCSLDRVWLLSETCSRWPDPIVLVVYLPLDTVLDAVDRATVIESIAEVMTGC 81
Query: 531 -RRNIGYHVV-----YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
+ + H+ + + YP+N +RN L+ V+T +V ++D+D +P L K +
Sbjct: 82 PQMKVVLHISDTSNGKDDTSTYPVNIMRNRGLDAVTTSHVLIMDVDLIPSADLSHVAKDN 141
Query: 585 -IRSMDMHGHGGKK---------VLVVPAFETQRYRTAFPASHAPTNFSR 624
+ + M KK +VVPAFE ++ T P + ++ R
Sbjct: 142 LVDQITMLNQTQKKGGEMEIPMNAIVVPAFE-RKIETPCPDIESCKSYLR 190
>gi|153807295|ref|ZP_01959963.1| hypothetical protein BACCAC_01573 [Bacteroides caccae ATCC 43185]
gi|149130415|gb|EDM21625.1| glycosyltransferase, family 8 [Bacteroides caccae ATCC 43185]
Length = 310
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 72/149 (48%), Gaps = 26/149 (17%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE--NQ 247
Y + L+L +P++LP + K + +D+D+IF DI+ L+ S + G+ + NQ
Sbjct: 79 YPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALYD--SDITNYALAGMEDAPNQ 136
Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLW 303
+ LG +P FN G +LL++ LRD+ + + R EK +L
Sbjct: 137 NALRLG--------FPESDLYFNAGFVLLNVKYLRDMDFTNKAMAYIRDCREKIVLH--- 185
Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNV 332
DQD+ NA++ V +P +WN+
Sbjct: 186 ----DQDVLNALLHGK---VLFVPIKWNM 207
>gi|290975672|ref|XP_002670566.1| predicted protein [Naegleria gruberi]
gi|284084126|gb|EFC37822.1| predicted protein [Naegleria gruberi]
Length = 378
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 73/139 (52%), Gaps = 11/139 (7%)
Query: 13 EVIQVAIVCAGYNSTRSLVTL-IKSILFYR--KNPLHFHLITDTVALNIL-QTLFSTWSV 68
+ + + IV G +T TL IKSIL + ++ TD +N L + +F+ V
Sbjct: 73 KTLSITIVSGGNRTTSDETTLLIKSILLNHVGNTQIELNIATDLYNMNFLIENVFNRIKV 132
Query: 69 --PQVEVSFYLADSVVEDVSW----IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
Q+ V+F + + D IP H+SG +G++KL L KV +T+ +TI +DTD+
Sbjct: 133 YRTQLNVNFKIVNLTRIDEECNELKIPIVHHSGKWGMVKLFLDKVF-DTVKRTIFVDTDM 191
Query: 123 IFATDIAQLWALFSRLRQR 141
+F T+ L++ F + ++
Sbjct: 192 VFGTNPDLLFSEFDKFKEE 210
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 24/163 (14%)
Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 244
IP H+SG +G++KL L KV +T+ +TI +DTD++F T+ L++ F + ++
Sbjct: 158 IPIVHHSGKWGMVKLFLDKVF-DTVKRTIFVDTDMVFGTNPDLLFSEFDKFKEETLFS-- 214
Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILL-DLTKLRDISWAGFWRIIA--EKF---- 297
W W + G I L D+ K R +++ + I A E F
Sbjct: 215 -----------WTREPNWESKGPNHVCSCIFLWDMKKTRKVNYLKEFAIPASIEIFGFDE 263
Query: 298 ---LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDN 337
+++ + +DQD A+ P L WN+ N
Sbjct: 264 KTQQISKEKRNGSDQDFLFALNKMKPNFFNELDPSWNLANCQN 306
>gi|219121206|ref|XP_002185831.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582680|gb|ACI65301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1347
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 79/173 (45%), Gaps = 15/173 (8%)
Query: 448 LVAQLSMDRLQMVEINPNNVDEYEASS----DGNDVTLVAQLSMDRLQMVEMLFKHWEGP 503
LV + + DR++ + V+E + + D TLV Q S +RL ++ K W P
Sbjct: 835 LVVRKAGDRIRHLGNTDLFVNETKCPANLIPDDVQTTLVIQSSSNRLWILAETCKRWNDP 894
Query: 504 ISLTLYMSDAEAQQFLS--YAGNSEALRSRRNIGYHVVYKEGN--FYPINTLRNVALNQV 559
I + ++ E +S +G + + I Y + E YP+N LRN+ L+ V
Sbjct: 895 IIAVIALTSTEDHDEVSALLSGWDDKCPHLQVIVYQMDTDEEKPEMYPVNRLRNIGLDAV 954
Query: 560 STPYVFLLDIDFLPMFGLYPYLKSSIRS-------MDMHGHGGKKVLVVPAFE 605
T +V + D+DF+P L +++++ D + +VVPAFE
Sbjct: 955 HTSHVLVADVDFIPAQDLARLIRTTLLDRARLRVLKDDMIPKDQDAIVVPAFE 1007
>gi|300122415|emb|CBK22986.2| unnamed protein product [Blastocystis hominis]
Length = 251
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 64/129 (49%), Gaps = 8/129 (6%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEALRSRRNIGYH 537
+ LV + R+ L + W GP+S+ +Y+ S E +Q + RS Y
Sbjct: 11 IPLVFPFRIQRVFFFPHLVQRWSGPMSVAVYLKSHIEKEQVTQLLAQTSLPRSITLSFY- 69
Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG-HGGK 596
+ K+ YP N LRN+AL+++ T + +++D+D P GLY S+ +D +
Sbjct: 70 -ITKDPTEYPYNKLRNIALSKIVTSHFWVMDMDMWPCDGLY----ESLHQLDSKFLNDDH 124
Query: 597 KVLVVPAFE 605
++VP+FE
Sbjct: 125 LAVIVPSFE 133
>gi|229846746|ref|ZP_04466853.1| putative glycosyl transferase, glycosyl transferase family 8
protein [Haemophilus influenzae 7P49H1]
gi|229810235|gb|EEP45954.1| putative glycosyl transferase, glycosyl transferase family 8
protein [Haemophilus influenzae 7P49H1]
Length = 312
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 72/147 (48%), Gaps = 13/147 (8%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H L +L L ++LP+ + + I LD D+I I +LW + L+ S
Sbjct: 77 HIQSTASLFRLYLHQILPQHIERVIYLDMDLIIHQAIDELWDI------NLEDSLIAGVS 130
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
D++ LW++ P+ + NTGV+L++L K R+ + ++ A K+ ++ D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGKNFVY---GD 185
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
QD+ N I + + LP ++N+Q+
Sbjct: 186 QDVINFSIPTNR--IKLLPVKFNIQVK 210
>gi|336372368|gb|EGO00707.1| glycosyltransferase family 49 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 337
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/246 (21%), Positives = 112/246 (45%), Gaps = 48/246 (19%)
Query: 462 INPNNVDE--YEASSDGN--DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ- 516
+ PN + Y AS N D+T+ ++ +R ++ L + ++GPIS+T+++ +
Sbjct: 15 MQPNKIVPFFYRASGTFNPEDITITTLVTSNRFKVFAQLVERYQGPISVTVHIKSDSSHT 74
Query: 517 -----QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
+ +S ++ + ++ H+V + +NT RN+A T +V +LD+DF
Sbjct: 75 TALLNELHKLYASSPSMATFVDV--HLVLDPFD-RQLNTWRNIARFFARTDFVMMLDVDF 131
Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR----------------------- 608
+ +++S +++ G LVVPAFE +
Sbjct: 132 VVCTDFRAAVRAS-KAVMARLSEGYTALVVPAFEYLKQADGADQKKFPRDKNSLLSLVKA 190
Query: 609 -----YRTAFPASHAPTNFSRWVNATTPYQI----EWAPDFEPYIVAHRDLPRY-DTRFV 658
+ ++ H T+++R+ ++ P ++ ++ +EPY++ + P + D RF+
Sbjct: 191 KKIDMFHRSWAPGHNSTDYARFY-SSQPGEVYKVTQYHSAYEPYVIFKKTGPPWCDERFI 249
Query: 659 GFGWNK 664
G+G NK
Sbjct: 250 GYGGNK 255
>gi|409350114|ref|ZP_11233352.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
[Lactobacillus equicursoris CIP 110162]
gi|407877661|emb|CCK85410.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
[Lactobacillus equicursoris CIP 110162]
Length = 574
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 81/176 (46%), Gaps = 31/176 (17%)
Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
GL ++ LP +L E + + I LD D++ D+ +LW ++ ++IG V +
Sbjct: 263 GLFRMLLPDLLLE-VKQVIYLDADIMVNCDLVELW---NQHLYGKSIGAVVDSG------ 312
Query: 255 LWKNHKPWPALGRG-------FNTGVILLDLTKLRDIS--WAGFWRIIAEKFLLTRLWTS 305
+ K +GRG FN+GV+L++L ++R+ W F I + + W
Sbjct: 313 IKKGRYTSKIIGRGLVEKDKYFNSGVLLMNLNRIREKGNLWEKFLSFIEKN---SNAW-- 367
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNV----QLSDNTRSDELCYTELTDLKIIHWN 357
L DQD N I + +Y L +WN + +DN + Y D +I ++N
Sbjct: 368 LIDQDALNCIFKDD---IYFLNSKWNTLIIDERADNPKLQPAIYHFAGDKRIYYYN 420
>gi|432118136|gb|ELK38021.1| hypothetical protein MDA_GLEAN10011661 [Myotis davidii]
Length = 241
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 69/171 (40%), Gaps = 24/171 (14%)
Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---- 264
+ + I LD D+ + T+I +L+ F IG+ Y W+ P
Sbjct: 67 ILQIIQLDLDLKYKTNIRELFEEFDNFLPGAIIGIAREMQPVYRHTFWQFRHENPKTRVG 126
Query: 265 -----LGRGFNTGVILLDLTKLRDISWAGFW------RIIAEKFLLTRLWTSLADQDIFN 313
GFN+GV+LL L +R G + +A+K+ L DQD F
Sbjct: 127 GPPPEGLPGFNSGVMLLHLEAMRQSPLYGRLLEPAQVQQLADKY---HFRGHLGDQDFFT 183
Query: 314 AIISEHPYLVYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
I EHP L + L C WN QL D+ SD E + ++I H N
Sbjct: 184 MIGMEHPELFHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVRIYHGNC 234
>gi|198432242|ref|XP_002119989.1| PREDICTED: similar to glycosyltransferase 8 domain containing 4
[Ciona intestinalis]
Length = 396
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/223 (22%), Positives = 92/223 (41%), Gaps = 29/223 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L + + +DTD++F + + +WA F + Q + + +G +
Sbjct: 164 RLFLPNIL-TNVDSLLYVDTDILFLSPVDDIWAKFKKFNSTQLSAMSPEHENPRVG-WYS 221
Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
P G G N+GV+L+++T+LR + W + K+ W
Sbjct: 222 RFARHPYYGDTGINSGVMLINMTRLRSADFKNDVQKSKLKWDNLLLPLFRKYSSNLTW-- 279
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD-LKIIH--------- 355
DQDI N I +P ++Y + C N + D+ ++C + + + + ++H
Sbjct: 280 -GDQDILNIIFHHNPEMLYVMSCYLNYR-PDHCMYMQMCGSAVENGVSVMHGCRRSFHND 337
Query: 356 -WNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT 397
W S K + K F+ L E NL + C L+
Sbjct: 338 KWPSFKAVYKAFKDYSFYNPLSSLLQEIKKNLDKSPHSQCKLS 380
>gi|16552621|dbj|BAB71355.1| unnamed protein product [Homo sapiens]
gi|119598436|gb|EAW78030.1| chromosome 3 open reading frame 21, isoform CRA_b [Homo sapiens]
Length = 190
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 70/171 (40%), Gaps = 24/171 (14%)
Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---- 264
+ + I LD D+ F T+I +L+ F IG+ Y W+ P
Sbjct: 16 ILQIIQLDLDLKFKTNIRELFEEFDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVG 75
Query: 265 -----LGRGFNTGVILLDLTKLRD------ISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
GFN+GV+LL+L +R + + +A+K+ L DQD F
Sbjct: 76 GPPPEGLPGFNSGVMLLNLEAMRQSPLYSRLLEPAQVQQLADKYHFR---GHLGDQDFFT 132
Query: 314 AIISEHPYLVYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
I EHP L + L C WN QL D+ SD E + +KI H N
Sbjct: 133 MIGMEHPKLFHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 183
>gi|428162720|gb|EKX31837.1| hypothetical protein GUITHDRAFT_149039 [Guillardia theta CCMP2712]
Length = 937
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 63/270 (23%), Positives = 102/270 (37%), Gaps = 72/270 (26%)
Query: 463 NPNNVDEYEASSDGN-DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA----EAQQ 517
+P N+ + G DVTL ++DRL + HW +++ +Y A E ++
Sbjct: 466 HPVNLTQQHLGKAGALDVTLTTTATLDRLVTLISTAAHWGDNMAVAMYARSAVEQGEIRR 525
Query: 518 FLS------YAGNSEALRSRRNIGYHVVYKEGN--FYPINTLRNVALNQVSTPYVFLLDI 569
F+ +A + L GN +PIN LR + T + D+
Sbjct: 526 FVMEILGPWFATRGKTLEVVLLSACPDPLNTGNDLVFPINMLRKICCAIAKTELILFTDV 585
Query: 570 DFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAFETQ--------------------- 607
D P GL +SI+ G G +LV+PAF +
Sbjct: 586 DMQPSDGL----AASIQLAYSRGQLGPMDLLVLPAFRSNELWPLDVEVDVNEGNLSVNTF 641
Query: 608 -----------------------RYRTAFPAS---HAPTNFSRWVNATTPYQIEWAPDFE 641
R T++ + H PT++ RW++A Y+ + +E
Sbjct: 642 SILFRDLRQHFATCSVYVPGIDCRGTTSWSGAGVFHMPTDYDRWLDAEDLYEASYLLGYE 701
Query: 642 PYIVAHRD--LPR-----YDTRFVGFGWNK 664
PY+V +R L R YD R+V +GW+K
Sbjct: 702 PYVVLNRSSWLGRHGAGVYDDRYVFWGWDK 731
>gi|47205429|emb|CAG01190.1| unnamed protein product [Tetraodon nigroviridis]
Length = 192
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 7/81 (8%)
Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-----RLWTSLADQDIFNA 314
+P P GFN+GV+LLDL +R S + +++ + L R L DQD F
Sbjct: 78 EPHPDGLPGFNSGVMLLDLAAMRASSL--YNQLLEPEHLAQLAAQYRFRGHLGDQDFFTM 135
Query: 315 IISEHPYLVYTLPCQWNVQLS 335
I EHP L Y L C WN QL
Sbjct: 136 IGMEHPELFYPLACGWNRQLC 156
>gi|68250084|ref|YP_249196.1| glycosyl transferase family protein [Haemophilus influenzae
86-028NP]
gi|68058283|gb|AAX88536.1| putative glycosyl transferase, glycosyl transferase family 8
protein [Haemophilus influenzae 86-028NP]
Length = 312
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 72/147 (48%), Gaps = 13/147 (8%)
Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
H L +L L ++LP+ + + I LD D+I I +LW + L+ S
Sbjct: 77 HIQSTASLFRLYLHQILPQHIERVIYLDMDLIIHQAIDELWDI------NLEDSLIAGVS 130
Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
D++ LW++ P+ + N GV+L++L K R+ + ++ A K+ ++ D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINAGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
QD+ N I + L+ LP ++N+Q+
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210
>gi|198417085|ref|XP_002124537.1| PREDICTED: similar to glycosyltransferase 8 domain containing 4
[Ciona intestinalis]
Length = 390
Score = 52.4 bits (124), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 87/187 (46%), Gaps = 19/187 (10%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
K++ G +L LP VL +T + +DTDVIF T +LW F RQ L +
Sbjct: 135 KNWWGPCASFRLFLPTVLDKT-DSVLYVDTDVIFLTPPEELWRHFYLFNDRQVAALAP-R 192
Query: 248 SDWYLGKLWKNHKPWPALGRG----FNTGVILLDLTKLRDISWA----GFWRIIAEKFLL 299
W ++ ++ + + G N+GV L++LT++R +A +I K LL
Sbjct: 193 VGWSF-QVPNDNANFIRMQDGKKTQVNSGVFLMNLTRMRQPVFATESESRQKISWNKKLL 251
Query: 300 TRLWTS-----LADQDIFNAIISEHPYLVYTLPCQWNV--QLSDNTRSDELCYTELTD-L 351
L+ DQ++ N + +P L+Y LPC++N + + + C + +D
Sbjct: 252 FPLYRKHKEDMYGDQNLINLVFHYNPDLIYFLPCKYNYHHKFCFDAYRERWCTSAESDGA 311
Query: 352 KIIHWNS 358
+IH N+
Sbjct: 312 AVIHGNA 318
Score = 45.4 bits (106), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 12/148 (8%)
Query: 8 VIPTCEVIQVAIV-CAGYNST--RSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
+ T E + +A+ C G + T + T+IKS + + ++ +H H+ T+ + + + F+
Sbjct: 47 IFQTDETMHLAMAACRGKSLTIVKEATTMIKSAIIFSRSRIHVHIFTENLE-DEFRKEFA 105
Query: 65 TWSVPQVEVSFYLADSVV-----EDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
TW + +V E++ + N + G +L LP VL +T + +D
Sbjct: 106 TWPLSAKRKFLLTIHPLVYPLDPEELKVMKN--WWGPCASFRLFLPTVLDKT-DSVLYVD 162
Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLV 147
TDVIF T +LW F RQ L
Sbjct: 163 TDVIFLTPPEELWRHFYLFNDRQVAALA 190
>gi|240995558|ref|XP_002404618.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Ixodes scapularis]
gi|215491620|gb|EEC01261.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Ixodes scapularis]
Length = 315
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 65/169 (38%), Gaps = 40/169 (23%)
Query: 532 RNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS---- 587
R++ + Y G YP+N LRNVA T +V +LD+D LP GL+ R
Sbjct: 133 RSLRFPENYALGVSYPVNLLRNVARRNSQTEFVLVLDVDLLPNDGLHESFVDFARENRLF 192
Query: 588 MDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNA--TTPYQIE---------- 635
++ H K V V PAFE R P+ + + V A P+ +E
Sbjct: 193 VESH-RDDKTVYVAPAFEV---RDGVPSPRDKASLLQRVEAMDARPFYLELCWKCQKHTD 248
Query: 636 --------------------WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
W +EP+ P YD RF +G+N+
Sbjct: 249 YEAWQREPPGQRLAVLFEVLWRDPWEPFYFGRNSAPLYDERFRQYGFNR 297
>gi|195586289|ref|XP_002082910.1| GD11824 [Drosophila simulans]
gi|194194919|gb|EDX08495.1| GD11824 [Drosophila simulans]
Length = 978
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 26/158 (16%)
Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP 263
+ +L + I+LD D++F +D+ L+ F Q GL + Y L++ +P
Sbjct: 786 IADRSLNRAILLDCDIVFRSDVRLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYP 845
Query: 264 --ALGR---------------------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
+ G G N+GV+LL L ++R+ +E L
Sbjct: 846 KTSFGNPYYPINNEGGNQHSRVHHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLV 905
Query: 301 RLWT---SLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
++ L DQD F + E+P L+Y L C WN QL
Sbjct: 906 AKYSFKGHLGDQDFFTLLGYEYPNLIYRLDCIWNRQLC 943
>gi|424766329|ref|ZP_18193682.1| glycosyltransferase, family 8 [Enterococcus faecium TX1337RF]
gi|402411068|gb|EJV43448.1| glycosyltransferase, family 8 [Enterococcus faecium TX1337RF]
Length = 302
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 102/205 (49%), Gaps = 33/205 (16%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
K++ +K+ L +L + + K I LD+DV+ DI +LW+ +++ +G + N
Sbjct: 77 KNHVSKAAYIKINLDTILSD-IDKVIYLDSDVLINDDIKELWSYSNKI---HYLGAIINP 132
Query: 248 SDWYLGKLW---KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRI-IAEKFLLTRLW 303
Y +++ K+HK FN+GV++L+L +R ++ ++ + EK LTR
Sbjct: 133 GYDYDNRVFGVSKDHKT-------FNSGVMMLNLKNIRKHEFSKKLKLFLDEKGHLTR-- 183
Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQ---LSDNTRSDELCYTELTDLK----IIHW 356
L DQ FNA+ + LP +WNVQ +++ ++ L DL+ IIH+
Sbjct: 184 --LNDQAAFNAVFLDWQ----LLPEKWNVQYVFYMKSSKELDIDSIHLKDLRKNPSIIHF 237
Query: 357 NSPKK-LKVKNKHMEFFRNLYLTFL 380
S K + +N H F+ Y ++L
Sbjct: 238 TSNSKPWQYRNAHP--FKKKYYSYL 260
>gi|391338768|ref|XP_003743727.1| PREDICTED: xyloside xylosyltransferase 1-like [Metaseiulus
occidentalis]
Length = 295
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 17/118 (14%)
Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALG 266
L + I LDTD+ F DI +LW +F + IG+ ++ Y K ++ P +G
Sbjct: 171 LDRAIFLDTDLRFHDDIRKLWDIFPEFTEENIIGIADDLQPMYREKFADYREKNPETDVG 230
Query: 267 ------RGFNTGVILLDLTKLRDIS---W---AGFWRIIAEKFLLTRLWTSLADQDIF 312
+GFNTGV+L+ L ++R+ S W R + EK+ L QD F
Sbjct: 231 SPHPGKQGFNTGVVLMHLGRMRNSSKYNWNFDEDHLRSLQEKYFFN---GDLGHQDFF 285
>gi|241248532|ref|XP_002402932.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Ixodes scapularis]
gi|215496429|gb|EEC06069.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
putative [Ixodes scapularis]
Length = 456
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/174 (24%), Positives = 65/174 (37%), Gaps = 42/174 (24%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL---SYAGNSEALRSRRNIG 535
VTL Q + + LQ L HW+GP+S+ Y + + L +Y R N+
Sbjct: 131 VTLTTQATHEFLQHAPTLCSHWQGPVSVAAYSPGTDYEIVLNKIAYLRQCGDPCVRANMT 190
Query: 536 YHVVYKEGN-------------------------------------FYPINTLRNVALNQ 558
+H+VY + YPIN LRN A +
Sbjct: 191 WHIVYDREHRPARLQNATDFLASWNGSCSSDVMTREYAEYRKAHKMTYPINVLRNAARTR 250
Query: 559 VSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA 612
T YV DI+ P + P ++ ++ +V ++P FE Q +TA
Sbjct: 251 AKTRYVLASDIELYPSGNIIPRFMRLLKQPNL--TQPHQVFILPIFEVQAKQTA 302
>gi|194378082|dbj|BAG57791.1| unnamed protein product [Homo sapiens]
Length = 186
Score = 51.6 bits (122), Expect = 0.001, Method: Composition-based stats.
Identities = 37/123 (30%), Positives = 52/123 (42%), Gaps = 34/123 (27%)
Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS--------------- 616
+P GL+ L+ +D G LVVPAFE +R R P +
Sbjct: 2 VPSEGLWRGLR---EMLDQSNQWGGTALVVPAFEIRRARR-MPMNKNELVQLYQVGEVRP 57
Query: 617 ---------HAPTNFSRWVNATT------PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFG 661
APTN+SRWVN Y + W +EP+ VA +P +D RF +G
Sbjct: 58 FYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYG 117
Query: 662 WNK 664
+N+
Sbjct: 118 FNR 120
>gi|322787502|gb|EFZ13590.1| hypothetical protein SINV_13647 [Solenopsis invicta]
Length = 324
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 69/181 (38%), Gaps = 50/181 (27%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSR-- 531
V L Q S+++L + + HW GP+S+ LY + E Q++L Y E++R R
Sbjct: 115 VCLATQSSVEKLHSLVQVALHWTGPMSVALYAAGDEEFEVLQRYLVYLRKCYESIRERVI 174
Query: 532 --------------------------------------RNIGYHVVYKEGNFYPINTLRN 553
R ++ N YP N +RN
Sbjct: 175 FSLAVPKMRTPKKQPRVFELPDIVDCAKPEATLNEFMTRVPTEQTNWRIRNVYPQNHMRN 234
Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
+A T YVFL D+D +P F L L +R + + K V+P +E R F
Sbjct: 235 LARKNCQTDYVFLTDVDIVPSFNLTVVLDEFLR----NDNCDKCAYVIPTYEID-TRVRF 289
Query: 614 P 614
P
Sbjct: 290 P 290
>gi|430860484|ref|ZP_19478083.1| hypothetical protein OI5_03257 [Enterococcus faecium E1573]
gi|430551882|gb|ELA91632.1| hypothetical protein OI5_03257 [Enterococcus faecium E1573]
Length = 302
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 101/205 (49%), Gaps = 33/205 (16%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
K++ +K+ L +L + + K I LD+DV+ DI +LW+ +++ +G + N
Sbjct: 77 KNHVSKAAYIKINLDTILSD-IDKVIYLDSDVLINDDIKELWSYSNKI---HYLGAIINP 132
Query: 248 SDWYLGKLW---KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR-IIAEKFLLTRLW 303
Y +++ K+HK FN+GV++L+L +R ++ + + EK LTR
Sbjct: 133 GYDYDNRVFGVSKDHKT-------FNSGVMMLNLKNIRKHEFSKKLKFFLDEKGHLTR-- 183
Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQ---LSDNTRSDELCYTELTDLK----IIHW 356
L DQ FNA+ + LP +WNVQ +++ ++ L DL+ IIH+
Sbjct: 184 --LNDQAAFNAVFLDWQ----LLPEKWNVQYVFYMKSSKELDIDSIHLKDLRKNPSIIHF 237
Query: 357 NSPKK-LKVKNKHMEFFRNLYLTFL 380
S K + +N H F+ Y ++L
Sbjct: 238 TSNSKPWQYRNAHP--FKKKYYSYL 260
>gi|330996534|ref|ZP_08320416.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
11841]
gi|329573090|gb|EGG54709.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
11841]
Length = 309
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 18/140 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+ L +LPE++ + + LD D++ DI LW + L + +VE+ L +
Sbjct: 90 RCFLSALLPESIDRVLYLDCDIVILGDITPLWR--TPLDAHTGVAVVEDTGCKELQRY-- 145
Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLWTSLADQDIFN 313
+PA FN+GV+L++L R+ A ++R E+ + DQD+ N
Sbjct: 146 EILQYPAEDSYFNSGVLLINLAYWREHHIAQACVDYYRAYPERIIFN-------DQDLLN 198
Query: 314 AIISEHPYLVYTLPCQWNVQ 333
++ H LV QWNVQ
Sbjct: 199 CVLHRHKTLV---GLQWNVQ 215
>gi|326202604|ref|ZP_08192472.1| glycosyl transferase family 8 [Clostridium papyrosolvens DSM 2782]
gi|325987188|gb|EGD48016.1| glycosyl transferase family 8 [Clostridium papyrosolvens DSM 2782]
Length = 502
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 63/149 (42%), Gaps = 16/149 (10%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW-----ALFSRLRQRQTIGLV 244
Y G ++ +P+++ ++ K I LD D++ DI +LW F + I +
Sbjct: 81 YFGYVTYFRIFIPEIVEASVRKVIYLDCDIVIKGDIRKLWENDISEYFVAAVEDVGIDIG 140
Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT 304
N + K H P G+ FN GV+L++L K R I + + R
Sbjct: 141 GN-----FATMVKKHIGIPRKGKYFNAGVLLINLDKWRADKTT---ETIRKYLIENREKI 192
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
ADQD NA+ + LP +WN Q
Sbjct: 193 YFADQDGLNAVFKDR---WLKLPIEWNQQ 218
>gi|392591311|gb|EIW80639.1| glycosyltransferase family 49 protein, partial [Coniophora puteana
RWD-64-598 SS2]
Length = 355
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/228 (23%), Positives = 96/228 (42%), Gaps = 45/228 (19%)
Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS------YAGNSEALRS 530
D+TL ++ R + + L + ++GPIS+ ++ + A ++ + + + R
Sbjct: 49 EDITLATLVTHSRFTVFKRLVERYDGPISVAVHHEWSHAHPAVASQLRALHVLYTSSPRF 108
Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
+ H+V +NT RNVA T V +LD+DF + + ++ ++ +
Sbjct: 109 AERVDVHLVLAPHE-RALNTWRNVARQLARTGLVMMLDVDF----AVCTDFRGALLNLGV 163
Query: 591 --HGHGGKKVLVVPAFETQRYRTA-----FP-----------------------ASHAPT 620
+VPAFE R R FP H T
Sbjct: 164 VGARDAAGAGAIVPAFEYTRLRDGMDSRTFPRDKQSLLALVHSGKLDMFHRSWQPGHNST 223
Query: 621 NFSRWVNATTP--YQIE-WAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
+++R+ A Y+++ + +EPY+V RD P + D RFVG+G NK
Sbjct: 224 DYTRYYAAAPGEVYKVDRYQAAYEPYVVFRRDGPPWCDERFVGYGGNK 271
>gi|298706849|emb|CBJ25813.1| Glycosyltransferase, family GT49 [Ectocarpus siliculosus]
Length = 1541
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 84/248 (33%), Gaps = 84/248 (33%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA----------EAQQFLSYAGNS--E 526
VTLV Q S DRL V W G SL +++ A E ++
Sbjct: 1289 VTLVTQCSTDRLPAVLEQALRWGGDTSLAVHVPSAPLSAKDETMSEIRRLCQRVDQEVRA 1348
Query: 527 ALRSRRNIGYHVVYKEGN------------FYPINTLRNVALNQV------STPYVFLLD 568
A R+ + V EG YP+NTLRN+AL Q VFL+D
Sbjct: 1349 APNRRKQLSVDVAILEGAEADPARHDHCGPLYPVNTLRNLALIQARDDSLHPAQAVFLVD 1408
Query: 569 IDFLPMFGLYPYLKSSIRSMDMHGHGGKK--VLVVPAFE------TQRYRTAFPAS---- 616
D LP L L S ++ + +V+P E + R+ + P+S
Sbjct: 1409 CDCLPSEHLLEELHSQEVQDRLNAESSARPAAVVIPCLEFTPGSVSARHDKSIPSSVQKV 1468
Query: 617 --------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTR 656
H PT+F R DLP YD R
Sbjct: 1469 VEMLAEGTAQGFHVDYFFKGHGPTDFLRKT----------------------DLPMYDER 1506
Query: 657 FVGFGWNK 664
F G+G NK
Sbjct: 1507 FRGYGMNK 1514
>gi|227872037|ref|ZP_03990417.1| lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase
[Oribacterium sinus F0268]
gi|227842114|gb|EEJ52364.1| lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase
[Oribacterium sinus F0268]
Length = 324
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 65/139 (46%), Gaps = 18/139 (12%)
Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 255
LL+L +P LPET+ K + LD D + I L+ L RL + +G+ S +
Sbjct: 88 LLRLFIPSTLPETITKALYLDADTVVLQSILSLYHL--RLGDK-LLGMAPEPS------I 138
Query: 256 WKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+K HK + +L +N GV+L++L+ LR+ G + + DQDI
Sbjct: 139 YKKHKEFLSLAEESPYYNAGVMLMNLSLLRE---EGMEEKCLRYYQMKEGQLPFNDQDIL 195
Query: 313 NAIISEHPYLVYTLPCQWN 331
N + + +LP ++N
Sbjct: 196 NMVCKGR---IRSLPQRFN 211
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 61/129 (47%), Gaps = 14/129 (10%)
Query: 45 LHFHLITDTVALNILQTLFSTWSVPQVEVSFY-LADSVVEDVSWIPNKHYSGVYG---LL 100
LHFH+++ + + L ++SFY L + + E +P ++G + LL
Sbjct: 31 LHFHILSLGIEEESKEKLRQYVETEGQKISFYDLEEKLSEWKEKLPAL-FTGKFSKATLL 89
Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
+L +P LPET+ K + LD D + I L+ L RL + +G+ S ++K
Sbjct: 90 RLFIPSTLPETITKALYLDADTVVLQSILSLYHL--RLGDK-LLGMAPEPS------IYK 140
Query: 161 NHKPWPALG 169
HK + +L
Sbjct: 141 KHKEFLSLA 149
>gi|300122829|emb|CBK23836.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 70/148 (47%), Gaps = 22/148 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
D +DVTL+ L++ R + L K W+ I++ +++++ E + F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228
Query: 520 SYAGNSEALRSRRNIGYH---VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
Y S ++I YH VY+ +PIN LR++A+ + T + F+ DID
Sbjct: 229 LYIVKSRIGSRYQSIYYHGKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
LY L+ +++ K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312
>gi|300176055|emb|CBK23366.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 70/148 (47%), Gaps = 22/148 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
D +DVTL+ L++ R + L K W+ I++ +++++ E + F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228
Query: 520 SYAGNSEALRSRRNIGYH---VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
Y S ++I YH VY+ +PIN LR++A+ + T + F+ DID
Sbjct: 229 LYIVKSRIGSRYQSIYYHGKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
LY L+ +++ K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312
>gi|332881420|ref|ZP_08449070.1| glycosyltransferase, family 8 [Capnocytophaga sp. oral taxon 329
str. F0087]
gi|357045165|ref|ZP_09106802.1| glycosyltransferase, family 8 [Paraprevotella clara YIT 11840]
gi|332680796|gb|EGJ53743.1| glycosyltransferase, family 8 [Capnocytophaga sp. oral taxon 329
str. F0087]
gi|355531748|gb|EHH01144.1| glycosyltransferase, family 8 [Paraprevotella clara YIT 11840]
Length = 309
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 64/140 (45%), Gaps = 18/140 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+ L +LPE + + + LD D++ DI LW + L + +VE+ L +
Sbjct: 90 RCFLSALLPEDIDRVLYLDCDIVILGDITPLWR--TPLDAHTGVAVVEDTGCKELQRY-- 145
Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLWTSLADQDIFN 313
+PA FN+GV+L++L R+ A ++R E+ + DQD+ N
Sbjct: 146 EILQYPAEDSYFNSGVLLINLVYWREHHIAQACVDYYRTYPERIIFN-------DQDLLN 198
Query: 314 AIISEHPYLVYTLPCQWNVQ 333
++ H LV QWNVQ
Sbjct: 199 CVLHRHKTLV---GLQWNVQ 215
>gi|347532720|ref|YP_004839483.1| glycosyl transferase [Roseburia hominis A2-183]
gi|345502868|gb|AEN97551.1| glycosyl transferase [Roseburia hominis A2-183]
Length = 333
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 69/164 (42%), Gaps = 30/164 (18%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE---- 245
Y V +L +P++ E + + LD D++ +DIAQL F Q + +G V
Sbjct: 91 YLSVMTYARLLIPQIF-ENFERVVYLDCDMVCNSDIAQL---FHADMQDKPLGAVADTVL 146
Query: 246 NQSDWY------LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLL 299
N W+ + K GR FN GVIL D+ +LR+ EK L
Sbjct: 147 NMEAWHNPNSEETKQYLKETVGITTEGRYFNGGVILFDIDRLRE---------DGEKLLA 197
Query: 300 T---RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
R W ADQD+ N I E V+ QWNV + N +
Sbjct: 198 CARERQW-RWADQDVLNHIYKER---VFYFDLQWNVIVISNLKQ 237
>gi|257784421|ref|YP_003179638.1| glycosyl transferase family protein [Atopobium parvulum DSM 20469]
gi|257472928|gb|ACV51047.1| glycosyl transferase family 8 [Atopobium parvulum DSM 20469]
Length = 358
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 79/170 (46%), Gaps = 23/170 (13%)
Query: 171 GFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
GF D+ + D+ +P+ + +L P +LP + K I LD+D++ TDIA+L+
Sbjct: 77 GFLDVDAALGDIE-LPHHGHFRPETYYRLLAPSLLP-NVNKAIYLDSDLVVNTDIAELYD 134
Query: 231 ------LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
L R TIG ++ D +G KN F GVIL++L ++R
Sbjct: 135 IDITGYLVGATRDADTIGQIDGY-DATVGPYLKNELGMDDPHDYFQAGVILMNLEEIRK- 192
Query: 285 SWAGFWRIIAEKFL---LTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
+I E+FL R W L DQD+ N ++ H YL + +WN
Sbjct: 193 ------QISPEEFLKVSTMRTWRWL-DQDVLNRFVNGH-YLRINM--KWN 232
>gi|326486345|gb|ADZ76178.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni]
Length = 456
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
KHY+ + +L L +LPE K + LD D++ +D+ +L+ L Q I +
Sbjct: 111 KHYATYF---RLYLGSILPENCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164
Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
W ++GK K++ + +G FN G +L++L + R S +A +
Sbjct: 165 KQWPGSLFVGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMVNVANSYTF- 223
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
+ DQDI N +I++ V LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247
>gi|350410076|ref|XP_003488938.1| PREDICTED: N-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
impatiens]
Length = 449
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 56/241 (23%), Positives = 83/241 (34%), Gaps = 68/241 (28%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ--------------------- 516
VTL + D++ + L + WEGP+SL ++ +A
Sbjct: 126 QVTLCTHATADQVYGIVELARRWEGPLSLAVFTPGLDAGIAVALLDRACRCEPEMYKVSV 185
Query: 517 -------------QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPY 563
Q G+ A +R G + YPIN RNVA Q +T
Sbjct: 186 HLVFPASRPPALGQITRIHGDCAASDLQRGDGGTERKRRSMIYPINVARNVARIQANTTR 245
Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR--------------- 608
V + DI+ LP L +R G+ V VVP FE +
Sbjct: 246 VLVTDIELLPSQRLASGFMEIVREKTPK---GRIVFVVPVFEIESNEVPPLTKKELLSAT 302
Query: 609 ------YRTAFPASHAP--TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLP--RYDTRFV 658
Y F SH SRW+ P ++ +P I+ R+ P R++ F+
Sbjct: 303 KAGLAVYFHRFLCSHCQRFPGLSRWLIRPDPGKV------KPLIITKREYPHHRWEPVFI 356
Query: 659 G 659
G
Sbjct: 357 G 357
>gi|195028428|ref|XP_001987078.1| GH20168 [Drosophila grimshawi]
gi|193903078|gb|EDW01945.1| GH20168 [Drosophila grimshawi]
Length = 208
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 60/146 (41%), Gaps = 36/146 (24%)
Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
+RN+A T YVFL DID +P L + +++ G+ V+P FE R
Sbjct: 1 MRNIARKGCQTKYVFLTDIDIIPSLNSVVELNNFFKTVICEGYCA---YVIPTFEID-VR 56
Query: 611 TAFPAS------------------------HAPTNFSRWVN---ATTPYQIEWAPD---- 639
AFP S TNFS+W++ + T +I
Sbjct: 57 AAFPRSKDLLRGLIKKGLARPFHEKVFIYNQYATNFSKWLSTNRSETAVRISHTVTNFEF 116
Query: 640 -FEPYIVAHRDLPRYDTRFVGFGWNK 664
+EP+ +A P +D RF G+G+ +
Sbjct: 117 LYEPFYIAIDSAPAHDERFTGYGFTR 142
>gi|326486369|gb|ADZ76201.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni]
Length = 456
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
KHY+ + +L L +LPE K + LD D++ +D+ +L+ L Q I +
Sbjct: 111 KHYATYF---RLYLGSILPENCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164
Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
W ++GK K++ + +G FN G +L++L + R S +A +
Sbjct: 165 KQWPGSLFVGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMVNVANSYTF- 223
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
+ DQDI N +I++ V LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247
>gi|384244587|gb|EIE18087.1| hypothetical protein COCSUDRAFT_55028 [Coccomyxa subellipsoidea
C-169]
Length = 136
Score = 49.3 bits (116), Expect = 0.007, Method: Composition-based stats.
Identities = 41/131 (31%), Positives = 59/131 (45%), Gaps = 17/131 (12%)
Query: 250 WYLGKLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
WY G K P G G N GV+LL+L +LR+ +A I F + L D
Sbjct: 14 WYKGGF---SKSLPFYGEYGLNAGVLLLNLDELRESHFAQERDKIIRHFHPKKA-LPLGD 69
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKH 368
QD+ NA S++P ++ + C +N RSD CY I+H N +KN+
Sbjct: 70 QDVLNAFASKYPTRLHVMSCVFNF------RSDSACYKGFP--AILHGNR----NLKNEF 117
Query: 369 MEFFRNLYLTF 379
+ +LY F
Sbjct: 118 SSTYSSLYKLF 128
>gi|300175712|emb|CBK21255.2| unnamed protein product [Blastocystis hominis]
Length = 499
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 72/165 (43%), Gaps = 26/165 (15%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
D T V ++DRL+ ++L + W+GP S+ +++ + + ++ + R + Y
Sbjct: 293 DFTYVTHTTVDRLRYFKVLRELWKGPFSIAIFLRRWDVNTVDRWVRSNSDVPHLRLLFYV 352
Query: 538 VVYK---------------------EGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMF 575
V EG F YPIN LR++A+ V T + LD+D P +
Sbjct: 353 VPSNLHFPKTDYSLWFDRSSMRKITEGAFLYPINLLRDIAIMNVVTTHYINLDMDLWPSY 412
Query: 576 GLYPYLKSSIRSMDMHGHGGKKVLVVPAFE-TQRYRTAFPASHAP 619
Y L R + K V ++PAF+ T+R+R S P
Sbjct: 413 STYQNLHRIPREL---LDPWKNVFILPAFQLTERFRNRILQSGNP 454
>gi|419633523|ref|ZP_14165956.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni LMG 23269]
gi|419667488|ref|ZP_14197457.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni 1997-10]
gi|419688439|ref|ZP_14216762.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni 1854]
gi|380612022|gb|EIB31561.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni LMG 23269]
gi|380645936|gb|EIB62940.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni 1997-10]
gi|380665463|gb|EIB81031.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni 1854]
Length = 483
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
KHY+ + +L L +LPE K + LD D++ +D+ +L+ L Q I +
Sbjct: 111 KHYATYF---RLYLGSILPENCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164
Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
W ++GK K++ + +G FN G +L++L + R S +A +
Sbjct: 165 KQWPGSLFVGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMVNVANSYTF- 223
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
+ DQDI N +I++ V LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247
>gi|341881070|gb|EGT37005.1| hypothetical protein CAEBREN_10005 [Caenorhabditis brenneri]
Length = 607
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 43/179 (24%), Positives = 75/179 (41%), Gaps = 41/179 (22%)
Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
+ + S D V+LV + D L+ E + K W+GPIS+ +++ D ++QQ L Y +
Sbjct: 256 ESFRVSDDLEPVSLVTHGTSDMLKHFENMPKMWDGPISIGVFI-DFDSQQVLQYLTDLHQ 314
Query: 528 LRSR--RNIGYHVVYKEGNF------------------------------------YPIN 549
+ R + H Y++ F YP N
Sbjct: 315 CDEQFSRKVTVHFAYRKSAFQKECLIRESPKTNTPCQKFLRNKVSYAKKIPSPFFLYPYN 374
Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
+RN+A + FL+D D + G+ +K+ M + G+ K VL+V FE+++
Sbjct: 375 LMRNLARKGAKSDLHFLMDGDMIVSNGMAEKVKTIANEM-LDGN-QKNVLLVRRFESEK 431
>gi|427412560|ref|ZP_18902752.1| hypothetical protein HMPREF9282_00159 [Veillonella ratti
ACS-216-V-Col6b]
gi|425716367|gb|EKU79351.1| hypothetical protein HMPREF9282_00159 [Veillonella ratti
ACS-216-V-Col6b]
Length = 310
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 81/190 (42%), Gaps = 59/190 (31%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF-------------------SRLRQ 237
L+L +PK+LP+++ K I DTD++ D+A+LW L SR +
Sbjct: 87 LRLMIPKLLPDSVRKAIYFDTDLVVIGDVAELWQLSLDGHPVGATVDLGIMSSKRSRREK 146
Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAG--FWRIIAE 295
++IGL N+SD Y FN+G++++D+++ R ++ I A
Sbjct: 147 HESIGL--NESDDY-----------------FNSGMMVIDVSRWRVENYGTEVLTEITAH 187
Query: 296 KFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV---------QLSDNTRSDELCYT 346
+F DQD N + + LP +WN+ ++ + R + +
Sbjct: 188 QF-------RHHDQDGLNKVFKNN---WQELPLRWNIIPPVFSLPLKILCSGRWRKKAFE 237
Query: 347 ELTDLKIIHW 356
L +IHW
Sbjct: 238 ALKSPAVIHW 247
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 27/161 (16%)
Query: 17 VAIVCAGY-NSTRSLVTLIKSILFYRKNP--LHFHLITDTVALNILQTLFSTWSVPQVEV 73
+AIV A N + SIL + P + ++++D ++ Q + +T + V
Sbjct: 4 IAIVLAADDNYAQHAAVTAASILLHTTEPQRVTLYILSDGISEIKQQKIEATIKDLKGRV 63
Query: 74 SFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
D E + + H S L+L +PK+LP+++ K I DTD++ D+A+LW
Sbjct: 64 QLIPVDG--EAIKGFTSGHISKA-AYLRLMIPKLLPDSVRKAIYFDTDLVVIGDVAELWQ 120
Query: 134 LF-------------------SRLRQRQTIGLVENQSDWYL 155
L SR + ++IGL N+SD Y
Sbjct: 121 LSLDGHPVGATVDLGIMSSKRSRREKHESIGL--NESDDYF 159
>gi|218442752|ref|YP_002381072.1| glycosyl transferase family 8 [Cyanothece sp. PCC 7424]
gi|218175110|gb|ACK73842.1| glycosyl transferase family 8 [Cyanothece sp. PCC 7424]
Length = 347
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 80/176 (45%), Gaps = 25/176 (14%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS--DWYLGK 254
+L LP++LP + + I LD+D++ ++A LW Q +G + D+
Sbjct: 99 FRLLLPELLPTQVERVIYLDSDLVVEGNLANLW--------EQELGNCPAVAVQDYVFPY 150
Query: 255 LWKNHKPWPALGRGFNT-----GVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQ 309
+ K + LG NT GV+L+++ + R R I E LADQ
Sbjct: 151 VCNGLKTYQQLGLASNTPYCNAGVMLINIKQWR---IEALNRKILEYIRKFYDLVYLADQ 207
Query: 310 DIFNAIISEHPYLVYTLPCQWNVQLSD--NTRSDELCYTE--LTDLKIIHWNSPKK 361
D NA+I+ L L +WNVQ+ N + D LC + + D I+H+ +P K
Sbjct: 208 DGINALIANRFKL---LDLKWNVQIFGVYNGKIDLLCKPKELIRDAFILHFTTPIK 260
>gi|308494240|ref|XP_003109309.1| hypothetical protein CRE_08184 [Caenorhabditis remanei]
gi|308246722|gb|EFO90674.1| hypothetical protein CRE_08184 [Caenorhabditis remanei]
Length = 389
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 71/177 (40%), Gaps = 43/177 (24%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS--RRNIGY 536
+TL + + ++ +E +W+GPISL L++ D +Q+ L Y + RR +
Sbjct: 66 ITLAVHGTPEMMETIERKPSNWDGPISLALFI-DYHSQRALEYLSDVHRCDQEFRRKVTV 124
Query: 537 HVVYK----------------------------------EGNF--YPINTLRNVALNQVS 560
H ++ G+F YPIN +RN+A
Sbjct: 125 HFAFRLSPFQKSCPLINVTSHHQDCKEFLKNRAKYRNEIAGSFQLYPINLMRNIARRGAK 184
Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASH 617
+ F+ DID + G +K S +M KKVLVV FE+ T P +H
Sbjct: 185 SDIHFIADIDMIMSEGFATKVKQI--SNEMIDGKSKKVLVVRRFESN--HTTIPTNH 237
>gi|428162183|gb|EKX31362.1| hypothetical protein GUITHDRAFT_122441 [Guillardia theta CCMP2712]
Length = 737
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 55/257 (21%), Positives = 93/257 (36%), Gaps = 74/257 (28%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS-YAGNSEAL-RSRRNIGY 536
VT+ ++DR ++ HW+ +++ Y E ++ L Y +S S RN
Sbjct: 281 VTMATISTVDRATVLMNTAAHWKEDMAVAFYARSPEDERDLRRYVADSLGPWFSSRNKSL 340
Query: 537 HVVY------KEGNF----YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
V +E +PIN LR +A + T + D+D + + +++
Sbjct: 341 EAVMLSVCPNREDPLKILVFPINMLRKIACSIAKTDLILYTDVDMIASDDIASHIRQVYE 400
Query: 587 SMDMHGHGGKKVLVVPAFE-----------------TQRYRTAFPAS------------- 616
+ K +LVVP+F+ TQ T P +
Sbjct: 401 GTRLKP---KDLLVVPSFKGSSSWPPVAEIQRGNNTTQVQVTTHPVTVADVIEHFIHCRV 457
Query: 617 ----------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD----- 649
HAPT++ RW+ Y++ + +EPY V +R
Sbjct: 458 HIPGMRCGMWTLASRWSEDGVFHAPTSYDRWMEEKEVYEVSYVVGYEPYFVVNRSAWEGR 517
Query: 650 --LPRYDTRFVGFGWNK 664
YD RFV +GW+K
Sbjct: 518 HGAGIYDDRFVHWGWDK 534
>gi|159476896|ref|XP_001696547.1| hypothetical protein CHLREDRAFT_187123 [Chlamydomonas reinhardtii]
gi|158282772|gb|EDP08524.1| predicted protein [Chlamydomonas reinhardtii]
Length = 417
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 55/242 (22%), Positives = 92/242 (38%), Gaps = 61/242 (25%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEG-PISLTLYMSDAEA---------QQFLSYAGNS-EA 527
VT+ ++DRL M+E K + G P + +Y+ + +Q ++ A + +A
Sbjct: 31 VTIFTTATLDRLDMLETQCKSYPGGPHAAAVYVPLVQEHGGELTIPNEQLIALAESRVQA 90
Query: 528 LRSRRN-----IGYHVVY--------KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
L R HV+ + PIN LRN AL V TP + ++D+D
Sbjct: 91 LFDRMEGIVNACHMHVLLLYEVTSDPALAHLTPINGLRNAALLAVRTPLLVMVDVDLCVS 150
Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------------- 612
L +L + V+PA++ +
Sbjct: 151 ASLVRFLSEPSNGDKLVASSSNTFWVLPAWDVSAGLSRVEIDHIAESALAGDKMTLAKLW 210
Query: 613 ------------FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD---LPRYDTRF 657
F HAPTN++RW+ + Y I+++ +EP+ + RD + YD RF
Sbjct: 211 ASGRLHWFGQLYFQLGHAPTNYTRWLLSRKSYLIDYSIGYEPWGILSRDRQVVVPYDARF 270
Query: 658 VG 659
G
Sbjct: 271 RG 272
>gi|412991019|emb|CCO18391.1| predicted protein [Bathycoccus prasinos]
Length = 601
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 56/261 (21%), Positives = 99/261 (37%), Gaps = 73/261 (27%)
Query: 476 GNDVTLVAQLSMDRLQMV-EMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN- 533
++ L A ++DRL+ V L + W G +S+ + S+ + ++ + E R+R
Sbjct: 179 AREICLFAPSTLDRLKSVLPRLAQTWSGTMSVAVLASEEDVRREMFGTSTREFTRNRLTV 238
Query: 534 IGYHVVYKEGNFYPINTLRNVALN----QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
I + + +P+N LRN+AL+ ++ YV L D+DF +F P + +
Sbjct: 239 IAVDPLPEYETRFPVNALRNLALSGCKRTLNATYVVLHDVDF-EIFPDAPSKELLEEIEE 297
Query: 590 MHGHGGKKVLVVPAF--------ETQRYRTA----------------------------- 612
+ K LV+P+F + +R A
Sbjct: 298 VLKPNAKHGLVLPSFTADDRYLRRVETFRKAHDFSRESLDVLNSFNRKEKLVELIKEYGI 357
Query: 613 --------FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR---------------- 648
+P +HA TN S+W++ + + A PY V
Sbjct: 358 VESFREKYWPVAHAATNVSKWLSMIIAEKKKDAEKHRPYSVGANHLGSRHPYYYEPWIIL 417
Query: 649 -----DLPRYDTRFVGFGWNK 664
D+P +D FV +G+NK
Sbjct: 418 RADVDDMPAFDESFVTYGFNK 438
>gi|237750590|ref|ZP_04581070.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879]
gi|229373680|gb|EEO24071.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879]
Length = 364
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 47/198 (23%), Positives = 86/198 (43%), Gaps = 45/198 (22%)
Query: 162 HKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 221
+K +P R + L+DS+ + + + N + + +L + LP+ + + LD D+I
Sbjct: 73 NKIYPCEFRVYTLSDSIFQGLPKLNNNYLA----YFRLKIASCLPQDIKTCLYLDVDMIC 128
Query: 222 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG------------- 268
DI + +F Q + G+V L +H+ + L R
Sbjct: 129 VADIRE---IFYTDLQGKICGVV----------LVPDHQQYCVLKRNSAIGDEFVFNAST 175
Query: 269 -FNTGVILLDLTKLRDISWAGFWRIIAEKFL--LTRLWTSLADQDIFNAIISEHPYLVYT 325
FN+G++L+D+ + R + + +K L + L DQD NA++ +H +
Sbjct: 176 YFNSGLMLIDVEQYRKYN-------VEQKCLEWFEQYVPVLLDQDALNAVLGDH---ICA 225
Query: 326 LPCQWN--VQLSDNTRSD 341
LP +WN V+L R D
Sbjct: 226 LPLEWNFFVELLKYKRQD 243
>gi|308494062|ref|XP_003109220.1| hypothetical protein CRE_08183 [Caenorhabditis remanei]
gi|308246633|gb|EFO90585.1| hypothetical protein CRE_08183 [Caenorhabditis remanei]
Length = 326
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 84/221 (38%), Gaps = 56/221 (25%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS--RRNIGY 536
+TL + + ++ +E +W+GPISL L++ D +Q+ L Y + RR +
Sbjct: 27 ITLAVHGTPEMMETIEKKPSNWDGPISLALFI-DYHSQRALEYLSDVHRCDQEFRRKMTA 85
Query: 537 HVVYK----------------------------------EGNF--YPINTLRNVALNQVS 560
H ++ G+F YPIN +RN+A
Sbjct: 86 HFTFRLSPFQKSCPLINVTSHHQDCKEFLKNRAKYRNEIAGSFQLYPINLMRNIARRGAK 145
Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPT 620
+ F+ DID + G +K S +M KKVLVV F Q + F H
Sbjct: 146 SDIHFIADIDMIMSEGFATKVKRI--SNEMIDGKSKKVLVVRVF--QFHHRHFFVGHQIP 201
Query: 621 NFSRWVNATT----------PYQIEWAPDFEPYIVAHRDLP 651
N S W + PYQ +E ++ HR+ P
Sbjct: 202 NISYWFEVSNASDTISAWEIPYQ---GSVWEVQVILHRNDP 239
>gi|194211847|ref|XP_001488961.2| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1
[Equus caballus]
Length = 376
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 35/150 (23%), Positives = 60/150 (40%), Gaps = 18/150 (12%)
Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFNTGVILLDL 278
F + +W+ R Q + + +G W N P G+ G N+GV+L+++
Sbjct: 158 FLRPVDDIWSNGGRFNSTQIAAMAPEHEEPRIG--WYNRFARHPYYGKTGVNSGVMLMNM 215
Query: 279 TKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLP 327
T++R + W + +K+ L W DQD+ N I +P ++ P
Sbjct: 216 TRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW---GDQDLLNIIFFHNPESLFVFP 272
Query: 328 CQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
CQWN + E + I+H N
Sbjct: 273 CQWNYRPDHCIYGSNCQEAEEEGIFILHGN 302
>gi|148665328|gb|EDK97744.1| mCG126098 [Mus musculus]
Length = 182
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 15/84 (17%)
Query: 261 PWPALGRGFNTGVILLDLTKLR---------DISWAGFWRIIAEKFLLTRLWTSLADQDI 311
P P GFN+GV+LL+L +R + SW + +A+K+ L DQD
Sbjct: 69 PPPEGLPGFNSGVMLLNLEAMRQSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDF 122
Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
F I EHP L + L C WN QL
Sbjct: 123 FTMIGMEHPELFHVLDCTWNRQLC 146
>gi|299472732|emb|CBN80300.1| Glycosyltransferase, family GT49 [Ectocarpus siliculosus]
Length = 280
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 60/161 (37%), Gaps = 42/161 (26%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
YP+N LRNVA+ V T + + DID P Y L R + +V AF
Sbjct: 19 YPVNRLRNVAVKAVKTSHFLMTDIDIWPDVNAYSALHMRYRLETERVEDARSAIVFSAFS 78
Query: 606 TQR---------YRTAFPASHA------PTNFSRWVNAT------TPYQIEW-------- 636
+R Y P + A TN +A T Y W
Sbjct: 79 RRRFCEEEDCLQYANDVPETIADLKPCLETNTCGRFDAANQSGQGTAYVYYWRSMLRQPN 138
Query: 637 -----------APDFEPYIVAHRD--LPRYDTRFVGFGWNK 664
+ FEPY+V + LP++D RF+G+G NK
Sbjct: 139 QRTMEHILCFKSHRFEPYLVVRKTALLPKFDERFLGYGKNK 179
>gi|336407713|ref|ZP_08588209.1| hypothetical protein HMPREF1018_00224 [Bacteroides sp. 2_1_56FAA]
gi|335944792|gb|EGN06609.1| hypothetical protein HMPREF1018_00224 [Bacteroides sp. 2_1_56FAA]
Length = 311
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 87/194 (44%), Gaps = 27/194 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
+ L ++LP + K + +D D++ DI++ W + + Q IG +E+ + Y +
Sbjct: 89 RCLLSRILPVNIDKILYMDCDIVVLNDISEFWN--TDITQ-YAIGCIEDIGSDEEEYYSR 145
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
L + K FN GV+L++L R+ G + + FL DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWREHKIDG---MCEQYFLAHSDRIRFNDQDLLNA 197
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
++ + V P +WNVQ + R+ E + LK I+H+ + K +
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254
Query: 368 H---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268
>gi|300119982|emb|CBK19536.2| unnamed protein product [Blastocystis hominis]
Length = 429
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/144 (22%), Positives = 65/144 (45%), Gaps = 13/144 (9%)
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
P+N +Y+ ++ +V +RL + ++GPI L +Y + + + Y
Sbjct: 163 PSNDQKYDVTA------IVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS 216
Query: 524 NSEALRSRRNIGYHVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
+ + R I +++V K +PIN LRN+A+ + T + +LD+D Y
Sbjct: 217 -THFIPKRVTILFYLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTYKE 275
Query: 581 LKSSIRSMDMHGHGGKKVLVVPAF 604
+ S + + H + +++P F
Sbjct: 276 VMSLPQFL---YHSNRSAVILPVF 296
>gi|419622112|ref|ZP_14155354.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni LMG 23216]
gi|380600476|gb|EIB20813.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
jejuni LMG 23216]
Length = 483
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
KHY+ + +L L +LP+ K + LD D++ +D+ +L+ L Q I +
Sbjct: 111 KHYATYF---RLYLGSILPKNCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164
Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
W ++GK K++ + +G FN G +L++L + R S +A +
Sbjct: 165 KQWPGSLFIGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMINVANSYTF- 223
Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
+ DQDI N +I++ V LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247
>gi|300176754|emb|CBK24419.2| unnamed protein product [Blastocystis hominis]
Length = 369
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 46/235 (19%), Positives = 92/235 (39%), Gaps = 52/235 (22%)
Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
DVT +V +RL + ++GPI L +Y + + + Y + + R I +
Sbjct: 110 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 168
Query: 537 HVVYK---EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
++V + +PIN LRN+A+ + T + +LD+D Y + S + + H
Sbjct: 169 YLVSRYLQSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTYKEVMSLPQFL---YH 225
Query: 594 GGKKVLVVPAF--ETQRYRTAFPASHAPTNFSRWVNATTPYQI----------------- 634
+ +++P F ++++ ++ + + S W+ ++
Sbjct: 226 SNRSAVILPVFFYKSKQILAHCSSTESCSYLSNWIQPENKLELIECINRKICISNKNNIR 285
Query: 635 -------EWAPD----------------FEPYIVAHRD--LPRYDTRFVGFGWNK 664
EW EPY++ LP ++ RFV +G+NK
Sbjct: 286 THMYVMPEWFTTSKESHVSRVRCFITNFMEPYLMVKYSPTLPLFEERFVNYGYNK 340
>gi|332018122|gb|EGI58734.1| Uncharacterized protein C3orf21-like protein [Acromyrmex
echinatior]
Length = 358
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 48/107 (44%), Gaps = 10/107 (9%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P +YS + L L ++ P I+ D D F D+ L+ F++ + GL
Sbjct: 168 PGTYYSDALFFISLGLHRIAPNHQKLAIMFDADTKFRRDVKDLFEEFNKFGKEALFGLAP 227
Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLR 282
+ Y + L++N P G G+N+G++LL+L +LR
Sbjct: 228 ELTPVYRHVLYLYRNKHPTTMFGEPRHKGGYPGYNSGIVLLNLERLR 274
>gi|300175964|emb|CBK22181.2| unnamed protein product [Blastocystis hominis]
Length = 429
Score = 47.4 bits (111), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 50/238 (21%), Positives = 89/238 (37%), Gaps = 58/238 (24%)
Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
DVT +V +RL + ++GPI L +Y + + + Y + + R I +
Sbjct: 170 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 228
Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
++V K +PIN LRN+A+ + T + +LD+D Y + S + + H
Sbjct: 229 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTYKEVMSLPQFL---YH 285
Query: 594 GGKKVLVVPAFETQRYRTAFPASHAPTN-----FSRWVNATTPYQI-------------- 634
+ +++P F Y+ +H T S W+ ++
Sbjct: 286 SNRSAVILPVF---LYKGKQILAHCSTTESCSYLSNWIQPENKLELIECINRKICISNKN 342
Query: 635 ----------EWAPD----------------FEPYIVAHRD--LPRYDTRFVGFGWNK 664
EW EPY++ LP ++ RFV +G+NK
Sbjct: 343 NIRTHMYVMPEWFTTSKESHVSRVRCFITNFMEPYLMVKYSPTLPLFEERFVNYGYNK 400
>gi|298480509|ref|ZP_06998706.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
[Bacteroides sp. D22]
gi|298273330|gb|EFI14894.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
[Bacteroides sp. D22]
Length = 311
Score = 47.4 bits (111), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 67/147 (45%), Gaps = 22/147 (14%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
Y + L+L +P++LP + K + +D+D+IF DI+ L+ + ++
Sbjct: 79 YPSLATYLRLFIPQLLPLNVEKALYIDSDIIFNGDISDLYNCDIANYALAGMEDAPHRQP 138
Query: 250 WYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLWTS 305
LG +P FN G +LL++ LRDI++ + R K +
Sbjct: 139 ERLG--------YPIEDLYFNAGFLLLNIKYLRDINFVDMALSYIRDCHHKIV------- 183
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNV 332
L DQD+ NA++ V +P +WN+
Sbjct: 184 LHDQDVLNALLHGR---VRFVPIKWNM 207
>gi|302853592|ref|XP_002958310.1| hypothetical protein VOLCADRAFT_108086 [Volvox carteri f.
nagariensis]
gi|300256335|gb|EFJ40603.1| hypothetical protein VOLCADRAFT_108086 [Volvox carteri f.
nagariensis]
Length = 1182
Score = 47.4 bits (111), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 616 SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
H T++S+W N T PY +EW+ +EP++V R + D RF G+G NK
Sbjct: 1035 GHKNTDYSKWFNTTEPYVVEWSGRYEPWVVVDRLSSSWADARFRGYGKNK 1084
>gi|340720433|ref|XP_003398643.1| PREDICTED: n-acetyllactosaminide
beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
terrestris]
Length = 449
Score = 47.4 bits (111), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 56/241 (23%), Positives = 84/241 (34%), Gaps = 70/241 (29%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ---------------------- 516
VTL + D++ + L + WEGP+SL ++ +A
Sbjct: 127 VTLCTHATADQVYGIVELARRWEGPLSLAVFTPGLDAGIAVALLDRACRCEPEMYKVSVH 186
Query: 517 ------------QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYV 564
+ G+ A +R G + YPIN RNVA Q +T V
Sbjct: 187 LVFPASRPPALGRITRIHGDCAASDLQRGDGATERKRRNMIYPINVARNVARIQANTTRV 246
Query: 565 FLLDIDFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAFETQR--------------- 608
+ DI+ LP L S +D G+ V VVP FE +
Sbjct: 247 LVTDIELLP----SQRLASGFMEIDREKTPKGRIVFVVPVFEIESNEVPPLTKKELLLAT 302
Query: 609 ------YRTAFPASHAP--TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLP--RYDTRFV 658
Y F SH SRW+ P ++ +P I+ R+ P R++ F+
Sbjct: 303 KAGLAVYFHRFLCSHCQRFPGLSRWLIRPDPGKV------KPLIITKREYPYHRWEPVFI 356
Query: 659 G 659
G
Sbjct: 357 G 357
>gi|300123773|emb|CBK25045.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
D +DVTL+ L++ R + L K W I++ +++++ E + F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWRHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228
Query: 520 SYAGNSEALRSRRNIGYHV---VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
Y S ++I Y+ VY+ +PIN LR++A+ + T + F+ DID
Sbjct: 229 LYIVKSRIGSRYQSIYYNEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
LY L+ +++ K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312
>gi|300120974|emb|CBK21216.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
D +DVTL+ L++ R + L K W+ I++ +++++ E + F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228
Query: 520 SYAGNSEALRSRRNIGY---HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
Y S ++I Y VY+ +PIN LR++A+ + T + F+ DID
Sbjct: 229 LYIVKSRIGSRYQSIYYDEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
LY L+ +++ K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312
>gi|300176749|emb|CBK24414.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
D +DVTL+ L++ R + L K W I++ +++++ E + F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWRHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228
Query: 520 SYAGNSEALRSRRNIGYHV---VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
Y S ++I Y+ VY+ +PIN LR++A+ + T + F+ DID
Sbjct: 229 LYIVKSRIGSRYQSIYYNEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
LY L+ +++ K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312
>gi|300119989|emb|CBK19543.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 47.0 bits (110), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
D +DVTL+ L++ R + L K W+ I++ +++++ E + F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228
Query: 520 SYAGNSEALRSRRNIGY---HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
Y S ++I Y VY+ +PIN LR++A+ + T + F+ DID
Sbjct: 229 LYIVKSRIGSRYQSIYYDEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
LY L+ +++ K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312
>gi|148673701|gb|EDL05648.1| mCG51561 [Mus musculus]
Length = 230
Score = 47.0 bits (110), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 69/161 (42%), Gaps = 10/161 (6%)
Query: 13 EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
E + +A+V G VT++KS L + PLH H+ + N + ++WS + +
Sbjct: 38 EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLHVHIFAEDQLHNSFKDRLASWSFLRRF 96
Query: 72 EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
+ S Y + + W K +L LP +L E + + +DTD++F +
Sbjct: 97 DYSLYPITFPCDSAADW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLPPVDD 152
Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW-PALGR 170
+W+L + Q + + +G W N W P GR
Sbjct: 153 IWSLLKKFNSTQIAAMAPEHKEPRIG--WYNRFAWHPYYGR 191
Score = 46.2 bits (108), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L LP +L E + + +DTD++F + +W+L + Q + + +G W
Sbjct: 124 RLFLPLILKE-VDSLLYVDTDILFLPPVDDIWSLLKKFNSTQIAAMAPEHKEPRIG--WY 180
Query: 258 NHKPW-PALGR-GFNTGVILLDLTKLR 282
N W P GR G N+GV+L+++T++R
Sbjct: 181 NRFAWHPYYGRTGVNSGVMLMNMTRMR 207
>gi|300123357|emb|CBK24630.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 46.6 bits (109), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 30/125 (24%), Positives = 53/125 (42%), Gaps = 6/125 (4%)
Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
+MV+ + N D Y D D+TLV S ++ + WEGPI + + + E Q
Sbjct: 85 KMVQSSSNTFDLY--PDDTFDITLVTYASASMYELAPKYCRQWEGPIVIVVSYNQNEESQ 142
Query: 518 FLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGL 577
+ + + + I + + P+N ++N + V T + + D +P L
Sbjct: 143 LTQFLASIASFPAVSVIPFPITAVS----PVNAMKNAGIAAVRTTHYVFAEFDLVPSSNL 198
Query: 578 YPYLK 582
Y LK
Sbjct: 199 YSVLK 203
>gi|300123772|emb|CBK25044.2| unnamed protein product [Blastocystis hominis]
Length = 429
Score = 46.6 bits (109), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 60/131 (45%), Gaps = 8/131 (6%)
Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
DVT +V +RL + ++GPI L +Y + + + Y + + R I +
Sbjct: 170 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 228
Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
++V K +PIN LRN+A+ + T + +LD+D Y + S + + H
Sbjct: 229 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSVNTYKEVMSLPQFL---YH 285
Query: 594 GGKKVLVVPAF 604
+ +++P F
Sbjct: 286 SNRSAVILPVF 296
>gi|291527266|emb|CBK92852.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
[Eubacterium rectale M104/1]
Length = 723
Score = 46.6 bits (109), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 86/188 (45%), Gaps = 22/188 (11%)
Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYL 252
+ ++ LP ++P+ L K I LD+D+ TDI +LW L L Q + N W
Sbjct: 429 MFRIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCSTIRN---W-- 482
Query: 253 GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
G + + R FN GV+ ++L +R + + F ++I +L T L DQD
Sbjct: 483 GTPYAVAAGQTSRDRYFNAGVLCMNLDNIRK-NGSLFQQVI--DYLNDNPRTWLPDQDAL 539
Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFF 372
NAI S L+ +WN + + +++ E + KI H+ + L + + + E
Sbjct: 540 NAIFSGKTLLIDE---KWNYFIDEARKNN-----EKAEKKIYHYAAT--LLMLHTNNEID 589
Query: 373 RNLYLTFL 380
R Y T L
Sbjct: 590 RAYYFTIL 597
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 55/117 (47%), Gaps = 29/117 (24%)
Query: 32 TLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNK 91
T ++SI+ K P+ FH++ D I + S ++AD+ D+ +
Sbjct: 362 TTMQSIVENTKAPIVFHILHDDTLNEINKNKLS-----------FIADNSGNDIEF---H 407
Query: 92 HY-SGVYG-------------LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
H+ S V+G + ++ LP ++P+ L K I LD+D+ TDI +LW L
Sbjct: 408 HFNSDVFGTFADSMNRFAIGTMFRIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNL 463
>gi|431910233|gb|ELK13306.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Pteropus alecto]
Length = 148
Score = 46.6 bits (109), Expect = 0.044, Method: Composition-based stats.
Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 617 HAPTNFSRWVNATT------PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
APTN+SRWVN Y + W +EP+ VA +P +D RF +G+N+
Sbjct: 29 QAPTNYSRWVNLPEETLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNR 82
>gi|300176052|emb|CBK23363.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 46.6 bits (109), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 22/139 (15%)
Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
DVT +V +RL + ++GPI L +Y + + + Y + + R I +
Sbjct: 110 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 168
Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
++V K +PIN LRN+A+ + T + +LD+D Y
Sbjct: 169 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTY--------------- 213
Query: 594 GGKKVLVVPAFETQRYRTA 612
K+V+ +P F Q R+A
Sbjct: 214 --KEVMSLPQFLYQSNRSA 230
>gi|300121109|emb|CBK21490.2| unnamed protein product [Blastocystis hominis]
Length = 343
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 68/148 (45%), Gaps = 22/148 (14%)
Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
D +DVTL+ L++ R L K W+ I++ +++++ E + F+
Sbjct: 169 DYDDVTLITILTLVRKDRFPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228
Query: 520 SYAGNSEALRSRRNIGY---HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
Y S ++I Y VY+ +PIN LR++A+ + T + F+ DID
Sbjct: 229 LYIVKSRIGSRYQSIYYDEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288
Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
LY L+ +++ K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312
>gi|444709933|gb|ELW50928.1| hypothetical protein TREES_T100021730 [Tupaia chinensis]
Length = 405
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 66/169 (39%), Gaps = 27/169 (15%)
Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP------- 263
+ I LD D+ + T+I +L+ F IG+ Y W+ P
Sbjct: 236 RIIQLDLDLKYKTNIRELFEEFDNFLPAAVIGIAREMQPVYRHTFWQFRHENPKTRVGGP 295
Query: 264 --ALGRGFNTGVILLDLTKLRDISWAGFW------RIIAEKFLLTRLWTSLADQDIFNAI 315
GFN+G +L +R S G + +A+K+ L DQD F I
Sbjct: 296 PPEGLPGFNSG---FNLEAMRQSSLYGRLLEPAKVQQLADKY---HFRGHLGDQDFFTMI 349
Query: 316 ISEHPYLVYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
EHP L + L C WN QL D+ SD E + +KI H N
Sbjct: 350 GMEHPELFHVLDCTWNRQLCTWWRDHGYSDIFEAYFRCEGHVKIYHGNC 398
>gi|300121608|emb|CBK22126.2| unnamed protein product [Blastocystis hominis]
Length = 224
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 22/139 (15%)
Query: 478 DVTLVA-QLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
DVT +A +RL + ++GPI L +Y + + + Y ++ + R I +
Sbjct: 71 DVTAIATAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYI-STHFIPKRVTILF 129
Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
++V K +PIN LRN+A+ + T + +LD+D L+ S+ +
Sbjct: 130 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMD----------LRLSLNTY----- 174
Query: 594 GGKKVLVVPAFETQRYRTA 612
K+VL +P F Q R+A
Sbjct: 175 --KEVLSLPQFLYQSNRSA 191
>gi|300176048|emb|CBK23359.2| unnamed protein product [Blastocystis hominis]
Length = 376
Score = 46.2 bits (108), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 22/139 (15%)
Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
DVT +V +RL + ++GPI L +Y + + + Y + + R I +
Sbjct: 170 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEIHLVRYIS-THFIPKRVTILF 228
Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
++V K +PIN LRN+A+ + T + +LD+D Y
Sbjct: 229 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTY--------------- 273
Query: 594 GGKKVLVVPAFETQRYRTA 612
K+V+ +P F Q R+A
Sbjct: 274 --KEVMSLPQFLYQSNRSA 290
>gi|345313001|ref|XP_001516834.2| PREDICTED: uncharacterized protein C3orf21 homolog, partial
[Ornithorhynchus anatinus]
Length = 111
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 37/74 (50%), Gaps = 9/74 (12%)
Query: 268 GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPY 321
GFN+GV+LLDL +R + + + E + +L L DQD F I EHP
Sbjct: 25 GFNSGVMLLDLEAMRQ---SQLYNQLLEPARIQQLTEKYHFRGHLGDQDFFTMIGMEHPE 81
Query: 322 LVYTLPCQWNVQLS 335
L + L C WN QL
Sbjct: 82 LFHVLDCTWNRQLC 95
>gi|336385107|gb|EGO26254.1| glycosyltransferase family 49 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 355
Score = 45.8 bits (107), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 54/264 (20%), Positives = 110/264 (41%), Gaps = 66/264 (25%)
Query: 462 INPNNVDE--YEASSDGN--DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ- 516
+ PN + Y AS N D+T+ ++ +R ++ L + ++GPIS+T+++ +
Sbjct: 15 MQPNKIVPFFYRASGTFNPEDITITTLVTSNRFKVFAQLVERYQGPISVTVHIKSDSSHT 74
Query: 517 -----QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
+ +S ++ + ++ H+V + +NT RN+A T +V +LD+DF
Sbjct: 75 TALLNELHKLYASSPSMATFVDV--HLVLDPFD-RQLNTWRNIARFFARTDFVMMLDVDF 131
Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR----------------------- 608
+ +++S M G LVVPAFE +
Sbjct: 132 VVCTDFRAAVRASKAVMARLSE-GYTALVVPAFEYLKQADGADQKKFPRDKNVCISIHSS 190
Query: 609 -----------------------YRTAFPASHAPTNFSRWVNATTPYQI----EWAPDFE 641
+ ++ H T+++R+ ++ P ++ ++ +E
Sbjct: 191 LSLTHFSLLQSLLSLVKAKKIDMFHRSWAPGHNSTDYARFY-SSQPGEVYKVTQYHSAYE 249
Query: 642 PYIVAHRDLPRY-DTRFVGFGWNK 664
PY++ + P + D RF+G+G NK
Sbjct: 250 PYVIFKKTGPPWCDERFIGYGGNK 273
>gi|392356257|ref|XP_003752296.1| PREDICTED: glucoside xylosyltransferase 1-like [Rattus norvegicus]
Length = 197
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 15/107 (14%)
Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
P G+ G N+GV+L+++T++R + W + +K+ L W DQD
Sbjct: 20 PYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW---GDQD 76
Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+ N + +P ++ PCQWN + E + I+H N
Sbjct: 77 LLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 123
>gi|33988262|gb|AAH15597.2| GLT8D3 protein [Homo sapiens]
Length = 202
Score = 45.8 bits (107), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 15/107 (14%)
Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
P G+ G N+GV+L+++T++R + W + +K+ L W DQD
Sbjct: 25 PYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW---GDQD 81
Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+ N + +P ++ PCQWN + E + I+H N
Sbjct: 82 LLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 128
>gi|159475561|ref|XP_001695887.1| hypothetical protein CHLREDRAFT_149054 [Chlamydomonas reinhardtii]
gi|158275447|gb|EDP01224.1| predicted protein [Chlamydomonas reinhardtii]
Length = 181
Score = 45.8 bits (107), Expect = 0.073, Method: Composition-based stats.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 3/45 (6%)
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPY--IVAHRDLPRYDTRFVG 659
H P N RW A PY ++WA FEP+ + HRD P YD + G
Sbjct: 121 HGPINHKRWATAKEPYTVKWAQAFEPWGILARHRD-PGYDEVWPG 164
>gi|238924869|ref|YP_002938385.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
ATCC 33656]
gi|238876544|gb|ACR76251.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
ATCC 33656]
Length = 723
Score = 45.8 bits (107), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 86/188 (45%), Gaps = 22/188 (11%)
Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYL 252
+ ++ LP ++P+ L K I LD+D+ TDI +LW L L Q + N W
Sbjct: 429 MFRIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCSTIRN---W-- 482
Query: 253 GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
G + + R FN GV+ ++L +R + + F +++ +L T L DQD
Sbjct: 483 GTPYAVAAGQTSRDRYFNAGVLCMNLDNIRK-NGSLFQQVM--DYLSDNPRTWLPDQDAL 539
Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFF 372
NAI S L+ +WN + + +++ E + KI H+ + L + + + E
Sbjct: 540 NAIFSGKTLLIDE---KWNYFIDEARKNN-----EKAEKKIYHYAAT--LLMLHTNNEID 589
Query: 373 RNLYLTFL 380
R Y T L
Sbjct: 590 RAYYFTIL 597
>gi|265764909|ref|ZP_06093184.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides sp. 2_1_16]
gi|263254293|gb|EEZ25727.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides sp. 2_1_16]
Length = 311
Score = 45.8 bits (107), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 87/194 (44%), Gaps = 27/194 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
+ L ++LP + K + +D D++ DI++ W + + Q IG +E+ + Y +
Sbjct: 89 RCLLSRILPVNIDKILYIDCDIVVLNDISEFWN--TDITQ-YAIGCIEDIGSDEEEYYSR 145
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
L + K FN GV+L++L R+ + + FL DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
++ ++ V P +WNVQ + R+ E + LK I+H+ + K +
Sbjct: 198 LLYKNKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254
Query: 368 H---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268
>gi|296491302|tpg|DAA33365.1| TPA: hypothetical protein LOC533703 [Bos taurus]
Length = 333
Score = 45.8 bits (107), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 56/120 (46%), Gaps = 9/120 (7%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
F + +++ + S +YS L + + +++P+ + + I LD D+ + T+I +L+
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
F IG+ Y W+ N K P P GFN+GV+LL+L +R
Sbjct: 242 FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 301
>gi|119578235|gb|EAW57831.1| hCG39367, isoform CRA_a [Homo sapiens]
Length = 177
Score = 45.8 bits (107), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 15/107 (14%)
Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
P G+ G N+GV+L+++T++R + W + +K+ L W DQD
Sbjct: 20 PYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW---GDQD 76
Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+ N + +P ++ PCQWN + E + I+H N
Sbjct: 77 LLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 123
>gi|291526000|emb|CBK91587.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
[Eubacterium rectale DSM 17629]
Length = 723
Score = 45.4 bits (106), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 85/186 (45%), Gaps = 22/186 (11%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYLGK 254
++ LP ++P+ L K I LD+D+ TDI +LW L L Q + N W G
Sbjct: 431 RIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCSTIRN---W--GT 484
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
+ + R FN GV+ ++L +R + + F ++I +L T L DQD NA
Sbjct: 485 PYAVAAGQTSRDRYFNVGVLCMNLDNIRK-NGSLFQQVI--DYLNDNPRTWLPDQDALNA 541
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
I S L+ +WN + + +++ E + KI H+ + L + + + E R
Sbjct: 542 IFSGKTLLIDE---KWNYFIDEARKNN-----EKAEKKIYHYAAT--LLMLHTNNEIDRA 591
Query: 375 LYLTFL 380
Y T L
Sbjct: 592 YYFTIL 597
>gi|329964475|ref|ZP_08301529.1| glycosyltransferase, family 8 [Bacteroides fluxus YIT 12057]
gi|328524875|gb|EGF51927.1| glycosyltransferase, family 8 [Bacteroides fluxus YIT 12057]
Length = 316
Score = 45.4 bits (106), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 90/199 (45%), Gaps = 22/199 (11%)
Query: 172 FYL-ADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
FYL + +++D S + H S + ++ L +LP L+K + LD D++ I +LW
Sbjct: 67 FYLIGEEILQDCSIYGDSHIS-LATYYRIFLCSILPADLSKALYLDCDLVVLGSINELWN 125
Query: 231 LFSRLRQRQTIGLVENQSDWYLGKLWKNHK-PWPALGRGFNTGVILLDLTKLRDISWAGF 289
+ + Q + VE D + GK + + + FN GV+L++L R++ + G
Sbjct: 126 --TDISQ-YAVACVE---DMWSGKPDNYERLHYASSDSYFNAGVLLVNLDYWRELDFEGL 179
Query: 290 WRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ---LSDNTR----SDE 342
+ ++ DQD+ NA++ + LP +WNVQ L R S
Sbjct: 180 AMAYIKAHHSELVFN---DQDVLNALLHDRKLF---LPFRWNVQDGFLRRKRRIRQESIA 233
Query: 343 LCYTELTDLKIIHWNSPKK 361
+ EL IIH+ KK
Sbjct: 234 MLDEELKSPVIIHYTGGKK 252
>gi|300122825|emb|CBK23832.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 45.4 bits (106), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 34/152 (22%), Positives = 65/152 (42%), Gaps = 27/152 (17%)
Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
P+N +Y+ ++ +V +RL + ++GPI L +Y + + + Y
Sbjct: 103 PSNDQKYDVTA------IVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS 156
Query: 524 NSEALRSRRNIGYHVVYK---EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
+ + R I +++V + +PIN LRN+A+ + T + +LD+D Y
Sbjct: 157 -THFIPKRVTILFYLVSRYLQSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTY-- 213
Query: 581 LKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA 612
K+V+ +P F Q R+A
Sbjct: 214 ---------------KEVMSLPQFLYQSNRSA 230
>gi|427412548|ref|ZP_18902740.1| hypothetical protein HMPREF9282_00147 [Veillonella ratti
ACS-216-V-Col6b]
gi|425716355|gb|EKU79339.1| hypothetical protein HMPREF9282_00147 [Veillonella ratti
ACS-216-V-Col6b]
Length = 606
Score = 45.4 bits (106), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 76/197 (38%), Gaps = 21/197 (10%)
Query: 173 YLADSVVEDVSWIP-NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
Y D +S +P N+ Y + +L + ++ K I +D+DVI A DI LW
Sbjct: 347 YFVDVDARSLSGLPLNRAYISINTYYRLLIHDLIDAD--KIIYIDSDVIVADDILNLWNF 404
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
+ G ++ G + A N GV++ +L +++ + R
Sbjct: 405 --DVTDACIAGALDEG-----GVMQSRRLSLGANSNYINAGVLVFNLKEIK-ARYKDPLR 456
Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS-------DNTRSDELC 344
+ E + R SL DQDI N + LP +WNV D S
Sbjct: 457 LYLETYYFNRDLISLQDQDILNLAFKNE---IKVLPLKWNVNGRIFEVNELDFKYSKADI 513
Query: 345 YTELTDLKIIHWNSPKK 361
L DL IIH+ KK
Sbjct: 514 NEALNDLGIIHYTDHKK 530
>gi|300175897|emb|CBK21893.2| unnamed protein product [Blastocystis hominis]
Length = 276
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 73/198 (36%), Gaps = 73/198 (36%)
Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY----------LKSSIR 586
H+V GN YPIN LRN+A++ V+T + F+ D+D P+ G+ P+ +I
Sbjct: 35 HLVLAYGNDYPINYLRNLAIDAVATSHFFVADMDVWPV-GISPFPLFSDFVGDVYNQAIS 93
Query: 587 SMDMH-------------------GHGGKKVLVVPAFETQRYRTAFPASHAPT------- 620
++M + ++VP +E R F + PT
Sbjct: 94 ILNMQPDPNYSGPINFNPAMVGSVANRNTTAIIVPVYEYMRACNKF--ATCPTMRENEIP 151
Query: 621 ----NFSRWVNATTPYQIEWAPDF----------------------------EPYIVAHR 648
R + +T Q D EPY++ +
Sbjct: 152 ATKKALRRCLETSTCRQFRQGQDLHNYYFDEWLDSNYTQPLTLVKCFVKDKQEPYVILKK 211
Query: 649 --DLPRYDTRFVGFGWNK 664
+PR+D RFV +G+NK
Sbjct: 212 TEGMPRFDERFVNYGYNK 229
>gi|423269915|ref|ZP_17248887.1| hypothetical protein HMPREF1079_01969 [Bacteroides fragilis
CL05T00C42]
gi|423272630|ref|ZP_17251577.1| hypothetical protein HMPREF1080_00230 [Bacteroides fragilis
CL05T12C13]
gi|392700761|gb|EIY93923.1| hypothetical protein HMPREF1079_01969 [Bacteroides fragilis
CL05T00C42]
gi|392708707|gb|EIZ01812.1| hypothetical protein HMPREF1080_00230 [Bacteroides fragilis
CL05T12C13]
Length = 311
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 86/194 (44%), Gaps = 27/194 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
+ L ++LP + K + +D D++ DI++ W + + Q IG +E+ + Y +
Sbjct: 89 RCLLSRILPVNIDKILYMDCDIVVLNDISEFWD--TDITQ-YAIGCIEDIGSDEEEYYSR 145
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
L + K FN GV+L++L R+ + + FL DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
++ + V P +WNVQ + R+ E + LK I+H+ + K +
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254
Query: 368 H---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268
>gi|237751063|ref|ZP_04581543.1| glycosyl transferase [Helicobacter bilis ATCC 43879]
gi|229373508|gb|EEO23899.1| glycosyl transferase [Helicobacter bilis ATCC 43879]
Length = 365
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 53/98 (54%), Gaps = 13/98 (13%)
Query: 46 HFHLITDTVA------LNILQTLFSTWSVPQVEVSFYLADSVVEDV---SWIPNKHYSGV 96
HFH+ITD++A +ILQT + Q+E ++D +D+ + + Y+
Sbjct: 35 HFHVITDSIAKKTLEQFHILQTTLNDIYPCQIEAHI-ISDEDFKDLPKWGYEEAQQYAAY 93
Query: 97 YGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
Y ++ L LP+ + K + LDTD++ TD+ +L+AL
Sbjct: 94 Y---RVKLVDFLPKNVDKCLYLDTDMLVLTDLRELFAL 128
>gi|60679779|ref|YP_209923.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
[Bacteroides fragilis NCTC 9343]
gi|375356564|ref|YP_005109335.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
protein [Bacteroides fragilis 638R]
gi|383116528|ref|ZP_09937276.1| hypothetical protein BSHG_1402 [Bacteroides sp. 3_2_5]
gi|423248186|ref|ZP_17229202.1| hypothetical protein HMPREF1066_00212 [Bacteroides fragilis
CL03T00C08]
gi|423253135|ref|ZP_17234066.1| hypothetical protein HMPREF1067_00710 [Bacteroides fragilis
CL03T12C07]
gi|423259422|ref|ZP_17240345.1| hypothetical protein HMPREF1055_02622 [Bacteroides fragilis
CL07T00C01]
gi|423263604|ref|ZP_17242607.1| hypothetical protein HMPREF1056_00294 [Bacteroides fragilis
CL07T12C05]
gi|423282507|ref|ZP_17261392.1| hypothetical protein HMPREF1204_00930 [Bacteroides fragilis HMW
615]
gi|60491213|emb|CAH05961.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
protein [Bacteroides fragilis NCTC 9343]
gi|251948201|gb|EES88483.1| hypothetical protein BSHG_1402 [Bacteroides sp. 3_2_5]
gi|301161244|emb|CBW20782.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
protein [Bacteroides fragilis 638R]
gi|387777002|gb|EIK39102.1| hypothetical protein HMPREF1055_02622 [Bacteroides fragilis
CL07T00C01]
gi|392657035|gb|EIY50672.1| hypothetical protein HMPREF1067_00710 [Bacteroides fragilis
CL03T12C07]
gi|392660293|gb|EIY53907.1| hypothetical protein HMPREF1066_00212 [Bacteroides fragilis
CL03T00C08]
gi|392707026|gb|EIZ00146.1| hypothetical protein HMPREF1056_00294 [Bacteroides fragilis
CL07T12C05]
gi|404582075|gb|EKA86770.1| hypothetical protein HMPREF1204_00930 [Bacteroides fragilis HMW
615]
Length = 311
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 86/194 (44%), Gaps = 27/194 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
+ L ++LP + K + +D D++ DI++ W + + Q IG +E+ + Y +
Sbjct: 89 RCLLSRILPVNIDKILYIDCDIVVLNDISEFWN--TDITQ-YAIGCIEDIGSDEEEYYSR 145
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
L + K FN GV+L++L R+ + + FL DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
++ + V P +WNVQ + R+ E + LK I+H+ + K +
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254
Query: 368 H---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268
>gi|420236654|ref|ZP_14741135.1| glycosyl transferase, family 8 [Parascardovia denticolens IPLA
20019]
gi|391880108|gb|EIT88604.1| glycosyl transferase, family 8 [Parascardovia denticolens IPLA
20019]
Length = 324
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 69/166 (41%), Gaps = 25/166 (15%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL-----FSRLRQRQTIGLVENQSDWY 251
+L L ++ P+ + K I LD D + DIAQL+ + F + E S +
Sbjct: 99 FRLFLSEMFPD-IDKAIYLDADTVTNVDIAQLYRIDLGDNFVAAVNDNFVAAGEETSYYT 157
Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
L L + N+GV+L++L +R+ AGF + + + ADQD
Sbjct: 158 LNALGIPSSEY------VNSGVLLMNLKAMRE---AGFVDHFVKLLNAYHVESIAADQDY 208
Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
N I + L +WN ++D T E KIIH+N
Sbjct: 209 LNVICRGR---ILMLGYEWNTMMADGTSGPEHP-------KIIHYN 244
>gi|218130745|ref|ZP_03459549.1| hypothetical protein BACEGG_02336 [Bacteroides eggerthii DSM 20697]
gi|217987089|gb|EEC53420.1| general stress protein A [Bacteroides eggerthii DSM 20697]
Length = 321
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 90/193 (46%), Gaps = 23/193 (11%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL-- 255
+L +P++LP + KT LD D++ + + W ++++ + + + +D+
Sbjct: 104 RLFIPQILPLNIKKTFYLDCDLLVLHPLREFWN--TKMQNKGVAAIADQWTDYIEAATRL 161
Query: 256 -WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
++N + + FN GV+LL+L LR+ ++ K ++ DQD+ N
Sbjct: 162 KYRNDREY------FNAGVLLLNLEYLRNHNFTNNAIDFVTKHANDIVY---HDQDVLNK 212
Query: 315 IISEHPYLVYTLPCQWNV-QLSDNTRSDELCYTELTDLK----IIHWNSPKKLKVKNKHM 369
+I E+ + +P +WNV N + + + D + IIH+ +P K ++
Sbjct: 213 LIGENRII---MPVKWNVCSFKINDKIPHIYNATMNDARKDPYIIHFFAPIKPWNQDSSH 269
Query: 370 EFFRNLYLTFLEY 382
+R+ Y FL++
Sbjct: 270 P-YRSYYYYFLQF 281
>gi|53711516|ref|YP_097508.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides fragilis
YCH46]
gi|52214381|dbj|BAD46974.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides fragilis
YCH46]
Length = 311
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 86/194 (44%), Gaps = 27/194 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
+ L ++LP + K + +D D++ DI++ W + + Q IG +E+ + Y +
Sbjct: 89 RCLLSRILPVNIDKILYIDCDIVVLNDISEFWD--TDITQ-YAIGCIEDIGSDEEEYYSR 145
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
L + K FN GV+L++L R+ + + FL DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
++ + V P +WNVQ + R+ E + LK I+H+ + K +
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254
Query: 368 H---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268
>gi|302848747|ref|XP_002955905.1| hypothetical protein VOLCADRAFT_96805 [Volvox carteri f.
nagariensis]
gi|300258873|gb|EFJ43106.1| hypothetical protein VOLCADRAFT_96805 [Volvox carteri f.
nagariensis]
Length = 1206
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 613 FPASHAPTNFSRWV-NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
FPA H T++ RW A Y I +A FEP+ +A R +P +D R G+G NK
Sbjct: 888 FPAGHGATDYLRWFKQANEAYDISYARKFEPWFMAGRQVVPWHDVRLRGYGHNK 941
>gi|302828420|ref|XP_002945777.1| hypothetical protein VOLCADRAFT_127353 [Volvox carteri f.
nagariensis]
gi|217416170|dbj|BAH03159.1| InvC [Volvox carteri f. nagariensis]
gi|300268592|gb|EFJ52772.1| hypothetical protein VOLCADRAFT_127353 [Volvox carteri f.
nagariensis]
Length = 401
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 51/243 (20%), Positives = 87/243 (35%), Gaps = 62/243 (25%)
Query: 479 VTLVAQLSMDRLQMVEMLFKHWEG-PISLTLYMSDAEA---------------------- 515
VT+ ++DRL M++ +++ G P +Y+ +A
Sbjct: 24 VTMFTTATLDRLDMLDTQCRNYRGGPHVAAVYLPLVQAANKSGLTNENEQAVRTAEEKLQ 83
Query: 516 QQFLSYAGNSEALRSRRNIGYHVVYKEGNFY--PINTLRNVALNQVSTPYVFLLDIDFLP 573
+ F +EA Y V Y PIN LRN AL T V ++D+D
Sbjct: 84 ELFYRMEATTEACHMHILFLYEVTSDSALAYLTPINALRNAALLASQTQLVMMIDVDLCV 143
Query: 574 MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY------------------------ 609
L ++ S + V+PA++ +
Sbjct: 144 SNTLVQFVSESKYIDQLVRMSSNSFWVLPAWDVNKQLPQEQINEVAASAASGTKMQLVKL 203
Query: 610 ----------RTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD---LPRYDTR 656
+ F H+PTN++RW+ + Y I+++ +EP+ + RD + YD R
Sbjct: 204 WLNGSLNWFGQHYFQLGHSPTNYTRWLMSRRSYVIDYSIGYEPWGLVSRDRMAIVPYDAR 263
Query: 657 FVG 659
F G
Sbjct: 264 FRG 266
>gi|357623047|gb|EHJ74352.1| hypothetical protein KGM_22246 [Danaus plexippus]
Length = 398
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 541 KEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM--DMHGHGG-- 595
+E N YPIN RNVA N T Y + DI+ +P GL P + +R + D G
Sbjct: 167 RENNMIYPINVGRNVARNASRTNYFIVSDIELVPSNGLAPKFLAMLRKLMGDKKRDEGCI 226
Query: 596 --KKVLVVPAFETQR 608
K V VVP FE +R
Sbjct: 227 FSKTVFVVPLFEVER 241
>gi|317474579|ref|ZP_07933853.1| glycosyl transferase family 8 [Bacteroides eggerthii 1_2_48FAA]
gi|316909260|gb|EFV30940.1| glycosyl transferase family 8 [Bacteroides eggerthii 1_2_48FAA]
Length = 313
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 64/141 (45%), Gaps = 21/141 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+ LP +LP L+K I LD+D++ I ++W + L G+ E +S K
Sbjct: 88 RCMLPSLLPSQLSKAIYLDSDILVLDSIKEIWN--TDLNNIAIAGIEEARS--------K 137
Query: 258 NHKPWPALG-----RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
K LG R N GV+L++L R + R K + L+ DQD+
Sbjct: 138 EDKHCDRLGYAPSYRYINAGVLLINLDYWRKYNIEEKCRQYYAKNIDRMLYN---DQDLL 194
Query: 313 NAIISEHPYLVYTLPCQWNVQ 333
NA++ + ++ P ++NVQ
Sbjct: 195 NALLYDKKAVI---PTRYNVQ 212
>gi|294786378|ref|ZP_06751632.1| glycosyltransferase [Parascardovia denticolens F0305]
gi|315225938|ref|ZP_07867726.1| glycosyltransferase [Parascardovia denticolens DSM 10105 = JCM
12538]
gi|294485211|gb|EFG32845.1| glycosyltransferase [Parascardovia denticolens F0305]
gi|315120070|gb|EFT83202.1| glycosyltransferase [Parascardovia denticolens DSM 10105 = JCM
12538]
Length = 324
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 69/166 (41%), Gaps = 25/166 (15%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL-----FSRLRQRQTIGLVENQSDWY 251
+L L ++ P+ + K I LD D + DIAQL+ + F + E S +
Sbjct: 99 FRLFLSEMFPD-IDKAIYLDADTVTNVDIAQLYRIDLGDNFFAAVNDNFVAAGEETSYYT 157
Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
L L + N+GV+L++L +R+ AGF + + + ADQD
Sbjct: 158 LNALGIPSSEY------VNSGVLLMNLKAMRE---AGFVDHFVKLLNAYHVESIAADQDY 208
Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
N I + L +WN ++D T E KIIH+N
Sbjct: 209 LNVICRGR---ILMLGYEWNTMMADGTSGPEHP-------KIIHYN 244
>gi|431910232|gb|ELK13305.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
[Pteropus alecto]
Length = 221
Score = 44.7 bits (104), Expect = 0.15, Method: Composition-based stats.
Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEA 527
+ + D NDV L S+D L + L + WEGP+S++++ + E Q L+YA +S
Sbjct: 85 KTTMDPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHC 144
Query: 528 LRSRRNIGYHVV 539
R + H+V
Sbjct: 145 PDMRARVAMHLV 156
>gi|42518147|ref|NP_964077.1| hypothetical protein LJ0061 [Lactobacillus johnsonii NCC 533]
gi|227889100|ref|ZP_04006905.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
johnsonii ATCC 33200]
gi|385825016|ref|YP_005861358.1| hypothetical protein [Lactobacillus johnsonii DPC 6026]
gi|41582431|gb|AAS08043.1| hypothetical protein LJ_0061 [Lactobacillus johnsonii NCC 533]
gi|227850329|gb|EEJ60415.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
johnsonii ATCC 33200]
gi|329666460|gb|AEB92408.1| hypothetical protein LJP_0069c [Lactobacillus johnsonii DPC 6026]
Length = 316
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 20/163 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P + P+ K + LD D I TDIA+L ++ V + S ++ L
Sbjct: 94 RLFIPNLFPQ-YDKAVYLDADTIICTDIAEL---YNTEIGDNMFASVPDMSIRFIKPLQV 149
Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
K + + N GVIL ++ + RD + + + EK+ + DQ N
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKEFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKDPKIVHYN 239
>gi|417838481|ref|ZP_12484719.1| glycosyl transferase family 8 domain containing protein
[Lactobacillus johnsonii pf01]
gi|338762024|gb|EGP13293.1| glycosyl transferase family 8 domain containing protein
[Lactobacillus johnsonii pf01]
Length = 316
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 20/163 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P + P+ K + LD D I TDIA+L ++ V + S ++ L
Sbjct: 94 RLFIPNLFPQ-YDKAVYLDADTIICTDIAEL---YNTEIGDNMFASVPDMSIRFIKPLQV 149
Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
K + + N GVIL ++ + RD + + + EK+ + DQ N
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKEFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKDPKIVHYN 239
>gi|356499719|ref|XP_003518684.1| PREDICTED: probable galacturonosyltransferase-like 3-like [Glycine
max]
Length = 342
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 50/94 (53%), Gaps = 11/94 (11%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
++ L +LP T+ + I D+D+I D+A+LW++ R +G E N ++++
Sbjct: 144 RMYLADLLPATVRRIIYFDSDLIVVDDVAKLWSIDLHAR---VLGAPEYCHANFTNYFTH 200
Query: 254 KLWKNHKPWPALGRG----FNTGVILLDLTKLRD 283
+ W N + R FNTGV+++DL K R+
Sbjct: 201 RFWSNPSYAASFKRRDACYFNTGVMVIDLWKWRE 234
>gi|365175698|ref|ZP_09363125.1| hypothetical protein HMPREF1006_01070 [Synergistes sp. 3_1_syn1]
gi|363612266|gb|EHL63811.1| hypothetical protein HMPREF1006_01070 [Synergistes sp. 3_1_syn1]
Length = 341
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 63/132 (47%), Gaps = 19/132 (14%)
Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 255
L +L +P+++ E K + LD D+I DIA+LW + +TIG V + W L +
Sbjct: 94 LFRLLIPQLINEK--KILYLDCDLIIDLDIAELW---NEPLDGKTIGAVCDV--WSLDYI 146
Query: 256 ------WKNHKPWPALG----RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS 305
W+ W +G FN GV ++DL ++RD F + + + + +
Sbjct: 147 KGTPVPWRYALAWEIMGIRNDSYFNAGVTIMDLERIRD--RYDFLKEVETFYDKYKKCIT 204
Query: 306 LADQDIFNAIIS 317
LADQD N I +
Sbjct: 205 LADQDCLNYIFA 216
>gi|297672811|ref|XP_002814481.1| PREDICTED: uncharacterized protein LOC100458876 [Pongo abelii]
Length = 570
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 15/101 (14%)
Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYL 322
FN+GV+LL+L +R + + + E + +L L DQD F I EHP L
Sbjct: 465 FNSGVMLLNLEAMRQ---SPLYSRLLEPAQVQQLADKYHFRGHLGDQDFFTMIGMEHPKL 521
Query: 323 VYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWN 357
+ L C WN QL D+ SD E + +KI H N
Sbjct: 522 FHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVKIYHGN 562
>gi|81299339|ref|YP_399547.1| lipopolysaccharide biosynthesis proteins LPS [Synechococcus
elongatus PCC 7942]
gi|81168220|gb|ABB56560.1| Lipopolysaccharide biosynthesis proteins LPS [Synechococcus
elongatus PCC 7942]
Length = 329
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 9/193 (4%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDW-YLGKL 255
++ LP+V P+ L + + DTDV+ D A L + V ++ W Y K
Sbjct: 139 RVWLPQVFPD-LTRILYFDTDVVLLEDPAILDQQAGDFNDQIFFAAVPHSRPAWLYFKKP 197
Query: 256 WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
W+ H A+G FN+GV++ DL + + + + D+ + NA
Sbjct: 198 WRAHSYIKAMGTTFNSGVMVTDLRFWTEAVYQRIQAALDRDRQFRYRFLEPGDEALLNAC 257
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTR-SDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
+ LP +WN N R L + + IIHW S K N H + +
Sbjct: 258 FPNY----RALPKRWNRCGYGNARFVARLLACDPQEAAIIHW-SGGHHKPWNTHDIIYGD 312
Query: 375 LYLTFLEYDGNLL 387
L+ + G LL
Sbjct: 313 LWRRYANLPGLLL 325
>gi|366054099|ref|ZP_09451821.1| glycosyl transferase family protein [Lactobacillus suebicus KCTC
3549]
Length = 316
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 76/166 (45%), Gaps = 27/166 (16%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L + + P+ K I LD D+I DI+ +LF + +G V +D ++ L
Sbjct: 97 RLFIADMFPQ-YDKAIYLDADIIVTGDIS---SLFDKNIDNNLMGAV---NDRFIA-LDP 148
Query: 258 NHKPWPALGRG------FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
N + G N+GV+LL+L ++R + +G + + +F T DQD
Sbjct: 149 NGSSYAREAIGVDGSLYVNSGVLLLNLDEIRQFNLSGRFLDLLRRF---HFDTIAPDQDY 205
Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
NAI + ++ L WN Q + + +TD++IIH+N
Sbjct: 206 LNAICQDR---IFQLDPAWNYQTA-------VLDDSVTDIQIIHFN 241
>gi|56751002|ref|YP_171703.1| hypothetical protein syc0993_d [Synechococcus elongatus PCC 6301]
gi|56685961|dbj|BAD79183.1| unknown protein [Synechococcus elongatus PCC 6301]
Length = 250
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 9/193 (4%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDW-YLGKL 255
++ LP+V P+ L + + DTDV+ D A L + V ++ W Y K
Sbjct: 60 RVWLPQVFPD-LTRILYFDTDVVLLEDPAILDQQAGDFNDQIFFAAVPHSRPAWLYFKKP 118
Query: 256 WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
W+ H A+G FN+GV++ DL + + + + D+ + NA
Sbjct: 119 WRAHSYIKAMGTTFNSGVMVTDLRFWTEAVYQRIQAALDRDRQFRYRFLEPGDEALLNAC 178
Query: 316 ISEHPYLVYTLPCQWNVQLSDNTR-SDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
+ LP +WN N R L + + IIHW S K N H + +
Sbjct: 179 FPNY----RALPKRWNRCGYGNARFVARLLACDPQEAAIIHW-SGGHHKPWNTHDIIYGD 233
Query: 375 LYLTFLEYDGNLL 387
L+ + G LL
Sbjct: 234 LWRRYANLPGLLL 246
>gi|30689156|ref|NP_189474.2| putative galacturonosyltransferase-like 10 [Arabidopsis thaliana]
gi|75273232|sp|Q9LHD2.1|GATLA_ARATH RecName: Full=Probable galacturonosyltransferase-like 10; AltName:
Full=Galactinol synthase 8; Short=AtGolS8; Short=GolS-8
gi|11994580|dbj|BAB02626.1| glycosyl transferase-like protein [Arabidopsis thaliana]
gi|44917577|gb|AAS49113.1| At3g28340 [Arabidopsis thaliana]
gi|51971391|dbj|BAD44360.1| unknown protein [Arabidopsis thaliana]
gi|332643914|gb|AEE77435.1| putative galacturonosyltransferase-like 10 [Arabidopsis thaliana]
Length = 365
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 10/91 (10%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L ++L +++ I LD+DVI DI +LW + L +TIG E N + ++ W
Sbjct: 162 LSEILSSCVSRVIYLDSDVIVVDDIQKLWKI--SLSGSRTIGAPEYCHANFTKYFTDSFW 219
Query: 257 KNHKPWPALGRG----FNTGVILLDLTKLRD 283
+ K FNTGV+++DL + R+
Sbjct: 220 SDQKLSSVFDSKTPCYFNTGVMVIDLERWRE 250
Score = 40.0 bits (92), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 64/143 (44%), Gaps = 9/143 (6%)
Query: 28 RSLVTLIKSILFYRKNP--LHFHLI-TDTVALNILQTLFSTWSVPQVEVSFYLADSVVED 84
R V+ + SIL + P + FH I + T ++ +TL S + +V + +V
Sbjct: 83 RGTVSAVHSILKHTSCPENIFFHFIASGTSQGSLAKTLSSVFPSLSFKVYTFEETTVKNL 142
Query: 85 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
+S + + L ++L +++ I LD+DVI DI +LW + L +TI
Sbjct: 143 ISSSIRQALDSPLNYARSYLSEILSSCVSRVIYLDSDVIVVDDIQKLWKI--SLSGSRTI 200
Query: 145 GLVE----NQSDWYLGKLWKNHK 163
G E N + ++ W + K
Sbjct: 201 GAPEYCHANFTKYFTDSFWSDQK 223
>gi|86152043|ref|ZP_01070256.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni
260.94]
gi|315124873|ref|YP_004066877.1| sugar transferase [Campylobacter jejuni subsp. jejuni ICDCCJ07001]
gi|56783473|emb|CAI38726.1| putative sugar transferase [Campylobacter jejuni]
gi|85841151|gb|EAQ58400.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni
260.94]
gi|315018595|gb|ADT66688.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni
ICDCCJ07001]
Length = 459
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 84/179 (46%), Gaps = 34/179 (18%)
Query: 165 WPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 224
+P+ + +++ D++ D + IP KH L+L L +L + + K + LD D++ D
Sbjct: 63 YPSEIKIYHINDNLFYDYN-IP-KHEGSYNAYLRLMLASILSKDIKKCLYLDVDMLVLGD 120
Query: 225 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL------------GRGFNTG 272
I++L+ L + + + ++++ PWP L G FN+G
Sbjct: 121 ISELFDLDLKDKVFAAVFILKH--------------PWPNLNSKDSSEIFYIYGSHFNSG 166
Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
++L++L R+ + R ++ F+ D+ + NAI+S+ +++L +WN
Sbjct: 167 LMLINLDAWREKNIES--RSLS--FIKNYYVPYAVDEYVLNAILSKDD--IFSLKLEWN 219
>gi|418087438|ref|ZP_12724607.1| hypothetical protein SPAR87_1223 [Streptococcus pneumoniae GA47033]
gi|353758454|gb|EHD39046.1| hypothetical protein SPAR87_1223 [Streptococcus pneumoniae GA47033]
Length = 1077
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 77/173 (44%), Gaps = 30/173 (17%)
Query: 151 SDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLA 210
SDW+ +L + ++ G Y+ D++ DV W +H S + + +P+++ E
Sbjct: 41 SDWF-HELKIQVEKLGSVVEGIYIGDTI--DVEWKTQEHISPI-AYARYLIPRLIKE--E 94
Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFN 270
+ I LD+D+I D++ L+ L + D+ + + A G GFN
Sbjct: 95 RVIYLDSDIIVNGDLSSLFEL--------------DFGDYSIAAVRD------ADGNGFN 134
Query: 271 TGVILLDLTKLR--DISWAGFWRIIAEKFLLTRLWTSL--ADQDIFNAIISEH 319
+GV+++D K R D++ F + + L T DQ IFN + H
Sbjct: 135 SGVLVIDSQKWREKDVTSILFDKTVEYMSYLDHTDTDRFNGDQTIFNLVFQNH 187
>gi|297818430|ref|XP_002877098.1| hypothetical protein ARALYDRAFT_905093 [Arabidopsis lyrata subsp.
lyrata]
gi|297322936|gb|EFH53357.1| hypothetical protein ARALYDRAFT_905093 [Arabidopsis lyrata subsp.
lyrata]
Length = 364
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 10/91 (10%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L ++L +++ I LD+DVI DI +LW + L +TIG E N + ++ W
Sbjct: 161 LSEILSSCVSRVIYLDSDVIVVDDIQKLWKI--SLSGSRTIGAPEYCHANFTKYFTESFW 218
Query: 257 KNHKPWPALGRG----FNTGVILLDLTKLRD 283
+ K FNTGV+++DL + R+
Sbjct: 219 SDRKLSSVFDSKTPCYFNTGVMVIDLDRWRE 249
>gi|384110005|ref|ZP_10010852.1| Lipopolysaccharide biosynthesis protein [Treponema sp. JC4]
gi|383868445|gb|EID84097.1| Lipopolysaccharide biosynthesis protein [Treponema sp. JC4]
Length = 350
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 71/168 (42%), Gaps = 33/168 (19%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+ +P + P+ K I LD D+I DI++L+ I L N +++
Sbjct: 95 RFFIPNLFPK-YKKIIYLDCDIIVKGDISELY----------NIDLGNNYVAAAPEEVFI 143
Query: 258 NHKPWPALGRG---------FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
H W RG FN GV+LL+ ++ A + +AEK+ + + D
Sbjct: 144 LHPNWLNYPRGALGIEPEDYFNAGVMLLNTERMLKDKIAEKFVALAEKY----TFRIVQD 199
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHW 356
+D N ++ +H + LP WN N D+ +LKIIH+
Sbjct: 200 EDYLNVLLKDHKKI---LPLDWNKTAFKNDAYDD------KNLKIIHY 238
>gi|357445497|ref|XP_003593026.1| UDP-glucose:glycoprotein glucosyltransferase [Medicago truncatula]
gi|355482074|gb|AES63277.1| UDP-glucose:glycoprotein glucosyltransferase [Medicago truncatula]
Length = 1650
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 68/141 (48%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I TD+ +L+ + + R +N + W
Sbjct: 1431 LFLDVIFPLSLEKVIFVDADQIVRTDMGELYDMDLKGRPLAYTPFCDNNREMDGYRFWRQ 1490
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK+H GR ++ + + ++DL K R+ + R+ E L++ SLA DQ
Sbjct: 1491 G-FWKDH----LRGRPYHISALYVVDLKKFRETAAGDNLRVFYET--LSKDPNSLANLDQ 1543
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1544 DLPN--YAQHTVPIFSLPQEW 1562
>gi|381150345|ref|ZP_09862214.1| LPS:glycosyltransferase [Methylomicrobium album BG8]
gi|380882317|gb|EIC28194.1| LPS:glycosyltransferase [Methylomicrobium album BG8]
Length = 311
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 54/112 (48%), Gaps = 16/112 (14%)
Query: 181 DVSWIPNKHYSGVYGLLKLT-----LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
+V P + +S + + ++T +P V PET++K + LD D++ DIA L
Sbjct: 71 EVDMKPFREFSTIAHISRITFARFLIPDVFPETVSKVLYLDADILVLDDIAPL------C 124
Query: 236 RQRQTIGLVENQSDWYLGKLWKNHKPW----PALGRGFNTGVILLDLTKLRD 283
R L+ +D YL K +P P + FN GV+L+DL + R+
Sbjct: 125 RMELNGALLGAVTD-YLDACLKRGEPLFAAVPRVSNYFNAGVLLIDLGRWRE 175
>gi|423331375|ref|ZP_17309159.1| hypothetical protein HMPREF1075_01172 [Parabacteroides distasonis
CL03T12C09]
gi|409230671|gb|EKN23533.1| hypothetical protein HMPREF1075_01172 [Parabacteroides distasonis
CL03T12C09]
Length = 301
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 24/198 (12%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
FY D V + P + + +L + +LP+++ K + LD DV+ + LW
Sbjct: 61 FYTVDKKV--FANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDT 118
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA-GFW 290
+ G++ + S + +++ + P+LG FN GV+L++L R+ + + F+
Sbjct: 119 DIK---SYAAGVIPDMSIDDI-RIYNRLQYSPSLGY-FNAGVLLVNLRYWRENNLSESFF 173
Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ--------LSDNTRSDE 342
II + R DQD+ N ++ E + TLP ++NVQ L T DE
Sbjct: 174 EIINKYPERLR----YHDQDVLNIVLKE---IKLTLPMKYNVQHGYFFKDPLISRTYRDE 226
Query: 343 LCYTELTDLKIIHWNSPK 360
+TD I+H++ K
Sbjct: 227 R-EQAITDPVILHYSGSK 243
>gi|339448960|ref|ZP_08652516.1| bifunctional glycosyl transferase family protein [Lactobacillus
fructivorans KCTC 3543]
Length = 290
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 89/200 (44%), Gaps = 33/200 (16%)
Query: 205 LPETLAK--TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK---NH 259
+PE + + + LD D+I DI LW + L + +G VEN YL +L + +
Sbjct: 103 IPEEVKRPRILYLDADMICDGDITGLWQ--TDLGGK-VVGAVENAG--YLDRLREMGVSE 157
Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
KP GR FN G++L+D K ++ + R +A DQD NAI + +
Sbjct: 158 KP----GRYFNAGLLLIDTKKWKEQGISQRARNLANDHPEI---LRFQDQDALNAIFNGN 210
Query: 320 PYLVYTLPCQWNVQLS--------DNTRSDELCYTE-LTDLKIIHW-NSPKKLKVKNKHM 369
+LP ++NVQ + T S E L IIH+ N K ++N H+
Sbjct: 211 ---WQSLPSKYNVQSNLVKGKYRKSGTESGRRSQQEALNQPVIIHYTNFDKPWLIRNGHL 267
Query: 370 EFFRNLYLTFLEYDGNLLRR 389
R+LY EY LL +
Sbjct: 268 HPLRSLY---DEYQNKLLNQ 284
>gi|366165519|ref|ZP_09465274.1| glycosyl transferase family protein [Acetivibrio cellulolyticus
CD2]
Length = 484
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 77/182 (42%), Gaps = 21/182 (11%)
Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
P Y+ Y ++ LP++L ++ K + LD D++ DIA+LW + VE
Sbjct: 80 PCASYATYY---RIFLPELLDSSIEKVLYLDCDIVVKGDIAKLW---ETDITGNYLAAVE 133
Query: 246 NQSDWYLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRL 302
+ Y G+ K K ++ R FN GV++++L R G I + +
Sbjct: 134 DVGVEYSGEFGKKVKENLSMDRKDIYFNAGVLIINLDLWRQ---HGISDKICDFLIQNPD 190
Query: 303 WTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS------DNTRSDELCYTELTDLKIIHW 356
ADQD NA++S L WN Q++ D D+ L + IIH+
Sbjct: 191 KAPFADQDGLNAVLSGK---WVPLSLLWNQQVALWEHFDDGKPLDQEMLESLHNPFIIHY 247
Query: 357 NS 358
S
Sbjct: 248 TS 249
>gi|255014941|ref|ZP_05287067.1| putative glucosyltransferase [Bacteroides sp. 2_1_7]
gi|410105347|ref|ZP_11300255.1| hypothetical protein HMPREF0999_04027 [Parabacteroides sp. D25]
gi|409232557|gb|EKN25403.1| hypothetical protein HMPREF0999_04027 [Parabacteroides sp. D25]
Length = 301
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 24/198 (12%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
FY D V + P + + +L + +LP+++ K + LD DV+ + LW
Sbjct: 61 FYTVDKKV--FANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDT 118
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA-GFW 290
+ G++ + S + +++ + P+LG FN GV+L++L R+ + + F+
Sbjct: 119 DIK---SYAAGVIPDMSIDDI-RIYNRLQYSPSLGY-FNAGVLLVNLRYWRENNLSESFF 173
Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ--------LSDNTRSDE 342
II + R DQD+ N ++ E + TLP ++NVQ L T DE
Sbjct: 174 EIINKYPERLR----YHDQDVLNIVLKE---IKLTLPMKYNVQHGYFFKDPLISRTYRDE 226
Query: 343 LCYTELTDLKIIHWNSPK 360
+TD I+H++ K
Sbjct: 227 R-EQAITDPVILHYSGSK 243
>gi|356495990|ref|XP_003516853.1| PREDICTED: probable galacturonosyltransferase-like 3-like [Glycine
max]
Length = 338
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 52/95 (54%), Gaps = 13/95 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
++ L +LP T+ + I D+D+I D+A+LW++ R +G E N ++++
Sbjct: 140 RMYLADLLPATVRRIIYFDSDLIVVDDVAKLWSIDLHAR---VLGAPEYCHANFTNYFTH 196
Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRD 283
+ W N + A +G FNTGV+++DL K R+
Sbjct: 197 RFWSN-PSYAASFKGRDACYFNTGVMVIDLWKWRE 230
>gi|357146607|ref|XP_003574051.1| PREDICTED: LOW QUALITY PROTEIN: UDP-glucose:glycoprotein
glucosyltransferase-like [Brachypodium distachyon]
Length = 1604
Score = 43.9 bits (102), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I TD+ +L+ + + R +N + W
Sbjct: 1385 LFLDVIFPLSLRKVIFVDADQIVRTDMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1444
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK+H GR ++ + + ++DL K R + R++ E L++ SL+ DQ
Sbjct: 1445 G-FWKDH----LRGRPYHISALYVVDLAKFRQTAAGDTLRVVYET--LSKDPNSLSNLDQ 1497
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1498 DLPN--YAQHTVPIFSLPQEW 1516
>gi|324497845|gb|ADY39571.1| putative N-acetyllactosaminide [Hottentotta judaicus]
Length = 232
Score = 43.9 bits (102), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 61/161 (37%), Gaps = 41/161 (25%)
Query: 484 QLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR------RNIGYH 537
Q + + L V +L W+ PIS+ +Y A F+S LR N+ +H
Sbjct: 2 QGTYEFLHHVNILCSRWQAPISVAVY---APGDDFISAHLTISYLRQCGPPCIVENVTWH 58
Query: 538 VVY------------------------------KEGNFYPINTLRNVALNQVSTPYVFLL 567
++Y K+G YPIN RNVA + T Y+F
Sbjct: 59 MIYDTDFPPEERKSVVKPINCSDSLNLSSSYKTKKGLLYPINVARNVARLKAETYYIFSS 118
Query: 568 DIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
DI+ P G+ ++ + ++ VP FE ++
Sbjct: 119 DIELYPSIGIVTQFLDMVQKEE--NSETLRIYAVPVFEIKK 157
>gi|284047872|ref|YP_003398211.1| glycosyl transferase family protein [Acidaminococcus fermentans DSM
20731]
gi|283952093|gb|ADB46896.1| glycosyl transferase family 8 [Acidaminococcus fermentans DSM
20731]
Length = 309
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 75/173 (43%), Gaps = 25/173 (14%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
L+L +P+++P+ + + I LDTD++ DI +LW + Q + +G V + ++
Sbjct: 87 LRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWEMDL---QGKPVGAVPDLGILASSRMR 143
Query: 257 KNHKPWPALGRG---FNTGVILLDLTKLRDISWAG-FWRIIAEKFLLTRLWTSLADQDIF 312
+ + + G FN+GV++++L R+ + R + E DQD
Sbjct: 144 RQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQVIRCVEEGNFRHH------DQDGL 197
Query: 313 NAIISEHPYLVYTLPCQWN---------VQLSDNTRSDELCYTELTDLKIIHW 356
N + ++ LP +WN V++ +R L L + HW
Sbjct: 198 NKVFQDN---WQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEALERPAVFHW 247
>gi|378823526|ref|ZP_09846149.1| glycosyltransferase, family 8 [Sutterella parvirubra YIT 11816]
gi|378597650|gb|EHY30915.1| glycosyltransferase, family 8 [Sutterella parvirubra YIT 11816]
Length = 309
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 29/117 (24%), Positives = 54/117 (46%), Gaps = 4/117 (3%)
Query: 19 IVCAGYNSTRSLVTLIKSILFYRKNP--LHFHLITDTVALNILQTLFSTWSVPQVEVSFY 76
+ C+ N + ++SI+ + P + F+ I D ++ L T ++F
Sbjct: 7 VFCSDDNFAQHSAVTMRSIVEHCSRPADIDFYYIHDGISEEKQVRLRRTIEEFGATLTFL 66
Query: 77 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
D V ++I HY + +L +P LP+ + + I LD D++ D+A+LWA
Sbjct: 67 TPDVSVFKDAFI--NHYYSLASYFRLLVPSCLPQEVHRCIYLDGDMVVDGDVAELWA 121
>gi|47222079|emb|CAG12105.1| unnamed protein product [Tetraodon nigroviridis]
Length = 416
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 92/227 (40%), Gaps = 58/227 (25%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
+L LP +L + + + +D+D++F + LWA S+ Q + + WY
Sbjct: 119 RLFLPLIL-KDVDSIVYVDSDILFLQPMDDLWAFLSKFTPSQLAAMAPEHEEPRIAWY-- 175
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDI-----------SWAGFWRIIAEKFLLTR 301
+ H P GR G N+GV+L+++T++R + W + +K+ L
Sbjct: 176 SRFARH---PFYGRTGVNSGVMLMNMTRMRTVVFQNDMTSVPLRWEELLMPLLQKYKLNI 232
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKK 361
W DQD+ N I T+ + + +S+ +C+ +
Sbjct: 233 TW---GDQDLLNIIFP------ITILVRLHTAVSNYASVCSVCF--------------QA 269
Query: 362 LKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSN 408
LK KN + L+L F+ REL G +L Q ELS+
Sbjct: 270 LKFKNCNSRIHFPLFLFFVP-------RELAGVSL------QVELSS 303
>gi|336426113|ref|ZP_08606126.1| hypothetical protein HMPREF0994_02132 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336011071|gb|EGN41039.1| hypothetical protein HMPREF0994_02132 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 352
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 68/154 (44%), Gaps = 24/154 (15%)
Query: 181 DVSWIPNKHYSGVYGL------------LKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 228
D+ WIP K S + +L + VLP L + + LD D++ + +L
Sbjct: 82 DIQWIPAKDISEELNMDVVVDRGSLSQYARLFVSSVLPADLERVLYLDCDIVINQSLDEL 141
Query: 229 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAG 288
W L + + L + S WY + + KP + FN+GV+L+DL + ++
Sbjct: 142 WNL--DMHGKTIAALKDAFSKWYRANI--DLKPTDIM---FNSGVMLIDLKRWKEQKIEK 194
Query: 289 FWRIIAEKFLLTRLW-TSLADQDIFNAIISEHPY 321
R++ KF+ ++ DQ NA++S Y
Sbjct: 195 --RLM--KFIASKNGRIQQGDQGALNAVLSHDTY 224
>gi|171058066|ref|YP_001790415.1| glycosyl transferase family protein [Leptothrix cholodnii SP-6]
gi|170775511|gb|ACB33650.1| glycosyl transferase family 8 [Leptothrix cholodnii SP-6]
Length = 316
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 53/136 (38%), Gaps = 10/136 (7%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L + +LP L + + LDTD++ D L L +G V + D L
Sbjct: 96 RLLMADLLPAELERVLYLDTDILVLGD---LLPLMRTELDGAILGAVRDGLDAELKSTSP 152
Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
P + FN GV+L+DL + R AG A L+ T ADQD N
Sbjct: 153 APTGMPDVCDYFNAGVLLIDLARWR----AGRVSAAARDHLVAHPQTPFADQDALNVACD 208
Query: 318 EHPYLVYTLPCQWNVQ 333
H L WN Q
Sbjct: 209 GH---WKPLAAHWNFQ 221
>gi|295691931|ref|YP_003600541.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
crispatus ST1]
gi|295030037|emb|CBL49516.1| Lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
crispatus ST1]
Length = 317
Score = 43.5 bits (101), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P++ P+ K + LD D I TDIA+++ + + S Y+ L K
Sbjct: 95 RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEI---DIADNMFASCPDLSIRYMPLLQK 150
Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
K +PA + N GVIL ++ RD + + + K+ + DQ N
Sbjct: 151 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 206
Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 207 EICEDE---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 240
>gi|296084616|emb|CBI25666.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 43.5 bits (101), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 8/95 (8%)
Query: 47 FHLITDTVALNILQTLF-------STWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGL 99
FH++TD + L +Q +F S V VE +L S V + + N Y +
Sbjct: 250 FHVVTDKMNLGAMQVMFKMRDYNGSHIEVKAVEDYKFLNSSYVPVLRQLENPKYLSMLNH 309
Query: 100 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
L+ LP++ P+ L + + LD DV+ D+ LW +
Sbjct: 310 LRFYLPEMYPK-LHRILFLDDDVVVQRDLTGLWKI 343
>gi|428176520|gb|EKX45404.1| hypothetical protein GUITHDRAFT_163273 [Guillardia theta CCMP2712]
Length = 1204
Score = 43.5 bits (101), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 58/131 (44%), Gaps = 15/131 (11%)
Query: 486 SMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV------V 539
++DR ++ L W GP+S+ Y S E ++ + E +R+ ++ +
Sbjct: 189 TVDRAVVIYNLAHRWRGPLSVVFYTSTEEEREIVRTFVREELIRASKSYCKSLWVQLVSD 248
Query: 540 YKEGN------FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
K+ N +P+N LR A+N + YV +D DFLP ++ ++ D
Sbjct: 249 CKDKNQPDDQFSFPVNKLRFEAINVLPHDYVLYVDADFLPSDNALVEIERGLKEFD---E 305
Query: 594 GGKKVLVVPAF 604
+LV+P F
Sbjct: 306 LESALLVIPCF 316
>gi|365853174|ref|ZP_09393469.1| glycosyltransferase, family 8 [Lactobacillus parafarraginis F0439]
gi|363713150|gb|EHL96796.1| glycosyltransferase, family 8 [Lactobacillus parafarraginis F0439]
Length = 316
Score = 43.5 bits (101), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 71/147 (48%), Gaps = 24/147 (16%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG--LVENQSDWYLGK 254
+L + ++ P+ L K I LD D + DIA+L+ + ++G L+ D+++G
Sbjct: 95 FRLFIAELFPD-LDKAIYLDADTVILDDIAKLYDV--------SLGENLIGAVPDYFIGH 145
Query: 255 L-----WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-D 308
+ N + N+GV+L++L +RD +A + + ++ + SLA D
Sbjct: 146 TPETIDYANQAVGIDSQKYVNSGVLLMNLKVMRDCHFADHFLDLLNRY----HFKSLAPD 201
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
QD NAI + + L WNVQ++
Sbjct: 202 QDYMNAIARKR---ICYLDSSWNVQIT 225
>gi|224108539|ref|XP_002314884.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222863924|gb|EEF01055.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 342
Score = 43.5 bits (101), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 59/117 (50%), Gaps = 16/117 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
++ L +LP ++ + I D+D+I D+A+LW + +G E N ++++
Sbjct: 145 RIYLADLLPMSVRRIIYFDSDLILVDDVAKLWNIN---LGAHVLGAPEYCHANFTNYFNS 201
Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRDISWA---GFWRIIAEKFLLTRL 302
+ W N +L RG FNTGV+++DL K R+ + +W + +K+ + L
Sbjct: 202 RFWSNSACAASL-RGRRACYFNTGVMVIDLGKWREGKYTERLEYWMKVQKKYRIYEL 257
>gi|293380058|ref|ZP_06626154.1| glycosyl transferase family 8 [Lactobacillus crispatus 214-1]
gi|312977025|ref|ZP_07788774.1| glycosyltransferase [Lactobacillus crispatus CTV-05]
gi|423319858|ref|ZP_17297733.1| hypothetical protein HMPREF9250_02166 [Lactobacillus crispatus
FB049-03]
gi|423320134|ref|ZP_17298006.1| hypothetical protein HMPREF9249_00006 [Lactobacillus crispatus
FB077-07]
gi|290923372|gb|EFE00279.1| glycosyl transferase family 8 [Lactobacillus crispatus 214-1]
gi|310896353|gb|EFQ45418.1| glycosyltransferase [Lactobacillus crispatus CTV-05]
gi|405586879|gb|EKB60623.1| hypothetical protein HMPREF9250_02166 [Lactobacillus crispatus
FB049-03]
gi|405609037|gb|EKB81940.1| hypothetical protein HMPREF9249_00006 [Lactobacillus crispatus
FB077-07]
Length = 317
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P++ P+ K + LD D I TDIA+++ + + S Y+ L K
Sbjct: 95 RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEI---DIADNMFASCPDLSIRYMPLLQK 150
Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
K +PA + N GVIL ++ RD + + + K+ + DQ N
Sbjct: 151 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 206
Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 207 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 240
>gi|313236397|emb|CBY11715.1| unnamed protein product [Oikopleura dioica]
Length = 275
Score = 43.1 bits (100), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/165 (21%), Positives = 69/165 (41%), Gaps = 34/165 (20%)
Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA-------QQFLSYAGNS 525
+ D D+TL ++ ++ + L + W+GP+S++L++ +A ++F +G
Sbjct: 86 AGDYEDITLATFATVVDIRQLWKLARKWQGPMSISLFVDGDQATIAHIAIEKFRKCSGLD 145
Query: 526 EALRSRRNIGYHVVY-----------------------KEGNFYPINTLRNVALNQVSTP 562
+++ + +H+VY + G P N LRN+A + ST
Sbjct: 146 DSI----PLHFHLVYSTSLEALTDDARLAELQQRIDFAQTGLRLPKNLLRNIAKDSASTD 201
Query: 563 YVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
+V DI P L ++ K+ V+PAFE +
Sbjct: 202 FVLSADISLSPNDNLRENFLKFANKENLFDATSKQAFVIPAFEIK 246
>gi|428186170|gb|EKX55021.1| hypothetical protein GUITHDRAFT_99661 [Guillardia theta CCMP2712]
Length = 1551
Score = 43.1 bits (100), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 69/142 (48%), Gaps = 8/142 (5%)
Query: 193 VYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYL 252
++G L L + P ++ K I +D+D + +D+ QLW + R R +++ +
Sbjct: 1362 IWGYKILMLDVLFPLSVPKIIYIDSDQVVRSDLKQLWDMNLRGRPYAYTPFCDDKREIEG 1421
Query: 253 GKLWKNHKPWPALG--RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--D 308
+ WK LG + + + ++DL + R I R++ + L+R SLA D
Sbjct: 1422 YRFWKQGFWQTHLGDMKYHISALYVVDLNRFRAIGAGDELRVVYSQ--LSRDPNSLANLD 1479
Query: 309 QDIFNAIISEHPYLVYTLPCQW 330
QD+ N ++H +++LP +W
Sbjct: 1480 QDLPN--YAQHSVPIFSLPQEW 1499
>gi|256844221|ref|ZP_05549707.1| conserved hypothetical protein [Lactobacillus crispatus 125-2-CHN]
gi|256613299|gb|EEU18502.1| conserved hypothetical protein [Lactobacillus crispatus 125-2-CHN]
Length = 317
Score = 43.1 bits (100), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P++ P+ K + LD D I TDIA+++ + + S Y+ L K
Sbjct: 95 RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEI---DIADNMFASCPDLSIRYMPLLQK 150
Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
K +PA + N GVIL ++ RD + + + K+ + DQ N
Sbjct: 151 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 206
Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 207 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 240
>gi|296110269|ref|YP_003620650.1| lipopolysaccharide glycosyltransferase [Leuconostoc kimchii IMSNU
11154]
gi|339490576|ref|YP_004705081.1| putative lipopolysaccharide glycosyltransferase [Leuconostoc sp.
C2]
gi|295831800|gb|ADG39681.1| putative lipopolysaccharide glycosyltransferase [Leuconostoc
kimchii IMSNU 11154]
gi|338852248|gb|AEJ30458.1| putative lipopolysaccharide glycosyltransferase [Leuconostoc sp.
C2]
Length = 277
Score = 43.1 bits (100), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 67/142 (47%), Gaps = 11/142 (7%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
+L + LP+TL + + LD D++ I LW L + Q I + + +
Sbjct: 87 FRLLCGEYLPKTLTRILYLDPDILVINPIRSLWTLDMK---NQMIAAASHNGLTGVSQTI 143
Query: 257 KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
N + + + FN+GV+L+DL K+R+ +++ RL L DQDI N +
Sbjct: 144 NNVR-FQNKNKYFNSGVMLMDLNKMREKVKLDDITSAIDRY-AKRL--ILPDQDILNYLY 199
Query: 317 SEHPYLVYTLPCQ-WNVQLSDN 337
S+H + ++P + WN DN
Sbjct: 200 SDH---ILSIPEEIWNYDTRDN 218
>gi|242066908|ref|XP_002454743.1| hypothetical protein SORBIDRAFT_04g036540 [Sorghum bicolor]
gi|241934574|gb|EES07719.1| hypothetical protein SORBIDRAFT_04g036540 [Sorghum bicolor]
Length = 1568
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N D W
Sbjct: 1349 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKDMDGYRFWKQ 1408
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK+H GR ++ + + ++DL K R + R+ E+ L++ SL+ DQ
Sbjct: 1409 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYEQ--LSKDPNSLSNLDQ 1461
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1462 DLPN--YAQHTVPIFSLPQEW 1480
>gi|395243686|ref|ZP_10420670.1| Putative glucosyl transferase [Lactobacillus hominis CRBIP 24.179]
gi|394484101|emb|CCI81678.1| Putative glucosyl transferase [Lactobacillus hominis CRBIP 24.179]
Length = 316
Score = 43.1 bits (100), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 82/193 (42%), Gaps = 24/193 (12%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLL----KLTLPKVLPETLAKTIVLDTDVIFATDIAQ 227
F+++D +V+ + + G + + +L +P + P+ K + LD D I TDI++
Sbjct: 64 FHISDDMVQPIHKGEENYLRGQFFTMSIFYRLFIPDLFPQ-YNKAVYLDADTIICTDISE 122
Query: 228 LWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL---GRGFNTGVILLDLTKLRDI 284
++ + + S Y+ L K K + + N GVIL ++ RD
Sbjct: 123 MYDI---EIGDNMFASCPDLSIRYMPLLQKYIKECQGIFPPEKYINNGVILFNMKAFRDK 179
Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
+ + + EK+ L DQ N I + +Y L +W+ +++
Sbjct: 180 KFVDKFYYLMEKYHFDNL---DPDQAYMNEICEDK---IYHLDKEWDAMPNESM------ 227
Query: 345 YTELTDLKIIHWN 357
E+ D KI+H+N
Sbjct: 228 -PEIKDPKIVHYN 239
>gi|414159380|ref|ZP_11415666.1| hypothetical protein HMPREF9310_00040 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410884382|gb|EKS32208.1| hypothetical protein HMPREF9310_00040 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 280
Score = 43.1 bits (100), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 21/165 (12%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
+L ++LPE++ K + LD D++ I LW L +R ++ +
Sbjct: 88 FRLACGELLPESVTKVLYLDPDILVINTIDDLWNLN---LERNIFAAATHEGFTKFSQGM 144
Query: 257 KNHKPWPALGRG-FNTGVILLDLTKLR-DISWAGFWRIIAE--KFLLTRLWTSLADQDIF 312
N + G+ FN+GV+L+D+ K R +IS + I + ++LL L DQD+
Sbjct: 145 NNLRLGTKQGQVYFNSGVMLIDVEKARQEISMDDIFNFIEKNNQYLL------LPDQDVM 198
Query: 313 NAIISEHPYLVYTLPCQ-WNVQLSDNTRSDELCYTELTDLKIIHW 356
N++ + + +P + WN +TR + YT+ HW
Sbjct: 199 NSLYGDK---IKEIPEEIWNY----DTRQANMYYTKSVGKYDTHW 236
>gi|227879001|ref|ZP_03996898.1| glucosyl transferase, partial [Lactobacillus crispatus JV-V01]
gi|227861406|gb|EEJ69028.1| glucosyl transferase [Lactobacillus crispatus JV-V01]
Length = 287
Score = 43.1 bits (100), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P++ P+ K + LD D I TDIA+++ + + S Y+ L K
Sbjct: 65 RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEIDI---ADNMFASCPDLSIRYMPLLQK 120
Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
K +PA + N GVIL ++ RD + + + K+ + DQ N
Sbjct: 121 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 176
Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 177 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 210
>gi|262047884|ref|ZP_06020832.1| glucosyl transferase [Lactobacillus crispatus MV-3A-US]
gi|260571828|gb|EEX28401.1| glucosyl transferase [Lactobacillus crispatus MV-3A-US]
Length = 288
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P++ P+ K + LD D I TDIA+++ + + S Y+ L K
Sbjct: 66 RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEIDI---ADNMFASCPDLSIRYMPLLQK 121
Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
K +PA + N GVIL ++ RD + + + K+ + DQ N
Sbjct: 122 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 177
Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 178 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 211
>gi|255564294|ref|XP_002523144.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
gi|223537706|gb|EEF39329.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
Length = 340
Score = 42.7 bits (99), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 11/94 (11%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ DIA+LW + L R TIG E N + ++ W
Sbjct: 170 LADLLEPCVRRVIYLDSDLVVVDDIAKLWN--TNLGSR-TIGAPEYCHANFTKYFTSSFW 226
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISW 286
N + GR FNTGV+++DL K R + +
Sbjct: 227 SNKRFSSTFSGRKPCYFNTGVMVIDLVKWRRVGY 260
>gi|256849379|ref|ZP_05554812.1| conserved hypothetical protein [Lactobacillus crispatus MV-1A-US]
gi|256714155|gb|EEU29143.1| conserved hypothetical protein [Lactobacillus crispatus MV-1A-US]
Length = 286
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P++ P+ K + LD D I TDIA+++ + + S Y+ L K
Sbjct: 64 RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEIDI---ADNMFASCPDLSIRYMPLLQK 119
Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
K +PA + N GVIL ++ RD + + + K+ + DQ N
Sbjct: 120 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 175
Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ D KI+H+N
Sbjct: 176 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 209
>gi|423349485|ref|ZP_17327141.1| hypothetical protein HMPREF9156_00679 [Scardovia wiggsiae F0424]
gi|393703033|gb|EJD65235.1| hypothetical protein HMPREF9156_00679 [Scardovia wiggsiae F0424]
Length = 319
Score = 42.7 bits (99), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 41/173 (23%), Positives = 78/173 (45%), Gaps = 39/173 (22%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLR------QRQTIGLV 244
+L +P++ P+ + K I LD+D + TD+A+L++ L + + ++T V
Sbjct: 96 FRLFIPEMFPD-IDKAIYLDSDTVINTDVAELYSYDLGDNLVAGVHDVFMSANKETTDYV 154
Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT 304
EN LG +++ N+G++L++L +R+ + + + K+ +
Sbjct: 155 ENA----LGMPVRDY---------INSGMLLMNLKAMREEHFTEHFVRLLRKY---HVEC 198
Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
+ DQD N I L L +WN ++D T + KIIH+N
Sbjct: 199 AAVDQDYLNVICRGRILL---LAREWNTMMTDGTAGSDHP-------KIIHYN 241
>gi|356500226|ref|XP_003518934.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
max]
Length = 357
Score = 42.7 bits (99), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 11/95 (11%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ DIA+LW+ + L R TIG E N + ++ W
Sbjct: 168 LADLLEPCVERVIYLDSDLVLVDDIAKLWS--TSLGSR-TIGAPEYCHANFTKYFTAGFW 224
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWA 287
+ + A GR FNTGV+++DL + R I ++
Sbjct: 225 SDMRFASAFAGRRPCYFNTGVMVIDLVRWRKIGYS 259
>gi|283956813|ref|ZP_06374287.1| hypothetical protein C1336_000310011 [Campylobacter jejuni subsp.
jejuni 1336]
gi|283791674|gb|EFC30469.1| hypothetical protein C1336_000310011 [Campylobacter jejuni subsp.
jejuni 1336]
Length = 396
Score = 42.7 bits (99), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 33/186 (17%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK--- 254
++ +P++ K I D+DVIF DI+ L+ F+ L ++ IG + + Y +
Sbjct: 105 RIFIPEIF-SNFKKVIYCDSDVIFKADISHLF--FTDLNNKE-IGACRDIAALYAYRKRE 160
Query: 255 -LWK-------NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSL 306
+W+ + + ++ FN+GVI+ D+ K + + + + L+
Sbjct: 161 TIWQQNIGNNFDKINFRSISDYFNSGVIVFDIVKCIQMKTVSKCLTVIKN--IDNLY--F 216
Query: 307 ADQDIFNAIISEHPYLVYTLPCQWN------VQLSDN-----TRSDELCYTELTDLKIIH 355
DQD+ N + H V+ LP +WN ++ DN + Y T KIIH
Sbjct: 217 PDQDVLNIVFCGH---VHFLPLEWNFLWTTYIEYRDNFMYLPKKIINEIYRAKTKPKIIH 273
Query: 356 WNSPKK 361
+ S K
Sbjct: 274 YISETK 279
>gi|224068414|ref|XP_002302739.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222844465|gb|EEE82012.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 367
Score = 42.7 bits (99), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 53/104 (50%), Gaps = 20/104 (19%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWA--LFSRLRQRQTIGLVE----NQSDWYLGK 254
L +L + + I LD+D++ DIA+LW L SR+ IG E N + ++
Sbjct: 178 LADLLEPCVKRVIYLDSDLVVVDDIAKLWTTNLGSRI-----IGAPEYCHANFTKYFTAD 232
Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLRDISWAGFWRII 293
W + K + RG FNTGV+++DL K R WAG+ + I
Sbjct: 233 FWSD-KRFSGTFRGRKPCYFNTGVMVIDLVKWR---WAGYTKRI 272
>gi|323452188|gb|EGB08063.1| hypothetical protein AURANDRAFT_64432 [Aureococcus anophagefferens]
Length = 2067
Score = 42.7 bits (99), Expect = 0.62, Method: Composition-based stats.
Identities = 44/152 (28%), Positives = 59/152 (38%), Gaps = 15/152 (9%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI--------GLVENQS 248
+KL LP++L T+ + LDTDV+FA D A LW G V
Sbjct: 1300 VKLFLPQILDPTIRWAVYLDTDVVFAADPAALWLGAVGTLAAAGAAVAAAAEHGDVAPSC 1359
Query: 249 DWYLGKLWKNHKPWPALGRGF-----NTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLW 303
+ + AL R F NTGV+ LD RD W + T+
Sbjct: 1360 EGSAEHYYNQSAAARALPRAFPDDGANTGVLALDAAAARDAGWGPTLAAVMADLAATKAN 1419
Query: 304 TS--LADQDIFNAIISEHPYLVYTLPCQWNVQ 333
S L DQ +FN + V LPC+ N +
Sbjct: 1420 GSLDLGDQSVFNVAAGGDQFTVARLPCEANFR 1451
Score = 39.7 bits (91), Expect = 5.9, Method: Composition-based stats.
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 100 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
+KL LP++L T+ + LDTDV+FA D A LW
Sbjct: 1300 VKLFLPQILDPTIRWAVYLDTDVVFAADPAALW 1332
>gi|429209656|ref|ZP_19200885.1| glycosyl transferase, family 8 [Rhodobacter sp. AKP1]
gi|428187382|gb|EKX55965.1| glycosyl transferase, family 8 [Rhodobacter sp. AKP1]
Length = 334
Score = 42.7 bits (99), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 73/151 (48%), Gaps = 15/151 (9%)
Query: 187 NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN 246
++H S L+ P+VLPE + + + LD D+I D+A++ ++ R + + +
Sbjct: 75 DRHLSAA-AYLRFLAPEVLPEAVERVLYLDCDLIVLDDVAKILSIDLR---GKAVAAAPD 130
Query: 247 QSDWYLGKLWKNHKPWPALGRGF-NTGVILLDLTKLR--DISWAGFWRIIAEKFLLTRLW 303
+ + H L R + N+GV+L+DL + R +S F + LL R
Sbjct: 131 LGWKDAAQAARFHTLGIPLDRAYVNSGVLLMDLGRWRRDGLSQKLFDYVARHGSLLLR-- 188
Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
DQD NA++++ ++ L +WN+Q+
Sbjct: 189 ---HDQDALNAVLADD---IHLLDRRWNLQV 213
>gi|440800079|gb|ELR21122.1| hypothetical protein ACA1_283190 [Acanthamoeba castellanii str.
Neff]
Length = 563
Score = 42.7 bits (99), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 70/144 (48%), Gaps = 10/144 (6%)
Query: 469 EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-EAQQFLSYAGNSEA 527
EY A++ +DVTLV LS+D+L ++ L +W+GPIS + + + +S N A
Sbjct: 223 EYAATA--SDVTLVTHLSLDQLSLLSDLASYWKGPISAAVLLPNQLSLGDLVSAHNNDPA 280
Query: 528 LRSRRNIGYHVVYKE---GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
L+ + H Y + + YP LR A + V++ YV +D+++ L +
Sbjct: 281 LKQWATL--HATYPDKVRSSGYPSGFLRKAATDAVTSTYVLQMDVNYRTEDSLRSDAERF 338
Query: 585 IRSMDMHGHGGKKVLVVPAFETQR 608
+ S+D V VVP+F R
Sbjct: 339 VASLDWTNE--DAVYVVPSFAVSR 360
>gi|227498918|ref|ZP_03929057.1| general stress protein A [Acidaminococcus sp. D21]
gi|352683476|ref|YP_004895459.1| general stress protein A [Acidaminococcus intestini RyC-MR95]
gi|226904369|gb|EEH90287.1| general stress protein A [Acidaminococcus sp. D21]
gi|350278129|gb|AEQ21319.1| general stress protein A [Acidaminococcus intestini RyC-MR95]
Length = 311
Score = 42.7 bits (99), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 65/143 (45%), Gaps = 22/143 (15%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
L+L + K+LP +AK + DTD++ D+A+LW IG V++ ++
Sbjct: 87 LRLLIAKLLPLAVAKALYFDTDLVVKDDVAKLWDF---PLDGHPIGAVKDFGIMASSRMR 143
Query: 257 KNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA----DQ 309
+ L G FN+GV+++DL R + EK L + TS A DQ
Sbjct: 144 RQKAESLGLPLGAPYFNSGVMIMDLAAFRKEGY-------GEKVL--QCVTSHAYRHHDQ 194
Query: 310 DIFNAIISEHPYLVYTLPCQWNV 332
D N + + + LP +WNV
Sbjct: 195 DGLNKVFMGNWSI---LPLRWNV 214
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 3/90 (3%)
Query: 43 NPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKL 102
PLHF+L D + + + T + V+F + D ++ + Y L+L
Sbjct: 33 RPLHFYLFDDGITGVKKEMVERTIKDLKGLVTFIDTKGIAVD-AYTSGHIHKAAY--LRL 89
Query: 103 TLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
+ K+LP +AK + DTD++ D+A+LW
Sbjct: 90 LIAKLLPLAVAKALYFDTDLVVKDDVAKLW 119
>gi|195382571|ref|XP_002050003.1| GJ20432 [Drosophila virilis]
gi|194144800|gb|EDW61196.1| GJ20432 [Drosophila virilis]
Length = 208
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 57/146 (39%), Gaps = 36/146 (24%)
Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
+RN+A + YVFL DID +P L + +++ G+ V+P FE R
Sbjct: 1 MRNIARKGCQSKYVFLTDIDIIPSINSVVLLNNFFKTVTCDGNCA---YVIPTFEID-VR 56
Query: 611 TAFPASHA------------------------PTNFSRWV----NATTPYQIEWAPDF-- 640
FP + TNFS+W+ N T +F
Sbjct: 57 APFPRTKENLLGLIKKGLARPFHEKVFIYNQYATNFSKWLSSHRNETVVRISHTVTNFEF 116
Query: 641 --EPYIVAHRDLPRYDTRFVGFGWNK 664
EP+ + + P +D RF G+G+ +
Sbjct: 117 LYEPFYITIDNAPIHDERFTGYGFTR 142
>gi|418056880|ref|ZP_12694931.1| glycosyl transferase family 2 [Hyphomicrobium denitrificans 1NES1]
gi|353207652|gb|EHB73059.1| glycosyl transferase family 2 [Hyphomicrobium denitrificans 1NES1]
Length = 674
Score = 42.4 bits (98), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 13/121 (10%)
Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
LLK LP++LP + + LD D+I D++ L+ FS + G++ + Y
Sbjct: 94 ALLKFALPELLPHH-DRVLYLDGDLIVRDDLSDLF--FSDI-DGYVAGVISDSGQIYFKH 149
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
W +G FN+GV+LLDL ++R + I A+K SL DQ+ FN
Sbjct: 150 EWVRR-----VGNYFNSGVMLLDLKEMRRSNVTELL-IKAKK---ENCDGSLLDQNAFNI 200
Query: 315 I 315
+
Sbjct: 201 V 201
>gi|328958101|ref|YP_004375487.1| hypothetical protein CAR_c18140 [Carnobacterium sp. 17-4]
gi|328674425|gb|AEB30471.1| hypothetical protein CAR_c18140 [Carnobacterium sp. 17-4]
Length = 284
Score = 42.4 bits (98), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 76/184 (41%), Gaps = 29/184 (15%)
Query: 162 HKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 221
H +P + +L + VV N++YS + +L P VLP L + + LD D+I
Sbjct: 57 HTFFPVDCKKLFLEEVVV-------NRYYS-IEMYYRLLAPFVLPIELERILYLDPDIIN 108
Query: 222 ATDIAQLW------ALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVIL 275
+ + LF ++ ++ LG L + FNTGVIL
Sbjct: 109 LRSFSSFYKQDFEGKLFVATTHDYATKWIQPINNIRLGTL--------SSEDYFNTGVIL 160
Query: 276 LDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI----ISEHPYLVYTLPCQWN 331
++LT +R I + + W L DQDIFN + + E + +Y L ++
Sbjct: 161 MNLTLIRSTRNL---EEITTAIKINKNWLVLPDQDIFNHLYWNEVKEADWRIYNLDARFY 217
Query: 332 VQLS 335
QL
Sbjct: 218 SQLK 221
>gi|402912676|ref|XP_003918876.1| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1-like
[Papio anubis]
Length = 405
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 74/176 (42%), Gaps = 23/176 (13%)
Query: 195 GLLKLTLPKVLPETLAKTIVLDTD-VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG 253
GL +L+LP L K + TD F + +W+L + Q + + +G
Sbjct: 166 GLQRLSLPLFL-----KXVDSHTDSXFFLRPVDTIWSLLKKFHSTQIAAMAPEPKEPLIG 220
Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLT-----R 301
+ P G+ G N+GV+L ++T +R D++ R+ L+T +
Sbjct: 221 -WYSRFARHPYRGKAGGNSGVMLTNMTGMRRKYFKNDVTTV---RLQXGDILMTWLKIYK 276
Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
L + DQD+ + + +P + PCQWN L D+ C + I+H N
Sbjct: 277 LNMTWXDQDLLDIMXFHNPESPFVSPCQWNYHL-DHCLYGSSCQEAEEGIFILHGN 331
>gi|225447947|ref|XP_002268972.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Vitis
vinifera]
Length = 1611
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N D W
Sbjct: 1389 LFLDVIFPLSLEKVIFVDADQIVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQ 1448
Query: 253 GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
G WK+H KP+ + + ++DL K R+ + R+ E L++ SL+
Sbjct: 1449 G-FWKDHLRGKPYHI------SALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNL 1499
Query: 308 DQDIFNAIISEHPYLVYTLPCQW 330
DQD+ N ++H +++LP +W
Sbjct: 1500 DQDLPN--FAQHTVPIFSLPQEW 1520
>gi|297838947|ref|XP_002887355.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis lyrata
subsp. lyrata]
gi|297333196|gb|EFH63614.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis lyrata
subsp. lyrata]
Length = 1616
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I TD+ +L+ + + R +N + W
Sbjct: 1388 LFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAYTPFCDNNREMDGYRFWRQ 1447
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK H GR ++ + + ++DL K R+ + R+ E L++ SL+ DQ
Sbjct: 1448 G-FWKEH----LRGRPYHISALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNLDQ 1500
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1501 DLPN--YAQHTVPIFSLPQEW 1519
>gi|253826760|ref|ZP_04869645.1| glycosyl transferase [Helicobacter canadensis MIT 98-5491]
gi|253510166|gb|EES88825.1| glycosyl transferase [Helicobacter canadensis MIT 98-5491]
Length = 397
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 93/215 (43%), Gaps = 40/215 (18%)
Query: 163 KPWPALGRGFYLADSVVEDVSWIP--NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
K +P + L D + + S +P N +Y Y +L + LP ++ + + LD D+I
Sbjct: 61 KIYPCTLTIYILDDQLFREYS-MPTLNGNYLAYY---RLKIGSALPLSIKRCVYLDVDMI 116
Query: 221 FATDIAQLWALFSRLRQRQTIGLV-ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLT 279
D+ + LF Q + G+V E+ S + +KP G FN+G++L+DL
Sbjct: 117 VLGDLRE---LFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLD 173
Query: 280 KLRDISWAGFWR---IIAEKFLLTRLWT-SLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
WR I F + + + S DQDI N ++S + V +WN+ +
Sbjct: 174 ---------LWRQENIEDRAFEIGKNYHYSFHDQDILNIVLSGKTHKV---GIEWNLMVC 221
Query: 336 --------DNTRSDELCY------TELTDLKIIHW 356
D D+L Y + L + KI+H+
Sbjct: 222 VYYRAICKDEKGRDKLPYYRKDFNSALRNPKILHY 256
>gi|413939392|gb|AFW73943.1| hypothetical protein ZEAMMB73_654737 [Zea mays]
Length = 417
Score = 42.4 bits (98), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 62/139 (44%), Gaps = 14/139 (10%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N D W
Sbjct: 201 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKDMDGFRFWKQ 260
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
G WK+H GR ++ + + ++DL K R + R+ E+ S DQD+
Sbjct: 261 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYEQLSKDPNSLSNLDQDL 315
Query: 312 FNAIISEHPYLVYTLPCQW 330
N ++H +++LP +W
Sbjct: 316 PN--YAQHTVPIFSLPQEW 332
>gi|147846492|emb|CAN79512.1| hypothetical protein VITISV_014158 [Vitis vinifera]
Length = 648
Score = 42.4 bits (98), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 8/88 (9%)
Query: 47 FHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPK 106
FH++TD ++ ++ F S +V + VE++ N Y + L+ LP+
Sbjct: 409 FHIVTDKLSFAAMKMWFLVNSPAKVTIQ-------VENIDDFKNPKYLSMLNHLRFYLPE 461
Query: 107 VLPETLAKTIVLDTDVIFATDIAQLWAL 134
V P+ L K + LD D++ D+ LW+L
Sbjct: 462 VYPK-LEKILFLDDDIVVQKDLTPLWSL 488
>gi|145337405|ref|NP_177278.3| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
gi|122236068|sp|Q0WL80.1|UGGG_ARATH RecName: Full=UDP-glucose:glycoprotein glucosyltransferase; AltName:
Full=EMS-mutagenized BRI1 suppressor 1; AltName:
Full=Protein PRIORITY IN SWEET LIFE 2; Flags: Precursor
gi|110740466|dbj|BAF02127.1| putative UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis
thaliana]
gi|332197053|gb|AEE35174.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
Length = 1613
Score = 42.4 bits (98), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I TD+ +L+ + + R +N + W
Sbjct: 1386 LFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAYTPFCDNNREMDGYKFWKQ 1445
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK H GR ++ + + ++DL K R+ + R+ E L++ SL+ DQ
Sbjct: 1446 G-FWKEH----LRGRPYHISALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNLDQ 1498
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1499 DLPN--YAQHTVPIFSLPQEW 1517
>gi|334183827|ref|NP_001185370.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
gi|332197054|gb|AEE35175.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
Length = 1614
Score = 42.4 bits (98), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I TD+ +L+ + + R +N + W
Sbjct: 1387 LFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAYTPFCDNNREMDGYKFWKQ 1446
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK H GR ++ + + ++DL K R+ + R+ E L++ SL+ DQ
Sbjct: 1447 G-FWKEH----LRGRPYHISALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNLDQ 1499
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1500 DLPN--YAQHTVPIFSLPQEW 1518
>gi|298204497|emb|CBI23772.3| unnamed protein product [Vitis vinifera]
Length = 1715
Score = 42.4 bits (98), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N D W
Sbjct: 1493 LFLDVIFPLSLEKVIFVDADQIVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQ 1552
Query: 253 GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
G WK+H KP+ + + ++DL K R+ + R+ E L++ SL+
Sbjct: 1553 G-FWKDHLRGKPYHI------SALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNL 1603
Query: 308 DQDIFNAIISEHPYLVYTLPCQW 330
DQD+ N ++H +++LP +W
Sbjct: 1604 DQDLPN--FAQHTVPIFSLPQEW 1624
>gi|21751080|dbj|BAC03899.1| unnamed protein product [Homo sapiens]
Length = 187
Score = 42.4 bits (98), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 47/102 (46%), Gaps = 15/102 (14%)
Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYL 322
FN+GV+LL+L +R + + + E + +L L DQD F I EHP L
Sbjct: 82 FNSGVMLLNLEAMRQ---SPLYSRLLEPAQVQQLADKYHFRGHLGDQDFFTMIGMEHPKL 138
Query: 323 VYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
+ L C WN QL D+ SD E + +KI H N
Sbjct: 139 FHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 180
>gi|323453752|gb|EGB09623.1| hypothetical protein AURANDRAFT_63327 [Aureococcus anophagefferens]
Length = 2161
Score = 42.4 bits (98), Expect = 0.91, Method: Composition-based stats.
Identities = 40/155 (25%), Positives = 62/155 (40%), Gaps = 20/155 (12%)
Query: 195 GLLKLTLPKVLPET----LAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVENQS 248
++KL LP +L + L + +DTDV+F D ++W A+ +
Sbjct: 1196 AVVKLFLPSLLAKNDTAGLGAAVYVDTDVLFLDDPQRVWVDAVGALAASGAVAAAAAEHG 1255
Query: 249 DWYLGKLWKNH--------KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
D G +H P G NTGV+ L+ T+ R + + R++ L
Sbjct: 1256 DLEAGCSSTDHFYDHEERETPRAFADDGANTGVVALNFTRARAVGYEA--RLVEALLDLD 1313
Query: 301 R----LWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
R SL DQ +FN V LPC++N
Sbjct: 1314 RSGGERALSLGDQSVFNVAAGNASLDVARLPCEFN 1348
>gi|224128416|ref|XP_002320324.1| predicted protein [Populus trichocarpa]
gi|222861097|gb|EEE98639.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 42.0 bits (97), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 13/91 (14%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ DIA+LWA + L R TIG E N + ++ W
Sbjct: 178 LADLLETCVKRVIYLDSDLVVVDDIAKLWA--TNLGSR-TIGAPEYCHANFTKYFTSGFW 234
Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLR 282
+ K + RG FNTGV+++DL K R
Sbjct: 235 SD-KRFSGAFRGRKPCYFNTGVMVIDLVKWR 264
>gi|300362623|ref|ZP_07058799.1| glycosyltransferase [Lactobacillus gasseri JV-V03]
gi|300353614|gb|EFJ69486.1| glycosyltransferase [Lactobacillus gasseri JV-V03]
Length = 316
Score = 42.0 bits (97), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P + P+ K + LD D I TDIA+L+ + V + S ++ L
Sbjct: 94 RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 149
Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
K + + N GVIL ++ RD + + + EK+ + DQ N
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ + KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 239
>gi|313142229|ref|ZP_07804422.1| glycosyl transferase family protein [Helicobacter canadensis MIT
98-5491]
gi|313131260|gb|EFR48877.1| glycosyl transferase family protein [Helicobacter canadensis MIT
98-5491]
Length = 357
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 93/215 (43%), Gaps = 40/215 (18%)
Query: 163 KPWPALGRGFYLADSVVEDVSWIP--NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
K +P + L D + + S +P N +Y Y +L + LP ++ + + LD D+I
Sbjct: 21 KIYPCTLTIYILDDQLFREYS-MPTLNGNYLAYY---RLKIGSALPLSIKRCVYLDVDMI 76
Query: 221 FATDIAQLWALFSRLRQRQTIGLV-ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLT 279
D+ + LF Q + G+V E+ S + +KP G FN+G++L+DL
Sbjct: 77 VLGDLRE---LFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLD 133
Query: 280 KLRDISWAGFWR---IIAEKFLLTRLWT-SLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
WR I F + + + S DQDI N ++S + V +WN+ +
Sbjct: 134 ---------LWRQENIEDRAFEIGKNYHYSFHDQDILNIVLSGKTHKV---GIEWNLMVC 181
Query: 336 --------DNTRSDELCY------TELTDLKIIHW 356
D D+L Y + L + KI+H+
Sbjct: 182 VYYRAICKDEKGRDKLPYYRKDFNSALRNPKILHY 216
>gi|241569994|ref|XP_002402644.1| hypothetical protein IscW_ISCW009544 [Ixodes scapularis]
gi|215502040|gb|EEC11534.1| hypothetical protein IscW_ISCW009544 [Ixodes scapularis]
Length = 85
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 8/78 (10%)
Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNHKPWPALGRG 268
+ +D D +F + + +LW++F ++ + L D WY P+PA G
Sbjct: 9 LYVDADTLFLSPVEELWSVFEKMNESHLTALTYETEDVRTNWYQQ---HGKHPYPA-PFG 64
Query: 269 FNTGVILLDLTKLRDISW 286
N GV+ ++LT++R W
Sbjct: 65 VNAGVMPMNLTRMRSFDW 82
>gi|116628739|ref|YP_813911.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
gasseri ATCC 33323]
gi|311111532|ref|ZP_07712929.1| glycosyltransferase [Lactobacillus gasseri MV-22]
gi|116094321|gb|ABJ59473.1| Lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
gasseri ATCC 33323]
gi|311066686|gb|EFQ47026.1| glycosyltransferase [Lactobacillus gasseri MV-22]
Length = 317
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P + P+ K + LD D I TDIA+L+ + V + S ++ L
Sbjct: 95 RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 150
Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
K + + N GVIL ++ RD + + + EK+ + DQ N
Sbjct: 151 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 207
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ + KI+H+N
Sbjct: 208 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 240
>gi|145297263|ref|YP_001140104.1| glycosyl transferase family protein [Aeromonas salmonicida subsp.
salmonicida A449]
gi|418362015|ref|ZP_12962660.1| glycosyl transferase family protein [Aeromonas salmonicida subsp.
salmonicida 01-B526]
gi|142850035|gb|ABO88356.1| glycosyl transferase family 8 [Aeromonas salmonicida subsp.
salmonicida A449]
gi|356686830|gb|EHI51422.1| glycosyl transferase family protein [Aeromonas salmonicida subsp.
salmonicida 01-B526]
Length = 321
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 64/140 (45%), Gaps = 21/140 (15%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG-KLW 256
+ +P VL + + K + +D D+I D++ LW++ +V SD LG
Sbjct: 91 RFAIPNVL-QNIEKVLFIDADMIAVGDVSSLWSI------EMGEAVVAVVSDHILGYDKE 143
Query: 257 KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEK---FLLTRLWTSLADQDIFN 313
K + + G+ FN G +L+DL K R+ + I+E+ L+ DQD N
Sbjct: 144 KQQERGISSGKYFNAGFMLMDLDKWRE-------KNISEQALGLLIDNNGFEHNDQDALN 196
Query: 314 AIISEHPYLVYTLPCQWNVQ 333
I+ + + + T +WN Q
Sbjct: 197 IILEKKTFYLDT---KWNAQ 213
>gi|297809943|ref|XP_002872855.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
lyrata]
gi|297318692|gb|EFH49114.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 17/93 (18%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
L +L +++ I LD+D++ DIA+LW +L SR+ IG E N + ++ G
Sbjct: 157 LADLLEPCVSRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCHANFTKYFTGG 211
Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
W + + RG FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGAFRGRKPCYFNTGVMVIDLKKWR 243
>gi|282852386|ref|ZP_06261728.1| glycosyltransferase family 8 [Lactobacillus gasseri 224-1]
gi|282556128|gb|EFB61748.1| glycosyltransferase family 8 [Lactobacillus gasseri 224-1]
Length = 316
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P + P+ K + LD D I TDIA+L+ + V + S ++ L
Sbjct: 94 RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 149
Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
K + + N GVIL ++ RD + + + EK+ + DQ N
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ + KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 239
>gi|356534929|ref|XP_003536003.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
max]
Length = 359
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 11/95 (11%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ DIA+LW+ + L R TIG E N + ++ W
Sbjct: 170 LADLLEPCVERVIYLDSDLVVVDDIAKLWS--TSLGSR-TIGAPEYCHANFTKYFTAAFW 226
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWA 287
+ + A GR FNTGV+++DL + R I ++
Sbjct: 227 SDTRFARAFAGRRPCYFNTGVMVIDLVRWRRIGYS 261
>gi|449523017|ref|XP_004168521.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like
[Cucumis sativus]
Length = 1056
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I TD+ +L+ + + + +N D W
Sbjct: 835 LFLDVIFPLSLEKVIFVDADQIVRTDMGELYDMDIKGKPLAYTPFCDNNKDMDGYRFWRQ 894
Query: 253 GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
G WK H KP+ + + ++DL K R+ + R+ E L++ SL+
Sbjct: 895 G-FWKEHLRGKPYHI------SALYVVDLKKFRETAAGDNLRVFYES--LSKDPNSLSNL 945
Query: 308 DQDIFNAIISEHPYLVYTLPCQW 330
DQD+ N ++H +++LP +W
Sbjct: 946 DQDLPN--YAQHTVPIFSLPQEW 966
>gi|358332612|dbj|GAA51246.1| glycosyltransferase-like protein LARGE2 [Clonorchis sinensis]
Length = 156
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 32/59 (54%)
Query: 276 LDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
+ L+++R+ W + A+ L + +Q + N II E P L Y +PC+W+VQL
Sbjct: 1 MQLSRMREAGWWDMFIEEAKNILSVYRALPIGEQSVLNNIILERPELYYRVPCEWHVQL 59
>gi|238852953|ref|ZP_04643352.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
gasseri 202-4]
gi|238834403|gb|EEQ26641.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
gasseri 202-4]
Length = 316
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
+L +P + P+ K + LD D I TDIA+L+ + V + S ++ L
Sbjct: 94 RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 149
Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
K + + N GVIL ++ RD + + + EK+ + DQ N
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206
Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
I + +Y LP +W+ +++ E+ + KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 239
>gi|357486289|ref|XP_003613432.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|355514767|gb|AES96390.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
Length = 395
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 50/95 (52%), Gaps = 18/95 (18%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ DI++LW++ ++ ++ IG E N + ++ + W
Sbjct: 173 LGDMLDTCVERVIYLDSDIVVVDDISKLWSV--KMDAKKVIGAPEYCHANFTKYFTDEFW 230
Query: 257 KNHKPWPALGRG--------FNTGVILLDLTKLRD 283
+ P L R FNTGV+++DL K R+
Sbjct: 231 ND----PLLSRVFKARKACYFNTGVMVMDLMKWRE 261
>gi|404369533|ref|ZP_10974867.1| hypothetical protein CSBG_03347 [Clostridium sp. 7_2_43FAA]
gi|226914520|gb|EEH99721.1| hypothetical protein CSBG_03347 [Clostridium sp. 7_2_43FAA]
Length = 273
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 25/125 (20%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGLVENQSDWY 251
+L P +LP+++ K + LD D++ I LW +F+ G++ +D
Sbjct: 90 RLLAPLILPKSIKKILYLDPDILIINSIRPLWETELGNYIFAAASHVGVTGVI---NDIN 146
Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS---LAD 308
+L +H +N+GV+L+DLTK R I + E F R L D
Sbjct: 147 RVRLRVDHD-------YYNSGVMLMDLTKARSIV------NVEEIFQCVREHKEELLLPD 193
Query: 309 QDIFN 313
QDIFN
Sbjct: 194 QDIFN 198
>gi|303273630|ref|XP_003056175.1| glycosyltransferase family 24 protein [Micromonas pusilla CCMP1545]
gi|226462259|gb|EEH59551.1| glycosyltransferase family 24 protein [Micromonas pusilla CCMP1545]
Length = 1657
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 66/137 (48%), Gaps = 10/137 (7%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 258
L L + P TL K I +D D + +++ +LW + R +N + + WK
Sbjct: 1452 LFLDVLFPLTLNKVIFVDADQVVRSNLKELWEMDLRGAPYAYTPFCDNNPEMEGYRFWK- 1510
Query: 259 HKPWPA--LGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQDIFN 313
H W G+ ++ + + ++DL R + R+I E L++ +SLA DQD+ N
Sbjct: 1511 HGFWQTHLAGKPYHISALYVVDLETFRHTAAGDKLRLIYET--LSKDPSSLANLDQDLPN 1568
Query: 314 AIISEHPYLVYTLPCQW 330
++H ++TLP QW
Sbjct: 1569 --YAQHQVPIFTLPQQW 1583
>gi|301309800|ref|ZP_07215739.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
protein [Bacteroides sp. 20_3]
gi|423340274|ref|ZP_17318013.1| hypothetical protein HMPREF1059_03938 [Parabacteroides distasonis
CL09T03C24]
gi|300831374|gb|EFK62005.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
protein [Bacteroides sp. 20_3]
gi|409227709|gb|EKN20605.1| hypothetical protein HMPREF1059_03938 [Parabacteroides distasonis
CL09T03C24]
Length = 301
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 78/163 (47%), Gaps = 15/163 (9%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
FY D V + P + + +L + +LP+++ K + LD DV+ + LW
Sbjct: 61 FYTVDKKV--FANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDT 118
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA-GFW 290
+ G++ + S + +++ + P+LG FN GV+L++L R+ + + F+
Sbjct: 119 DIK---SYAAGVIPDMSIDDI-RIYNRLQYSPSLGY-FNAGVLLVNLRYWRENNLSESFF 173
Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
II + R DQD+ N ++ E + TLP ++NVQ
Sbjct: 174 EIINKYPERLR----YHDQDVLNIVLKE---IKLTLPLKYNVQ 209
>gi|413939391|gb|AFW73942.1| hypothetical protein ZEAMMB73_654737 [Zea mays]
Length = 388
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 62/139 (44%), Gaps = 14/139 (10%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N D W
Sbjct: 201 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKDMDGFRFWKQ 260
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
G WK+H GR ++ + + ++DL K R + R+ E+ S DQD+
Sbjct: 261 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYEQLSKDPNSLSNLDQDL 315
Query: 312 FNAIISEHPYLVYTLPCQW 330
N ++H +++LP +W
Sbjct: 316 PN--YAQHTVPIFSLPQEW 332
>gi|321464173|gb|EFX75183.1| hypothetical protein DAPPUDRAFT_306884 [Daphnia pulex]
Length = 498
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 61/165 (36%), Gaps = 49/165 (29%)
Query: 496 LFKHWEGPISLTLYM--SDAE-AQQFLSYAGNSEALRSRRNIGYHVVYKEGNF------- 545
L + W+GP+S+ +Y SD E A + ++Y R+ + +H+V E +F
Sbjct: 180 LTRRWDGPLSVAVYTPGSDYESAIRSMAYLRQCTGPDIRQRVSFHLVLDERHFPTVLRKL 239
Query: 546 --------------------------------------YPINTLRNVALNQVSTPYVFLL 567
YP+N LRNVA T Y+
Sbjct: 240 HSQQGPPTTRSPLQTSAVNGLSTATPETQDNKRTMNLTYPVNLLRNVARQNAPTHYILAS 299
Query: 568 DIDFLPMFGLYPYLKSSIRSMDMHGHGGK-KVLVVPAFETQRYRT 611
D + P + +R + H G+ KV V+P FE R +T
Sbjct: 300 DAELYPSLDMVRLFFDMLRRNESVLHLGRPKVFVLPIFEVYRNQT 344
>gi|449457101|ref|XP_004146287.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Cucumis
sativus]
Length = 1575
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I TD+ +L+ + + + +N D W
Sbjct: 1359 LFLDVIFPLSLEKVIFVDADQIVRTDMGELYDMDIKGKPLAYTPFCDNNKDMDGYRFWRQ 1418
Query: 253 GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
G WK H KP+ + + ++DL K R+ + R+ E L++ SL+
Sbjct: 1419 G-FWKEHLRGKPYHI------SALYVVDLKKFRETAAGDNLRVFYES--LSKDPNSLSNL 1469
Query: 308 DQDIFNAIISEHPYLVYTLPCQW 330
DQD+ N ++H +++LP +W
Sbjct: 1470 DQDLPN--YAQHTVPIFSLPQEW 1490
>gi|390603232|gb|EIN12624.1| hypothetical protein PUNSTDRAFT_59955 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 332
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 7/109 (6%)
Query: 44 PLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVS------WIPNKHYSGVY 97
P FH+I A+ ++ + +S P V +D+S + H +G+
Sbjct: 6 PTVFHIICSQDAIPTIEEKLALFSRPAYPVRATFYPLTQDDISRRAARAGVGTSHEAGMG 65
Query: 98 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
GL+K + +VL + + I +DTD++F D A LW F + + Q +
Sbjct: 66 GLVKTFIHEVL-HDVERIIFVDTDMLFLVDPALLWREFDAMDENQMVAF 113
>gi|15217851|ref|NP_171772.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
gi|42571307|ref|NP_973744.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
gi|75172933|sp|Q9FWY9.1|GATL5_ARATH RecName: Full=Probable galacturonosyltransferase-like 5
gi|9972380|gb|AAG10630.1|AC022521_8 Unknown protein [Arabidopsis thaliana]
gi|25083419|gb|AAN72073.1| Unknown protein [Arabidopsis thaliana]
gi|26452192|dbj|BAC43184.1| unknown protein [Arabidopsis thaliana]
gi|31711862|gb|AAP68287.1| At1g02720 [Arabidopsis thaliana]
gi|332189341|gb|AEE27462.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
gi|332189342|gb|AEE27463.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
Length = 361
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 13/91 (14%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ DI +LW + L QR TIG E N + ++ G W
Sbjct: 172 LADLLEPCVKRVIYLDSDLVVVDDIVKLWK--TGLGQR-TIGAPEYCHANFTKYFTGGFW 228
Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLR 282
+ K + +G FNTGV+++DL K R
Sbjct: 229 SD-KRFNGTFKGRNPCYFNTGVMVIDLKKWR 258
Score = 38.9 bits (89), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 11/169 (6%)
Query: 2 SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNP--LHFHLITDTVALNIL 59
SS E+ + ++ VAI + R + + SIL + P + FH + + + N+
Sbjct: 68 SSGEDSGVCNPNLVHVAIT-LDIDYLRGSIAAVNSILQHSMCPQSVFFHFLVSSESQNLE 126
Query: 60 QTLFSTW-SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVL 118
+ ST+ + +++ ++ ++V +S + + L +L + + I L
Sbjct: 127 SLIRSTFPKLTNLKIYYFAPETVQSLISSSVRQALEQPLNYARNYLADLLEPCVKRVIYL 186
Query: 119 DTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLWKNHK 163
D+D++ DI +LW + L QR TIG E N + ++ G W + +
Sbjct: 187 DSDLVVVDDIVKLWK--TGLGQR-TIGAPEYCHANFTKYFTGGFWSDKR 232
>gi|357473531|ref|XP_003607050.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|357496195|ref|XP_003618386.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|355493401|gb|AES74604.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|355508105|gb|AES89247.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|388506006|gb|AFK41069.1| unknown [Medicago truncatula]
Length = 371
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 49/94 (52%), Gaps = 14/94 (14%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L E + + I LD+DVI DI LW + L + IG E N + ++ + W
Sbjct: 172 LADLLEECVERVIYLDSDVIVVDDIQDLWKV--SLTDSKVIGAPEYCHANFTRYFSYEFW 229
Query: 257 KNHKPWPALGRG-------FNTGVILLDLTKLRD 283
+++ + + +G FNTGV+++DL K R+
Sbjct: 230 SSYE-FSEVFKGRKNRPCYFNTGVMVMDLMKWRE 262
>gi|225437483|ref|XP_002274175.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Vitis
vinifera]
Length = 450
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 48/94 (51%), Gaps = 11/94 (11%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D+I DI +LW+ + L R TIG E N + ++ K W
Sbjct: 261 LADLLEPCVRRVIYLDSDLIVVDDIYKLWS--TSLGTR-TIGAPEYCHANFTRYFTDKFW 317
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISW 286
+ + GR FNTGVI++DL K R +
Sbjct: 318 SEKRYYGTFDGRKPCYFNTGVIVIDLAKWRRFGF 351
>gi|7362746|emb|CAB83116.1| putative protein [Arabidopsis thaliana]
Length = 357
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 78/178 (43%), Gaps = 32/178 (17%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D+I DIA+LW ++L + TIG E N + ++ W
Sbjct: 167 LADLLEPCVRRVIYLDSDLIVVDDIAKLW--MTKLGSK-TIGAPEYCHANFTKYFTPAFW 223
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+ + A GR FNTGV+++DL + R + G+ +I + W + D
Sbjct: 224 SDERFSGAFSGRKPCYFNTGVMVMDLERWRRV---GYTEVIEK-------WMEIQKSDRI 273
Query: 313 NAIISEHPYL------VYTLPCQWNVQ--LSDNTRSDELCYT-ELTDLKIIHWNSPKK 361
+ S P+L V + +WN DN R C + ++HW+ K
Sbjct: 274 YELGSLPPFLLVFAGEVAPIEHRWNQHGLGGDNVRGS--CRDLHPGPVSLLHWSGSGK 329
>gi|297843072|ref|XP_002889417.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
lyrata]
gi|297335259|gb|EFH65676.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 13/91 (14%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ DI +LW + L QR TIG E N + ++ G W
Sbjct: 172 LADLLEPCVKRVIYLDSDLVVVDDIVKLWK--TGLGQR-TIGAPEYCHANFTKYFTGGFW 228
Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLR 282
+ K + +G FNTGV+++DL K R
Sbjct: 229 SD-KRFNGTFKGRNPCYFNTGVMVIDLKKWR 258
>gi|147771376|emb|CAN62993.1| hypothetical protein VITISV_021618 [Vitis vinifera]
Length = 367
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 47/90 (52%), Gaps = 11/90 (12%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D+I DI +LW+ + L R TIG E N + ++ K W
Sbjct: 178 LADLLEPCVRRVIYLDSDLIVVDDIYKLWS--TSLGTR-TIGAPEYCHANFTRYFTDKFW 234
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLR 282
+ + GR FNTGVI++DL K R
Sbjct: 235 SEKRYYGTFDGRKPCYFNTGVIVIDLAKWR 264
>gi|339007385|ref|ZP_08639960.1| sporulation kinase E [Brevibacillus laterosporus LMG 15441]
gi|338776594|gb|EGP36122.1| sporulation kinase E [Brevibacillus laterosporus LMG 15441]
Length = 599
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 98/228 (42%), Gaps = 39/228 (17%)
Query: 359 PKKLKVKNK--HMEFFRNLYLTFLEYDGNLLRRELFGCNL-----TQTTLQQAE-LSNLN 410
P +L+V+N H F RN + L DGN+L L G N+ T+ LQQ+E LS +
Sbjct: 326 PIELRVRNHDGHFLFIRNSCIPMLSKDGNILGFALIGENIHHQKQTENLLQQSEKLSLIG 385
Query: 411 E--EDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVD 468
E +++ R+ LT+LR L+ E S+ D+ L +DR+ + N
Sbjct: 386 ELAAGIAHEI-RNPLTALRGFTQLLQTHPEDSNKYTDIMLK---ELDRIHFIA----NEL 437
Query: 469 EYEASSDG-----NDVTLVAQLSMDRLQMVEMLFK-----HWEGPISLTLYMSDAEAQQF 518
A G ND+ + Q + L+ +L HWE I L Q F
Sbjct: 438 LLLAKPQGYQLAPNDICSLLQDVITLLESQAILANVTISTHWEPDIPSLLCEPRQLKQVF 497
Query: 519 LSYAGNS-EALRSRRNIG-------YHVVYK---EGNFYPINTLRNVA 555
++ NS EA+ + I +HVV + EG P++TL V
Sbjct: 498 INIIKNSVEAMPAGGCIHIEVTKLEHHVVIRFSDEGCGIPLSTLEKVG 545
>gi|356528410|ref|XP_003532796.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Glycine
max]
Length = 1676
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D + D+ L+ + R + +N + W
Sbjct: 1455 LFLDVIFPLSLEKVIFVDADQVVRADMGVLYDMDIRGKPLAYTPFCDNNKEMDGYRFWRQ 1514
Query: 253 GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
G WK+H KP+ + + ++DL K R+ + R+I E L++ SLA
Sbjct: 1515 G-FWKDHLRGKPYHI------SALYVVDLKKFRETAAGDNLRVIYET--LSKDPNSLANL 1565
Query: 308 DQDIFNAIISEHPYLVYTLPCQW 330
DQD+ N ++H +++LP +W
Sbjct: 1566 DQDLPN--YAQHTVPIFSLPQEW 1586
>gi|159487158|ref|XP_001701602.1| predicted protein [Chlamydomonas reinhardtii]
gi|158271543|gb|EDO97360.1| predicted protein [Chlamydomonas reinhardtii]
Length = 377
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 615 ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
+ H PT+++ W + PY ++W +EP++ R + D RF G+G NK
Sbjct: 236 SGHRPTDYTAWFRSAEPYPVQWQERYEPWVFVDRLGSSWADARFRGYGKNK 286
>gi|20258800|gb|AAM13982.1| putative glycosyl transferase [Arabidopsis thaliana]
Length = 313
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 17/93 (18%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
L +L + + I LD+D++ DIA+LW +L SR+ IG E N + ++ G
Sbjct: 157 LADLLEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCHANFTKYFTGG 211
Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
W + + RG FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGTFRGRKPCYFNTGVMVIDLKKWR 243
>gi|20089086|ref|NP_615161.1| hypothetical protein MA0188 [Methanosarcina acetivorans C2A]
gi|19913948|gb|AAM03641.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans
C2A]
Length = 914
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 20/128 (15%)
Query: 93 YSGVYGLLKLTLPKVLPETLAK-------TIVLDTDVIFATDIAQ-----LWALFSRLRQ 140
Y GV G+ K +L K LP L K T V+ T + FAT+ + L L S+LRQ
Sbjct: 43 YYGVAGIGKTSLRKELPALLEKHNESDLHTRVIWTSIDFATEQYKQPYKFLEVLSSQLRQ 102
Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDV-----SWIPNKHYSGVYG 195
+ + N D L WK P L + Y S+V D+ ++P VY
Sbjct: 103 KYNVKF--NSFDIALATYWKKINPHNPLVKENYSEGSIVHDILDVCDEFVPANLIPNVYN 160
Query: 196 LLKLTLPK 203
L K LPK
Sbjct: 161 LTK-NLPK 167
>gi|366165525|ref|ZP_09465280.1| putative glycosyl transferase (general stress protein) [Acetivibrio
cellulolyticus CD2]
Length = 482
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 78/180 (43%), Gaps = 16/180 (8%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
Y G ++ + ++ ++ K I LD D+I DIA+LW V ++D
Sbjct: 81 YFGQVTFFRIFVTELFDPSVEKIIFLDCDMIIKGDIAELWETDVSGYYAAAGEDVGIEND 140
Query: 250 WYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-RLWTSLAD 308
G K + FN GV+++++T R+ + G +LLT R D
Sbjct: 141 GLFGTQHKRSLGIKRKSKYFNAGVMVINMTMWRNHNIPG----QTSDYLLTHRNEIKFPD 196
Query: 309 QDIFNAIISEHPYLVYTLPCQWN----VQLSDNTR---SDELCYTELTDLKIIHWNSPKK 361
QD NA++ + L++ +WN +QL + D+L + + IIH++ P K
Sbjct: 197 QDALNAVLCDKWKLLHP---KWNQVATLQLFYKKKWVIRDDLL-EAVHNPAIIHYSEPSK 252
>gi|431918394|gb|ELK17619.1| hypothetical protein PAL_GLEAN10007398 [Pteropus alecto]
Length = 148
Score = 40.8 bits (94), Expect = 2.2, Method: Composition-based stats.
Identities = 26/72 (36%), Positives = 35/72 (48%), Gaps = 9/72 (12%)
Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYL 322
FN+GV+LL L +R + + + E + +L L DQD F I EHP L
Sbjct: 43 FNSGVMLLKLEAMRQ---SPLYSRLLEPAQVQQLADKYHFRGHLGDQDFFTMIGMEHPEL 99
Query: 323 VYTLPCQWNVQL 334
+ L C WN QL
Sbjct: 100 FHVLDCTWNRQL 111
>gi|222623395|gb|EEE57527.1| hypothetical protein OsJ_07839 [Oryza sativa Japonica Group]
Length = 1597
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N + W
Sbjct: 1381 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1440
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK+H GR ++ + + ++DL K R + R+ E L++ SL+ DQ
Sbjct: 1441 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYET--LSKDPNSLSNLDQ 1493
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1494 DLPN--YAQHTVPIFSLPQEW 1512
>gi|22331906|ref|NP_191825.2| putative galacturonosyltransferase-like 7 [Arabidopsis thaliana]
gi|75161472|sp|Q8VYF4.1|GATL7_ARATH RecName: Full=Probable galacturonosyltransferase-like 7
gi|18175835|gb|AAL59936.1| unknown protein [Arabidopsis thaliana]
gi|20465549|gb|AAM20257.1| unknown protein [Arabidopsis thaliana]
gi|23397213|gb|AAN31889.1| unknown protein [Arabidopsis thaliana]
gi|332646856|gb|AEE80377.1| putative galacturonosyltransferase-like 7 [Arabidopsis thaliana]
Length = 361
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 78/178 (43%), Gaps = 32/178 (17%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D+I DIA+LW ++L + TIG E N + ++ W
Sbjct: 171 LADLLEPCVRRVIYLDSDLIVVDDIAKLW--MTKLGSK-TIGAPEYCHANFTKYFTPAFW 227
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+ + A GR FNTGV+++DL + R + G+ +I + W + D
Sbjct: 228 SDERFSGAFSGRKPCYFNTGVMVMDLERWRRV---GYTEVIEK-------WMEIQKSDRI 277
Query: 313 NAIISEHPYL------VYTLPCQWNVQ--LSDNTRSDELCYT-ELTDLKIIHWNSPKK 361
+ S P+L V + +WN DN R C + ++HW+ K
Sbjct: 278 YELGSLPPFLLVFAGEVAPIEHRWNQHGLGGDNVRGS--CRDLHPGPVSLLHWSGSGK 333
>gi|297743944|emb|CBI36914.3| unnamed protein product [Vitis vinifera]
Length = 364
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 47/90 (52%), Gaps = 11/90 (12%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D+I DI +LW+ + L R TIG E N + ++ K W
Sbjct: 151 LADLLEPCVRRVIYLDSDLIVVDDIYKLWS--TSLGTR-TIGAPEYCHANFTRYFTDKFW 207
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLR 282
+ + GR FNTGVI++DL K R
Sbjct: 208 SEKRYYGTFDGRKPCYFNTGVIVIDLAKWR 237
>gi|218191315|gb|EEC73742.1| hypothetical protein OsI_08377 [Oryza sativa Indica Group]
Length = 1673
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N + W
Sbjct: 1457 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1516
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK+H GR ++ + + ++DL K R + R+ E L++ SL+ DQ
Sbjct: 1517 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYET--LSKDPNSLSNLDQ 1569
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1570 DLPN--YAQHTVPIFSLPQEW 1588
>gi|50251355|dbj|BAD28382.1| putative UDP-glucose:glycoprotein glucosyltransferase [Oryza sativa
Japonica Group]
Length = 1626
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 18/141 (12%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N + W
Sbjct: 1410 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1469
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
G WK+H GR ++ + + ++DL K R + R+ E L++ SL+ DQ
Sbjct: 1470 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYET--LSKDPNSLSNLDQ 1522
Query: 310 DIFNAIISEHPYLVYTLPCQW 330
D+ N ++H +++LP +W
Sbjct: 1523 DLPN--YAQHTVPIFSLPQEW 1541
>gi|359405620|ref|ZP_09198369.1| glycosyltransferase, family 8 [Prevotella stercorea DSM 18206]
gi|357557990|gb|EHJ39504.1| glycosyltransferase, family 8 [Prevotella stercorea DSM 18206]
Length = 308
Score = 40.8 bits (94), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 84/203 (41%), Gaps = 45/203 (22%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQL--WALFSRLRQRQTIGLVENQSDWYLGK 254
KL +P++LP T + + LD D+I DI L W L Q + I E+ D K
Sbjct: 85 FKLLIPQLLPATEKRCLFLDVDMIVYDDITSLYHWNL-----QDKVIAAAEDMPDCITFK 139
Query: 255 LWKNHKPWPALGR-----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
P LG N+GV++ DL K WR++ ++ + T++
Sbjct: 140 --------PRLGLKDTDFCINSGVMVCDLEK---------WRLMEQEQPIMAYATNIISI 182
Query: 308 ---DQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWN 357
+QD+ + + LP +WN+ R ++ L +L+ IIH+
Sbjct: 183 IVNEQDVLACYFKDK---IALLPIRWNMTTFYFNRKPKIFSKYLPELEAAKRFPGIIHFA 239
Query: 358 SPKKLKVKNKHMEFFRNLYLTFL 380
+P K K+ +R LY +L
Sbjct: 240 APIKPWFKDCQHP-YRKLYKKYL 261
>gi|15235247|ref|NP_192122.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|30679061|ref|NP_849285.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|79324977|ref|NP_001031573.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|75317781|sp|O04253.1|GATL6_ARATH RecName: Full=Probable galacturonosyltransferase-like 6; AltName:
Full=Like glycosyl transferase 10
gi|2104536|gb|AAC78704.1| predicted glycosyl transferase [Arabidopsis thaliana]
gi|7268597|emb|CAB80706.1| predicted glycosyl transferase [Arabidopsis thaliana]
gi|24030376|gb|AAN41350.1| putative glycosyl transferase [Arabidopsis thaliana]
gi|26453088|dbj|BAC43620.1| putative glycosyl transferase [Arabidopsis thaliana]
gi|110738563|dbj|BAF01207.1| glycosyl transferase like protein [Arabidopsis thaliana]
gi|332656727|gb|AEE82127.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|332656728|gb|AEE82128.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|332656729|gb|AEE82129.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
Length = 346
Score = 40.8 bits (94), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 17/93 (18%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
L +L + + I LD+D++ DIA+LW +L SR+ IG E N + ++ G
Sbjct: 157 LADLLEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCHANFTKYFTGG 211
Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
W + + RG FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGTFRGRKPCYFNTGVMVIDLKKWR 243
>gi|255566847|ref|XP_002524407.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
gi|223536368|gb|EEF38018.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
Length = 344
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 59/117 (50%), Gaps = 16/117 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
+ L +LP ++ + I D+D+I D+A+LW + + +G E N ++++
Sbjct: 147 RFYLADLLPASVPRIIYFDSDLIVVDDVAKLWNINLGV---HVLGAPEYCHANFTNYFNS 203
Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRDISWA---GFWRIIAEKFLLTRL 302
+ W N + + A R FNTGV+++DL K R+ + +W + +K+ + L
Sbjct: 204 RFWSN-EGYAASFRERRACYFNTGVMVIDLMKWREGKYREKLEYWMKVQKKYRIYEL 259
>gi|428175077|gb|EKX43969.1| hypothetical protein GUITHDRAFT_140108 [Guillardia theta CCMP2712]
Length = 919
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 40/175 (22%), Positives = 64/175 (36%), Gaps = 60/175 (34%)
Query: 547 PINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFET 606
P N LR +A + + V D+D L L +++ + R G G ++LV+P+F +
Sbjct: 544 PTNMLRKIACSIAKSNMVLNADVDMLASDELADHIRKA-RGDGWMGRG--ELLVIPSFRS 600
Query: 607 -----QRYRTAF--------PAS------------------------------------- 616
+ +T F PA+
Sbjct: 601 AGSWPRAVKTHFDSHQLTVKPAAITMEDLKAALLRCELLLPLLGCGMWSPSTRWSENAVF 660
Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-------DLPRYDTRFVGFGWNK 664
HAPT + RW+ A Y++++ +EP V R YD RF GW+K
Sbjct: 661 HAPTEYGRWMEAEEVYEVDYLLGYEPSFVVDRRAWVGGQGAGMYDDRFEQGGWDK 715
>gi|21554300|gb|AAM63375.1| putative glycosyl transferase [Arabidopsis thaliana]
Length = 346
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 17/93 (18%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
L +L + + I LD+D++ DIA+LW +L SR+ IG E N + ++ G
Sbjct: 157 LADLLEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCYANFTKYFTGG 211
Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
W + + RG FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGTFRGRKPCYFNTGVMVIDLKKWR 243
>gi|424665842|ref|ZP_18102878.1| hypothetical protein HMPREF1205_01717 [Bacteroides fragilis HMW
616]
gi|404574095|gb|EKA78846.1| hypothetical protein HMPREF1205_01717 [Bacteroides fragilis HMW
616]
Length = 304
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 45/195 (23%), Positives = 81/195 (41%), Gaps = 29/195 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
+ +LPE + K + LD D++ DI++ W + ++ VE N+ + Y
Sbjct: 89 RCMFSAILPEEVDKVLYLDCDIVILGDISEYW---NTDMSNYSVACVEDIGSNEDERY-- 143
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
+ K K + FN GV+L++L R+ + FL DQD+ N
Sbjct: 144 DILKYDKSFSY----FNAGVLLINLRYWRE---HKIDEQCEQYFLRYPERIRFNDQDLLN 196
Query: 314 AIISEHPYLVYTLPCQWNVQ-------LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
A++ E V P +WN+Q + + + +L I+H+ + K +
Sbjct: 197 AVLHESKLFV---PLKWNMQDGFYRYGADKGVKDKKQLHQQLLHPVILHYTNKKPWNYDS 253
Query: 367 KH---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 254 MHPLKGEYFKYLDMT 268
>gi|356511033|ref|XP_003524236.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Glycine
max]
Length = 1647
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 22/143 (15%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D + D+ L+ + R + +N + W
Sbjct: 1426 LFLDVIFPLSLEKVIFVDADQVVRADMGVLYDMDIRGKPLAYTPFCDNNKEMDGYRFWRQ 1485
Query: 253 GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
G W +H KP+ + + ++DL K R+ + R+I E L+R SLA
Sbjct: 1486 G-FWNDHLQGKPYHI------SALYVVDLKKFRETAAGDNLRVIYET--LSRDPNSLANL 1536
Query: 308 DQDIFNAIISEHPYLVYTLPCQW 330
DQD+ N ++H +++LP +W
Sbjct: 1537 DQDLPN--YAQHTVPIFSLPQEW 1557
>gi|242041219|ref|XP_002468004.1| hypothetical protein SORBIDRAFT_01g037870 [Sorghum bicolor]
gi|241921858|gb|EER95002.1| hypothetical protein SORBIDRAFT_01g037870 [Sorghum bicolor]
Length = 371
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 50/94 (53%), Gaps = 11/94 (11%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + + I LD+D++ D+A+LW + L R T+G E N + ++ G+ W
Sbjct: 175 LADLLEPCVRRVIYLDSDLVLVDDVAKLWR--TDLGGR-TVGAPEYCHANFTKYFTGRFW 231
Query: 257 KNHK-PWPALGRG---FNTGVILLDLTKLRDISW 286
+ + +GR FNTGV++LDL + R +
Sbjct: 232 SDQRFAGTFVGRRPCYFNTGVMVLDLERWRQAGY 265
>gi|423281223|ref|ZP_17260134.1| hypothetical protein HMPREF1203_04351 [Bacteroides fragilis HMW
610]
gi|404583387|gb|EKA88068.1| hypothetical protein HMPREF1203_04351 [Bacteroides fragilis HMW
610]
Length = 305
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 45/195 (23%), Positives = 81/195 (41%), Gaps = 29/195 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
+ +LPE + K + LD D++ DI++ W + ++ VE N+ + Y
Sbjct: 89 RCMFSAILPEEVDKVLYLDCDIVILGDISEYW---NTDMSNYSVACVEDIGSNEDERY-- 143
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
+ K K + FN GV+L++L R+ + FL DQD+ N
Sbjct: 144 DILKYDKSFSY----FNAGVLLINLRYWRE---HKIDEQCEQYFLRYPERIRFNDQDLLN 196
Query: 314 AIISEHPYLVYTLPCQWNVQ-------LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
A++ E V P +WN+Q + + + +L I+H+ + K +
Sbjct: 197 AVLHESKLFV---PLKWNMQDGFYRYGADKGVKDKKQLHQQLLHPVILHYTNKKPWNYDS 253
Query: 367 KH---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 254 MHPLKGEYFKYLDMT 268
>gi|219363085|ref|NP_001137113.1| uncharacterized protein LOC100217291 precursor [Zea mays]
gi|194698410|gb|ACF83289.1| unknown [Zea mays]
gi|413952525|gb|AFW85174.1| hypothetical protein ZEAMMB73_350653 [Zea mays]
Length = 372
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 73/183 (39%), Gaps = 38/183 (20%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +LP + + I LD+DV+ A D+ +LW +RL + E N S ++ W
Sbjct: 159 LADLLPRCVPRAIYLDSDVLAADDVRRLWE--TRLPAAAVVAAPEYCHANFSRYFTPAFW 216
Query: 257 KNHKPWPALGRG-----------FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS 305
+ P LG FNTGV+++DL + R AG +R E+ W
Sbjct: 217 SD----PVLGARVFAGRRRPPCYFNTGVMVIDLRRWR----AGNYRQRIER------WME 262
Query: 306 LADQDIFNAIISEHPYL------VYTLPCQWNVQ-LSDNTRSDELCYTELTDLKIIHWNS 358
+ Q + S P+L V + +WN L N + ++HW+
Sbjct: 263 IQKQKRIYELGSLPPFLLVFAGEVEAVDHRWNQHGLGGNNVHGSCRPLHAGPVSLMHWSG 322
Query: 359 PKK 361
K
Sbjct: 323 KGK 325
>gi|298480796|ref|ZP_06998991.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
[Bacteroides sp. D22]
gi|298272819|gb|EFI14385.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
[Bacteroides sp. D22]
Length = 312
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 73/163 (44%), Gaps = 15/163 (9%)
Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
FY DS + + I + +Y +L L +L +++ + + LD+D+I I LW
Sbjct: 62 FYEVDSNILNNLPISTRFRKSIY--YRLLLDNILDKSIERILYLDSDIIVRDSIEALW-- 117
Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
+ +G V +QS + N P L FN+GV+L+++ K R + R
Sbjct: 118 -NENIDEYVLGAVLDQSCDDIRNF--NRTKLPYLSDYFNSGVLLININKWRAFNIGK--R 172
Query: 292 IIAEKFLLTRLWTSL-ADQDIFNAIISEHPYLVYTLPCQWNVQ 333
I K++ + L DQD N I S + LP +NVQ
Sbjct: 173 CI--KYISENPESCLYPDQDALNVITSNSHKI---LPLCFNVQ 210
>gi|443710200|gb|ELU04498.1| hypothetical protein CAPTEDRAFT_192237 [Capitella teleta]
Length = 221
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 37/66 (56%), Gaps = 3/66 (4%)
Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
DVTL+ L+ DR++ + L ++W GPIS +Y+ + + N +SR NI H
Sbjct: 138 DVTLMVHLTSDRIEKLIRLTENWSGPISAAIYVDNITDSLMIE---NMCHFQSRPNIYIH 194
Query: 538 VVYKEG 543
+++K G
Sbjct: 195 ILHKVG 200
>gi|300176891|emb|CBK25460.2| unnamed protein product [Blastocystis hominis]
Length = 234
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 8/72 (11%)
Query: 543 GNFYPINTLRNVALNQVSTPYVFLLDIDFLPM-------FGLYPYLKSSIRSMDMH-GHG 594
GN + IN LRN+AL++ ST +VF+ D D LP F L+ + S+ H
Sbjct: 147 GNSFRINALRNLALSKASTTHVFIADADILPACTQNESNFDFTDNLRDTFLSLPFHLLQD 206
Query: 595 GKKVLVVPAFET 606
+ +VP F+
Sbjct: 207 SAQAFIVPLFDV 218
>gi|297850896|ref|XP_002893329.1| hypothetical protein ARALYDRAFT_472678 [Arabidopsis lyrata subsp.
lyrata]
gi|297339171|gb|EFH69588.1| hypothetical protein ARALYDRAFT_472678 [Arabidopsis lyrata subsp.
lyrata]
Length = 389
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 12/92 (13%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L ++ + I LD+DVI DI +LW + L + IG E N + ++ W
Sbjct: 175 LGDILDRSVERVIYLDSDVITVDDITKLWN--TVLTGSRVIGAPEYCHANFTQYFTSGFW 232
Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLRD 283
+ P L G FNTGV+++DL + R+
Sbjct: 233 SD-PALPGLISGQKPCYFNTGVMVMDLVRWRE 263
>gi|313147947|ref|ZP_07810140.1| glycosyl transferase [Bacteroides fragilis 3_1_12]
gi|313136714|gb|EFR54074.1| glycosyl transferase [Bacteroides fragilis 3_1_12]
Length = 311
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 45/195 (23%), Positives = 81/195 (41%), Gaps = 29/195 (14%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
+ +LPE + K + LD D++ DI++ W + ++ VE N+ + Y
Sbjct: 96 RCMFSAILPEEVDKVLYLDCDIVILGDISEYW---NTDMSNYSVACVEDIGSNEDERY-- 150
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
+ K K + FN GV+L++L R+ + FL DQD+ N
Sbjct: 151 DILKYDKSFSY----FNAGVLLINLRYWRE---HKIDEQCEQYFLRYPERIRFNDQDLLN 203
Query: 314 AIISEHPYLVYTLPCQWNVQ-------LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
A++ E V P +WN+Q + + + +L I+H+ + K +
Sbjct: 204 AVLHESKLFV---PLKWNMQDGFYRYGADKGVKDKKQLHQQLLHPVILHYTNKKPWNYDS 260
Query: 367 KH---MEFFRNLYLT 378
H E+F+ L +T
Sbjct: 261 MHPLKGEYFKYLDMT 275
>gi|326803152|ref|YP_004320970.1| glycosyltransferase family protein [Aerococcus urinae
ACS-120-V-Col10a]
gi|326650610|gb|AEA00793.1| glycosyltransferase, family 8 [Aerococcus urinae ACS-120-V-Col10a]
Length = 274
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 32/132 (24%), Positives = 59/132 (44%), Gaps = 22/132 (16%)
Query: 205 LPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGLVENQSDWYLGKLWKN 258
LP+ L K + LD D++ D++ L+ L+ + + + + LG +
Sbjct: 91 LPDQLEKILYLDADILCINDVSPLYETELGNNLYGAASHAKLTKITDQINKLRLGNSQAD 150
Query: 259 HKPWPALGRGFNTGVILLDLTKLRD-ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
H +N+GV+L++L +R + + I E R L DQD+ NA+ S
Sbjct: 151 HY--------YNSGVLLMNLKAIRQKVKAEDIFDFIHE----NRYQLILPDQDVLNALYS 198
Query: 318 EHPYLVYTLPCQ 329
++ +Y +P Q
Sbjct: 199 QY---IYDVPDQ 207
>gi|357464109|ref|XP_003602336.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|355491384|gb|AES72587.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
Length = 343
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 49/95 (51%), Gaps = 13/95 (13%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
++ L ++P T + I D+D+I D+A+LW++ +G E N + ++
Sbjct: 144 RMYLADLVPATAQRIIYFDSDLIVVDDVAKLWSI---DLGNHVLGAPEYCHANFTTYFTH 200
Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRD 283
+ W N + A +G FNTGV+++DL K R+
Sbjct: 201 RFWSN-PSYSASFKGREACYFNTGVMVIDLWKWRE 234
>gi|308807441|ref|XP_003081031.1| Predicted glycosyltransferase (ISS) [Ostreococcus tauri]
gi|116059493|emb|CAL55200.1| Predicted glycosyltransferase (ISS) [Ostreococcus tauri]
Length = 476
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 8/59 (13%)
Query: 613 FPASHAPTNFSRWVNATT--PYQIEWAPD---FEPYIVAHRD--LPRYDTRFVGFGWNK 664
FP +HAP+N+++W + T PY++ +EPY++ D LP +D FV +G+NK
Sbjct: 260 FPRAHAPSNYNQWFSNVTCEPYRVAAHKHPWYYEPYVILRADMALP-FDESFVTYGFNK 317
>gi|291222851|ref|XP_002731428.1| PREDICTED: glycosyltransferase 8 domain-containing protein 4-like
[Saccoglossus kowalevskii]
Length = 238
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 31/131 (23%), Positives = 63/131 (48%), Gaps = 14/131 (10%)
Query: 15 IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQVEV 73
I +++V G + +L+ L ++LF R PLHFH+ + +I + W S + ++
Sbjct: 66 IHLSVVACGDRANETLIMLKSAVLFTRA-PLHFHIFAEDELHSIFRNRLELWPSKYRAKI 124
Query: 74 SFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFATD 127
+ + +++ P ++ L K L LP VL + + + +DTDV+F
Sbjct: 125 KY-----DIYSITFPPGENAMEWKKLFKPCASQRLFLPDVLTD-VDSILYVDTDVLFLFP 178
Query: 128 IAQLWALFSRL 138
+ ++W F++
Sbjct: 179 LDEIWEFFNKF 189
>gi|189465050|ref|ZP_03013835.1| hypothetical protein BACINT_01394 [Bacteroides intestinalis DSM
17393]
gi|189437324|gb|EDV06309.1| general stress protein A [Bacteroides intestinalis DSM 17393]
Length = 301
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 75/167 (44%), Gaps = 23/167 (13%)
Query: 172 FYLAD-SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
FY D S+++D H S + L+L + +++P + K + LD D+I I +LW
Sbjct: 63 FYDVDYSLIKDFPIRKQDHLS-LAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELW- 120
Query: 231 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKP----WPALGRGFNTGVILLDLTKLRDISW 286
+ + VE +S + P +P FN+GV+L++L K R+ +
Sbjct: 121 --EKNIDNIAVAAVEERSP------FDTESPVTLKYPVEYSYFNSGVMLINLQKWREKKF 172
Query: 287 AGFWR-IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
+ IA + + L DQD+ NA++ + + +WN+
Sbjct: 173 VEACKSYIASNYENIK----LHDQDVLNALLYKEKQFI---SIRWNL 212
>gi|449469050|ref|XP_004152234.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
sativus]
gi|449515897|ref|XP_004164984.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
sativus]
Length = 367
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 11/91 (12%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +L + K I LD+D+I DI +LW + L + TIG E N S ++ + W
Sbjct: 178 LAGLLESCVKKVIYLDSDLIVVDDIRKLWT--TNLGE-WTIGAPEYCHANFSKYFTTRFW 234
Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRD 283
+ + + GR FNTGV+++DL K R+
Sbjct: 235 SDERFFGTFAGRKPCYFNTGVMVIDLVKWRN 265
>gi|307108997|gb|EFN57236.1| expressed protein, partial [Chlorella variabilis]
Length = 429
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
T+ SRW + Y+I + FEPY++ R +P YD RF G+ NK
Sbjct: 340 TDHSRWFETSISYRIPYEEGFEPYVLVQRRFVPWYDERFKGYRKNK 385
>gi|52076753|dbj|BAD45664.1| putative Avr9/Cf-9 rapidly elicited protein 231 [Oryza sativa
Japonica Group]
gi|125554743|gb|EAZ00349.1| hypothetical protein OsI_22365 [Oryza sativa Indica Group]
Length = 371
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 25/113 (22%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +LP + + I LD+DV+ D+ +LW +RL + E N S ++ W
Sbjct: 150 LADLLPRCVPRAIYLDSDVLAVDDVRRLWE--TRLPAAAVVAAPEYCHANFSRYFTPAFW 207
Query: 257 KNHKPWPALGRG-----------FNTGVILLDLTKLRDISWAGFWRIIAEKFL 298
+ P LGR FNTGV+++DL + R AG +R E+++
Sbjct: 208 SD----PGLGRRVFAGRRRPPCYFNTGVMVIDLRRWR----AGNYRHRIERWM 252
>gi|226324415|ref|ZP_03799933.1| hypothetical protein COPCOM_02196 [Coprococcus comes ATCC 27758]
gi|225206863|gb|EEG89217.1| glycosyltransferase, family 8 [Coprococcus comes ATCC 27758]
Length = 348
Score = 40.0 bits (92), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 65/150 (43%), Gaps = 26/150 (17%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGL---VENQ 247
+L +P++L + K + +D+D++ DIA+L+A L + + T GL E
Sbjct: 99 FRLLMPQILAD-YDKAVYIDSDLVVNADIAELYATDVDGYLLAAAKDADTAGLYNGFEPN 157
Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAE--KFLLTRLWTS 305
Y+ + K KP+ F GVI+ +L + R AE KF + W
Sbjct: 158 KKKYMDTILKIKKPY----EYFQAGVIVFNLAEFRKTY------TTAEMLKFAASYEW-E 206
Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
L DQD+ N + V + WNV +
Sbjct: 207 LLDQDVLNYLAQGR---VKFVDMAWNVMVD 233
>gi|366165534|ref|ZP_09465289.1| putative glycosyl transferase (general stress protein) [Acetivibrio
cellulolyticus CD2]
Length = 491
Score = 39.7 bits (91), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 44/181 (24%), Positives = 77/181 (42%), Gaps = 18/181 (9%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
Y G ++ + + ++ K + LD D+I DIA+LW + V ++D
Sbjct: 81 YFGQATFFRIFVTDLFDPSVEKIVFLDCDMIIKGDIAELWKTDVSGYYMAAVEDVGLEND 140
Query: 250 WYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLW-TSLAD 308
G K + FN GV+++++T R+ + R +LLT L D
Sbjct: 141 GLYGIQHKRSLGIKRRSKYFNAGVMVINMTLWRNHNIPDRTR----NYLLTHHNDVKLPD 196
Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDN--------TRSDELCYTELTDLKIIHWNSPK 360
QD NA++ ++ L++ +WN Q + R D L + + IIH++ P
Sbjct: 197 QDALNAVLCDNWKLLHP---KWNQQATLQLFYKKKWVIREDLL--EAVHNPAIIHYSEPS 251
Query: 361 K 361
K
Sbjct: 252 K 252
>gi|195116431|ref|XP_002002758.1| GI11232 [Drosophila mojavensis]
gi|193913333|gb|EDW12200.1| GI11232 [Drosophila mojavensis]
Length = 458
Score = 39.7 bits (91), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 34/149 (22%), Positives = 54/149 (36%), Gaps = 38/149 (25%)
Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS----MDMHGHGGKKVLVV 601
YPIN RN+A V+T ++F DI+ P GL + + + + +V +
Sbjct: 233 YPINVGRNIARKAVNTHFIFACDIELFPSLGLIEQFLDMVHRNRSVLALSPNQRPRVYPL 292
Query: 602 PAFET-----------------QRYRTAF------PASHAPTNFSRWVNATTPYQIEWA- 637
P FE Q+ R + H + RW+N TTP E
Sbjct: 293 PVFEIEENEQVPNDKLELLDLLQKRRAQLFHAKVCASCHKVPGYQRWINRTTPIPDELQL 352
Query: 638 ----------PDFEPYIVAHRDLPRYDTR 656
+EP+ ++ P +D R
Sbjct: 353 FSMTLRQDTFRHWEPFYISDNKEPVFDER 381
>gi|330996530|ref|ZP_08320412.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
11841]
gi|329573086|gb|EGG54705.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
11841]
Length = 637
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 59/138 (42%), Gaps = 11/138 (7%)
Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 255
+ L +LP +++K + LD DV+ + + LW + L + + +
Sbjct: 87 FFRCLLSSILPLSVSKVLYLDCDVLVLSSLYGLWET-----DLTGVALAGVPDSFTVNPV 141
Query: 256 WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
+ + FN GV+LL+L R G R+ AE + + DQD+ N++
Sbjct: 142 HCRRLHYASSYNYFNGGVLLLNLEYWRA---HGVERLCAEHYRMYPDRIVYNDQDLLNSL 198
Query: 316 ISEHPYLVYTLPCQWNVQ 333
+ E L L +WNVQ
Sbjct: 199 LHERKRL---LDMKWNVQ 213
>gi|145350151|ref|XP_001419480.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579712|gb|ABO97773.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 471
Score = 39.7 bits (91), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 9/61 (14%)
Query: 613 FPASHAPTNFSRWVNATT-----PYQIEWAPD---FEPYIVAHRDLP-RYDTRFVGFGWN 663
FP +HAPTN++ W TT Y++ +EPY++ DL +D FV +G+N
Sbjct: 253 FPIAHAPTNYTAWFENTTTGADSTYRVATPKHPWYYEPYVIVRADLALPFDESFVQYGFN 312
Query: 664 K 664
K
Sbjct: 313 K 313
>gi|237747745|ref|ZP_04578225.1| lipopolysaccharide 3-alpha-galactosyltransferase [Oxalobacter
formigenes OXCC13]
gi|229379107|gb|EEO29198.1| lipopolysaccharide 3-alpha-galactosyltransferase [Oxalobacter
formigenes OXCC13]
Length = 307
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 40/170 (23%), Positives = 80/170 (47%), Gaps = 18/170 (10%)
Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN-QSDWYLG 253
L+ +P++L + + LD D+I +I+ L+ L + + + + +VE+ S+ YL
Sbjct: 88 AFLRFFIPELLQGLTDRALYLDADIICINNISDLFHL--EMDENEILAVVEDIDSETYLN 145
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKL-RDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+ K R FN+GV+++D+ K ++ + ++ EK +L DQD
Sbjct: 146 ENASFQK------RYFNSGVLMMDIEKWNKNNVYGQLLSVLNEK----GSGFNLIDQDAL 195
Query: 313 NAIISEHPYLVYTLPCQWNVQL-SDNTRSDELCYTELTDLKIIHWNSPKK 361
N ++ + V+ L WN + ++ + Y+ + K IH+ P K
Sbjct: 196 NLVMIDK---VHYLDNIWNYMINAEQLDKKKEKYSVPENAKFIHFVGPVK 242
>gi|222635310|gb|EEE65442.1| hypothetical protein OsJ_20804 [Oryza sativa Japonica Group]
Length = 341
Score = 39.7 bits (91), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 25/113 (22%)
Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
L +LP + + I LD+DV+ D+ +LW +RL + E N S ++ W
Sbjct: 114 LADLLPRCVPRAIYLDSDVLAVDDVRRLWE--TRLPAAAVVAAPEYCHANFSRYFTPAFW 171
Query: 257 KNHKPWPALGRG-----------FNTGVILLDLTKLRDISWAGFWRIIAEKFL 298
+ P LGR FNTGV+++DL + R AG +R E+++
Sbjct: 172 SD----PGLGRRVFAGRRRPPCYFNTGVMVIDLRRWR----AGNYRHRIERWM 216
>gi|254421706|ref|ZP_05035424.1| Glycosyl transferase family 8 [Synechococcus sp. PCC 7335]
gi|196189195|gb|EDX84159.1| Glycosyl transferase family 8 [Synechococcus sp. PCC 7335]
Length = 298
Score = 39.7 bits (91), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 69/167 (41%), Gaps = 21/167 (12%)
Query: 207 ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN--QSDWYLGKLWKNHKPWPA 264
E L + I LDTD+I DIA+L+A L + G + + +Y K + P
Sbjct: 112 EDLERVIYLDTDLIVLGDIAELYAYTKALDEHCYFGSIPHFYPCIFYFSNFMKMREEIPK 171
Query: 265 LGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYL-- 322
+ FN GV +L+ + ++ ++ +L D+ +FN + + YL
Sbjct: 172 FKQTFNAGVWFTNLSFWNEKTYERLNYYLSLDAKSNYKLYTLGDEPVFNLMFKD--YLQA 229
Query: 323 --------VYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKK 361
T P N+ L+ + L++ K+IHW+ P K
Sbjct: 230 DKNWNRCGYGTHPAVTNLFLASGEKF-------LSEAKLIHWSGPFK 269
>gi|218673597|ref|ZP_03523266.1| putative glycosyltransferase protein [Rhizobium etli GR56]
Length = 311
Score = 39.3 bits (90), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 60/123 (48%), Gaps = 21/123 (17%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
++ LP+ LP+T + + LD D++ T + QLW + IG V D++L +
Sbjct: 88 RILLPQFLPQTCDRALYLDGDILVLTALEQLW---NTDLGDAVIGAVP---DYWLDSAAR 141
Query: 258 NHKPWPALG----RGFNTGVILLDLTKLRDISWAGFWRIIAEKFL--LTRLWTS-LADQD 310
N P G R FN G++L+DL K R+ I+E+ L L R T+ +DQD
Sbjct: 142 N-GPGATGGARVKRYFNAGILLIDLAKWRN-------ERISERSLDYLERFPTTEYSDQD 193
Query: 311 IFN 313
N
Sbjct: 194 ALN 196
>gi|90657605|gb|ABD96904.1| hypothetical protein [Cleome spinosa]
Length = 528
Score = 39.3 bits (90), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 8/94 (8%)
Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLL--- 197
+ +V ++ + K+W +P LGRG ++ VVED ++ N Y V L
Sbjct: 280 KHVFHVVTDRMNLAAMKVWFKMRP---LGRGAHIEIKVVEDFKFL-NSSYVPVLRQLESA 335
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
K LP++ P+ L K + LD DV+ D+ LW +
Sbjct: 336 KFYLPEMYPK-LHKILFLDDDVVVQKDLTGLWKI 368
>gi|295398746|ref|ZP_06808760.1| family 8 glycosyl transferase [Aerococcus viridans ATCC 11563]
gi|294973009|gb|EFG48822.1| family 8 glycosyl transferase [Aerococcus viridans ATCC 11563]
Length = 283
Score = 39.3 bits (90), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 83/200 (41%), Gaps = 38/200 (19%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL---FSRLRQRQTIGLVENQSDWYLG 253
+L ++LP+ + + I LD D++ + +LW L +GL + +
Sbjct: 93 FRLLAGEILPKEMKRVIYLDPDILVVNPLLELWQTDLEGHMLAAATHVGLTDVSTRVNQV 152
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
+L +H +N+GV+++DL K R+I W IA+ L L DQDI N
Sbjct: 153 RLDVDHA-------YYNSGVMVIDLDKAREIV---KWSDIAQMIEKYNLLLFLPDQDILN 202
Query: 314 AIISEHPYLVYT--LPCQ-WNVQLSDNTRSDELCYTELTDLKIIHW-----------NSP 359
H Y YT +P + WN +TR +T+ IHW P
Sbjct: 203 -----HLYGKYTKEIPEEIWNY----DTRKYMRYFTKSLTQHDIHWVMANTSILHFCGGP 253
Query: 360 KKLKVKNKHMEFFRNLYLTF 379
K +KH F +LYL +
Sbjct: 254 KPW--DDKHDNRFTSLYLNY 271
>gi|404328965|ref|ZP_10969413.1| hypothetical protein SvinD2_02667 [Sporolactobacillus vineae DSM
21990 = SL153]
Length = 255
Score = 39.3 bits (90), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 66/142 (46%), Gaps = 19/142 (13%)
Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
++ +P +L + + K + LD+D+I D+ LW +LR IG VE+ D +
Sbjct: 86 AFYRIVIPDLLDKKIKKAVYLDSDLIIKDDLLNLWN-NEQLRH-YFIGAVEDPVDVTGIR 143
Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQDIF 312
L P + FN+GV+L++L K R + RI+ + R + + DQD
Sbjct: 144 L-------PKKYKYFNSGVMLMNLEKWRKNKTSQ--RILQ---FIRRHSSKIMWWDQDAL 191
Query: 313 NAIISEHPYLVYTLPCQWNVQL 334
NAI+ + L WN Q+
Sbjct: 192 NAILYNK---WFELGYTWNFQV 210
>gi|115447761|ref|NP_001047660.1| Os02g0664200 [Oryza sativa Japonica Group]
gi|113537191|dbj|BAF09574.1| Os02g0664200, partial [Oryza sativa Japonica Group]
Length = 242
Score = 39.3 bits (90), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 61/139 (43%), Gaps = 14/139 (10%)
Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
L L + P +L K I +D D I D+ +L+ + + R +N + W
Sbjct: 26 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 85
Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
G WK+H GR ++ + + ++DL K R + R+ E S DQD+
Sbjct: 86 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYETLSKDPNSLSNLDQDL 140
Query: 312 FNAIISEHPYLVYTLPCQW 330
N ++H +++LP +W
Sbjct: 141 PN--YAQHTVPIFSLPQEW 157
>gi|198275401|ref|ZP_03207932.1| hypothetical protein BACPLE_01564 [Bacteroides plebeius DSM 17135]
gi|198271737|gb|EDY96007.1| glycosyltransferase, family 8 [Bacteroides plebeius DSM 17135]
Length = 308
Score = 39.3 bits (90), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 68/144 (47%), Gaps = 26/144 (18%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
+ L ++LP + + + LD D++ DI++ W + L + VE N+ Y
Sbjct: 89 RCILSELLPADIDRLLYLDCDIVIVGDISEYWN--TPLDDETGVAAVEDMGCNEPARY-- 144
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLR--DISWA--GFWRIIAEKFLLTRLWTSLADQ 309
++ K +P FN GV+L++L R D++ A ++ E+ L DQ
Sbjct: 145 EILK----YPMEDSYFNAGVLLINLDYWRKNDVAHACVDYFHKYPERILFN-------DQ 193
Query: 310 DIFNAIISEHPYLVYTLPCQWNVQ 333
D+ N+I+ ++ LV +WNVQ
Sbjct: 194 DLLNSILHKNKILV---DLKWNVQ 214
>gi|424790536|ref|ZP_18217073.1| glycosyltransferase, family 8 [Enterococcus faecium V689]
gi|424955172|ref|ZP_18370024.1| glycosyltransferase, family 8 [Enterococcus faecium R494]
gi|425039651|ref|ZP_18444168.1| glycosyltransferase, family 8 [Enterococcus faecium 513]
gi|402920658|gb|EJX41154.1| glycosyltransferase, family 8 [Enterococcus faecium V689]
gi|402934695|gb|EJX54015.1| glycosyltransferase, family 8 [Enterococcus faecium R494]
gi|403015200|gb|EJY28128.1| glycosyltransferase, family 8 [Enterococcus faecium 513]
Length = 280
Score = 38.9 bits (89), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 87/199 (43%), Gaps = 36/199 (18%)
Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYLG 253
+L ++LP+ + + I LD D++ + LW L +GL + +
Sbjct: 90 FRLLAGEILPKEMKRVIYLDPDILVVNPLLDLWQTDLEGYMLAAATHVGLTDVSTRVNQM 149
Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRD-ISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
+L +H +N+GV+++DL K R+ I W+ ++I EK+ L + L DQDI
Sbjct: 150 RLDVDHA-------YYNSGVMVIDLDKAREIIKWSDIAQMI-EKYNLLLI---LPDQDIL 198
Query: 313 NAIISEHPYLVYTLPCQ-WNVQLSDNTRSDELCYTELTDLKIIHW-----------NSPK 360
N + ++ +P + WN +TR +T+ IHW PK
Sbjct: 199 NRLYGKY---TKEIPEEIWNY----DTRKYMRYFTKSLAQHDIHWVMAHTAILHFCGGPK 251
Query: 361 KLKVKNKHMEFFRNLYLTF 379
+KH F +LYLT+
Sbjct: 252 PW--DDKHDNRFTSLYLTY 268
>gi|224117396|ref|XP_002317564.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222860629|gb|EEE98176.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 532
Score = 38.9 bits (89), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 21/83 (25%), Positives = 42/83 (50%), Gaps = 3/83 (3%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
Y+ V +++ LP++ P +L K + LD D++ TD++ LW + + + Q
Sbjct: 325 YNSVMNHIRIHLPELFP-SLNKVVFLDDDIVVQTDLSPLWDIDMNGKVNGAVETCRGQDK 383
Query: 250 WYLGKLWKNHKPW--PALGRGFN 270
+ + K KN+ + P + + FN
Sbjct: 384 FVMSKRLKNYLNFSHPLIAKNFN 406
>gi|224141469|ref|XP_002324094.1| glycosyltransferase [Populus trichocarpa]
gi|222867096|gb|EEF04227.1| glycosyltransferase [Populus trichocarpa]
Length = 528
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 93 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 152
Y + L++ +P++ P +L K + LD DV+ D++ LW + + + + + + +
Sbjct: 319 YISILNHLRIYIPELFP-SLDKVVFLDDDVVIQRDLSPLWEIDLKGKVNGAVETCKGEDE 377
Query: 153 WYLGKLWKNH 162
W + K +KN+
Sbjct: 378 WVMSKHFKNY 387
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
Y + L++ +P++ P +L K + LD DV+ D++ LW + + + + + + +
Sbjct: 319 YISILNHLRIYIPELFP-SLDKVVFLDDDVVIQRDLSPLWEIDLKGKVNGAVETCKGEDE 377
Query: 250 WYLGKLWKNH 259
W + K +KN+
Sbjct: 378 WVMSKHFKNY 387
>gi|218706579|ref|YP_002414098.1| putative Capsule biosynthesis protein [Escherichia coli UMN026]
gi|300900284|ref|ZP_07118463.1| hypothetical protein HMPREF9552_04335 [Escherichia coli MS 198-1]
gi|419934881|ref|ZP_14451973.1| putative Capsule biosynthesis protein [Escherichia coli 576-1]
gi|432403347|ref|ZP_19646092.1| hypothetical protein WEK_03550 [Escherichia coli KTE26]
gi|432462309|ref|ZP_19704444.1| hypothetical protein A15I_03183 [Escherichia coli KTE204]
gi|432632844|ref|ZP_19868765.1| hypothetical protein A1UW_03231 [Escherichia coli KTE80]
gi|432767333|ref|ZP_20001727.1| hypothetical protein A1S9_00118 [Escherichia coli KTE50]
gi|432963419|ref|ZP_20152838.1| hypothetical protein A15E_03777 [Escherichia coli KTE202]
gi|433049396|ref|ZP_20236734.1| hypothetical protein WII_03332 [Escherichia coli KTE120]
gi|433064414|ref|ZP_20251326.1| hypothetical protein WIO_03240 [Escherichia coli KTE125]
gi|433069285|ref|ZP_20256061.1| hypothetical protein WIQ_03168 [Escherichia coli KTE128]
gi|218433676|emb|CAR14591.1| putative Capsule biosynthesis protein [Escherichia coli UMN026]
gi|300356174|gb|EFJ72044.1| hypothetical protein HMPREF9552_04335 [Escherichia coli MS 198-1]
gi|388406150|gb|EIL66559.1| putative Capsule biosynthesis protein [Escherichia coli 576-1]
gi|430923733|gb|ELC44466.1| hypothetical protein WEK_03550 [Escherichia coli KTE26]
gi|430986553|gb|ELD03120.1| hypothetical protein A15I_03183 [Escherichia coli KTE204]
gi|431167973|gb|ELE68227.1| hypothetical protein A1UW_03231 [Escherichia coli KTE80]
gi|431322497|gb|ELG10082.1| hypothetical protein A1S9_00118 [Escherichia coli KTE50]
gi|431471994|gb|ELH51886.1| hypothetical protein A15E_03777 [Escherichia coli KTE202]
gi|431562785|gb|ELI36028.1| hypothetical protein WII_03332 [Escherichia coli KTE120]
gi|431579442|gb|ELI52025.1| hypothetical protein WIO_03240 [Escherichia coli KTE125]
gi|431580783|gb|ELI53338.1| hypothetical protein WIQ_03168 [Escherichia coli KTE128]
Length = 330
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 30/62 (48%), Gaps = 6/62 (9%)
Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
LRN A V + YV LLD+D P F L+ K I+S G K ++P +Y
Sbjct: 77 LRNQAFKSVESEYVILLDVDIYPDFNLFEKYKEKIKS------GIKPFYILPCLYLTKYG 130
Query: 611 TA 612
T+
Sbjct: 131 TS 132
>gi|405376632|ref|ZP_11030585.1| LPS:glycosyltransferase [Rhizobium sp. CF142]
gi|397326770|gb|EJJ31082.1| LPS:glycosyltransferase [Rhizobium sp. CF142]
Length = 322
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 23/124 (18%)
Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
++ +P+ LPET ++ + LD D++ T + QLW + IG V D++L +
Sbjct: 94 RILMPQFLPETCSRVLYLDGDILVLTALEQLWNIDL---GEAVIGAVP---DYWLDNVVS 147
Query: 258 NHKPWPA---LGRGFNTGVILLDLTKLRDISWAGFWRI--IAEKFL--LTRLWTS-LADQ 309
+ + + R FN G++L+DL + WR I+E+ L L R T+ +DQ
Sbjct: 148 SGRGATGGARVERYFNAGILLIDLAR---------WRAERISERSLDYLDRFPTTEYSDQ 198
Query: 310 DIFN 313
D N
Sbjct: 199 DALN 202
>gi|418935516|ref|ZP_13489286.1| glycosyl transferase family 8 [Rhizobium sp. PDO1-076]
gi|375057763|gb|EHS53917.1| glycosyl transferase family 8 [Rhizobium sp. PDO1-076]
Length = 306
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 67/146 (45%), Gaps = 11/146 (7%)
Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
Y V + +L + ++PET+++ + LD D + +A + LFS +G V++
Sbjct: 81 YWSVATIARLYMDTLVPETISRLVYLDADTL---AVASIVPLFSLDMGGHPVGAVDDCLM 137
Query: 250 WYLGKL--WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA 307
+ K+ K+ A R FN GV+L D + L + + + E FL
Sbjct: 138 AFPDKMSERKSRIGMAAEARYFNAGVLLFDWSVLPETN---VLKEAREIFLKDPERYPFK 194
Query: 308 DQDIFNAIISEHPYLVYTLPCQWNVQ 333
DQD+ N +++E+ L +WN Q
Sbjct: 195 DQDVLNVVLAEN---WLALDPRWNTQ 217
>gi|313225951|emb|CBY21094.1| unnamed protein product [Oikopleura dioica]
Length = 330
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 65/141 (46%), Gaps = 28/141 (19%)
Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY--PYLK-------SSIRSMDMHG 592
E +++P N LRNVAL +FL ++ L LY P + +S +D +
Sbjct: 141 EESWFPNNLLRNVALKPKIE--IFLKNMSDLSNKDLYLIPVFEKESSTQINSFPQLDENY 198
Query: 593 HGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTP--------YQIEWAPDFEPYI 644
GK V + A+ T+ ++ +F +W T Y+I++ FEPY
Sbjct: 199 QNGK-VQMFHAWCTE-------WCYSFYDFPKWFEKTKKNPKSTEVGYKIKYKLSFEPYY 250
Query: 645 VAHRD-LPRYDTRFVGFGWNK 664
+ D +P YD RF G+G+NK
Sbjct: 251 LGRLDAVPFYDIRFRGYGYNK 271
>gi|302840832|ref|XP_002951962.1| hypothetical protein VOLCADRAFT_92485 [Volvox carteri f.
nagariensis]
gi|300262863|gb|EFJ47067.1| hypothetical protein VOLCADRAFT_92485 [Volvox carteri f.
nagariensis]
Length = 550
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 10/74 (13%)
Query: 594 GGKKVLVVPAFETQRYRT----AFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD 649
G KK L + ++T+ RT FP H T++ RW++ PY I + +EP+ + R
Sbjct: 342 GDKKRLQM-LWQTRHLRTFSEDIFPQGHNATDYRRWLSTDQPYTIRYGRGYEPWGITSRK 400
Query: 650 ----LPRYDTRFVG 659
+P YD RF G
Sbjct: 401 QYTWMP-YDVRFRG 413
>gi|300770875|ref|ZP_07080752.1| family 2 glycosyl transferase [Sphingobacterium spiritivorum ATCC
33861]
gi|300762148|gb|EFK58967.1| family 2 glycosyl transferase [Sphingobacterium spiritivorum ATCC
33861]
Length = 271
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFL 572
E R R NI ++ E N + ++ +RN A++QVS PY+ +D D +
Sbjct: 53 EKFRLRTNISIQHIWHEDNGFQLSQIRNKAISQVSNPYIIQIDGDVI 99
>gi|209877503|ref|XP_002140193.1| glycosyl transferase family 8 protein [Cryptosporidium muris RN66]
gi|209555799|gb|EEA05844.1| glycosyl transferase family 8 protein, putative [Cryptosporidium
muris RN66]
Length = 304
Score = 38.9 bits (89), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 50/101 (49%), Gaps = 11/101 (10%)
Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
KH S + +L LP ++ +++ K + LDTDVI T + +L+ + G+V
Sbjct: 129 KHVSEA-TMCRLLLPNIIDKSIDKLLYLDTDVIVNTPLRELFG----ININSQCGIVARS 183
Query: 248 SDW--YLGKLWKNHKPWPAL----GRGFNTGVILLDLTKLR 282
S + + K K +P + + FN GV+L+ L +LR
Sbjct: 184 STKADLINEWLKKDKIYPHIIYNGTKSFNAGVLLISLNELR 224
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.137 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,707,228,967
Number of Sequences: 23463169
Number of extensions: 457256966
Number of successful extensions: 933385
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 382
Number of HSP's successfully gapped in prelim test: 661
Number of HSP's that attempted gapping in prelim test: 929761
Number of HSP's gapped (non-prelim): 2398
length of query: 664
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 515
effective length of database: 8,863,183,186
effective search space: 4564539340790
effective search space used: 4564539340790
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 80 (35.4 bits)