BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10760
(356 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|307213205|gb|EFN88700.1| Lactosylceramide 4-alpha-galactosyltransferase [Harpegnathos
saltator]
Length = 351
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 142/355 (40%), Positives = 206/355 (58%), Gaps = 21/355 (5%)
Query: 8 LQRALLYLTAVCLTLSFLLGVHIFLSPRFLNHFVFRVKQPLRYIEYWRSGYRVQNINCVL 67
++R LL+ + L+ L V+I +S V+Q + +++ R N
Sbjct: 1 MKRRLLF----GIILASLFFVYIIVSTE-------EVRQRIPFLDARAYNTRCYNEASTA 49
Query: 68 SGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVV 127
G+ +DF + P +IFF ETSC +G+ L RQAC++ESAA MNP + VY++
Sbjct: 50 EGI-RDFEPESNVDKLKPGRNIFFHETSCFGEEGLMLNARQACAVESAARMNPSMTVYLL 108
Query: 128 VIASVR--NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHM 185
++ N TR ++ L Y+N+ I + Y +NTPL +Y + A+ S WP SHM
Sbjct: 109 FVSKSEFSNNTRE-IVKHLLNYRNIKIRHIQPDNYVKNTPLEAWYARGALKKSRWPNSHM 167
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHW---LSG 242
SD+LRY+TL+KYGG YLDLD +V SLE L N+AGAE VAAGVI D ++
Sbjct: 168 SDVLRYLTLWKYGGIYLDLDVVVTTSLEDLTNFAGAEDWDDVAAGVIGFDMSELGRRIAD 227
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIIS--CRNFTIYPPRFFYPVH 300
A +R+L+ NF+ WG NGPGV+TR L+ C + YA ++ + C +FT++PP FYPVH
Sbjct: 228 ACVRDLKKNFRGDLWGNNGPGVITRTLQKFCATK-YARDMTTARCHSFTVFPPSIFYPVH 286
Query: 301 WEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
++ W +Y N+ TM + + A+HVWN +K V++ S PYA IARRYCP
Sbjct: 287 YKKWKNYFEVKNSNETMKILNKAKAIHVWNKLSKAEQVRVDSNVPYAVIARRYCP 341
>gi|380019673|ref|XP_003693727.1| PREDICTED: uncharacterized protein LOC100863408 [Apis florea]
Length = 656
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 136/345 (39%), Positives = 197/345 (57%), Gaps = 14/345 (4%)
Query: 20 LTLSFLLGVHIF-LSPRFLNHFVFRVKQPLRYIEYWRSGYRVQNINCVLSGVEKDFAFDD 78
L L F V F L N F+ + PL I W +I+C D D
Sbjct: 21 LVLCFFCAVIFFILMISTDNDFIQQHVSPLMNIVVW------DDISCYNQPNMPDSLLDF 74
Query: 79 ITRLDVP--DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT 136
+LD P D +IFF ETSC +G+ L RQAC++ESAA MNP + VY++ ++ +
Sbjct: 75 DPKLDKPNTDRNIFFHETSCFDENGLVLNARQACAVESAAKMNPNMNVYLLFVSPSKISI 134
Query: 137 RNP-LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLY 195
++ + ++L Y N+HI + Y ++TPL +Y D + S WP +HMSD+LRY+TL+
Sbjct: 135 QSKEMFEQLQTYPNIHIRYIKPENYIKDTPLDLWYKSDILKRSRWPRNHMSDILRYLTLW 194
Query: 196 KYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLD---KDHWLSGAALRELRDNF 252
KYGG YLDLD +VI S+E L N+AGAE VAAGVI D ++ A +R++R NF
Sbjct: 195 KYGGIYLDLDVVVITSIEHLTNFAGAEDWKYVAAGVIGFDFSTLGRRMADACIRDIRANF 254
Query: 253 KTTEWGANGPGVLTRLLKAECKPQSYAHNIIS-CRNFTIYPPRFFYPVHWEHWADYLNET 311
+ WG NGPGV+TR L+ C ++ + S C+ F ++PP FYP+H+++W Y
Sbjct: 255 RGDIWGNNGPGVITRTLQKFCSTKNIQNMSTSRCQGFKVFPPSAFYPIHYDNWKVYFQTK 314
Query: 312 NAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
N ATM + + A+HVWN + V + S+ PY IAR++CP+
Sbjct: 315 NMNATMKMLEKAMAIHVWNKLSSAEKVDVNSDVPYVNIARKHCPK 359
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 133/348 (38%), Positives = 195/348 (56%), Gaps = 39/348 (11%)
Query: 47 PLRYIEYWRSGYRVQNINCVLSGVEKDFAFDDITRL----------DVP----------- 85
P+ Y + W+ ++ +N+N + +EK A +L DVP
Sbjct: 301 PIHY-DNWKVYFQTKNMNATMKMLEKAMAIHVWNKLSSAEKVDVNSDVPYVNIARKHCPK 359
Query: 86 -----------DN-SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVR 133
DN +IFF ETSC + + L RQAC++ESAA MNP + VYV+ ++ +
Sbjct: 360 IFNNCGKHMLIDNRNIFFHETSCFDENDLALNPRQACAVESAAKMNPDMNVYVLFLSPSK 419
Query: 134 -NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYV 192
+R L+ +L Y N+HI ++ Y +NTPL +Y + + S WP++HMSD+LRY+
Sbjct: 420 VSRRFRKLLKQLQTYPNIHIRRIKPEDYMKNTPLDAWYKTEILKKSKWPINHMSDILRYL 479
Query: 193 TLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKD---HWLSGAALRELR 249
TL+KYGG YLDLD IVIK ++ L N+AGAE S VAAG+I D ++ ++E+R
Sbjct: 480 TLWKYGGIYLDLDIIVIKPIKYLANFAGAEDESQVAAGIIGFDTSMIGRRMANECIQEIR 539
Query: 250 DNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYL 308
NF+ W NGPGV+TR+LK C + ++I C+ F IYP FYP+H+ +W Y
Sbjct: 540 SNFRGDIWNHNGPGVITRILKKICSTEKLQDMSMIRCKGFKIYPSSTFYPIHYGNWKSYF 599
Query: 309 NETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TN ATM ++ +HVWN + V + S+ PY IAR+YCP+
Sbjct: 600 QTTNKNATMKKIEEAITIHVWNKLSNGEKVNVNSDVPYVTIARKYCPK 647
>gi|328710182|ref|XP_003244188.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Acyrthosiphon pisum]
Length = 352
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 115/289 (39%), Positives = 175/289 (60%), Gaps = 4/289 (1%)
Query: 71 EKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA 130
++D + ++ +V NSIFF+ETSC H G+ L+LRQ C+IESAA +NP + ++V+ +A
Sbjct: 54 KQDNSLLELNPEEVTKNSIFFVETSCNHKKGINLSLRQGCAIESAANLNPNLNIFVLFVA 113
Query: 131 SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLR 190
+ +I+R Y NV++ ++ +Y NTPL F + I TS WP+SH SDLLR
Sbjct: 114 PSFISDESEIINRFKMYSNVNLRYINFVKYSHNTPLQDFVASNTISTSQWPVSHASDLLR 173
Query: 191 YVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDH---WLSGAALRE 247
++TL+K+GGTYLDLD +++KSLE L N+A ES++ VA+ V++ D D +S ++ +
Sbjct: 174 FLTLWKFGGTYLDLDVVLMKSLEGLSNFASIESNTSVASLVLNFDVDKIGRTVSNTSIND 233
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWAD 306
N+ +WG NGPGV+TR L+ C N C+ F ++ F P+ W W
Sbjct: 234 FASNYYANDWGYNGPGVITRTLEKICNTNLVTDMNKQKCKGFMVFGTEAFCPISWTDWRL 293
Query: 307 YLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
Y N + M +DS +HVWN +K P+ +GS+QPY +A +YCP
Sbjct: 294 YFNTNTSDEVMVKLKDSIGIHVWNLHSKHTPINIGSKQPYGLVAEKYCP 342
>gi|328700186|ref|XP_003241171.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Acyrthosiphon pisum]
Length = 353
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 116/297 (39%), Positives = 176/297 (59%), Gaps = 5/297 (1%)
Query: 63 INCVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGV 122
I+C L + D + ++ +V NSIFF+E+SC H G+ L+LRQ C+IESAA +NP +
Sbjct: 48 ISCYLEE-KPDNSLPELNPEEVTKNSIFFIESSCNHKKGINLSLRQGCAIESAANLNPNL 106
Query: 123 QVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL 182
++V+ +A + +I+R Y N+++ ++ +Y NTPL F T + I TS WP+
Sbjct: 107 NIFVLFVAPTFISNESEIINRFQMYSNINLRYINFVKYSHNTPLQNFVTSNTISTSQWPV 166
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHW--- 239
H SDLLR +TL+K+GGTYLDLD +++KSLE L N+ G ES+++VA+ V++ D D
Sbjct: 167 HHASDLLRLLTLWKFGGTYLDLDVVLMKSLEGLSNFVGIESNTLVASCVLNFDVDKIGRT 226
Query: 240 LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYP 298
+S ++ E N+ WG NGPG++TR LK C N C+ F ++ F+P
Sbjct: 227 VSNTSINEFASNYHANYWGYNGPGLITRTLKKICNTNLVPDMNKKKCKGFMVFGTEEFFP 286
Query: 299 VHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ W W Y N + M +DS +HVWN +K + +GS QPY +A +YCP
Sbjct: 287 ISWTDWRLYFNTNTSDEVMVKLKDSIGIHVWNFLSKDTAINIGSRQPYGLVAEKYCP 343
>gi|328793424|ref|XP_001123219.2| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like,
partial [Apis mellifera]
Length = 480
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 134/354 (37%), Positives = 197/354 (55%), Gaps = 19/354 (5%)
Query: 10 RALLYLTAVCLTLSFLLGVHIFLSPRFLNHFVFRVKQPLRYIEYWRSGYRVQNINCVLSG 69
+ L L C + F+L + N F+ + PL I W +I+C
Sbjct: 2 KKRLLLCFFCAVIFFILMIST------DNDFIQQHVSPLMSIVVW------DDISCYNQP 49
Query: 70 VEKDFAFDDITRLDVP--DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVV 127
D D ++D P D +IFF ETSC +G+ L RQAC++ESAA MNP + VY++
Sbjct: 50 NMPDSLLDFDPKIDKPNTDKNIFFHETSCFDENGLMLNARQACAVESAAKMNPNMNVYLL 109
Query: 128 VIASVR-NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMS 186
++ + + + +L Y N+HI + Y ++TPL +Y D + S WP +HMS
Sbjct: 110 FVSPSKISIDSKEMFKQLQTYPNIHIRYIKPENYMKDTPLDLWYKSDILKRSRWPRNHMS 169
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLD---KDHWLSGA 243
D+LRY+TL+KYGG YLDLD +VI S+E L N+AGAE VAAGVI D ++ A
Sbjct: 170 DILRYLTLWKYGGIYLDLDVVVITSIEHLTNFAGAEDWKYVAAGVIGFDFTTLGRRMADA 229
Query: 244 ALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIIS-CRNFTIYPPRFFYPVHWE 302
+R++R NF+ WG NGPGV+TR L+ C ++ S C+ F ++PP FYP+H++
Sbjct: 230 CIRDIRANFRGDIWGNNGPGVITRTLQKFCSTKNIQDMSTSRCQGFKVFPPSAFYPIHYD 289
Query: 303 HWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+W Y N ATM + + A+HVWN + V + S+ PY IAR++CP+
Sbjct: 290 NWKVYFQTKNKNATMKILEKAMAIHVWNKLSSAEKVDVNSDVPYVNIARKHCPK 343
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 88/130 (67%), Gaps = 1/130 (0%)
Query: 81 RLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVR-NRTRNP 139
+L + + IFF ETSC +G+ L RQAC++ESAA +NP + +Y++ ++ + +R
Sbjct: 351 KLFIQNRCIFFHETSCFDENGLALNARQACAVESAAKINPDMNIYLLFLSPSKISRRFRR 410
Query: 140 LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGG 199
L +L Y N+HI ++ Y +NTPL+ +Y + + S WP++HMSD+LRY+TL+KYGG
Sbjct: 411 LFKQLQTYPNIHIRRIKPEDYMKNTPLNEWYKTEILKKSKWPINHMSDILRYLTLWKYGG 470
Query: 200 TYLDLDFIVI 209
YLDLD IVI
Sbjct: 471 IYLDLDVIVI 480
>gi|332025645|gb|EGI65807.1| Lactosylceramide 4-alpha-galactosyltransferase [Acromyrmex
echinatior]
Length = 291
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 125/275 (45%), Positives = 173/275 (62%), Gaps = 9/275 (3%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVR--NRTRNPLIDRLY 145
+IFF ETSC +G+ L RQAC+IESAA MNP + VY++ I+ N TR ++ L
Sbjct: 9 NIFFHETSCFGDEGLTLNARQACAIESAARMNPSMTVYLLFISKSEFSNSTR-EIVRHLL 67
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
Y NV I +D +Y + TPL +YT + S WP+SHMSD+LRY+TL+KYGG YLDLD
Sbjct: 68 NYSNVRIRHIDPQKYVKETPLDAWYTSGVLKKSHWPVSHMSDILRYLTLWKYGGIYLDLD 127
Query: 206 FIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHW---LSGAALRELRDNFKTTEWGANGP 262
+V SLE+L N+AGAE VAAGV+ D ++ A +R+LR NF+ WG NGP
Sbjct: 128 VVVTSSLENLTNFAGAEDWDDVAAGVMGFDLSKLGRRVADACVRDLRKNFRGDVWGNNGP 187
Query: 263 GVLTRLLKAECKPQSYAHNIIS--CRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
GV+TR L+ C +YA ++ + C FT+Y P FYP+H++ W Y ++ T+ +
Sbjct: 188 GVITRTLQKLC-ATTYARDMTTNRCHGFTVYSPSVFYPIHYKKWKKYFEIKDSNVTLKIL 246
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ A+HVWN+ +K V++ S PYA IAR YCP
Sbjct: 247 NKAKAIHVWNNLSKAEKVRVNSNVPYAVIARNYCP 281
>gi|340716416|ref|XP_003396694.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Bombus terrestris]
Length = 350
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 184/305 (60%), Gaps = 11/305 (3%)
Query: 61 QNINCVLSGVEKDFAFDDITRLDVP--DNSIFFLETSCTHADGVELTLRQACSIESAAMM 118
+I C D D +LD P D +IFF ETSC DG+ L RQAC++ESAA M
Sbjct: 39 NDITCYEEESTPDGLPDFDPQLDKPMTDRNIFFHETSCFDKDGLVLNARQACAVESAAKM 98
Query: 119 NPGVQVYVVVIA--SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAIL 176
NP + VY++ I+ + +++R + ++L Y NV I + +Y ++TPLH +Y+ +
Sbjct: 99 NPSMNVYLLFISPSKISDQSRE-MFNQLQTYSNVRIRHIKPEKYMRDTPLHLWYSSGILK 157
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK 236
S WP SHMSD+LRY+TL+KYGG YLDLD +VI SLE L N+AGAE VAAGV+ D
Sbjct: 158 KSYWPRSHMSDILRYLTLWKYGGIYLDLDVVVIASLERLTNFAGAEDWKNVAAGVMGFDV 217
Query: 237 DHW---LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIIS--CRNFTIY 291
++ A +R++R NF+ WG NGPGV+TR L+ C + Y ++ + C F ++
Sbjct: 218 TELGRRMADACIRDMRANFRGDVWGNNGPGVITRTLQKLCSTK-YVRDMTTSRCHGFKVF 276
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIAR 351
P FYP+H+E W Y + TM + + A+HVWN F+K V + ++ PY IAR
Sbjct: 277 SPSTFYPIHYEKWKIYFETKDKNTTMQIVNKAMAIHVWNKFSKFEKVDVNNDVPYVIIAR 336
Query: 352 RYCPR 356
++CP+
Sbjct: 337 KHCPK 341
>gi|350406339|ref|XP_003487738.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Bombus impatiens]
Length = 350
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 122/285 (42%), Positives = 178/285 (62%), Gaps = 11/285 (3%)
Query: 81 RLDVP--DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVI--ASVRNRT 136
+LD P D +IFF ETSC DG+ L RQAC++ESAA MNP + VY++ I + + +++
Sbjct: 59 QLDKPMTDRNIFFHETSCFDKDGLVLNARQACAVESAAKMNPSMNVYLLFISPSKISDQS 118
Query: 137 RNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYK 196
R + ++L Y NV I + +Y ++TPLH +Y+ + S WP SHMSD+LRY+TL+K
Sbjct: 119 RE-MFNQLQTYSNVRIRHIKPEKYMRDTPLHLWYSSGILKKSYWPRSHMSDILRYLTLWK 177
Query: 197 YGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHW---LSGAALRELRDNFK 253
YGG YLDLD +VI SLE L N+AGAE VAAGV+ D ++ A +R++R NF+
Sbjct: 178 YGGIYLDLDVVVIASLERLTNFAGAEDWKNVAAGVMGFDVTELGRRMADACIRDMRANFR 237
Query: 254 TTEWGANGPGVLTRLLKAECKPQSYAHNIIS--CRNFTIYPPRFFYPVHWEHWADYLNET 311
WG NGPGV+TR L+ C + Y ++ + C F ++ P FYP+H+E W Y
Sbjct: 238 GDVWGNNGPGVITRTLQKLCSTK-YVRDMTTSRCHGFKVFSPSTFYPIHYEKWKIYFETK 296
Query: 312 NAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ TM + + +HVWN F+K V + ++ PY IAR++CP+
Sbjct: 297 DKNTTMQIVDKAMTIHVWNKFSKFEKVDVNNDVPYVIIARKHCPK 341
>gi|383866067|ref|XP_003708493.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Megachile rotundata]
Length = 390
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 125/291 (42%), Positives = 179/291 (61%), Gaps = 10/291 (3%)
Query: 73 DFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-- 130
DF ++ + D +IFF ETSC DG+ L RQAC++ESAA MNP + VY++ ++
Sbjct: 56 DFNPKSVSNNAISDRNIFFHETSCFQ-DGLVLNARQACAVESAAKMNPNMNVYLLFVSPS 114
Query: 131 SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLR 190
+ N+++ L +L Y N+H+ + Y +NTPL +Y + S WP SHMSD+LR
Sbjct: 115 KISNQSKE-LFKQLETYPNIHLGHIYPDEYVKNTPLDLWYKSGVLKKSRWPRSHMSDILR 173
Query: 191 YVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK---DHWLSGAALRE 247
Y+TL+KYGG YLDLD +V SLE L N+AGAE VAAGVI LD ++ A +R+
Sbjct: 174 YLTLWKYGGIYLDLDVVVTTSLEHLTNFAGAEDWDDVAAGVIGLDATPLGRRVADACIRD 233
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNII--SCRNFTIYPPRFFYPVHWEHWA 305
L NF+ WG NGPGV+TR+L+ C + Y ++ C F +Y P FYPVH++ W
Sbjct: 234 LMKNFRGNVWGNNGPGVITRILRRICSAK-YVRDMTPARCGGFKVYSPSAFYPVHYKKWK 292
Query: 306 DYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
Y + ATM + + + A+HVWN +K + V + S+ PYA IAR++CP+
Sbjct: 293 MYFETKDKNATMKMLKKALAIHVWNKLSKSMEVHVNSDVPYAIIARKHCPK 343
>gi|307189322|gb|EFN73753.1| Nudix hydrolase 8 [Camponotus floridanus]
Length = 898
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 128/305 (41%), Positives = 181/305 (59%), Gaps = 10/305 (3%)
Query: 60 VQNINCV--LSGVEKDFAFD-DITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAA 116
+ NI C +S E FD + V +IFF ETSC +G+ L RQAC++ESAA
Sbjct: 39 IHNIKCYDEVSSAESIRDFDAESNDKPVLGRNIFFHETSCFGEEGLVLNARQACAVESAA 98
Query: 117 MMNPGVQVYVVVIAS--VRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDA 174
MNP + VY++ I+ N TR ++ L Y+N+ I + RY ++TP +YT
Sbjct: 99 RMNPTMTVYLLFISKSDFSNSTRE-IVRHLLNYRNIKIRHIYPNRYVKDTPFEVWYTSGM 157
Query: 175 ILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHL 234
+ S WP SHMSD+LRY+TL+KYGG YLDLD +VI SLE+L N+AGAE VAAGV+
Sbjct: 158 LKKSHWPASHMSDMLRYLTLWKYGGIYLDLDVVVISSLENLTNFAGAEDWDDVAAGVMGF 217
Query: 235 DKDHW---LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIIS-CRNFTI 290
D ++ A +R+ + NF+ WG NGPGV+TR L+ C + CR FT+
Sbjct: 218 DTSELGRRIADACVRDFKKNFRGDVWGNNGPGVITRTLQKLCGTINVRDMTTDRCRGFTV 277
Query: 291 YPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIA 350
+PP FYP+H++ W Y + + AT+ + + A+HVWN +K V++ S PYA IA
Sbjct: 278 FPPSVFYPIHYKKWKKYFDTRDNNATLKILSKAKAIHVWNKLSKAEQVRVNSHVPYAVIA 337
Query: 351 RRYCP 355
R++CP
Sbjct: 338 RKHCP 342
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 124/277 (44%), Gaps = 16/277 (5%)
Query: 84 VPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVR-NRTRNPLID 142
+P IFF T+C+ LT RQAC++ESAA N + V+V+ A+ + + ID
Sbjct: 354 IPSRCIFFHNTNCSPP---YLTPRQACAVESAAKNNHNLNVFVLFFATDQLSEKSKAFID 410
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYT---QDAILTSLWPLSHMSDLLRYVTLYKYGG 199
L Y NV+I +V L Y +TPL +++ Q+ W D +R +TL+K+GG
Sbjct: 411 ILQTYDNVYIRRVHLSTYVIDTPLENWFSINVQEFSENHDWLRKDFHDYIRMLTLWKFGG 470
Query: 200 TYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA 259
L+L+ I+ SL+ L+ +A A+ + V +D + K
Sbjct: 471 VALNLNNIIPTSLDELNTFAVAQDNQSADVDVFGVDTSTNFGMNFVNTSIKVIKNINVEN 530
Query: 260 NGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
+TR++ + Y C+ FTIY P FY V N M +
Sbjct: 531 YSRYNITRVIVEVLQKLCYEQYDRECKKFTIYSPEKFYSV-------LPYSRNMDKLMKV 583
Query: 320 FRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+++ ++ W ++++ K + Y A+ +CP+
Sbjct: 584 MQNAKMIYSWTNYSR--DTKADTMATYRIAAKLHCPK 618
>gi|158296878|ref|XP_317211.3| AGAP008258-PA [Anopheles gambiae str. PEST]
gi|157014923|gb|EAA12323.3| AGAP008258-PA [Anopheles gambiae str. PEST]
Length = 358
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 129/277 (46%), Positives = 169/277 (61%), Gaps = 8/277 (2%)
Query: 88 SIFFLETSCTHADG-VELTLRQACSIESAAMMNPGVQVYVVVIASV--RNRTRNPLIDRL 144
+IFF ETSC DG V L RQAC+IESAA NP VYV+ A V RN T P++D L
Sbjct: 73 NIFFHETSCLKEDGIVRLNARQACAIESAARANPEWNVYVLFAAPVGFRNHTTQPILDAL 132
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
EY+NVH+ V+L Y TPL + I SL+ SH+SD++RY+TLYKYGGTYLDL
Sbjct: 133 LEYRNVHLRYVNLTTYANETPLEEWMASGEIFRSLYMNSHLSDVMRYLTLYKYGGTYLDL 192
Query: 205 DFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGAN 260
D IV +S E L NYAGAES VAAGV++ + H L+ +R+L NF +WG N
Sbjct: 193 DVIVQQSFEKLEPNYAGAESVRWVAAGVMNFEPKGHGHELAEMCVRDLLANFNGKDWGNN 252
Query: 261 GPGVLTRLLKAECKPQSYAHNIIS-CRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GPGV+TR+L+ C +S AH CR+FT+YP FY + +E + + E +
Sbjct: 253 GPGVVTRVLQKYCHTRSTAHMTRERCRHFTVYPISAFYAIGYEDYRQFFEEQYLEHALYT 312
Query: 320 FRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
S +HVWN F+K PV++GS Y +A R+CP+
Sbjct: 313 LNQSIVVHVWNKFSKNHPVRVGSRVAYGVLAERHCPK 349
>gi|157108368|ref|XP_001650194.1| lactosylceramide 4-alpha-galactosyltransferase (alpha-
1,4-galactosyltransferase) [Aedes aegypti]
gi|108879300|gb|EAT43525.1| AAEL005019-PA [Aedes aegypti]
Length = 354
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 121/276 (43%), Positives = 173/276 (62%), Gaps = 7/276 (2%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV--RNRTRNPLIDRLY 145
+IFF ETSC+ ++L RQAC+IESAA MNP +V+ A V RNR+ PL+D L+
Sbjct: 70 NIFFHETSCSEDGVIKLNARQACAIESAARMNPDWNAFVLFAAPVGFRNRSALPLLDALH 129
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
Y NV++ V+L Y Q+TPL + I S + SH+SD++RY+TL+KYGGTYLDLD
Sbjct: 130 SYPNVNLRYVNLSTYAQDTPLEEWMQSGEIFRSKYMNSHLSDIMRYLTLFKYGGTYLDLD 189
Query: 206 FIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANG 261
+V+KS ++ NYAGAES+ VAAGV++ + D H L+ +R+L NF +WG NG
Sbjct: 190 VVVLKSFNAMEPNYAGAESARWVAAGVMNFEPDGHGHELAAMCVRDLLLNFNGQDWGNNG 249
Query: 262 PGVLTRLLKAECKPQS-YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
PGV+TR+LK C Q+ CR+FT+YPP FY ++++ + + E M+
Sbjct: 250 PGVITRVLKRICSTQAPLMMTRERCRHFTVYPPEAFYAINFDDYLQFFEERWLEQAMATV 309
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
S +HVWN F+K V++GS Y +A ++CPR
Sbjct: 310 NRSIVVHVWNKFSKDHKVRVGSRVAYGVLAEQFCPR 345
>gi|158296876|ref|XP_001237894.2| AGAP008259-PA [Anopheles gambiae str. PEST]
gi|157014922|gb|EAU76327.2| AGAP008259-PA [Anopheles gambiae str. PEST]
Length = 302
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 123/277 (44%), Positives = 166/277 (59%), Gaps = 8/277 (2%)
Query: 88 SIFFLETSCTHADG-VELTLRQACSIESAAMMNPGVQVYVVVIASV--RNRTRNPLIDRL 144
+IFF ETSC DG V L RQAC+IESAA NP VYV+ A V RNRT P++D L
Sbjct: 17 NIFFHETSCLKKDGIVRLNARQACAIESAARANPEWNVYVLFAAPVGFRNRTTQPILDAL 76
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
Y NVH+ V+L Y +TPL + + IL S + SH+SD++RY+TLYKYGGTYLDL
Sbjct: 77 LAYPNVHLRYVNLTTYANDTPLKEWMARGEILRSQYMNSHLSDVMRYLTLYKYGGTYLDL 136
Query: 205 DFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGAN 260
D IV +S E + NYAGAES +AAGVI+ + H L+ +R+L N+ +W N
Sbjct: 137 DVIVQQSFEKMKPNYAGAESPQYIAAGVINFESKGHGHELAEMCVRDLLANYNGYQWAQN 196
Query: 261 GPGVLTRLLKAECKPQSYAHNIISC-RNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GPGV+TR+L C QS + C +FT+YPP FY + + ++ + E + +
Sbjct: 197 GPGVITRVLVKHCHTQSITNMTQHCSEHFTVYPPSAFYAIAYWNYKQFFEEPYLKSALEA 256
Query: 320 FRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
S +H+WN +K PV +GS Y +A RYCP+
Sbjct: 257 LNQSIVVHLWNKLSKDNPVWVGSRVAYGVLAERYCPK 293
>gi|57914725|ref|XP_555205.1| AGAP008260-PA [Anopheles gambiae str. PEST]
gi|55237431|gb|EAL39606.1| AGAP008260-PA [Anopheles gambiae str. PEST]
Length = 404
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 123/276 (44%), Positives = 169/276 (61%), Gaps = 7/276 (2%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV--RNRTRNPLIDRLY 145
+IFF ETSC V L RQAC+IESAA NPG VYV+ A V RNRT P++D L
Sbjct: 120 NIFFHETSCWKDGIVRLNARQACAIESAARANPGWNVYVLFAAPVGFRNRTTQPVLDALL 179
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
EY+NVH+ V+L Y +TPL + + IL S + SH+SD++RY+TLYKYGGTYLDLD
Sbjct: 180 EYRNVHLRYVNLTTYANDTPLKEWMARGDILQSQYMNSHLSDVMRYLTLYKYGGTYLDLD 239
Query: 206 FIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANG 261
IV +S E L NYAGAES ++ + V++L+ H L+ +R+L NF +W NG
Sbjct: 240 VIVQQSFEKLEPNYAGAESFDLINSAVMNLEPKGHGHDLAEICVRDLLTNFNGNDWVNNG 299
Query: 262 PGVLTRLLKAECKPQSYAHNIISC-RNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
PGV+TR+L+ C+ QS A C R+FT+YP FY + + ++ E + F
Sbjct: 300 PGVITRVLQEHCRTQSIAEMTRHCSRHFTVYPSSAFYAIEYWNYELLFEEQCLEEALVAF 359
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
S +H+WN +K PV++GS Y +A R+CP+
Sbjct: 360 NRSIVVHLWNKLSKDSPVRVGSRVAYGVLAERHCPK 395
>gi|380019675|ref|XP_003693728.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Apis florea]
Length = 337
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 105/276 (38%), Positives = 168/276 (60%), Gaps = 6/276 (2%)
Query: 86 DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVR-NRTRNPLIDRL 144
D +IFF ETSC +G+ L RQAC+IESAA MN + VY++ ++ + ++ + ++L
Sbjct: 54 DKNIFFHETSCFDKNGLILNARQACAIESAAKMNSNMNVYLLFLSPSKISKQSKKIFEQL 113
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
Y N+ I ++ Y + TPL +Y D + S WP M+D+LR++TL+KYGG YLDL
Sbjct: 114 QTYPNIRIRRIKFKNYIKKTPLDVWYKTDILKKSKWPRIQMADILRFLTLWKYGGIYLDL 173
Query: 205 DFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHW---LSGAALRELRDNFKTTEWGANG 261
D ++++S+E L N+AGAE VAAG++ D + + E+R+NF+ WG NG
Sbjct: 174 DVVIMRSIEHLTNFAGAEDWYQVAAGILGFDTSKLGRHIVNECIHEIRNNFRGDIWGYNG 233
Query: 262 PGVLTRLLKAECKPQSYAHNIIS-CRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
PGV+TR+L+ C ++ S C+ F ++PP FYP+H+ +W+ Y TN TM +
Sbjct: 234 PGVITRVLQRICLTENVQDMSTSRCQGFRVFPPSMFYPIHYTNWSLYFT-TNDTKTMKII 292
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ A+HVWN + V + ++ PY +A+++CP+
Sbjct: 293 EQAMAIHVWNKLSNAEKVDVNNDVPYVNVAKKHCPK 328
>gi|156553705|ref|XP_001600253.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Nasonia vitripennis]
Length = 357
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 113/277 (40%), Positives = 171/277 (61%), Gaps = 8/277 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV--RNRTRNPLIDRL 144
+IFF ETSC + EL RQAC++ESAA MNP V ++ ++ NRT + L
Sbjct: 73 KNIFFHETSCFESGRSELNCRQACAVESAARMNPSTSVNLLFLSPSPPSNRTARLVDLVL 132
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
Y+NV +++V + Y +++P+ +Y + +S WP SHMSD++RY+TL+K+GG YLDL
Sbjct: 133 QSYRNVRVMRVQVDEYVRDSPIEQWYASGILGSSHWPRSHMSDIMRYLTLWKFGGVYLDL 192
Query: 205 DFIVIKSLESLHNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTEWGANG 261
D +V SLE L ++AGAE VAAGVI ++ A LR+L NF+ WG NG
Sbjct: 193 DVVVTTSLEDLTDFAGAEDWMDVAAGVIGFGATGLGRRVANACLRDLMRNFRGNLWGNNG 252
Query: 262 PGVLTRLLKAECKPQSYAHNIIS--CRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
PGV+TR L+ C + +A ++ S C F ++PP FYP+ ++ W Y + + TM L
Sbjct: 253 PGVITRTLQKFCAVE-HAKDMTSSRCSGFRVFPPSAFYPIFYKEWRRYFSAQDFNETMRL 311
Query: 320 FRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+++ A+HVWN + V++GS+ PYA +A+++CPR
Sbjct: 312 IQEARAIHVWNKLSATEVVRVGSKVPYAIVAQKHCPR 348
>gi|195148474|ref|XP_002015199.1| GL19577 [Drosophila persimilis]
gi|198474841|ref|XP_001356832.2| GA14400 [Drosophila pseudoobscura pseudoobscura]
gi|194107152|gb|EDW29195.1| GL19577 [Drosophila persimilis]
gi|198138571|gb|EAL33898.2| GA14400 [Drosophila pseudoobscura pseudoobscura]
Length = 384
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 114/293 (38%), Positives = 167/293 (56%), Gaps = 20/293 (6%)
Query: 84 VPDNSIFFLETSCTHADG-----VELTLRQACSIESAAMMNPGVQVYVVVIASV------ 132
P SIFF ETSC A+ +ELT RQAC+IESAA+ NP QV+V+
Sbjct: 83 TPGRSIFFHETSCHQAENKKYKLLELTARQACAIESAALHNPNFQVFVLFAGPTYRPSPK 142
Query: 133 --RNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLR 190
N + PL+D + Y NVH+ ++L RY TP+ + ++ S + SH+SD LR
Sbjct: 143 GRNNSNQQPLVDAILSYSNVHLRNLNLWRYAAGTPIEEWLKDGSLFRSRYLFSHISDFLR 202
Query: 191 YVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALR 246
Y+TLY+YGG YLD+D +V++ +E + NY GAES++ +AAGV+ L H ++ + LR
Sbjct: 203 YLTLYRYGGLYLDMDVVVLQKMEDVPPNYTGAESNTHLAAGVMSLAATGFGHEIAESCLR 262
Query: 247 ELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRFFYPVHWEH 303
+ + NF +WG NGPGV+TR+ + C + + C F +Y FY V W+
Sbjct: 263 DFQHNFAGKDWGNNGPGVITRVAQQICGTKDITLMQEDSKRCLGFKVYGRGAFYAVPWKQ 322
Query: 304 WADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
W D+ TM +DSY +HVWN + ++P+K+GS YA+ A + CPR
Sbjct: 323 WRDFFEPEKLEETMGRTKDSYVVHVWNKHSNQLPIKVGSNNAYAKYAEQNCPR 375
>gi|195342161|ref|XP_002037670.1| GM18175 [Drosophila sechellia]
gi|194132520|gb|EDW54088.1| GM18175 [Drosophila sechellia]
Length = 369
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/301 (38%), Positives = 174/301 (57%), Gaps = 20/301 (6%)
Query: 76 FDDITRLD---VPDNSIFFLETSCTHADGVEL-----TLRQACSIESAAMMNPGVQVYVV 127
DD+ + D P NSIFF ETSC ++ +L T RQAC+IESAAM NP QV+V+
Sbjct: 60 LDDVLQADPKPSPGNSIFFHETSCRLSENRQLETLKVTARQACAIESAAMHNPNFQVFVL 119
Query: 128 VIA-----SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL 182
S N PL++ + Y NVH+ +++L Y TP+ + + S +
Sbjct: 120 FAGPTYRISNNNSHPQPLVEAILSYSNVHLRRLNLESYASGTPMEEWLKDGRLSRSKYLF 179
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DH 238
SH+SD LRY+TLY+YGG YLD+D +V++++E + NY GAES++ +AAGV++L H
Sbjct: 180 SHISDFLRYLTLYRYGGLYLDMDVVVLRNMEKVPPNYTGAESNTHLAAGVMNLAATGFGH 239
Query: 239 WLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRF 295
++ + LR+ + NF +WG NGPGV+TR+ + C + A + C F ++
Sbjct: 240 EIAASCLRDFQHNFNGEDWGNNGPGVITRVAQKICGTEDIALMQEDPKRCMGFKVFGRGA 299
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
FY V W+ W D+ TM+ +DSY +HVWN + ++P+K GS+ YA A + CP
Sbjct: 300 FYAVPWKRWRDFFEPEKLEQTMARCKDSYVVHVWNKHSSKLPIKQGSKNAYALYAEQNCP 359
Query: 356 R 356
R
Sbjct: 360 R 360
>gi|170071309|ref|XP_001869873.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
gi|167867207|gb|EDS30590.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
Length = 351
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/311 (42%), Positives = 184/311 (59%), Gaps = 13/311 (4%)
Query: 58 YRVQNINCVLSGV-EKDFAFDDITRLDVP---DNSIFFLETSCTHADGV-ELTLRQACSI 112
Y N NC G E +D+ R +IFF ETSC+ ADGV +L RQAC+I
Sbjct: 33 YHTYNENCFARGFPEPQRTLEDVQRAHKQPHRGRNIFFHETSCS-ADGVIKLNARQACAI 91
Query: 113 ESAAMMNPGVQVYVVVIASV--RNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFY 170
ESAA MNP +V+ A V RN++ PL+D L Y NV++ V+L Y ++TPL +
Sbjct: 92 ESAARMNPSWNAFVLFAAPVGFRNKSALPLLDALLAYPNVNLRFVNLTTYAEDTPLDAWM 151
Query: 171 TQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSVVAA 229
I S + SH+SD++RY+TL+KYGGTYLDLD +V+KS ++L NYAGAES VAA
Sbjct: 152 QSGEIFRSRYMNSHLSDIMRYLTLFKYGGTYLDLDVVVLKSFDTLEPNYAGAESPRWVAA 211
Query: 230 GVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQS-YAHNIISC 285
GV++ + D H L+ +R+L NF +WG NGPGV+TR+LK C +S C
Sbjct: 212 GVMNFEPDGHGHELAEMCVRDLLINFNGQDWGNNGPGVITRVLKQICATKSPLMMTRERC 271
Query: 286 RNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQP 345
R FT+YPP FY ++++ + + E M+ S +HVWN F+K V++GS
Sbjct: 272 RFFTVYPPEAFYAINFDDYKQFFEERWLDQAMATVNRSVVVHVWNKFSKDHKVRVGSRVL 331
Query: 346 YAQIARRYCPR 356
Y +A +YCPR
Sbjct: 332 YGVLAEQYCPR 342
>gi|195550750|ref|XP_002076096.1| GD11999 [Drosophila simulans]
gi|194201745|gb|EDX15321.1| GD11999 [Drosophila simulans]
Length = 369
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/301 (38%), Positives = 174/301 (57%), Gaps = 20/301 (6%)
Query: 76 FDDITRLDV---PDNSIFFLETSCTHADGVEL-----TLRQACSIESAAMMNPGVQVYVV 127
DD+ + D P NSIFF ETSC ++ +L T RQAC+IESAAM NP QV+V+
Sbjct: 60 LDDVLQADSKPSPGNSIFFHETSCRLSENRQLETLKVTARQACAIESAAMHNPNFQVFVL 119
Query: 128 VIA-----SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL 182
S N PL++ + Y NVH+ +++L Y TP+ + + S +
Sbjct: 120 FAGPTYRISNNNSHPQPLVEAILSYSNVHLRRLNLESYASGTPMEEWLKDGRLSRSKYLF 179
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DH 238
SH+SD LRY+TLY+YGG YLD+D +V++++E + NY GAES++ +AAGV++L H
Sbjct: 180 SHISDFLRYLTLYRYGGLYLDMDVVVLRNMEKVPPNYTGAESNTHLAAGVMNLAATGFGH 239
Query: 239 WLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRF 295
++ + LR+ + NF +WG NGPGV+TR+ + C + A + C F ++
Sbjct: 240 EIAASCLRDFQHNFNGEDWGNNGPGVITRVAQKICGTKDIALMQEDPKRCMGFKVFGRGA 299
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
FY V W+ W D+ TM+ +DSY +HVWN + ++P+K GS+ YA A + CP
Sbjct: 300 FYAVPWKRWRDFFEPEKLEQTMARCKDSYVVHVWNKHSSKLPIKQGSKNAYALYAEQNCP 359
Query: 356 R 356
R
Sbjct: 360 R 360
>gi|19920618|ref|NP_608737.1| alpha4GT1 [Drosophila melanogaster]
gi|7295862|gb|AAF51162.1| alpha4GT1 [Drosophila melanogaster]
gi|15291359|gb|AAK92948.1| GH17972p [Drosophila melanogaster]
gi|220945468|gb|ACL85277.1| alpha4GT1-PA [synthetic construct]
gi|220955278|gb|ACL90182.1| alpha4GT1-PA [synthetic construct]
Length = 369
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 112/289 (38%), Positives = 171/289 (59%), Gaps = 17/289 (5%)
Query: 85 PDNSIFFLETSCTHADGVEL-----TLRQACSIESAAMMNPGVQVYVVVIASVRNRTRN- 138
P NSIFF ETSC ++ +L T RQAC+IESAAM NP QV+V+ + N
Sbjct: 72 PGNSIFFHETSCRLSENRQLETLKVTARQACAIESAAMHNPNFQVFVLFAGPTYRISNNK 131
Query: 139 ----PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTL 194
PL++ + Y NVH+ +++L Y TP+ + + S + SH+SD LRY+TL
Sbjct: 132 SHPQPLLEAILSYSNVHLRRLNLESYASGTPMEEWLKDGRLSRSKYLFSHISDFLRYLTL 191
Query: 195 YKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRD 250
Y+YGG YLD+D +V++++E + NY GAES++ +AAGV++L H ++ + LR+ +
Sbjct: 192 YRYGGLYLDMDVVVLRNMEKVPPNYTGAESNTHLAAGVMNLAATGFGHEIAASCLRDFQH 251
Query: 251 NFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRFFYPVHWEHWADY 307
NF +WG NGPGV+TR+ + C + A + C F ++ FY V W+ W D+
Sbjct: 252 NFNGGDWGNNGPGVITRVAQKICGTKDIALMREDPKRCMGFKVFGRGAFYAVPWKQWRDF 311
Query: 308 LNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
N T++ +DSY +HVWN + ++P+K+GS+ YA A + CPR
Sbjct: 312 FEPENLEETIARCKDSYVVHVWNKHSSKLPIKIGSKNAYALYAEQNCPR 360
>gi|194759101|ref|XP_001961788.1| GF14753 [Drosophila ananassae]
gi|190615485|gb|EDV31009.1| GF14753 [Drosophila ananassae]
Length = 379
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 176/304 (57%), Gaps = 23/304 (7%)
Query: 76 FDDITRLD---VPDNSIFFLETSCTHADG-----VELTLRQACSIESAAMMNPGVQVYVV 127
DD+ + D P +IFF ETSC + +ELT RQAC+IESAA+ NP QV+V+
Sbjct: 67 LDDVLQADPKPTPGKTIFFHETSCHQKENNRYKVLELTARQACAIESAALHNPNFQVFVL 126
Query: 128 V------IASVRNRTRNP--LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSL 179
I + R+ +P L++ + Y NVH+ +++L Y TP+ + + S
Sbjct: 127 FAGPTYQIPTARHNDSHPQTLVEAILSYSNVHLRRLNLWSYAAATPIEEWLKDGRLFRSK 186
Query: 180 WPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK-- 236
+ SH+SD LRY+TLY+YGG YLD+D +V++S+E + NY GAES + +AAGV++L+
Sbjct: 187 YLFSHISDFLRYLTLYRYGGLYLDMDVVVLRSMEEVPPNYTGAESDTHLAAGVMNLEPTG 246
Query: 237 -DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQS---YAHNIISCRNFTIYP 292
H ++ + LR+ + NF +WG NGPGV+TR+ + C+ + C FT++
Sbjct: 247 FGHGIAESCLRDFQHNFDGRDWGNNGPGVITRVAQKICQTNDIRLMQEDRKRCLGFTVFG 306
Query: 293 PRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARR 352
FY + W+ W D+ TM+ +DSY +HVWN + ++P+K GS YA+ A +
Sbjct: 307 RAAFYAIPWKQWKDFFEPEKMEETMARAKDSYVVHVWNKHSSKLPIKHGSSNAYAKYAEK 366
Query: 353 YCPR 356
CPR
Sbjct: 367 NCPR 370
>gi|194855265|ref|XP_001968508.1| GG24467 [Drosophila erecta]
gi|190660375|gb|EDV57567.1| GG24467 [Drosophila erecta]
Length = 369
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/301 (37%), Positives = 174/301 (57%), Gaps = 20/301 (6%)
Query: 76 FDDITRLD---VPDNSIFFLETSCTHADGVEL-----TLRQACSIESAAMMNPGVQVYVV 127
+D+ + D P NSIFF ETSC +D +L T RQAC+IESAAM NP QV+V+
Sbjct: 60 LEDVLQADPKPSPGNSIFFHETSCRLSDNRQLETLRVTARQACAIESAAMHNPSFQVFVL 119
Query: 128 VIA-----SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL 182
S N PL++ + Y NVH+ ++++ Y TP+ + + S +
Sbjct: 120 FAGPTYRISNNNSHPQPLVEAILSYSNVHLRRLNIETYASGTPMEEWLKDGRLSRSKYLF 179
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DH 238
SH+SD LRY+TLY+YGG YLD+D +V++++E + NY GAES++ +AAGV++L H
Sbjct: 180 SHISDFLRYLTLYRYGGLYLDMDVVVLRNMEKVPPNYTGAESNTHLAAGVMNLAATGFGH 239
Query: 239 WLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRF 295
++ + LR+ + NF +WG NGPGV+TR+ + C + A + C F ++
Sbjct: 240 EIAASCLRDFQHNFNGKDWGNNGPGVITRVAQKICGTKDIALMQEDPKRCMGFKVFGRGA 299
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
FY V W+ W D+ T++ +DSY +HVWN + ++P+K GS+ YA A + CP
Sbjct: 300 FYAVPWKQWRDFFEPEKLEETIARCKDSYVVHVWNKHSSKLPIKQGSKNAYAMYAEQNCP 359
Query: 356 R 356
R
Sbjct: 360 R 360
>gi|48473847|emb|CAF18556.1| alpha1,4-N-acetylgalactosaminyltransferase [Drosophila
melanogaster]
Length = 357
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 169/289 (58%), Gaps = 17/289 (5%)
Query: 85 PDNSIFFLETSCTHADGVEL-----TLRQACSIESAAMMNPGVQVYVVVIA-----SVRN 134
P NSIFF ETSC ++ +L T RQAC+IESAAM NP QV+V+ S N
Sbjct: 60 PGNSIFFHETSCRLSENRQLETLKVTARQACAIESAAMHNPNFQVFVLFAGPTYRISNNN 119
Query: 135 RTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTL 194
PL++ + Y N H+ +++L Y TP+ + + S + SH+SD LRY+TL
Sbjct: 120 SHPQPLVEAILSYSNGHLRRLNLESYASGTPMEEWLKDGRLSRSKYLFSHISDFLRYLTL 179
Query: 195 YKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRD 250
Y+YGG YLD+D +V++++E + NY GAES++ +AAGV++L H ++ + LR+ +
Sbjct: 180 YRYGGLYLDMDVVVLRNMEKVPPNYTGAESNTHLAAGVMNLAATGFGHEIAASCLRDFQH 239
Query: 251 NFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRFFYPVHWEHWADY 307
NF +WG NGPGV+TR+ + C + A + C F ++ FY V W+ W D+
Sbjct: 240 NFNGVDWGNNGPGVITRVAQKICGTKDIALMREDPKRCMGFKVFGRGAFYAVPWKQWRDF 299
Query: 308 LNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
T++ +DSY +HVWN + ++P+K+GS+ YA A + CPR
Sbjct: 300 FEPEKLEETIARCKDSYVVHVWNKHSSKLPIKIGSKNAYALYAEQNCPR 348
>gi|195470949|ref|XP_002087769.1| GE14957 [Drosophila yakuba]
gi|194173870|gb|EDW87481.1| GE14957 [Drosophila yakuba]
Length = 369
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 173/301 (57%), Gaps = 20/301 (6%)
Query: 76 FDDITRLD---VPDNSIFFLETSCTHADGVEL-----TLRQACSIESAAMMNPGVQVYVV 127
+D+ + D P +SIFF ETSC +D +L T RQAC+IESAA NP QV+V+
Sbjct: 60 LEDVLQADPKPSPGSSIFFHETSCRLSDNRQLETLRVTARQACAIESAAKHNPNFQVFVL 119
Query: 128 VIA-----SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL 182
S N PL++ + Y NVH+ +++L Y TP+ + + S +
Sbjct: 120 FAGPTYRISNNNSQPQPLVEAILSYSNVHLRRLNLESYASGTPMEEWLKDGRLSRSKYLF 179
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DH 238
SH+SD LRY+TLY+YGG YLD+D +V++++E + NY GAES++ +AAGV++L H
Sbjct: 180 SHISDFLRYLTLYRYGGLYLDMDVVVLRNMEKVPPNYTGAESNTHLAAGVMNLAATGFGH 239
Query: 239 WLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRF 295
++ + LR+ + NF +WG NGPGV+TR+ + C + + C F ++
Sbjct: 240 EIAASCLRDFQHNFNGKDWGNNGPGVITRVAQKICGTKDITLMQEDPKRCMGFKVFGRGA 299
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
FY V W+ W+D+ T++ +DSY +HVWN + ++P+K G++ YA A + CP
Sbjct: 300 FYAVPWKQWSDFFEPEKLEETIARCKDSYVVHVWNKHSSKLPIKQGTKNAYAMYAEQNCP 359
Query: 356 R 356
R
Sbjct: 360 R 360
>gi|198474838|ref|XP_002132786.1| GA26016 [Drosophila pseudoobscura pseudoobscura]
gi|198138570|gb|EDY70188.1| GA26016 [Drosophila pseudoobscura pseudoobscura]
Length = 350
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 118/313 (37%), Positives = 181/313 (57%), Gaps = 25/313 (7%)
Query: 69 GVEKDFAFDDITRLD---VPDNSIFFLETSCTHADG-----VELTLRQACSIESAAMMNP 120
VE+ DD + P SIFF ETSC A ++LT RQAC+IESAA+ NP
Sbjct: 29 AVEESIRLDDALLAEPRPTPGQSIFFHETSCYDAKNKNLNILKLTARQACAIESAALHNP 88
Query: 121 GVQVYV---------VVIASVRNRT-RNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFY 170
QV+V VV +V NR+ + PL D + Y NVH+ +++L RY TP+ +
Sbjct: 89 NFQVFVLFVDRKYSHVVDPNVGNRSIQQPLFDAILSYSNVHLRRLNLWRYAAGTPMEEWL 148
Query: 171 TQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAA 229
+ + S + +SH+SD LRY+TL++YGG YLD+D +V++S+E + NY GAES + +AA
Sbjct: 149 REGNLFRSRYLVSHISDFLRYLTLFRYGGLYLDMDVVVLRSMEDIPPNYTGAESDTFLAA 208
Query: 230 GVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNII 283
G+++L H ++ + L + + NF WG NGP V+TR+ + C ++ + N
Sbjct: 209 GIMNLAASGFGHQIAASCLHDFQTNFDGNVWGQNGPEVITRVAQQICGTKNISVMQTNRK 268
Query: 284 SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSE 343
C F ++ FY V + W ++ N T++ +DSYA+HVWNS +++ P+K+GS
Sbjct: 269 RCLGFKVFGRGAFYAVTSDKWLNFFNPHKLEETLARTKDSYAVHVWNSQSEKKPIKIGST 328
Query: 344 QPYAQIARRYCPR 356
YA+ A + CPR
Sbjct: 329 CAYAKYAEKNCPR 341
>gi|322795112|gb|EFZ17954.1| hypothetical protein SINV_80258 [Solenopsis invicta]
Length = 253
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 106/245 (43%), Positives = 151/245 (61%), Gaps = 9/245 (3%)
Query: 118 MNPGVQVYVVVIA--SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAI 175
MNP + VY++ ++ + N TR +I L Y NV I +D RY ++TPL +Y +
Sbjct: 1 MNPSMAVYLLFVSKSNFSNSTRE-IIRNLLNYPNVRIQHIDPQRYVKDTPLDAWYNSGIL 59
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLD 235
S WP SHMSD+LRY+TL+KYGG YLDLD +V SLE+L N+AGAE VAAGV+ D
Sbjct: 60 KKSHWPTSHMSDMLRYLTLWKYGGIYLDLDVVVTSSLENLTNFAGAEDWDDVAAGVMGFD 119
Query: 236 KDHW---LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIIS--CRNFTI 290
++ A +R+L+ NF+ WG NGPGV+TR L+ C YA ++ + C FT+
Sbjct: 120 MSKLGRRVADACVRDLKKNFRGDVWGNNGPGVITRTLQKLCATM-YARDMTTDRCHGFTV 178
Query: 291 YPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIA 350
YPP FYP+H++ W Y ++ AT+ + + A+HVWN +K V++ PYA IA
Sbjct: 179 YPPSVFYPIHYKKWKKYFEIKDSNATLKILSKAKAIHVWNKLSKAEQVRVNINVPYAVIA 238
Query: 351 RRYCP 355
R++CP
Sbjct: 239 RKHCP 243
>gi|195148470|ref|XP_002015197.1| GL19576 [Drosophila persimilis]
gi|194107150|gb|EDW29193.1| GL19576 [Drosophila persimilis]
Length = 350
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 179/313 (57%), Gaps = 25/313 (7%)
Query: 69 GVEKDFAFDDITRLD---VPDNSIFFLETSCTHADG-----VELTLRQACSIESAAMMNP 120
VE+ DD + P SIFF ETSC A ++LT RQ C+IESAA+ NP
Sbjct: 29 AVEESIRLDDALLAEPRPTPGRSIFFHETSCYDAKNKNLNILKLTARQVCAIESAALHNP 88
Query: 121 GVQVYV---------VVIASVRNRT-RNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFY 170
QV+V VV +V NR+ + PL D + Y NVH+ +++L RY TP+ +
Sbjct: 89 NFQVFVLFVDRKYSLVVDPNVGNRSIQQPLFDAILSYSNVHLRRLNLWRYAAGTPMEKWL 148
Query: 171 TQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAA 229
+ + S + +SH+SD LRY+TL++YGG YLD+D +V++ +E + NY GAES + +AA
Sbjct: 149 REGNLFRSRYLVSHISDFLRYLTLFRYGGLYLDMDVVVLRRMEDIPPNYTGAESDTFLAA 208
Query: 230 GVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNII 283
G+++L H ++ + L + + NF WG NGP V+TR+ + C ++ + N
Sbjct: 209 GIMNLAASGFGHQIAASCLHDFQTNFDGNVWGQNGPEVITRVAQQICGTKNISVMQTNRK 268
Query: 284 SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSE 343
C F ++ FY V + W ++ N T++ +DSYA+HVWNS +++ P+K+GS
Sbjct: 269 RCLGFKVFGRGAFYAVTSDKWLNFFNPHKLEETLARTKDSYAVHVWNSQSEKQPIKIGST 328
Query: 344 QPYAQIARRYCPR 356
YA+ A + CPR
Sbjct: 329 CAYAKYAEKNCPR 341
>gi|332373546|gb|AEE61914.1| unknown [Dendroctonus ponderosae]
Length = 368
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/280 (40%), Positives = 160/280 (57%), Gaps = 12/280 (4%)
Query: 88 SIFFLETSCT--HADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDR-- 143
SIFF ETSC + +T RQAC++ESAA MNP VY++ + + + DR
Sbjct: 79 SIFFHETSCNSFRNGKITITARQACAVESAARMNPDYDVYLLYASPGTYKMEDTESDRFL 138
Query: 144 --LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L +Y+NV I +D+ RYF NTP+ + Q + S + SH SD+LR++TL+KYGG Y
Sbjct: 139 MELLKYRNVRIYHIDMDRYFMNTPVESLWKQQQMKQSRFAQSHTSDVLRFLTLWKYGGIY 198
Query: 202 LDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLD---KDHWLSGAALRELRDNFKTTEW 257
LDLD IV KSL+ L ++ G ES + VAAG++ + H + + L +L++NFK +W
Sbjct: 199 LDLDVIVTKSLDDLGTDFTGFESKTSVAAGILSFNYTGDGHDFANSCLEDLKNNFKGHDW 258
Query: 258 GANGPGVLTRLLKAECKPQSYAHNI-ISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT 316
G NGPG +TRL+K C+ + + +C+ F IYPP FY + W +W + E
Sbjct: 259 GWNGPGTVTRLIKRLCEENNIPKLVNKTCKGFKIYPPNRFYSIPWWNWKYFFQEEFLDFV 318
Query: 317 MSLFRDSYALHVWNSFTKRVPVKLGSEQ-PYAQIARRYCP 355
DSY +HVWN F+ + L E PY A+ YCP
Sbjct: 319 KKQTADSYLIHVWNKFSTDTNISLHYENVPYLNFAKLYCP 358
>gi|195433803|ref|XP_002064896.1| GK15176 [Drosophila willistoni]
gi|194160981|gb|EDW75882.1| GK15176 [Drosophila willistoni]
Length = 386
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/297 (36%), Positives = 168/297 (56%), Gaps = 28/297 (9%)
Query: 88 SIFFLETSC------THADG-------VELTLRQACSIESAAMMNPGVQVYVVV------ 128
SIFF ETSC G ++LT RQAC+IESAA+ NP ++V+
Sbjct: 81 SIFFHETSCHLFRKEERKQGKKKEYKLLQLTPRQACAIESAALHNPNFDIFVLFADPTYK 140
Query: 129 IASVRNRT--RNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMS 186
+ RN + ++PLI+ + Y NVH+ ++L Y + TP+ ++ + S + SH+S
Sbjct: 141 VTKWRNGSLEQSPLIEAILGYNNVHLRSLNLWTYSEGTPIEAWFKDGQLFQSRYLFSHLS 200
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSG 242
D LRY+TLY+YGG YLD+D +V++S+E + NY GAES S +AAGV++ H ++
Sbjct: 201 DFLRYLTLYRYGGLYLDMDVVVLRSMEDIPPNYTGAESHSSLAAGVMNFAAHGFGHEIAE 260
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRFFYPV 299
+ LR+ + NF ++WG NGPGV+TR+ + C Q + + C F +Y FY +
Sbjct: 261 SCLRDFQQNFDGSDWGQNGPGVITRVAQKICGTQDISLMIEDSKRCLGFKVYSRGAFYAI 320
Query: 300 HWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
W W D+ TM+ +DSY +HVWN + ++ ++ G++ YAQ A CPR
Sbjct: 321 PWRQWQDFFEPHKLETTMARAKDSYVIHVWNKHSSKLKIRYGTKSAYAQYAESNCPR 377
>gi|195053081|ref|XP_001993459.1| GH13060 [Drosophila grimshawi]
gi|193900518|gb|EDV99384.1| GH13060 [Drosophila grimshawi]
Length = 371
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/291 (37%), Positives = 172/291 (59%), Gaps = 16/291 (5%)
Query: 82 LDVPDNSIFFLETSCTHA-------DGVELTLRQACSIESAAMMNPGVQVYVVVIASV-- 132
L V SIFF+ETS + ++LT RQAC+IESAA NP + V+V+
Sbjct: 72 LPVAGRSIFFIETSGVLKPRKEVVKNLLQLTARQACAIESAARHNPNLMVFVLFATPTYR 131
Query: 133 RNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYV 192
+ + PLID + Y NV + Q+++ RY TP++ + + +S + +SH+SDLLR+V
Sbjct: 132 QKEEKLPLIDAIRSYGNVQLRQLNIRRYALRTPINEWVKHGELFSSRYLVSHISDLLRFV 191
Query: 193 TLYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDK---DHWLSGAALREL 248
TLY++GG YLD+D +V++SLE + NYAG ES + +AAGV+ + H ++ A LR+
Sbjct: 192 TLYRFGGIYLDMDVVVLRSLEDVSLNYAGPESETHLAAGVMGMAPFGFGHEIAEACLRDF 251
Query: 249 RDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIIS---CRNFTIYPPRFFYPVHWEHWA 305
+ NF +WG NGPGV+TR+ + C ++ + + C F ++ FY V +HW
Sbjct: 252 QQNFDGQKWGNNGPGVITRVAQKICATKNISLMLADRKRCLGFRVFERNAFYAVPRKHWR 311
Query: 306 DYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ T+ L RDSY +H+WN +K++P+K+GS Y + A ++CP+
Sbjct: 312 HFFEPKYLEETLELTRDSYLVHMWNKHSKQLPIKVGSSTAYGKYAEQHCPK 362
>gi|195116905|ref|XP_002002992.1| GI17680 [Drosophila mojavensis]
gi|193913567|gb|EDW12434.1| GI17680 [Drosophila mojavensis]
Length = 395
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 108/304 (35%), Positives = 174/304 (57%), Gaps = 31/304 (10%)
Query: 84 VPDNSIFFLETSC-----------THADGVELTLRQACSIESAAMMNPGVQVYVVVIASV 132
+P +SIFF ETSC + + ++LT RQAC+IESAA+ NP QV+V+ +
Sbjct: 83 IPGHSIFFHETSCHRPKRKRQRPDSQYNMLQLTARQACAIESAALHNPNFQVFVLFASPT 142
Query: 133 -------------RNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSL 179
+ +ID + Y+NVH+ Q++L RY TP+ + + S
Sbjct: 143 YLHYGNSSTGGSSGYQRHQSIIDAILSYKNVHLRQLNLWRYAAGTPIEEWLKDGRLFRSS 202
Query: 180 WPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK-- 236
+ SH+SD LR++TLY+YGG YLD+D ++++S+E + NY GAES++ +AAGV+ L
Sbjct: 203 YLFSHISDFLRFLTLYRYGGIYLDMDVVMLRSMEDVPPNYTGAESNTHLAAGVMSLAPTG 262
Query: 237 -DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYP 292
H ++ A L++ + NF ++WG NGPGV+TR+ + C + + + CR F ++
Sbjct: 263 FGHEIAEACLQDFQRNFDGSDWGNNGPGVITRVAQKICATKDISLMLEDRKRCRGFKVFD 322
Query: 293 PRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARR 352
FY V W+HW + TM+ +DS+ +HVWN +K++ +K+GS Y + A +
Sbjct: 323 RNAFYAVPWKHWRHFFEPQLLEQTMAHTKDSFLVHVWNKHSKQLAIKVGSSTAYGKYAEQ 382
Query: 353 YCPR 356
+CPR
Sbjct: 383 HCPR 386
>gi|91088471|ref|XP_969928.1| PREDICTED: similar to GA14400-PA [Tribolium castaneum]
gi|270011737|gb|EFA08185.1| hypothetical protein TcasGA2_TC005812 [Tribolium castaneum]
Length = 356
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 112/279 (40%), Positives = 157/279 (56%), Gaps = 11/279 (3%)
Query: 88 SIFFLETSCTH--ADGVELTLRQACSIESAAMMNPGVQVYVVV----IASVRNRTRNPLI 141
SIFF ETSC + +T RQAC++ESAA +NP +++++ I + +
Sbjct: 68 SIFFHETSCNSFLNGKITITARQACAVESAARLNPNFEIHLLFASPGIFKFEGTQSDRFL 127
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L Y NV I VD RY + TP+ Y + I S + SH SD+LRY+TL+K+GG Y
Sbjct: 128 QNLMTYPNVRIHHVDYERYTKGTPVETLYRKGKIEVSGYAQSHASDVLRYITLWKFGGIY 187
Query: 202 LDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEW 257
LDLD IV K LESL NYAGAES VAAGV+ + H L+ L++L +NFK +W
Sbjct: 188 LDLDVIVTKPLESLPLNYAGAESDRNVAAGVLSFSPEGLGHELAQRCLQDLSENFKGYDW 247
Query: 258 GANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT 316
G NGPGV+TRLLK C ++ C F ++P FYP+ W W Y +E
Sbjct: 248 GYNGPGVITRLLKKLCGAETAKEMQSKDCEGFKVFPVDAFYPIPWWDWRLYFDENLTEKV 307
Query: 317 MSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+++ +DS+ +HVWN + V YA +A+++CP
Sbjct: 308 LNISKDSHVIHVWNKHSGGTRVAARGNSAYAVLAQKFCP 346
>gi|195573999|ref|XP_002104977.1| GD21237 [Drosophila simulans]
gi|194200904|gb|EDX14480.1| GD21237 [Drosophila simulans]
Length = 400
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 160/281 (56%), Gaps = 12/281 (4%)
Query: 88 SIFFLETS----CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV-RNRTRNPLID 142
SIFF ET+ V+LT R+AC+IESAA+ NPG+ V+V+ + R + +PLI
Sbjct: 111 SIFFHETTNFRRIEKNAVVQLTAREACAIESAALHNPGLTVFVLFAGATHRPLSEDPLIR 170
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L+ Y+N+ + ++L RY TP+ + + S + H+SDLLRYVTLYKYGG YL
Sbjct: 171 ALHNYKNIRLRHLNLWRYAAGTPIAKWLKSGKLFKSKFLFPHVSDLLRYVTLYKYGGLYL 230
Query: 203 DLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTEWG 258
DLD +V ++LE L N++GAES+ VA GV+ + H ++ LR+L N+ +WG
Sbjct: 231 DLDVVVQQNLEKLPPNFSGAESNISVACGVMKMSPGGLGHKIATMCLRDLEANYNANKWG 290
Query: 259 ANGPGVLTRLLKAECKPQ---SYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
NGPGV+TR+ K +C + S +N C F I+ FY + W W D+
Sbjct: 291 TNGPGVITRVAKKQCNTENIKSVINNPKRCNGFKIFDANAFYAISWRQWKDFFEPNRLNV 350
Query: 316 TMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TM S +HVWN F+K +K + Y ++A+ +CPR
Sbjct: 351 TMKRISKSPVIHVWNKFSKGWKLKTKANCAYTKLAKTHCPR 391
>gi|195504212|ref|XP_002098984.1| GE23622 [Drosophila yakuba]
gi|194185085|gb|EDW98696.1| GE23622 [Drosophila yakuba]
Length = 399
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 160/281 (56%), Gaps = 12/281 (4%)
Query: 88 SIFFLETS----CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV-RNRTRNPLID 142
SIFF ET+ + V+LT R AC+IESAA+ NPG+ V+V+ + R + +PLI
Sbjct: 110 SIFFHETTNFRRLEKSALVQLTARSACAIESAALHNPGLTVFVLFAGATHRPLSGDPLIK 169
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L++Y N+ + ++L RY TP+ + + S + H+SDLLRYVTLYKYGG YL
Sbjct: 170 ALHKYNNIRLRHLNLWRYAAGTPITKWLNSGKLFKSKFLFPHVSDLLRYVTLYKYGGLYL 229
Query: 203 DLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTEWG 258
DLD +V ++LE + N+ GAES+ +A GV+ + H ++ LR+L N+K +WG
Sbjct: 230 DLDVVVQQNLEKMPPNFTGAESNKSLACGVMKMSSAGLGHKIATMCLRDLEANYKGDKWG 289
Query: 259 ANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
NGPGV+TR+ K +CK ++ +N C F +Y P FY + W W D+
Sbjct: 290 TNGPGVITRVAKKQCKTENVKAMINNSKGCNGFRVYDPNAFYAIPWLQWKDFFLPNRLNV 349
Query: 316 TMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TM S +HVWN F+K +K Y +A+ +CPR
Sbjct: 350 TMRRVSKSPVVHVWNKFSKGWKLKTKDSCAYISLAKTHCPR 390
>gi|195053083|ref|XP_001993460.1| GH13059 [Drosophila grimshawi]
gi|193900519|gb|EDV99385.1| GH13059 [Drosophila grimshawi]
Length = 394
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 171/303 (56%), Gaps = 30/303 (9%)
Query: 84 VPDNSIFFLETSC---------------THADGVELTLRQACSIESAAMMNPGVQVYVVV 128
+P SIFF ETSC T + ++LT RQAC+IESAA+ NP QV+V+
Sbjct: 83 IPGRSIFFHETSCHRPKRRRQRPGQRPNTQYNMLQLTARQACAIESAALHNPNFQVFVLF 142
Query: 129 IA--------SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLW 180
S + P+ID + Y+N+ + Q++L Y TP+ + + S +
Sbjct: 143 AGPTYRYHGNSSIGQRHQPIIDAILSYKNIQLRQLNLWSYAAGTPIEEWLKDGQLFRSSY 202
Query: 181 PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK--- 236
SH+SD LR++TLY+YGG YLD+D ++++S+E + N+ GAES++ +AAGV+ L
Sbjct: 203 LFSHISDFLRFLTLYRYGGIYLDMDVVMLRSMEDVPPNFTGAESNTHLAAGVMSLAPTGF 262
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPP 293
H ++ A LR+ + NF ++WG NGPGV+TR+ + C + + + C F ++
Sbjct: 263 GHEIAEACLRDFQQNFDGSDWGNNGPGVITRVAQHICGTKDISLMLEDRKRCLGFKVFER 322
Query: 294 RFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRY 353
FY V W+HW + T++ +DSY +HVWN +K++ +K+GS Y + A ++
Sbjct: 323 NAFYAVPWKHWRHFFEPQLLEQTLAHTKDSYLVHVWNKHSKQLAIKVGSSTAYGKYAEQH 382
Query: 354 CPR 356
CP+
Sbjct: 383 CPK 385
>gi|221459580|ref|NP_651434.3| alpha4GT2 [Drosophila melanogaster]
gi|220903215|gb|AAF56516.3| alpha4GT2 [Drosophila melanogaster]
Length = 395
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/281 (38%), Positives = 160/281 (56%), Gaps = 12/281 (4%)
Query: 88 SIFFLETS----CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV-RNRTRNPLID 142
+IFF ET+ + V+LT R+AC+IESAA+ NPG+ V+V+ + R + +PLI
Sbjct: 106 NIFFHETTNFKRIEKSSVVQLTAREACAIESAALHNPGLTVFVLFAGATHRPSSGDPLIR 165
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L+ Y+N+ + ++L RY TP+ + + S + H+SDLLRYV+LYKYGG YL
Sbjct: 166 ALHNYKNIRLRHLNLWRYAAGTPIAKWLKSGKLFKSKFLFPHVSDLLRYVSLYKYGGLYL 225
Query: 203 DLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTEWG 258
DLD +V ++LE L N+ GAES+ VA GV+ + H ++ LR+L N+ +WG
Sbjct: 226 DLDVVVQQNLEKLPPNFTGAESNISVACGVMKMSPGGLGHKIATMCLRDLEANYNANKWG 285
Query: 259 ANGPGVLTRLLKAECKP---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
NGPGV+TR++K +C +S +N C F ++ FY + W W D+
Sbjct: 286 TNGPGVITRVVKKQCNTDNIKSVINNPKRCNGFKVFDANAFYAISWLQWKDFFQPNRHNV 345
Query: 316 TMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TM S +HVWN F+K VK S Y +A+ +CPR
Sbjct: 346 TMKRVSKSPVIHVWNKFSKGWKVKTKSNCAYTTLAKIHCPR 386
>gi|195387175|ref|XP_002052275.1| GJ17463 [Drosophila virilis]
gi|194148732|gb|EDW64430.1| GJ17463 [Drosophila virilis]
Length = 392
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 109/312 (34%), Positives = 176/312 (56%), Gaps = 31/312 (9%)
Query: 76 FDDITRLD---VPDNSIFFLETSC--------THADG-------VELTLRQACSIESAAM 117
+D+ D +P SIFF ETSC H ++LT RQAC+IESAA+
Sbjct: 72 LEDVLSTDPKPIPGRSIFFHETSCHRPKRRRQRHGQSPDLQYNMLQLTARQACAIESAAL 131
Query: 118 MNPGVQVYVVVIA------SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYT 171
NP QV+V+ S ++ + P+ID + Y+NV + Q++L RY TP+ +
Sbjct: 132 HNPNFQVFVLFAGPTYRHLSNSSQRQQPIIDAILSYKNVQLRQLNLWRYAAGTPIEEWLK 191
Query: 172 QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAG 230
+ S + SH+SD LR++TLY+YGG YLD+D ++++S+E + N+ GAES++ +AAG
Sbjct: 192 DGRLFRSSYLFSHISDFLRFLTLYRYGGIYLDMDVVMLRSMEDVPPNFTGAESNTHLAAG 251
Query: 231 VIHLDK---DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIIS 284
V+ L H ++ + LR+ + +F ++WG NGPGV+TR+ + C Q + +
Sbjct: 252 VMSLAPTGFGHEIAESCLRDFQLHFDGSDWGNNGPGVITRVAQQICGTQDISLMLEDRKR 311
Query: 285 CRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQ 344
C F ++ FY V W+HW + T+ +DSY +HVWN +K++ +K+GS
Sbjct: 312 CLGFKVFDRNAFYAVPWKHWRHFFEPQLLEQTLEHTKDSYLVHVWNKHSKQLAIKVGSST 371
Query: 345 PYAQIARRYCPR 356
Y + A ++CP+
Sbjct: 372 AYGKYAEQHCPK 383
>gi|242024268|ref|XP_002432550.1| Lactosylceramide 4-alpha-galactosyltransferase, putative [Pediculus
humanus corporis]
gi|212518010|gb|EEB19812.1| Lactosylceramide 4-alpha-galactosyltransferase, putative [Pediculus
humanus corporis]
Length = 361
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 115/321 (35%), Positives = 167/321 (52%), Gaps = 23/321 (7%)
Query: 50 YIEYWRSGYRVQNINCVLSGVEKDFAFDDITRLDVPDNSIFFLETSC-THADGVELTLRQ 108
YI +W+ R +NI+C K ++ V I+F ETSC + + + RQ
Sbjct: 41 YIIHWKGKTR-KNIDCFYLTGNKLITLNE--NEPVLGKGIYFHETSCPKNLTDIVINSRQ 97
Query: 109 ACSIESAAMMNPGVQVYVVVIASVR---NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTP 165
ACS+ESA +P V++++ + V T N LI +L Y+N + ++ Y NTP
Sbjct: 98 ACSVESALRAHPEKDVFLLITSPVLTPLKNTNNKLIHQLLSYKNFKVRHINFSEYLSNTP 157
Query: 166 LHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSS 225
LH YT A+ S WP++H SD+LRY TLYK+GG YLDLD IV+KSL L N+AGAES+S
Sbjct: 158 LHVIYTSGALNNSYWPVAHSSDVLRYATLYKFGGIYLDLDVIVLKSLSGLKNFAGAESNS 217
Query: 226 VVAAGVIHLDKDHWLSGAA---LRELRDNFKTTEWGANGPGVLTRLLKAEC-------KP 275
+ +G++ D A + EL F WG NGPGV+TR+LK +C P
Sbjct: 218 SLGSGILGFSYDEVGRSTAKECVEELASTFIGYFWGHNGPGVITRVLKRKCMTDDVLRMP 277
Query: 276 QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKR 335
+S C+ F+IYP +F P + + E N + SY +H WN + +
Sbjct: 278 ES------DCQGFSIYPTEYFAPFSYNDPEFFFQEGNTDLARKISNHSYTIHYWNKMSSK 331
Query: 336 VPVKLGSEQPYAQIARRYCPR 356
+K + P+ A YCP+
Sbjct: 332 FILKNNLDIPFFYYALNYCPK 352
>gi|195349547|ref|XP_002041304.1| GM10267 [Drosophila sechellia]
gi|194122999|gb|EDW45042.1| GM10267 [Drosophila sechellia]
Length = 400
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 107/281 (38%), Positives = 157/281 (55%), Gaps = 12/281 (4%)
Query: 88 SIFFLETS----CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR-NPLID 142
SIFF ET+ V+LT R+AC+IESAA+ NPG+ V+V+ + R +PLI
Sbjct: 111 SIFFHETTNFRRIEKNAVVQLTAREACAIESAALHNPGLTVFVLFAGATHRPLRGDPLIR 170
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L+ Y+N+ + ++L RY TP+ + + S + H+SDLLRYVTLYKYGG YL
Sbjct: 171 ALHNYKNIRLRNLNLWRYAAGTPIAKWLKSGKLFKSKFLFPHVSDLLRYVTLYKYGGLYL 230
Query: 203 DLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTEWG 258
DLD +V ++LE L N+ GAES+ +A GV+ + H ++ LR+L N+ +WG
Sbjct: 231 DLDVVVQQNLEKLPPNFTGAESNISLACGVMKMSPGGLGHKIATMCLRDLEANYNANKWG 290
Query: 259 ANGPGVLTRLLKAECKP---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
NGPGV+TR+ K +C +S +N C F I+ FY + W W +
Sbjct: 291 TNGPGVITRVAKKQCNTDNIKSVINNPKHCNGFQIFDANAFYAISWRQWKYFFEPNRLNV 350
Query: 316 TMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TM S +HVWN F+K +K + Y ++A+ +CPR
Sbjct: 351 TMKRISKSPVIHVWNKFSKGWKLKTKTNCAYTKLAKTHCPR 391
>gi|194908313|ref|XP_001981745.1| GG11427 [Drosophila erecta]
gi|190656383|gb|EDV53615.1| GG11427 [Drosophila erecta]
Length = 325
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 161/281 (57%), Gaps = 12/281 (4%)
Query: 88 SIFFLETS----CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV-RNRTRNPLID 142
+IFF ET+ + V+LT R AC+IESAA+ NPG+ V+V+ + R +PLI
Sbjct: 36 NIFFHETTNFMRIERSALVQLTARAACAIESAALHNPGLTVFVLFAGATHRKLGGDPLIK 95
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L++Y+N+ + ++L RY TP+ + + + S + H+SDLLRYVTLYKYGG YL
Sbjct: 96 ALHKYKNIRLRHLNLWRYAAGTPIAKWLSTGKLFKSKFLFPHVSDLLRYVTLYKYGGLYL 155
Query: 203 DLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTEWG 258
DLD +V ++L + N+ GAES++ VA GV+ + H ++ LR+L N+ +WG
Sbjct: 156 DLDVVVQQNLVKMPPNFTGAESNTSVACGVMKMSAGGVGHKIATMCLRDLEANYNGDKWG 215
Query: 259 ANGPGVLTRLLKAECKPQ---SYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
NGPGV+TR++K +CK + + +N C F +Y P FY + W W D+
Sbjct: 216 TNGPGVITRVVKKQCKTENVRAMINNPKRCNGFRVYDPSAFYAIPWLQWKDFFQPNKLNV 275
Query: 316 TMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
T+ S +HVWN +K + + Y ++A+ +CPR
Sbjct: 276 TLRRVSKSPVVHVWNKLSKGWKLNTKASCAYTKLAKTHCPR 316
>gi|195387179|ref|XP_002052277.1| GJ17465 [Drosophila virilis]
gi|194148734|gb|EDW64432.1| GJ17465 [Drosophila virilis]
Length = 378
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 101/292 (34%), Positives = 171/292 (58%), Gaps = 21/292 (7%)
Query: 85 PDNSIFFLETS-------CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV----- 132
P +IFFLET+ + ++LT RQAC+IESAA+ NP +V+V+ +
Sbjct: 79 PGRTIFFLETTRIVRPKPEMMNNMLQLTARQACAIESAALHNPTFEVFVLFAGATYRYHE 138
Query: 133 -RNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRY 191
++ + P+ID + Y+NV + Q++L RY TP+ + + S + ++H+SDLLR+
Sbjct: 139 NSSQHQQPIIDAILSYENVKLRQLNLRRYVTGTPIEDWVKYGELFGSSFLINHISDLLRF 198
Query: 192 VTLYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDK---DHWLSGAALRE 247
+TLY++GG YLD+D ++++S+E + NYAGAES S VA+GV+ + H + LR+
Sbjct: 199 ITLYRFGGIYLDMDVVLLRSMEDVPLNYAGAESDSHVASGVMSMAPHGFGHQFAKYCLRD 258
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRFFYPVHWEHW 304
+ +F WG NGPGV+TR+++ C+ ++ + C F ++ FY V WE W
Sbjct: 259 FQRHFDGDAWGNNGPGVITRVIQRVCRTKNITLMMEDEKRCLGFKVFDQNAFYAVPWEDW 318
Query: 305 ADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ + T++ +DSY +H+WN +K++ ++GS Y A+++CPR
Sbjct: 319 RHFFDPNLQKQTLARTKDSYLVHIWNDVSKQIN-RVGSRTAYGIYAQQHCPR 369
>gi|195387177|ref|XP_002052276.1| GJ17464 [Drosophila virilis]
gi|194148733|gb|EDW64431.1| GJ17464 [Drosophila virilis]
Length = 379
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/292 (34%), Positives = 163/292 (55%), Gaps = 20/292 (6%)
Query: 85 PDNSIFFLETSCTHA-------DGVELTLRQACSIESAAMMNPGVQVYVVVIASV----- 132
P +IFFLETS T + ++LT RQAC+IESAA+ NP +V+V+
Sbjct: 79 PGRTIFFLETSRTVRPKPEMINNMLQLTARQACAIESAALHNPNFEVFVLFAGPTYRYHE 138
Query: 133 -RNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRY 191
++ + P+ID + Y+NV + Q++L RY TP+ + + +S +P+ H+SDLLR
Sbjct: 139 NSSQHQQPIIDAILSYKNVQLRQLNLRRYVTGTPIEDWVKYGELFSSRFPIHHVSDLLRL 198
Query: 192 VTLYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDK---DHWLSGAALRE 247
+TLY+YGG YLD+D ++++S+E + NYAG ES + VA GV+ + H + + L +
Sbjct: 199 ITLYRYGGIYLDMDVVLLRSMEDVPLNYAGVESFTHVANGVLSMAPTGFGHKFAESCLLD 258
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYA---HNIISCRNFTIYPPRFFYPVHWEHW 304
+ F WG NGPGV+TR+ + C + + + C+ F ++ FY V + +W
Sbjct: 259 FQQQFDGDAWGHNGPGVITRVAQRICGTANISLLLEDRTRCQGFNVFNHTAFYAVSYRNW 318
Query: 305 ADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ T++ +DSY +HVWN +K +K+GS Y A + CPR
Sbjct: 319 RHFFQPQYLQQTLARTKDSYLVHVWNQISKLFHIKVGSRTAYGMYAEKNCPR 370
>gi|194741216|ref|XP_001953085.1| GF19918 [Drosophila ananassae]
gi|190626144|gb|EDV41668.1| GF19918 [Drosophila ananassae]
Length = 347
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 106/291 (36%), Positives = 158/291 (54%), Gaps = 16/291 (5%)
Query: 81 RLDVPDNSIFFLET----SCTH-ADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN- 134
++ P SIFF+ET S H + +EL+ RQAC+IESAA+ N +V+V+ + S
Sbjct: 49 QMPAPGRSIFFIETLFQPSANHKSRSLELSPRQACAIESAALHNRNSEVFVLFVGSTARI 108
Query: 135 -RTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVT 193
+PLI+ + Y NVH +D+ Y TPL + + + TS + SH+SD LR++T
Sbjct: 109 PGDAHPLIEIISSYVNVHFRSLDIWSYAAGTPLESWLKKGDLFTSKYLFSHLSDFLRFLT 168
Query: 194 LYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHL---DKDHWLSGAALRELR 249
LY++GG YLD+D +V+++L+ L N GAE S + + VI + ++ L +LR
Sbjct: 169 LYRFGGVYLDMDVVVLQTLDRLPPNCVGAEDSGSINSAVIKIAATSTGRKIAKLFLYDLR 228
Query: 250 DNFKTTEWGANGPGVLTRLLKAECK----PQSYAHNIISCRNFTIYPPRFFYPVHWEHWA 305
DNF + WG NGPGV+TR+ + CK P+ Y C ++ P FY VHW W
Sbjct: 229 DNFNGSLWGNNGPGVVTRVSQKLCKTHEIPRIYLR-YSRCSGIRVFSPSAFYAVHWSKWQ 287
Query: 306 DYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
D+ + TM SY HVWN +K + S+ Y +I + CPR
Sbjct: 288 DFFDSDKLEKTMVAMEHSYVAHVWNHMSKNWILTATSKNAYRKITEKNCPR 338
>gi|194743750|ref|XP_001954363.1| GF16775 [Drosophila ananassae]
gi|190627400|gb|EDV42924.1| GF16775 [Drosophila ananassae]
Length = 371
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/282 (36%), Positives = 161/282 (57%), Gaps = 12/282 (4%)
Query: 87 NSIFFLETSCTHADG----VELTLRQACSIESAAMMNPGVQVYVVVIASV-RNRTRNPLI 141
+IFFL+T+ + ++LT R+AC+IESAA+ NPG+ V+V+ + R + NP I
Sbjct: 81 KNIFFLQTTGVQKEETLQLIKLTAREACAIESAALHNPGLTVFVLFAGATHRMYSGNPWI 140
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L Y+N+ + +++L RY TP+ + I S + H SDLLRY+TLYKYGG Y
Sbjct: 141 RSLNFYKNIRLRRLNLLRYAAGTPIEKWIKLGKIYKSKYIFPHASDLLRYLTLYKYGGIY 200
Query: 202 LDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLD---KDHWLSGAALRELRDNFKTTEW 257
LDLD +V++SLE + N+ GAE+S +A GV+ + + H ++ L++L+ NF W
Sbjct: 201 LDLDVVVLRSLEKMPPNFTGAETSKSLACGVMKMSSTGEGHQIAALCLQDLQANFNANNW 260
Query: 258 GANGPGVLTRLLKAEC---KPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAP 314
G+ GP V+TR+ K C + Q+ C+ T++ + FY + W W D+ ++
Sbjct: 261 GSIGPAVITRVAKKTCNTTRIQAMIDKPSHCKGLTVFDAKAFYAIPWRQWMDFFRSSSLN 320
Query: 315 ATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
T+ +SY +HVWN F+K ++ Y + AR CPR
Sbjct: 321 KTLKATSNSYVIHVWNKFSKFQRLRAREITAYTKYARTNCPR 362
>gi|195392082|ref|XP_002054688.1| GJ22668 [Drosophila virilis]
gi|194152774|gb|EDW68208.1| GJ22668 [Drosophila virilis]
Length = 360
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 99/283 (34%), Positives = 156/283 (55%), Gaps = 11/283 (3%)
Query: 85 PDNSIFFLETSCTHADG----VELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL 140
P +IFF+ETSC AD + QAC IESAA+ NP +QV+V+ S + P+
Sbjct: 68 PGRTIFFIETSCHRADFQYNLSSIKAHQACPIESAALHNPNMQVFVLFACSTHHAKSMPI 127
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
+D L Y+NV ++ L RY Q+TP+ + + +S + + H+SDLLR +TLY++GG
Sbjct: 128 VDALLSYKNVQFRELTLERYAQDTPVADWIKNGKLFSSRFLMYHLSDLLRLITLYRFGGV 187
Query: 201 YLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTE 256
YLD+D + +++LE + NYAGAES + VI L+ + H L L+ + N+ +
Sbjct: 188 YLDMDVLSLRTLEDVPLNYAGAESLDSIGNSVISLEPNGFGHQLGELFLQNFQKNYIGSA 247
Query: 257 WGANGPGVLTRLLKAECKPQS---YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNA 313
W NGP VL R+L+ C Q+ +N C F ++ Y + W W + +A
Sbjct: 248 WAHNGPMVLVRVLRELCGTQNITLMVNNRERCYGFQVFNVSDIYEIPWRQWTLFFEPKHA 307
Query: 314 PATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
T+ +DS +H+WN ++ P+K+ S+ Y A ++CPR
Sbjct: 308 NLTLERTKDSRMVHMWNHLVRKWPLKIDSKAAYLHWAAQHCPR 350
>gi|195115956|ref|XP_002002522.1| GI17430 [Drosophila mojavensis]
gi|193913097|gb|EDW11964.1| GI17430 [Drosophila mojavensis]
Length = 367
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 103/290 (35%), Positives = 155/290 (53%), Gaps = 22/290 (7%)
Query: 85 PDNSIFFLETSCTHADGVE----LTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL 140
P SIFF ETSCT D + LT RQACSIESAA+ NP Q++ V T PL
Sbjct: 73 PGRSIFFHETSCTSPDSISNIMTLTARQACSIESAALNNPNFQIFTVFSCP----TFRPL 128
Query: 141 -------IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVT 193
+D + Y+NV++ ++L Y NT + + + +LTS +P+ H SDL+R ++
Sbjct: 129 AGRQKLFVDAIESYENVNLRHLNLRNYAMNTSIEDWVKRGDLLTSSYPMHHTSDLIRLIS 188
Query: 194 LYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELR 249
LY++GG YLD+D IV+ SLE L NY GAES+ + VI L + H ++ L+ +
Sbjct: 189 LYRFGGIYLDMDIIVLTSLEKLPLNYVGAESTDTLCNAVIGLTAEGIGHEVADLFLQHYQ 248
Query: 250 DNFKTTEWGANGPGVLTRLLKAECKP---QSYAHNIISCRNFTIYPPRFFYPVHWEHWAD 306
F + NGP ++T +L C ++ SC+ F I+ FY +HW+ W
Sbjct: 249 KYFNGKNYVQNGPALVTAVLLKFCGTSLMEAMEGGRKSCKGFRIFNSTAFYSIHWQEWRH 308
Query: 307 YLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ TM+ +DSY +H+WN + +K+ S Y + A ++CPR
Sbjct: 309 FTEPRYLEETMARTKDSYMIHMWNKVSSGERIKVASNTAYVKYAEKHCPR 358
>gi|195034301|ref|XP_001988866.1| GH10340 [Drosophila grimshawi]
gi|193904866|gb|EDW03733.1| GH10340 [Drosophila grimshawi]
Length = 355
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 158/290 (54%), Gaps = 17/290 (5%)
Query: 84 VPDNSIFFLETSCTHADG-------VELTLRQACSIESAAMMNPGVQVYVVVIASVR--- 133
P +IFF ET+C + + LT+RQAC+IESAA+ NP V+V+ R
Sbjct: 57 TPGRTIFFHETTCNNRSSEDSQFQVMNLTVRQACAIESAALHNPNFDVFVLFTCPTRRPL 116
Query: 134 NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVT 193
+ + LI+ + Y NV Q+D+ RY + TP+ + + +L +P+ H SDLLR ++
Sbjct: 117 SIAQKSLINAIESYSNVQFRQLDVKRYAKGTPVEVWIKKGYVLKGRFPVHHTSDLLRLIS 176
Query: 194 LYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELR 249
LY+YGG Y+D+D IV++SLE + NY GAE+ ++ VI L + H ++ LR+ +
Sbjct: 177 LYRYGGIYMDMDVIVLRSLEDVPLNYLGAETFDLLGNAVISLKPNGTGHEIAELFLRDFQ 236
Query: 250 DNFKTTEWGANGPGVLTRLLKAECK---PQSYAHNIISCRNFTIYPPRFFYPVHWEHWAD 306
N+ + NGP ++TR++ A C ++ + +CR ++ FY + W+ W
Sbjct: 237 INYNGRHYVDNGPALVTRVVTAICGVKLVKAIQEDSKTCRGLKLFNSTAFYAIPWQQWEH 296
Query: 307 YLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TM +DSY +H+WN + R +K+GS Y + A ++CP+
Sbjct: 297 LTKPQYLKDTMEKTKDSYLIHLWNKVSSRGLIKVGSNTAYGKYAEKHCPK 346
>gi|195118539|ref|XP_002003794.1| GI21127 [Drosophila mojavensis]
gi|193914369|gb|EDW13236.1| GI21127 [Drosophila mojavensis]
Length = 347
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/286 (35%), Positives = 154/286 (53%), Gaps = 14/286 (4%)
Query: 85 PDNSIFFLETSCTHADG----VELTLRQACSIESAAMMNPGVQVYVVVIASVRN---RTR 137
P SIFF ETSCT D + LT RQACSIESAA+ NP Q++ V +
Sbjct: 53 PGRSIFFHETSCTSPDTASNIMNLTARQACSIESAALNNPNFQIFTVFSCPTYRPLVGRQ 112
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
L+D + Y+NV+ ++L Y NTP+ + + +L S +P+ H SDLLR ++LY++
Sbjct: 113 KLLVDAIESYENVNFRHLNLRDYAMNTPIEDWVKRGELLNSSFPMEHTSDLLRLISLYRF 172
Query: 198 GGTYLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFK 253
GG YLD+D IV+K LE L NY GA+S+ +A GVI L D H ++ L++ + F
Sbjct: 173 GGIYLDMDIIVLKRLEKLPLNYVGAQSNYTLANGVIGLTADGIGHEVAELFLQQYQKYFN 232
Query: 254 TTEWGANGPGVLTRLLKAECKP---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNE 310
+ NGP ++T +L C ++ SC+ F ++ FY + W W +
Sbjct: 233 GKNYVQNGPTLVTAVLLKYCGTSLMKAIEGGRKSCKGFRLFNTTAFYSIPWPEWRHFTEP 292
Query: 311 TNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TM+ +DS +H+WN ++ +K+GS Y + A + CPR
Sbjct: 293 RYLEETMARTKDSLMIHMWNKVSRGERIKVGSNAAYVKYAEKKCPR 338
>gi|195118537|ref|XP_002003793.1| GI21138 [Drosophila mojavensis]
gi|193914368|gb|EDW13235.1| GI21138 [Drosophila mojavensis]
Length = 347
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/283 (36%), Positives = 152/283 (53%), Gaps = 14/283 (4%)
Query: 88 SIFFLETSCTHADG----VELTLRQACSIESAAMMNPGVQVYVVVIASVRN---RTRNPL 140
SIFF ETSCT D + LT RQACSIESAA+ NP Q++ V + L
Sbjct: 56 SIFFHETSCTSPDTASNIMNLTARQACSIESAALNNPNFQIFTVFSCPTYRPLVGRQKLL 115
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
+D + Y+NV+ ++L Y NTP+ + + +L S +P+ H SDLLR ++LY++GG
Sbjct: 116 VDAIESYENVNFRHLNLRDYAMNTPIEDWVKRGELLNSSFPMEHTSDLLRLISLYRFGGI 175
Query: 201 YLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTE 256
YLD+D IV+KSLE L NY GAES+ + VI L D H ++ L F +
Sbjct: 176 YLDMDIIVLKSLEKLPLNYVGAESNYSLCNAVIGLAADGIGHEVAELFLERYVKYFNGKD 235
Query: 257 WGANGPGVLTRLLKAECKP---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNA 313
+ NGP ++T +L C ++ SC+ F ++ FY + W+ W +
Sbjct: 236 YAQNGPALVTAVLLKFCGTSLMKAIEGGRKSCKGFRLFNTTAFYSIPWQEWKHFTEPRYL 295
Query: 314 PATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
TM+ +DS +H+WN ++R +K+ S Y + A +YCPR
Sbjct: 296 EETMARTKDSLMIHIWNIASRRERIKVASNIAYVKYAEKYCPR 338
>gi|195118543|ref|XP_002003796.1| GI21107 [Drosophila mojavensis]
gi|193914371|gb|EDW13238.1| GI21107 [Drosophila mojavensis]
Length = 274
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 93/265 (35%), Positives = 147/265 (55%), Gaps = 10/265 (3%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVVIASVRN---RTRNPLIDRLYEYQNVHIVQVDLG 158
+ LT RQACSIESAA+ NP Q++ V + L+D + Y+NV+ ++L
Sbjct: 1 MNLTARQACSIESAALNNPNFQIFTVFSCPTYRPLVGRQKLLVDAIESYENVNFRHLNLR 60
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-N 217
Y NTP+ + + +L S +P+ H SDLLR ++LY++GG YLD+D IV+KSLE L N
Sbjct: 61 DYAMNTPIEDWVKRGELLNSSYPMQHTSDLLRLISLYRFGGIYLDMDIIVLKSLEKLPLN 120
Query: 218 YAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK 274
Y GAES+ +A GVI L D H ++ L++ + F ++ NGP ++T +L C
Sbjct: 121 YVGAESNYTLANGVISLTADGIGHEVAELFLQQYQKYFNGNDYVQNGPTLVTAVLLKYCG 180
Query: 275 P---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
++ SC+ F ++ FY + W+ W + TM+ +DS +H+WN
Sbjct: 181 TSLMKAIEGGRKSCKGFRLFNTTAFYSIPWQEWKHFTEPRYLEETMARTKDSLMIHIWNI 240
Query: 332 FTKRVPVKLGSEQPYAQIARRYCPR 356
++R +K+ S Y + A ++CPR
Sbjct: 241 ASRRERIKVASNIAYVKYAEKHCPR 265
>gi|321477481|gb|EFX88440.1| hypothetical protein DAPPUDRAFT_305581 [Daphnia pulex]
Length = 370
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 148/268 (55%), Gaps = 12/268 (4%)
Query: 90 FFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL-IDRLYEYQ 148
FF ETS A L RQACS+ES A++NP + V+++ S ++ N + L Y+
Sbjct: 103 FFHETSGVRA----LDFRQACSVESLAIVNP--HLTVILLMSGKDIDWNSTTMKTLGNYE 156
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
N+ I ++LG YF ++P +Y + +SH+SD LR++TLY YGG Y DLD I+
Sbjct: 157 NIKIYNINLGDYFIHSPFRQWYFCSTWNYGSFAVSHLSDALRFLTLYNYGGYYFDLDIIM 216
Query: 209 IKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL 268
++ + N+ AE+ +AAG +H+D H + A+ E R+ ++ WG NGP +LTR+
Sbjct: 217 VQPVTHYRNFIVAENEKNLAAGALHVDYLHPVIRMAVEEFRETYRKDIWGHNGPLLLTRV 276
Query: 269 LKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH 327
+ C ++ A N SC F I PP+ FYP+ W W Y E + ++ D+ +H
Sbjct: 277 MTKWCLTENIAEMNSDSCYGFKILPPKTFYPIEWPKWLRYFLEQD----ITWDNDTIGIH 332
Query: 328 VWNSFTKRVPVKLGSEQPYAQIARRYCP 355
VWN + V SEQ Y ++AR CP
Sbjct: 333 VWNQKSAGQAVSKTSEQVYTKLARFQCP 360
>gi|157125005|ref|XP_001654206.1| lactosylceramide 4-alpha-galactosyltransferase (alpha-
1,4-galactosyltransferase) [Aedes aegypti]
gi|108882735|gb|EAT46960.1| AAEL001900-PA [Aedes aegypti]
Length = 371
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/275 (34%), Positives = 154/275 (56%), Gaps = 7/275 (2%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVY-VVVIASVRNRTRNPLIDRLYE 146
+IFF+ T+ T+ ++LT RQAC+IESAA NP +V+ + V A N + +P I L +
Sbjct: 87 NIFFILTTLTNDGTIQLTPRQACAIESAARANPDWKVFPLFVFAKWFNISSDPFIPSLLQ 146
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
+ N+ + V+L + +P+ + A+ S + + H +D+LR +TLYKYGGTYLD D
Sbjct: 147 FCNIRMRHVNLDTFAVGSPVEKLFADGALGKSSFIVEHTADVLRLLTLYKYGGTYLDTDV 206
Query: 207 IVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGP 262
+V ++L L NY G+E S VA GVI+L+ H + + L +L NF T+W ANGP
Sbjct: 207 VVRRTLNMLQPNYLGSEGSGYVANGVINLEASGYGHEFAESCLNDLALNFDGTQWAANGP 266
Query: 263 GVLTRLLKAECKPQSYAH-NIISCRN-FTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
++TR L+ C + N C T+YPP FY + + + + M+
Sbjct: 267 FMVTRNLRKFCNVTDVSQMNRQQCGGQLTVYPPDVFYRIRYPRHDWFFYPERSEVVMNSI 326
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
++ +H+WN T + +K+ S Y Q+A+++CP
Sbjct: 327 KNDVLVHMWNKATSGIQLKITSTAAYIQLAQQFCP 361
>gi|332376428|gb|AEE63354.1| unknown [Dendroctonus ponderosae]
Length = 358
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 156/282 (55%), Gaps = 14/282 (4%)
Query: 88 SIFFLETSCTHADGVELTL--RQACSIESAAMMNPGVQVYVVV----IASVRNRTRNPLI 141
+IFFLET+C +L + RQAC++ESAA +NP + V+++ I N + L+
Sbjct: 67 NIFFLETTCNSFRRGKLFIKARQACAVESAARLNPDMDVHLLFASPGILKDENTQSDRLL 126
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L Y NV ++ DL ++ Q++P+ ++ I S +P++H+SD+LR +TL+K+GG Y
Sbjct: 127 HSLTAYPNVKLLHFDLQQFIQDSPVEELWSSGRIKESRYPVAHVSDILRLLTLWKFGGIY 186
Query: 202 LDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLD---KDHWLSGAALRELRDNFKTTEW 257
LDLD IV+KSL +L N+AGA+S +VA GV+ K H L + NF W
Sbjct: 187 LDLDVIVLKSLTTLPENFAGAQSVDLVANGVMGFSRTGKGHQYMQECLEDAALNFNGIIW 246
Query: 258 GANGPGVLTRLLKAECKPQSYAHNII--SCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
G NGP ++TR + C SY I C F I PP FY + +++W + E A
Sbjct: 247 GENGPILITRNIFKHCSKFSYPMLIRFGICDEFRIIPPSGFYLLPYQNWQLFFEEDLAGN 306
Query: 316 TMSLFR-DSYALHVWNSFTKRVPV-KLGSEQPYAQIARRYCP 355
+S +SY +HVWN + + K PY Q+A +YCP
Sbjct: 307 IVSYAEANSYLVHVWNKLSLNWTILKRDLNVPYLQLASKYCP 348
>gi|195385990|ref|XP_002051687.1| GJ10950 [Drosophila virilis]
gi|194148144|gb|EDW63842.1| GJ10950 [Drosophila virilis]
Length = 273
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 88/256 (34%), Positives = 142/256 (55%), Gaps = 7/256 (2%)
Query: 108 QACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLH 167
QAC IESAA+ NP QV+V+ P+ID L Y+NVH ++L Y Q+TP+
Sbjct: 8 QACPIESAALHNPNFQVFVLFACPTHRNKSMPIIDALLSYKNVHFRSLNLDLYAQDTPIA 67
Query: 168 GFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSV 226
+ + + TS + + H+SDLLR +TLY+YGG YLD+D + ++SLE N+AGAE +
Sbjct: 68 DWVKKGDLYTSRYLIFHLSDLLRLITLYRYGGVYLDMDVLQLRSLEDEPLNFAGAERADS 127
Query: 227 VAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQS---YAH 280
+ VI L+ + H L L++ + N+ + W NGP L R+L C ++ +
Sbjct: 128 IGNSVISLEPNGFGHQLGELFLQDFQKNYDSDAWAHNGPMGLVRVLSEICGTKNVTLMVN 187
Query: 281 NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKL 340
N C F ++ FY V ++ + N N+ T++ ++S +H+WN ++ P+K+
Sbjct: 188 NRKRCHGFRVFDINAFYEVKFDECIMFFNPENSTETLARTKNSRMVHIWNHIVRKWPLKI 247
Query: 341 GSEQPYAQIARRYCPR 356
S+ Y A ++CPR
Sbjct: 248 DSKSAYMHWAAQHCPR 263
>gi|170029757|ref|XP_001842758.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
gi|167864077|gb|EDS27460.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
Length = 371
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 96/276 (34%), Positives = 151/276 (54%), Gaps = 9/276 (3%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV-RNRTRNPLIDRLYE 146
+IFF+ ++ + ++LT RQAC+IESAA N +V+ + +A+ N + N I L
Sbjct: 87 NIFFIVSTLSPEGVIKLTARQACAIESAARSNADWKVFPLFVATTWFNSSNNEFISPLLR 146
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
Y N+H+ +DL + TPL +T+ A+ S + + H +D+LR + LYKYGGTYLD D
Sbjct: 147 YCNIHMRYIDLETFAFGTPLESLFTKHALQNSSYIVEHTADVLRLLVLYKYGGTYLDTDV 206
Query: 207 IVIKSLE-SLHNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTEWGANGP 262
IV +S + L NY G+E S VA GVI+L+ H + + L +L ++F W ANGP
Sbjct: 207 IVRRSFDLLLPNYLGSEGSGYVANGVINLEATGYGHRFAESCLNDLAEHFDGQVWAANGP 266
Query: 263 GVLTRLLKAECKPQSYAHNIISCR---NFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
++TR L+ C A N+ R +++PP FY + + + A M+
Sbjct: 267 FMVTRNLQKFCNVSEVA-NMTRARCGGQLSVHPPDVFYRIRYPRHDWFFYPERTEAVMTS 325
Query: 320 FRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+D +H+WN T + +K+ S Y ++A YCP
Sbjct: 326 IKDDILVHMWNKATSGIQLKVNSTAAYVKLAHEYCP 361
>gi|195385996|ref|XP_002051690.1| GJ10917 [Drosophila virilis]
gi|194148147|gb|EDW63845.1| GJ10917 [Drosophila virilis]
Length = 272
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/256 (34%), Positives = 141/256 (55%), Gaps = 8/256 (3%)
Query: 108 QACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLH 167
QAC+IESAA+ NP QV+V+ S+RN + P+ID L Y+NV ++L RY ++TP+
Sbjct: 8 QACAIESAALHNPKFQVFVLFACSIRNESV-PIIDALLSYKNVQFRSINLDRYAEDTPIA 66
Query: 168 GFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-NYAGAESSSV 226
+ + + S + + H+SDLLR +TLY++GG Y+D+D +V++SLE N+AGAE +
Sbjct: 67 DWLKKGDLFKSSYLMFHLSDLLRLITLYRFGGVYMDMDVLVLRSLEEEPLNFAGAERADS 126
Query: 227 VAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQS---YAH 280
+ GVI L+ + H L L++ + N++ W NGP L R+L C + +
Sbjct: 127 IGNGVIGLEPNGFGHQLCELFLQDFQVNYRGETWAHNGPMGLVRVLSEICGTNNVTLMVN 186
Query: 281 NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKL 340
N C+ F ++ FY V W+ W + A + +S +H+WN P++
Sbjct: 187 NRQRCQGFKVFDVNAFYEVPWQEWRLFFQPETALFVRARTENSIMVHIWNHVVTEWPLQT 246
Query: 341 GSEQPYAQIARRYCPR 356
S Y A ++CPR
Sbjct: 247 NSITAYMMWAAQHCPR 262
>gi|195118545|ref|XP_002003797.1| GI21097 [Drosophila mojavensis]
gi|193914372|gb|EDW13239.1| GI21097 [Drosophila mojavensis]
Length = 274
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 143/265 (53%), Gaps = 10/265 (3%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVVIASVRN---RTRNPLIDRLYEYQNVHIVQVDLG 158
+ LT RQACSIESAA+ NP +Q++ V + L+D + Y+NV+ ++L
Sbjct: 1 MNLTARQACSIESAALNNPNLQIFTVFSCPTYRPLVGRQKLLVDAIESYENVNFRHLNLR 60
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-N 217
Y +TP+ + + +L S +P+ H SDLLR ++LY++GG YLD+D IV+KSLE L N
Sbjct: 61 DYAMSTPIEDWVKRGELLNSSFPMEHTSDLLRLISLYRFGGIYLDMDIIVLKSLEKLPLN 120
Query: 218 YAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK 274
Y GAES+ + VI L D H ++ L F ++ NGP ++T +L C
Sbjct: 121 YVGAESNYSLCNAVIGLAADGIGHEVAELFLERYVKYFNGKDYAQNGPALVTAVLLKFCG 180
Query: 275 P---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
++ SC+ F ++ FY + W+ W + TM+ +DS +H+WN
Sbjct: 181 TSLMKAIEGGRKSCKGFRLFNTTAFYSIPWQEWKHFTEPRYLEETMARTKDSLMIHIWNI 240
Query: 332 FTKRVPVKLGSEQPYAQIARRYCPR 356
++R +K+ S Y + A +YCPR
Sbjct: 241 ASRRERIKVASNIAYVKYAEKYCPR 265
>gi|195385994|ref|XP_002051689.1| GJ10928 [Drosophila virilis]
gi|194148146|gb|EDW63844.1| GJ10928 [Drosophila virilis]
Length = 379
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 151/283 (53%), Gaps = 11/283 (3%)
Query: 85 PDNSIFFLETSCTHADG----VELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL 140
P +IFF+E+SC ++ + QAC IESAA+ NP QV+++ P+
Sbjct: 87 PGKTIFFIESSCYRSESEYNMSSIKAHQACPIESAALHNPNFQVFLLFACPTHRNKSMPI 146
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
ID L Y+NVH ++L RY Q TP+ + + + S + + H+SDLLR +TLY++GG
Sbjct: 147 IDALLSYKNVHFRSLNLDRYAQGTPIADWLKKGDLFKSRYLMFHLSDLLRLLTLYRFGGV 206
Query: 201 YLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTE 256
YLD+D + ++SLE NYAGAE + + VI L+ + H L L++ N+
Sbjct: 207 YLDMDVLQLQSLEDEPLNYAGAERADSIGNSVISLEPNGFGHQLGELFLQDFHVNYNGDA 266
Query: 257 WGANGPGVLTRLLKAECKPQS---YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNA 313
W NGP L R++ C + +N C+ F ++ FY V W W + N ++
Sbjct: 267 WAHNGPMGLVRVMSEICGTNNVTLMVNNRQRCQGFKVFDVNAFYEVPWPQWPLFFNAESS 326
Query: 314 PATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
T++ ++S +H+WN + P+K S+ PY A ++CPR
Sbjct: 327 NETLARTKNSRMVHIWNHIVTKWPLKTDSKSPYMHWAAQHCPR 369
>gi|195385998|ref|XP_002051691.1| GJ10906 [Drosophila virilis]
gi|194148148|gb|EDW63846.1| GJ10906 [Drosophila virilis]
Length = 274
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 89/270 (32%), Positives = 146/270 (54%), Gaps = 20/270 (7%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVVIASV---RNRTRNPLIDRLYEYQNVHIVQVDLG 158
+ LT RQAC+IESA + NP QV+V+ + + LID + Y+NV +++
Sbjct: 1 MNLTARQACAIESAGLNNPNFQVFVLFSCPTYRPLSGGQKLLIDAIEIYKNVRFRHLNIR 60
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-N 217
Y +TP+ + + +L S +P+ H +DLLR ++LY++GG YLD+D +V++SLE+ N
Sbjct: 61 NYASDTPVEDWIKKGDLLNSSFPMQHTADLLRLISLYRFGGIYLDMDVVVLRSLENEPLN 120
Query: 218 YAGAESSSVVAAGVIHLD---KDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK 274
Y GA + + VI L+ K H ++ LR+ N+ E+ NGP ++TR++K C
Sbjct: 121 YVGAHDNITLGNAVIGLEPTGKGHEIAELFLRDYEKNYNGKEYVQNGPALVTRVVKKLC- 179
Query: 275 PQSYAHNII--------SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYAL 326
NI+ SC+ ++ FYP W W + TM++ +DSY +
Sbjct: 180 ----GDNIVKLIEEGRTSCQGLKVFNSTAFYPFGWPQWMHFTEPKYLKETMTITKDSYLI 235
Query: 327 HVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
H+WN + R +++GS A+R+CPR
Sbjct: 236 HLWNKASYRGLIRVGSNTALGIYAKRHCPR 265
>gi|195385992|ref|XP_002051688.1| GJ10939 [Drosophila virilis]
gi|194148145|gb|EDW63843.1| GJ10939 [Drosophila virilis]
Length = 390
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 152/284 (53%), Gaps = 13/284 (4%)
Query: 85 PDNSIFFLETSCTHADGVELTL-----RQACSIESAAMMNPGVQVYVVVIASVRNRTRNP 139
P +IFF+E+SC + G E + QAC IESAA+ NP QV+++ P
Sbjct: 98 PGQTIFFIESSC-YRSGSEYNMSSIKAHQACPIESAALHNPNFQVFLLFACPTHRNKSMP 156
Query: 140 LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGG 199
+ID L Y+NVH ++L RY Q TP+ + + + S + + H+SDLLR +TLY++GG
Sbjct: 157 IIDALLSYKNVHFRSLNLDRYAQGTPIADWLKKGDLFKSRYLMFHLSDLLRLLTLYRFGG 216
Query: 200 TYLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTT 255
YLD+D + ++SLE NYAGAE + + VI L+ + H L L++ N+
Sbjct: 217 VYLDMDVLQLQSLEDEPLNYAGAERADSIGNSVISLEPNGFGHQLGELFLQDFHVNYNGD 276
Query: 256 EWGANGPGVLTRLLKAECKPQS---YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETN 312
W NGP L R++ C + +N C+ F ++ FY V W W + N +
Sbjct: 277 AWAHNGPMGLVRVMSEICGTNNVTLMVNNRQRCQGFKVFDVNAFYEVPWPQWPLFFNAES 336
Query: 313 APATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ T++ ++S +H+WN + P+K S+ PY A ++CPR
Sbjct: 337 SNETLARTKNSRMVHIWNHIVTKWPLKTDSKSPYMHWAGQHCPR 380
>gi|195118547|ref|XP_002003798.1| GI21086 [Drosophila mojavensis]
gi|193914373|gb|EDW13240.1| GI21086 [Drosophila mojavensis]
Length = 274
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 143/265 (53%), Gaps = 10/265 (3%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVVIASVRN---RTRNPLIDRLYEYQNVHIVQVDLG 158
+ LT RQACSIESAA+ NP Q++ V + L+D + Y+NV+ ++L
Sbjct: 1 MNLTARQACSIESAALNNPNFQIFTVFSCPTYRPLVGRQKLLVDAIESYENVNFRHLNLR 60
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-N 217
Y NTP+ + + +L S +P+ H SDLLR ++LY++GG YLD+D IV+KSLE L N
Sbjct: 61 DYAMNTPIEDWVKRGELLNSSYPMQHTSDLLRLISLYRFGGIYLDMDIIVLKSLEKLPLN 120
Query: 218 YAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK 274
Y GAES+ + VI L D H ++ L++ + F ++ NGP ++T +L C
Sbjct: 121 YVGAESNYSLCNAVIGLAADGIGHEVAELFLQQYQKYFNGNDYVQNGPTLVTAVLLKYCG 180
Query: 275 P---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
++ SC+ F ++ FY + W W + TM+ +DS +H+WN
Sbjct: 181 TSLMKAIEGGRKSCKGFRLFNTTAFYSIPWPEWRHFTEPRYLEETMARTKDSLMIHMWNK 240
Query: 332 FTKRVPVKLGSEQPYAQIARRYCPR 356
++ +K+GS Y + + + CPR
Sbjct: 241 LSRGERIKVGSNAAYVKYSEKNCPR 265
>gi|195118541|ref|XP_002003795.1| GI21116 [Drosophila mojavensis]
gi|193914370|gb|EDW13237.1| GI21116 [Drosophila mojavensis]
Length = 274
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 143/265 (53%), Gaps = 10/265 (3%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVVIASVRN---RTRNPLIDRLYEYQNVHIVQVDLG 158
+ LT RQACSIESAA+ NP Q++ V + L+D + Y+NV+ ++L
Sbjct: 1 MNLTARQACSIESAALNNPNFQIFTVFSCPTYRPLVGRQKLLVDAIESYENVNFRHLNLR 60
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-N 217
Y +TP+ + + +L S +P+ H SDLLR ++LY++GG YLD+D IV+K LE L N
Sbjct: 61 DYAMSTPIEDWVKRGELLNSSYPMEHTSDLLRLISLYRFGGIYLDMDIIVLKRLEKLPLN 120
Query: 218 YAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK 274
Y GA+S+ +A GVI L D H ++ L++ + F + NGP ++T +L C
Sbjct: 121 YVGAQSNYTLANGVIGLTADGIGHEVAELFLQQYQKYFNGKNYVQNGPTLVTAVLLKYCG 180
Query: 275 P---QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
++ SC+ F ++ FY + W W + TM+ +DS +H+WN
Sbjct: 181 TSLMKAIEGGRKSCKGFRLFNTTAFYSIPWPEWRHFTEPRYLEETMARTKDSLMIHMWNK 240
Query: 332 FTKRVPVKLGSEQPYAQIARRYCPR 356
++ +K+GS Y + A + CPR
Sbjct: 241 VSRGERIKVGSNAAYVKYAEKKCPR 265
>gi|74194122|dbj|BAE36956.1| unnamed protein product [Mus musculus]
Length = 359
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 98/278 (35%), Positives = 142/278 (51%), Gaps = 9/278 (3%)
Query: 81 RLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT---R 137
R+ P N IFFLETS D CS+ESAA +P QV V++ R+ T R
Sbjct: 78 RVSAPGN-IFFLETS----DRTSPNFLFMCSVESAARAHPESQVVVLMKGLPRDTTAQPR 132
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
N I L + NV I +DL F++TPL +Y++ + L +SD R L+K+
Sbjct: 133 NLGISLLSCFPNVWIRPLDLQELFEDTPLAAWYSEARHRWEPYQLPVLSDASRIALLWKF 192
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
GG YLD DFIV+K+L +L N G +S V+ + ++ H L + N+ W
Sbjct: 193 GGIYLDTDFIVLKNLLNLTNTLGIQSRYVLNGAFLAFERKHEFLALCLHDFVANYNGWIW 252
Query: 258 GANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM 317
G GP +LTR+ K C QS + +CR T PP FYP+ W++W Y + +
Sbjct: 253 GHQGPQLLTRVFKKWCSIQSLEKS-HACRGVTALPPEAFYPIPWQNWKKYFEDISPEELT 311
Query: 318 SLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
L +YA+HVWN ++ ++ S+ AQ+ RYCP
Sbjct: 312 QLLNATYAVHVWNKKSQGTHLEATSKALLAQLHARYCP 349
>gi|51921295|ref|NP_001004150.1| lactosylceramide 4-alpha-galactosyltransferase [Mus musculus]
gi|283483963|ref|NP_001164425.1| lactosylceramide 4-alpha-galactosyltransferase [Mus musculus]
gi|59797925|sp|Q67BJ4.1|A4GAT_MOUSE RecName: Full=Lactosylceramide 4-alpha-galactosyltransferase;
AltName: Full=Alpha-1,4-N-acetylglucosaminyltransferase;
AltName: Full=Alpha-1,4-galactosyltransferase; AltName:
Full=Alpha4Gal-T1; AltName: Full=Globotriaosylceramide
synthase; Short=Gb3 synthase; AltName:
Full=UDP-galactose:beta-D-galactosyl-beta1-R
4-alpha-D-galactosyltransferase
gi|38350359|gb|AAR18365.1| Gb3 synthase [Mus musculus]
gi|74226011|dbj|BAE28761.1| unnamed protein product [Mus musculus]
gi|148672530|gb|EDL04477.1| alpha 1,4-galactosyltransferase [Mus musculus]
gi|187951225|gb|AAI38846.1| Alpha 1,4-galactosyltransferase [Mus musculus]
gi|187952061|gb|AAI38845.1| Alpha 1,4-galactosyltransferase [Mus musculus]
Length = 359
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 98/278 (35%), Positives = 142/278 (51%), Gaps = 9/278 (3%)
Query: 81 RLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT---R 137
R+ P N IFFLETS D CS+ESAA +P QV V++ R+ T R
Sbjct: 78 RVSAPGN-IFFLETS----DRTSPNFLFMCSVESAARAHPESQVVVLMKGLPRDTTAQPR 132
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
N I L + NV I +DL F++TPL +Y++ + L +SD R L+K+
Sbjct: 133 NLGISLLSCFPNVWIRPLDLQELFEDTPLAAWYSEARHRWEPYQLPVLSDASRIALLWKF 192
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
GG YLD DFIV+K+L +L N G +S V+ + ++ H L + N+ W
Sbjct: 193 GGIYLDTDFIVLKNLLNLTNTLGIQSRYVLNGAFLAFERKHEFLALCLHDFVANYNGWIW 252
Query: 258 GANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM 317
G GP +LTR+ K C QS + +CR T PP FYP+ W++W Y + +
Sbjct: 253 GHQGPQLLTRVFKKWCSIQSLEKS-HACRGVTALPPEAFYPIPWQNWKKYFEDISPEELT 311
Query: 318 SLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
L +YA+HVWN ++ ++ S+ AQ+ RYCP
Sbjct: 312 QLLNATYAVHVWNKKSQGTHLEATSKALLAQLHARYCP 349
>gi|75706609|gb|ABA25853.1| alpha-1,4-galactosyltransferase [Mus musculus]
Length = 348
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 98/278 (35%), Positives = 142/278 (51%), Gaps = 9/278 (3%)
Query: 81 RLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT---R 137
R+ P N IFFLETS D CS+ESAA +P QV V++ R+ T R
Sbjct: 67 RVSAPGN-IFFLETS----DRTSPNFLFMCSVESAARAHPESQVVVLMKGLPRDTTAQPR 121
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
N I L + NV I +DL F++TPL +Y++ + L +SD R L+K+
Sbjct: 122 NLGISLLSCFPNVWIRPLDLQELFEDTPLAAWYSEARHRWEPYQLPVLSDASRIALLWKF 181
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
GG YLD DFIV+K+L +L N G +S V+ + ++ H L + N+ W
Sbjct: 182 GGIYLDTDFIVLKNLLNLTNTLGIQSRYVLNGAFLAFERKHEFLALCLHDFVANYNGWIW 241
Query: 258 GANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM 317
G GP +LTR+ K C QS + +CR T PP FYP+ W++W Y + +
Sbjct: 242 GHQGPQLLTRVFKKWCSIQSLEKS-HACRGVTALPPEAFYPIPWQNWKKYFEDISPEELT 300
Query: 318 SLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
L +YA+HVWN ++ ++ S+ AQ+ RYCP
Sbjct: 301 QLLNATYAVHVWNKKSQGTHLEATSKALLAQLHARYCP 338
>gi|157125007|ref|XP_001654207.1| lactosylceramide 4-alpha-galactosyltransferase (alpha-
1,4-galactosyltransferase) [Aedes aegypti]
gi|108882736|gb|EAT46961.1| AAEL001895-PA [Aedes aegypti]
Length = 404
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 96/281 (34%), Positives = 151/281 (53%), Gaps = 13/281 (4%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA----SVRNRTRNPL 140
P+ +IFF+ ++ D +LT RQ+C+IESAA N V+V+ + S +N T +
Sbjct: 117 PEKNIFFVMSTVIEDDIAKLTPRQSCAIESAARANSDWSVFVLFTSARMFSFQNSTN--M 174
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
+ L+ Y N+H ++++ Y TPL F+ +++ SL+ + H SD+LR +TLYKYGGT
Sbjct: 175 VPLLF-YSNIHFRRLNMETYAIGTPLEKFFRDNSLRNSLFIVEHTSDVLRLLTLYKYGGT 233
Query: 201 YLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDK---DHWLSGAALRELRDNFKTTE 256
YLD D +V+ SL L HNY G+E +A G+I+L H ++ A L E+ +NF +
Sbjct: 234 YLDSDVVVMDSLNELPHNYLGSEGDGYIANGIINLQATGYGHTVAEAFLNEIAENFNGSV 293
Query: 257 WGANGPGVLTRLLKAECKPQS-YAHNIISC-RNFTIYPPRFFYPVHWEHWADYLNETNAP 314
W ANGP ++TR+++ C + + C +I PP F+ V + Y E +A
Sbjct: 294 WAANGPAMVTRVMRKFCNVTNVWDMTRERCGGKMSILPPDTFFQVTYPRHTWYFEEAHAS 353
Query: 315 ATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
M H+WN T + ++ S Y +A YCP
Sbjct: 354 EVMEKVAGHILTHLWNKLTGGIVLRKDSPVAYIILANVYCP 394
>gi|170029759|ref|XP_001842759.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
gi|167864078|gb|EDS27461.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
Length = 341
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 148/275 (53%), Gaps = 7/275 (2%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRN-PLIDRLYE 146
+IFF+ ++ A +L RQ C+IESAA NP +V+V+ +++ N P I L
Sbjct: 57 NIFFIISTIFQAGLAKLNPRQTCAIESAARSNPEWKVFVLFVSATEFSFFNSPNIVPLLP 116
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
Y NV + +V++ + +P+ + + A+ S + + H SD+LR +T+YKYGGTYLD D
Sbjct: 117 YPNVFLRRVNMTTFGLGSPVEEMFERGALRNSSFIVEHTSDVLRLLTVYKYGGTYLDTDV 176
Query: 207 IVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGANGP 262
+V+KSL+ L NY +E VA G+I+L H L+ + LR++ N+ TEW ANGP
Sbjct: 177 VVMKSLDELPLNYLVSEGDGFVANGIINLQASGVGHTLAESMLRDVAKNYSATEWAANGP 236
Query: 263 GVLTRLLKAECKPQSYAHNI-ISC-RNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
++TR+L+ C H C F + PP F+ V + H + Y M
Sbjct: 237 FLVTRILRQYCNVTEPWHMTREQCGGQFGVLPPDQFFQVFYPHQSWYFEANRTREVMERM 296
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ H+WN T + +K ++ Y ++AR YCP
Sbjct: 297 KGKVLTHLWNKLTNGIKLKKDADAAYIELARLYCP 331
>gi|11560038|ref|NP_071576.1| lactosylceramide 4-alpha-galactosyltransferase [Rattus norvegicus]
gi|59797638|sp|Q9JI93.1|A4GAT_RAT RecName: Full=Lactosylceramide 4-alpha-galactosyltransferase;
AltName: Full=Alpha-1,4-N-acetylglucosaminyltransferase;
AltName: Full=Alpha-1,4-galactosyltransferase; AltName:
Full=Alpha4Gal-T1; AltName: Full=Globotriaosylceramide
synthase; Short=Gb3 synthase; AltName:
Full=UDP-galactose:beta-D-galactosyl-beta1-R
4-alpha-D-galactosyltransferase
gi|9082162|gb|AAF82758.1|AF248544_1 Gb3 synthase [Rattus norvegicus]
gi|67677925|gb|AAH97323.1| Alpha 1,4-galactosyltransferase [Rattus norvegicus]
gi|149065761|gb|EDM15634.1| alpha 1,4-galactosyltransferase, isoform CRA_a [Rattus norvegicus]
gi|149065762|gb|EDM15635.1| alpha 1,4-galactosyltransferase, isoform CRA_a [Rattus norvegicus]
Length = 360
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 141/278 (50%), Gaps = 9/278 (3%)
Query: 81 RLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT---R 137
R+ P N IFFLETS D + CS+ESAA +P QV V++ R+ T R
Sbjct: 79 RVSAPGN-IFFLETS----DRTNPSFLFMCSVESAARAHPESQVVVLMKGLPRDTTAWPR 133
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
N I L + NV I +DL F++TPL +Y + + L +SD R L+K+
Sbjct: 134 NLGISLLSCFPNVQIRPLDLQELFEDTPLAAWYLEAQHRWEPYLLPVLSDASRIALLWKF 193
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
GG YLD DFIV+K+L +L N G +S V+ + ++ H +R+ ++ W
Sbjct: 194 GGIYLDTDFIVLKNLRNLTNMLGIQSRYVLNGAFLAFERKHEFLALCIRDFVAHYNGWIW 253
Query: 258 GANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM 317
G GP +LTR+ K C S + +CR T PP FYP+ W++W Y + +
Sbjct: 254 GHQGPQLLTRVFKKWCSIHSLKES-RACRGVTALPPEAFYPIPWQNWKKYFEDVSPEELA 312
Query: 318 SLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
L +YA+HVWN ++ ++ S AQ+ RYCP
Sbjct: 313 QLLNATYAVHVWNKKSQGTHLEATSRALLAQLHARYCP 350
>gi|344247208|gb|EGW03312.1| Lactosylceramide 4-alpha-galactosyltransferase [Cricetulus griseus]
Length = 348
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 138/272 (50%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT---RNPLIDR 143
+IFFLETS D + + CS+ESAA +P QV V++ R +T RN I
Sbjct: 72 GNIFFLETS----DRTKPSFLFMCSVESAARAHPETQVVVLMKGLHRYKTALPRNLGISL 127
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV I +DL FQ+TPL +Y + + L +SD R L+K+GG YLD
Sbjct: 128 LRCFPNVQIRPLDLKELFQDTPLAAWYLKVQHSWEPYLLPVLSDASRIALLWKFGGIYLD 187
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ DN+ WG GP
Sbjct: 188 TDFIVLKNLRNLTNTLGVQSRYVLNGAFLAFERHHNFLALCIRDFVDNYNGWIWGHQGPQ 247
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S +CR PP FYP+ W++W Y + + L +
Sbjct: 248 LLTRVFKKWCSIRSLKET-HTCRGVIALPPEAFYPIPWQNWKKYFEDISPEELARLLNAT 306
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 307 YAVHVWNKKSQGTHLDSMSRALLAQLYGRYCP 338
>gi|354491879|ref|XP_003508081.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Cricetulus griseus]
Length = 466
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 138/272 (50%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT---RNPLIDR 143
+IFFLETS D + + CS+ESAA +P QV V++ R +T RN I
Sbjct: 190 GNIFFLETS----DRTKPSFLFMCSVESAARAHPETQVVVLMKGLHRYKTALPRNLGISL 245
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV I +DL FQ+TPL +Y + + L +SD R L+K+GG YLD
Sbjct: 246 LRCFPNVQIRPLDLKELFQDTPLAAWYLKVQHSWEPYLLPVLSDASRIALLWKFGGIYLD 305
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ DN+ WG GP
Sbjct: 306 TDFIVLKNLRNLTNTLGVQSRYVLNGAFLAFERHHNFLALCIRDFVDNYNGWIWGHQGPQ 365
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S +CR PP FYP+ W++W Y + + L +
Sbjct: 366 LLTRVFKKWCSIRSLKET-HTCRGVIALPPEAFYPIPWQNWKKYFEDISPEELARLLNAT 424
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 425 YAVHVWNKKSQGTHLDSMSRALLAQLYGRYCP 456
>gi|442748467|gb|JAA66393.1| Putative alpha-14-n-acetylglucosaminyltransferase [Ixodes ricinus]
Length = 345
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 146/299 (48%), Gaps = 11/299 (3%)
Query: 58 YRVQNINCVLSGVEKDFAFD-DITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAA 116
Y +N+ C E D D+ L + FFLET A L RQ+CSIESAA
Sbjct: 45 YHSRNMICACQPGENAILTDRDLYELTKSGRNFFFLET----AGSTCLNSRQSCSIESAA 100
Query: 117 MMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAIL 176
NP ++++ + R R +D L +N + ++D+ +TPL+G+Y DA +
Sbjct: 101 RQNPEFTIFLLTLWGTR---RCRYLDHLKSLRNFRLARIDVKSLVNDTPLNGWYHSDAWI 157
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK 236
S + +H SD LR + L+KYGG Y DLD +V+KS+ + N E ++ ++ K
Sbjct: 158 VSPFRTNHFSDALRLLVLWKYGGVYADLDTLVLKSVANPQNSVSRELFPLIGNSMMSFQK 217
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHN-IISCRNFTIYPPRF 295
H A L+E N+K W NGP +L R+LK C + + C + +I P
Sbjct: 218 GHPFLLACLQEFAINYKPRRWAYNGPRLLERVLKTWCPKEPVMQQPYVYCSDVSILPGEA 277
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYC 354
FYPV + W + A L +SYA+H+WN+ +K ++ GS Y + R C
Sbjct: 278 FYPVPYTEWKLPFQASEASHVAMLLSNSYAIHLWNALSKITRIEEGSA--YDVLQRNVC 334
>gi|327273441|ref|XP_003221489.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Anolis carolinensis]
Length = 353
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 138/273 (50%), Gaps = 8/273 (2%)
Query: 86 DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---ID 142
I+FLETS + CS+ESAA ++P +++ +++ V P I
Sbjct: 76 SKDIYFLETS----ERTNPNFLFMCSVESAARVHPELKIVILMKGLVNYNNTLPKHLGIS 131
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
+ N+ I +DL F NTPL +Y+ + L +SD R ++KYGG YL
Sbjct: 132 FFSCFPNLEIKPLDLNELFSNTPLIRWYSLAQQRWEPYFLPILSDACRIAIMWKYGGIYL 191
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D DFIV+K+L++L N G +S V+ + + H + E +N+ WG GP
Sbjct: 192 DTDFIVLKNLKNLINTLGIQSKYVLNGAFLSFEPKHKFIQLCMEEFVNNYNRWIWGHQGP 251
Query: 263 GVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRD 322
+ TR+ K C +S + SC+ T +P FYP+HW+ W Y TNA LF++
Sbjct: 252 QLFTRMFKKWCAIRSLQSST-SCKGVTTFPQEAFYPIHWQDWRKYYEVTNASELPKLFKN 310
Query: 323 SYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+YA+HVWN ++ ++ SE AQ+ +YCP
Sbjct: 311 TYAVHVWNMKSQGKQFEITSETLLAQLHSKYCP 343
>gi|195034296|ref|XP_001988865.1| GH10342 [Drosophila grimshawi]
gi|193904865|gb|EDW03732.1| GH10342 [Drosophila grimshawi]
Length = 274
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 147/265 (55%), Gaps = 10/265 (3%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVV-IASVR--NRTRNPLIDRLYEYQNVHIVQVDLG 158
+ LT RQAC+IESAA+ NP V+V+ ++R + + LI+ + NVH Q++L
Sbjct: 1 MNLTARQACAIESAALHNPNFDVFVLFSCPTLRPLSIAQKSLINVIESIPNVHFRQLNLQ 60
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH-N 217
+Y +TP+ + + +L +P+ H SDLLR ++LY+YGG Y+DLD +V++SLE + N
Sbjct: 61 KYATDTPVEDWIKKGYMLKGRYPMEHTSDLLRLISLYRYGGIYIDLDVVVLRSLEDVPLN 120
Query: 218 YAGAESSSVVAAGVIHLD---KDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK 274
Y GA + + GV+ ++ H ++ LR+ + N+ E+ NGP + R+++A C
Sbjct: 121 YVGAFDNVTLGNGVLSVEPTGTGHEIAELFLRDFKSNYTGEEYTRNGPQGIRRVVRAICG 180
Query: 275 PQ---SYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
+ + CR F ++ FY + ++ W + TM +DSY +H+WN+
Sbjct: 181 VEIVKAIEEGRKICRGFQVFNSTAFYALPYQQWRHSTDPEFLEDTMEKTKDSYLIHLWNN 240
Query: 332 FTKRVPVKLGSEQPYAQIARRYCPR 356
+ + K+GS Y + A +CP+
Sbjct: 241 LSHKKLFKVGSNTAYGKYAEIHCPK 265
>gi|242000458|ref|XP_002434872.1| secreted protein, putative [Ixodes scapularis]
gi|215498202|gb|EEC07696.1| secreted protein, putative [Ixodes scapularis]
Length = 293
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 137/268 (51%), Gaps = 8/268 (2%)
Query: 78 DITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR 137
D+ L + FFLET A L RQ+CSIESAA NP ++++ + R R R
Sbjct: 11 DLYELTKSGRNFFFLET----AGSTCLNSRQSCSIESAARQNPEFTIFLLTLWGTR-RCR 65
Query: 138 NPL--IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLY 195
+D L +N + ++D+ +TPL+G+Y DA + S + +H SD LR + L+
Sbjct: 66 YAFMYLDHLQSLRNFRLARIDVNSLVNDTPLNGWYHSDAWIVSPFRTNHFSDALRLLVLW 125
Query: 196 KYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTT 255
KYGG Y DLD +V++S+ +L N E ++ ++ K H A L+E N+K
Sbjct: 126 KYGGVYADLDTLVLRSVANLQNSVSRERFPLIGNSMMSFQKGHPFLLACLQEFAINYKPR 185
Query: 256 EWGANGPGVLTRLLKAECKPQSYAHN-IISCRNFTIYPPRFFYPVHWEHWADYLNETNAP 314
W NGP +L R+LK C + + C + +I P FYPV + W + A
Sbjct: 186 RWAYNGPRLLERVLKTWCPKEPVMQQPYVDCVDVSILPGEAFYPVSYTEWKLPFQASEAS 245
Query: 315 ATMSLFRDSYALHVWNSFTKRVPVKLGS 342
L +SYA+H+WN+ +K ++ GS
Sbjct: 246 HVAMLLSNSYAIHLWNALSKITRIEEGS 273
>gi|109094415|ref|XP_001107622.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 1
[Macaca mulatta]
gi|297261239|ref|XP_002798423.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 2
[Macaca mulatta]
gi|297261241|ref|XP_002798424.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 3
[Macaca mulatta]
Length = 353
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 134/272 (49%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLQELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNMLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N LF +
Sbjct: 253 LLTRVFKKWCSIRSLAES-HTCRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPQLFNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ K S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFKATSRALLAQLHARYCP 343
>gi|355785047|gb|EHH65898.1| hypothetical protein EGM_02761 [Macaca fascicularis]
gi|384943442|gb|AFI35326.1| lactosylceramide 4-alpha-galactosyltransferase [Macaca mulatta]
Length = 353
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 134/272 (49%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLQELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNMLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N LF +
Sbjct: 253 LLTRVFKKWCSIRSLAES-HACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPQLFNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ K S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFKATSRALLAQLHARYCP 343
>gi|380795485|gb|AFE69618.1| lactosylceramide 4-alpha-galactosyltransferase, partial [Macaca
mulatta]
Length = 322
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 134/272 (49%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 46 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 101
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 102 LSCFPNVQMLPLDLQELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 161
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 162 TDFIVLKNLRNLTNMLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 221
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N LF +
Sbjct: 222 LLTRVFKKWCSIRSLAES-HACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPQLFNAT 280
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ K S AQ+ RYCP
Sbjct: 281 YAVHVWNKKSQGTRFKATSRALLAQLHARYCP 312
>gi|355563733|gb|EHH20295.1| hypothetical protein EGK_03117 [Macaca mulatta]
gi|383414667|gb|AFH30547.1| lactosylceramide 4-alpha-galactosyltransferase [Macaca mulatta]
Length = 353
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 134/272 (49%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLQELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNMLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N LF +
Sbjct: 253 LLTRVFKKWCSIRSLAES-HACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPQLFNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ K S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFKATSRALLAQLHARYCP 343
>gi|321465528|gb|EFX76529.1| hypothetical protein DAPPUDRAFT_3999 [Daphnia pulex]
Length = 273
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 90/271 (33%), Positives = 142/271 (52%), Gaps = 11/271 (4%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPG--VQVYVVVIASVRNRTRNPLIDRLYE 146
I+F ETS LTLRQ+C++ESAA NP VQ+ + S NR R ++ L
Sbjct: 1 IYFHETSGRS----RLTLRQSCAVESAAKENPDRPVQLIMQTDMSSINR-RGTWLNVLSN 55
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
Y NV ++ + YF++TPL +Y + S L H +D +R ++ K GG Y+DLDF
Sbjct: 56 YPNVAVILIKEMDYFRDTPLEDWYRKGQWRQSPHKLEHFADYIRMLSSLKGGGLYMDLDF 115
Query: 207 IVIKSLESLHNYAGAESSSV--VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGV 264
+ IK L+ + N+ E ++ ++ G+ H D H L + +L ++ EW A+GP +
Sbjct: 116 VTIKQLD-IGNFLAVEDAAANHISNGIFHFDHGHRLIREIVNQLAARYQPEEWNAHGPAL 174
Query: 265 LTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
+ ++ C ++ C++ + P F YP+HW W Y + N M SY
Sbjct: 175 IFSIMSRICGLKTGQPLSNQCQDVALMPYNFVYPIHWPDWRIYFQKANR-NVMQWINGSY 233
Query: 325 ALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
A+HVWN + P+ + S+Q YA +A R+CP
Sbjct: 234 AVHVWNKMSHSEPLLINSDQVYATLASRHCP 264
>gi|403308828|ref|XP_003944846.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 1
[Saimiri boliviensis boliviensis]
gi|403308830|ref|XP_003944847.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 2
[Saimiri boliviensis boliviensis]
Length = 353
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 135/274 (49%), Gaps = 8/274 (2%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---I 141
P IFFLETS D CS+ESAA +P +V V++ P I
Sbjct: 75 PPGDIFFLETS----DRTNPNFLFMCSVESAARTHPESRVLVLMKGLPGGNASLPRHLGI 130
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG Y
Sbjct: 131 SLLSCFPNVQMLPLDLRELFRDTPLADWYAAAQRRWEPYLLPVLSDASRIALMWKFGGIY 190
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
LD DFIV+K+L +L N GA+S V+ + ++ H +R+ D++ WG G
Sbjct: 191 LDTDFIVLKNLRNLSNVLGAQSRYVLNGAFLAFERQHEFMALCMRDFVDHYNGWVWGHQG 250
Query: 262 PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
P +LTR+ K C +S + +CR T PP FYP+ W+ W Y + + L
Sbjct: 251 PQLLTRVFKKWCSIRSLTES-HACRGVTALPPEAFYPIPWQDWKKYFEDISPQELPRLLN 309
Query: 322 DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+YA+HVWN ++ K+ S AQ+ RYCP
Sbjct: 310 ATYAVHVWNKKSQGTHFKVTSRALLAQLHARYCP 343
>gi|57114111|ref|NP_001009123.1| lactosylceramide 4-alpha-galactosyltransferase [Pan troglodytes]
gi|25452795|sp|Q9N291.1|A4GAT_PANTR RecName: Full=Lactosylceramide 4-alpha-galactosyltransferase;
AltName: Full=Alpha-1,4-N-acetylglucosaminyltransferase;
AltName: Full=Alpha-1,4-galactosyltransferase; AltName:
Full=Globotriaosylceramide synthase; Short=Gb3 synthase;
AltName: Full=UDP-galactose:beta-D-galactosyl-beta1-R
4-alpha-D-galactosyltransferase
gi|7593034|dbj|BAA94504.1| alpha-1,4-N-acetylglucosaminyltransferase/alpha-
1,4-galactosyltransferase [Pan troglodytes]
Length = 353
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL FQ+TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFQDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + SCR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RSCRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|321476295|gb|EFX87256.1| hypothetical protein DAPPUDRAFT_4146 [Daphnia pulex]
Length = 271
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 132/263 (50%), Gaps = 16/263 (6%)
Query: 107 RQACSIESAAMMNPGVQVYVVVIASVRNRTRN-------PLIDRLYE-YQNVHIVQVDLG 158
RQAC++ES A NP + V V+ + + + +++L E Y NV + DLG
Sbjct: 1 RQACAVESLAFHNPNLTVNVLFMVDDGHHQKQNQSKVMIKTMEKLREKYLNVQFIVADLG 60
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
Y T L +Y T + ++H+SD LR +TL+KYGG Y DLD I+++ + N+
Sbjct: 61 EYLAGTLLEKWYHCTDWRTGPYHVAHLSDGLRLLTLHKYGGYYFDLDVILVRPVTFYRNF 120
Query: 219 AGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSY 278
AES S VIH D H + A+ + N+ W NGP +L R++K C+ +++
Sbjct: 121 VAAESGSEFGNSVIHADYGHPIMQLAVNDFPSNYNKNAWAHNGPDLLMRVMKTYCREENF 180
Query: 279 -AHNIISCRNFTIYPPRFFYPVHWEHWADYL-----NETNAPATMSLFRDSYALHVWNSF 332
A N +SCR F P F P+HW +W + NET AP ++ + +HVWN
Sbjct: 181 NAINYVSCRGFGALPNSTFSPIHWSNWRSFFSQRPANETGAPGWIT--KQVVGVHVWNKL 238
Query: 333 TKRVPVKLGSEQPYAQIARRYCP 355
+ S Q Y ++ R CP
Sbjct: 239 SFNETAYKNSTQEYVRLVRDNCP 261
>gi|397466996|ref|XP_003805220.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 1
[Pan paniscus]
gi|397466998|ref|XP_003805221.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 2
[Pan paniscus]
gi|397467000|ref|XP_003805222.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 3
[Pan paniscus]
Length = 353
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL FQ+TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLQELFQDTPLADWYAAARGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|402884446|ref|XP_003905692.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 1
[Papio anubis]
gi|402884448|ref|XP_003905693.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 2
[Papio anubis]
gi|402884450|ref|XP_003905694.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 3
[Papio anubis]
gi|402884452|ref|XP_003905695.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 4
[Papio anubis]
gi|402884454|ref|XP_003905696.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 5
[Papio anubis]
gi|402884456|ref|XP_003905697.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 6
[Papio anubis]
gi|402884458|ref|XP_003905698.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 7
[Papio anubis]
Length = 353
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 134/272 (49%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLQELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N LF +
Sbjct: 253 LLTRVFKKWCSIRSLAES-HACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPQLFNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|392283971|gb|AFM54542.1| alpha 1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G ES V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTESRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|410213716|gb|JAA04077.1| alpha 1,4-galactosyltransferase [Pan troglodytes]
gi|410253608|gb|JAA14771.1| alpha 1,4-galactosyltransferase [Pan troglodytes]
gi|410297134|gb|JAA27167.1| alpha 1,4-galactosyltransferase [Pan troglodytes]
gi|410334721|gb|JAA36307.1| alpha 1,4-galactosyltransferase [Pan troglodytes]
gi|410334723|gb|JAA36308.1| alpha 1,4-galactosyltransferase [Pan troglodytes]
Length = 353
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + SCR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RSCRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|444723844|gb|ELW64473.1| Lactosylceramide 4-alpha-galactosyltransferase [Tupaia chinensis]
Length = 353
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 90/275 (32%), Positives = 141/275 (51%), Gaps = 8/275 (2%)
Query: 84 VPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA---SVRNRTRNPL 140
+P +IFFLETS D + CS+ESAA +P +V V++ S + R+
Sbjct: 74 LPPGNIFFLETS----DRISPNFLFTCSVESAARAHPESRVVVLMKGLPGSNASLPRHLG 129
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
+ L + NV ++++DL F++TPL +Y + L +SD R L+K+GG
Sbjct: 130 LSLLGCFPNVQLLRLDLEELFRDTPLAAWYASGRRRWEPYLLPVLSDASRLALLWKFGGI 189
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
YLD DFIV+KSL +L N G +S V+ + + H +R+ ++ + WG
Sbjct: 190 YLDTDFIVLKSLRNLSNALGTQSRYVLNGAFLAFQRRHEFLALCMRDFVAHYNSWVWGHQ 249
Query: 261 GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
GP +LTR+ K C +S + +CR T P + FYPV W++W Y + + L
Sbjct: 250 GPQLLTRVFKKWCSIRSLDES-QACRGVTTLPSQAFYPVPWQNWKKYFEDISPEELPRLL 308
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+YA+HVWN ++ ++ S AQ+ RYCP
Sbjct: 309 GATYAVHVWNKKSQGTRLQATSRALLAQLQARYCP 343
>gi|449277571|gb|EMC85684.1| Lactosylceramide 4-alpha-galactosyltransferase [Columba livia]
Length = 360
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 136/270 (50%), Gaps = 8/270 (2%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP---LIDRLY 145
+FF+ETS T CS+ESAA +PG +V V++ + P L
Sbjct: 83 VFFVETSEQTNPSYLFT----CSVESAARAHPGTRVVVLMKGLAKGNVSLPSHWAFSLLS 138
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
+ NV I +DL F TPL +Y Q + L +SD R ++K+GG YLD D
Sbjct: 139 CFPNVEIRPLDLAELFSGTPLAKWYLQAQHRWEPYFLPVLSDACRIAIMWKFGGIYLDTD 198
Query: 206 FIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVL 265
FIV+K+L++L N G +S V+ + H +++ +N+ + WG GP +L
Sbjct: 199 FIVLKNLKNLTNVLGTQSKYVLNGAFLSFQPKHKFIELCMQDFVENYNSWIWGHQGPQLL 258
Query: 266 TRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYA 325
TR+ K C +S + SC+ + P FYP+ W+ W Y ++ LFR++YA
Sbjct: 259 TRVFKKWCSIRSLRSS-TSCKGVSALPREAFYPIRWQDWKKYFEMVSSSELQHLFRNTYA 317
Query: 326 LHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+HVWN ++ +++ S+ AQ+ ++CP
Sbjct: 318 VHVWNKKSQGTRLEIPSQALLAQLHSQFCP 347
>gi|146217396|gb|ABQ10740.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKTLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|426394713|ref|XP_004063632.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 1
[Gorilla gorilla gorilla]
gi|426394715|ref|XP_004063633.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 2
[Gorilla gorilla gorilla]
gi|426394717|ref|XP_004063634.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 3
[Gorilla gorilla gorilla]
gi|426394719|ref|XP_004063635.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 4
[Gorilla gorilla gorilla]
gi|426394721|ref|XP_004063636.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 5
[Gorilla gorilla gorilla]
gi|426394723|ref|XP_004063637.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 6
[Gorilla gorilla gorilla]
gi|426394725|ref|XP_004063638.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 7
[Gorilla gorilla gorilla]
Length = 353
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 134/272 (49%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +++ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMQDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N LF +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLFSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|8392830|ref|NP_059132.1| lactosylceramide 4-alpha-galactosyltransferase [Homo sapiens]
gi|25452796|sp|Q9NPC4.1|A4GAT_HUMAN RecName: Full=Lactosylceramide 4-alpha-galactosyltransferase;
AltName: Full=Alpha-1,4-N-acetylglucosaminyltransferase;
AltName: Full=Alpha-1,4-galactosyltransferase; AltName:
Full=Alpha4Gal-T1; AltName: Full=CD77 synthase; AltName:
Full=Globotriaosylceramide synthase; Short=Gb3 synthase;
AltName: Full=P1/Pk synthase; AltName:
Full=UDP-galactose:beta-D-galactosyl-beta1-R
4-alpha-D-galactosyltransferase
gi|7959011|dbj|BAA95915.1| Gb3/CD77 synthase [Homo sapiens]
gi|8250233|emb|CAB93532.1| alpha-4-galactosyltransferase [Homo sapiens]
gi|16877647|gb|AAH17068.1| Alpha 1,4-galactosyltransferase [Homo sapiens]
gi|60459546|gb|AAX20109.1| alpha 1,4-galactosyltransferase [Homo sapiens]
gi|119593680|gb|EAW73274.1| alpha 1,4-galactosyltransferase (globotriaosylceramide synthase)
[Homo sapiens]
gi|123981432|gb|ABM82545.1| alpha 1,4-galactosyltransferase (globotriaosylceramide synthase)
[synthetic construct]
gi|123996271|gb|ABM85737.1| alpha 1,4-galactosyltransferase (globotriaosylceramide synthase)
[synthetic construct]
gi|189066521|dbj|BAG35771.1| unnamed protein product [Homo sapiens]
gi|307685263|dbj|BAJ20562.1| alpha 1,4-galactosyltransferase [synthetic construct]
Length = 353
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|33115183|gb|AAH55286.1| A4GALT protein [Homo sapiens]
gi|34421674|gb|AAQ68076.1| globotriaosylceramide/CD77 synthase [Homo sapiens]
Length = 353
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|7593032|dbj|BAA94503.1| alpha-1,4-N-acetylglucosaminyltransferase/alpha-
1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|31324076|gb|AAP47170.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 354
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|440900832|gb|ELR51879.1| Lactosylceramide 4-alpha-galactosyltransferase, partial [Bos
grunniens mutus]
Length = 351
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 136/272 (50%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA---SVRNRTRNPLIDR 143
+IFFLETS D CS+ESAA +P V V++ +R R+ +
Sbjct: 75 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESPVVVLMKGLPGGNASRPRHLGLSL 130
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV + +DLG FQ+TPL +Y + L +SD R L+K+GG YLD
Sbjct: 131 LGCFPNVQMRPLDLGELFQDTPLAAWYAAVQRRWEPYLLPVLSDASRIALLWKFGGIYLD 190
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K L +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 191 TDFIVLKDLRNLTNALGTQSRYVLNGAFLAFEQHHEFMAQCMRDFVAHYNGWIWGHQGPQ 250
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S + + +CR T PP FYP+ W++W Y + + L +
Sbjct: 251 LLTRVFKKWCSIRSLSES-RACRGVTTLPPEAFYPIPWQNWKKYFEDISPQELTRLLNAT 309
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+A+HVWN ++ + S+ AQ+ RYCP
Sbjct: 310 FAVHVWNKKSQGTRFEATSKALLAQLHARYCP 341
>gi|25452794|sp|Q9N290.1|A4GAT_GORGO RecName: Full=Lactosylceramide 4-alpha-galactosyltransferase;
AltName: Full=Alpha-1,4-N-acetylglucosaminyltransferase;
AltName: Full=Alpha-1,4-galactosyltransferase; AltName:
Full=Globotriaosylceramide synthase; Short=Gb3 synthase;
AltName: Full=UDP-galactose:beta-D-galactosyl-beta1-R
4-alpha-D-galactosyltransferase
gi|7593036|dbj|BAA94505.1| alpha-1,4-N-acetylglucosaminyltransferase/alpha-
1,4-galactosyltransferase [Gorilla gorilla]
Length = 327
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 51 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 106
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 107 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 166
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H + + D++ WG GP
Sbjct: 167 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERXHEFMALCMXDFVDHYNGWIWGHQGPQ 226
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N LF +
Sbjct: 227 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLFSAT 285
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 286 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 317
>gi|321460399|gb|EFX71441.1| hypothetical protein DAPPUDRAFT_255611 [Daphnia pulex]
Length = 364
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 89/271 (32%), Positives = 140/271 (51%), Gaps = 9/271 (3%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQ 148
IFF ETS +L LRQ C++ESAA NPG + V + A R +P ++ L Y
Sbjct: 87 IFFHETSGRS----DLGLRQTCTVESAAKNNPGRPIQVFMTAE-RLDYSSPWLEVLQNYP 141
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
NV IV VD YF TPL G+YT+ S++ + H+SD +R +TL K GG Y+DLD I
Sbjct: 142 NVSIVLVDPRSYFAETPLDGWYTEGEWRKSMYSVVHLSDYIRVLTLLKGGGMYMDLDIIT 201
Query: 209 IKSLES--LHNY--AGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGV 264
+KSL+ L N+ E ++ V+HL++ HWL ++ L + E+ +GP +
Sbjct: 202 LKSLDEKLLRNFFLFETEEMKLLTNSVLHLERGHWLIEEMIQRLVKYYDPNEYMWHGPSM 261
Query: 265 LTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
++ ++ +C + N +C + + P F P+ W + + SY
Sbjct: 262 ISNIMSRKCGVKRGQPNSNNCTDVRLLPHYKFAPISNNGWEILFGDATPDRLAQVTNGSY 321
Query: 325 ALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+H W+ +K +K+ S Q Y+ +AR +CP
Sbjct: 322 GVHCWSGKSKEEQLKVHSNQVYSVLAREHCP 352
>gi|197099292|ref|NP_001125876.1| lactosylceramide 4-alpha-galactosyltransferase [Pongo abelii]
gi|55729514|emb|CAH91488.1| hypothetical protein [Pongo abelii]
Length = 353
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +YT + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYTAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + + H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFQRRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + + L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDISPEELPRLLNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|321471395|gb|EFX82368.1| hypothetical protein DAPPUDRAFT_25035 [Daphnia pulex]
Length = 273
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 138/265 (52%), Gaps = 15/265 (5%)
Query: 104 LTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLIDRLY-EYQNVHIVQVD 156
L +RQA ++ES A NP + V V+ + ++ R +++L +Y+N+ + VD
Sbjct: 1 LNIRQAFAVESLAFHNPNLFVNVLFMMDDGHQKRINKSKVAETVEKLQSKYENIQFIVVD 60
Query: 157 LGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
L Y T + ++ T + ++H+SD LR++TL+KYGG Y DLD I ++ +
Sbjct: 61 LDEYMAGTSMEKWFHCTDWRTGPYHVAHLSDGLRFLTLHKYGGYYFDLDVIFVRPVTYYR 120
Query: 217 NYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQ 276
N+ A S++ A G+IH D H ++ A+ + N+K +W NGP ++ R++K C +
Sbjct: 121 NFITAASATNFANGIIHADHGHPITQLAVNDFPSNYKKNKWTHNGPDLVLRVMKIFCGEE 180
Query: 277 SY-AHNIISCRNFTIYPPRFFYPVHWEHWADYL-----NETNAPATMSLFRDSYALHVWN 330
++ A N +SCR F + P F P+HW HW + NET P+ ++ +HVWN
Sbjct: 181 NFNAINDVSCRGFGVLPRSTFLPIHWSHWQSFFIRRPANETGEPSWIT--NQVVGVHVWN 238
Query: 331 SFTKRVPVKLGSEQPYAQIARRYCP 355
+ S Q Y ++ CP
Sbjct: 239 KLSCNETAYKNSTQEYVRLVSHNCP 263
>gi|427782387|gb|JAA56645.1| Putative secreted protein [Rhipicephalus pulchellus]
Length = 353
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/345 (30%), Positives = 171/345 (49%), Gaps = 27/345 (7%)
Query: 15 LTAVCLTLSFLLGVHIFLSPRFLNHFVFRVKQPLRYIEYWRSGYRVQNIN--CVLSGVEK 72
LTA+ +TL F + + + LS R L + P + G+R Q + G+E+
Sbjct: 15 LTAL-ITLVFFVLLTVTLSARHLGY------DPRALLSGASDGFRQQCLGEQDQRKGIER 67
Query: 73 DFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV 132
+D T+ +IFFLET A + R ACSIESAA+ +P V+++ I +
Sbjct: 68 HVNLNDSTK------NIFFLET----AGASCINERAACSIESAALRHPHFTVWLLTILDM 117
Query: 133 RNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYV 192
R+ PL + L + N ++ +DL +++ L +Y +D S + ++H+SD LR +
Sbjct: 118 RDC--RPLRN-LQQLPNFRLLNIDLNSMVKDSVLVHWYLKDDWNHSPFRVNHLSDALRLL 174
Query: 193 TLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNF 252
L+KYGG Y D+D + +KS L N E V V+ D+ H L E +
Sbjct: 175 VLWKYGGVYADMDVLTLKSFSELRNVVSRELFPDVGNSVLVFDRGHPFLLRCLEEFSRTY 234
Query: 253 KTTEWGANGPGVLTRLLKAECKPQSYAHNI--ISCRNFTIYPPRFFYPVHWEHWADYLNE 310
K+ +W NGP +L R+L C P++ + + C T+ P FYP+++ W
Sbjct: 235 KSRKWAHNGPRLLERVLSWFC-PRNLLGKVPLVECSGLTVLPGTAFYPMNYMVWQKAFQR 293
Query: 311 TNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ A + DSYALH+WNS+++ V+ GS Y + R+ CP
Sbjct: 294 NHTAAVLRAASDSYALHLWNSYSRTAAVERGS--AYDLLRRKLCP 336
>gi|321455618|gb|EFX66746.1| hypothetical protein DAPPUDRAFT_262661 [Daphnia pulex]
Length = 375
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 157/300 (52%), Gaps = 14/300 (4%)
Query: 64 NCVLSGVEKDFAFDDITR--LDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPG 121
NC L+ E + + ++ + + FF+ETS + L++RQAC++ES A+ NP
Sbjct: 72 NCTLTCTEPEPSIPELESDPANFNRDRAFFIETSGSGG----LSVRQACAVESLALHNPN 127
Query: 122 VQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWP 181
+ VYV+ + N + + + + +Y NVH++ ++L Y T L +Y +
Sbjct: 128 LTVYVLFVNVKINTSLDTVQEVEKKYNNVHLISINLDYYMAGTALEHWYHCSDWRNGFY- 186
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLS 241
++++S+ LR +TL KYGG Y DLD I ++ + N+ A+ + + VIH D ++ +
Sbjct: 187 VNNLSNGLRLLTLSKYGGYYFDLDIISVRPVTYYRNFVAAQDHNDINNDVIHADLNNPVI 246
Query: 242 GAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSY-AHNIISCRNFTIYPPRFFYPVH 300
A+++ NFK WG NGP ++ R+L C + + + ++CR F I P F+PVH
Sbjct: 247 QLAIKDFIINFKPDVWGHNGPSMILRVLTKWCNVGNLTSMDYVTCRGFNILPTSSFHPVH 306
Query: 301 WEHWADYL-----NETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ + NET+A + L +H+WN +K P+ S Q Y ++AR +CP
Sbjct: 307 YSKMKELFIRRMANETDALSDW-LTEKVIGVHIWNKLSKDQPIYKNSTQDYNRLARDHCP 365
>gi|45775237|gb|AAS77216.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGVFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|321462009|gb|EFX73036.1| hypothetical protein DAPPUDRAFT_308011 [Daphnia pulex]
Length = 372
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 146/293 (49%), Gaps = 10/293 (3%)
Query: 67 LSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYV 126
+ G+ + +F I+ L D I+F ET+ L LRQ C++ES A N V +
Sbjct: 71 VKGLNQSSSFHSISLLS-DDKRIYFHETTGRD----HLNLRQLCAVESTAKENSNRSVQM 125
Query: 127 VVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMS 186
++ N T PL L +Y N+ ++ +D+ YF TPL +Y + A S + H S
Sbjct: 126 FFQSNHVNLTVGPLAHILEKYPNIFVILIDVRDYFNQTPLEDWYLRGAWRQSPYKTEHFS 185
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLES--LHNYAGAESSSVVAAGVIHLDKDHWLSGAA 244
D +R ++ YK GG Y+DLDF+ +K + N+ E SV+ H KDH +
Sbjct: 186 DYIRILSSYKGGGMYMDLDFVALKPFDDNIFWNFVPEEDDSVLTGSSFHFQKDHPIVRKM 245
Query: 245 LRELRDNFKTTEWGANGPGVLTRLLKAECKPQS--YAHNIISCRNFTIYPPRFFYPVHWE 302
+ L ++ EW +GP + ++ C+ ++ + + C + + P + YP +
Sbjct: 246 MTYLASSYHPKEWSYSGPAMFQSVVLKFCQRRAPLPTYPVFLCPDIRVLPKTYLYPYKFA 305
Query: 303 HWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
W +N ++F+ SYA+H +N +K+ P+ +GS Q Y++IAR +CP
Sbjct: 306 EWKRMFRFSNVSGDDAIFQ-SYAVHTYNKLSKKEPIFVGSNQLYSKIARLHCP 357
>gi|37726539|gb|AAO39149.1| alpha-1,4-galactosyltransferase [Homo sapiens]
gi|37726541|gb|AAO39150.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 132/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R +K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALKWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|321463537|gb|EFX74552.1| hypothetical protein DAPPUDRAFT_32452 [Daphnia pulex]
Length = 265
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 91/271 (33%), Positives = 137/271 (50%), Gaps = 11/271 (4%)
Query: 90 FFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQN 149
FF ETS L RQAC +ES A NP + V++++ + + Y N
Sbjct: 1 FFHETS----GATFLNFRQACVVESLAFHNPNLTVHLLMTGQHLDLNSVTMKTLRLNYPN 56
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVI 209
+ I ++LG Y TPL +Y + ++H+SD LR++TL KYGG Y DLD I +
Sbjct: 57 LQITSINLGDYMVATPLERWYFCTEWNRGWYAVAHLSDALRFLTLSKYGGYYFDLDVIQL 116
Query: 210 KSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLL 269
+ + N+ AE + + + VIH+D H + A+ + ++K W NGP ++TR+L
Sbjct: 117 RPVTPYRNFVVAEDADKLGSSVIHVDHQHPIIRTAVEKFAADYKWYVWSHNGPDLVTRIL 176
Query: 270 KAECKPQSYAHNIIS---CRNFTIYPPRFFYPVHWEHWADYL-NETNAPATMSLFRDS-Y 324
+ C Q Y + ++ C+ F I P+ FYPVH+ W DY + P + +S
Sbjct: 177 QNWC--QVYYISWMTPERCQGFRILAPKSFYPVHYHRWRDYFYKRGDRPVDKVNWDESVV 234
Query: 325 ALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
HVWNS + V S Q YAQ+AR CP
Sbjct: 235 GAHVWNSMSSHWLVNKNSNQYYAQMARSSCP 265
>gi|359066145|ref|XP_003586205.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
isoform 1 [Bos taurus]
gi|359066148|ref|XP_003586206.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
isoform 2 [Bos taurus]
Length = 355
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 136/272 (50%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA---SVRNRTRNPLIDR 143
+IFFLETS D CS+ESAA +P V V++ +R R+ +
Sbjct: 79 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESPVVVLMKGLPGGNASRPRHLGLSL 134
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV + +DLG F++TPL +Y + L +SD R L+K+GG YLD
Sbjct: 135 LGCFPNVQMRPLDLGELFRDTPLAAWYAAVQRRWEPYLLPVLSDASRIALLWKFGGIYLD 194
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K L +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 195 TDFIVLKDLRNLTNALGTQSRYVLNGAFLAFEQHHEFMAQCMRDFVTHYNGWIWGHQGPQ 254
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S + + +CR T PP FYP+ W++W Y + + L +
Sbjct: 255 LLTRVFKKWCSIRSLSES-RACRGVTTLPPEAFYPIPWQNWKKYFEDISPQELTRLLNAT 313
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+A+HVWN ++ + S+ AQ+ RYCP
Sbjct: 314 FAVHVWNKKSQGTRFEATSKALLAQLHARYCP 345
>gi|332231408|ref|XP_003264888.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase isoform 1
[Nomascus leucogenys]
gi|441617625|ref|XP_004088462.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Nomascus
leucogenys]
Length = 353
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + + +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLQELFRDTPLADWYVAVQGRWEPYLVPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +++ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMQDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQAWKKYFEDINPEELPRLLNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTQFEATSRALLAQLQARYCP 343
>gi|358412545|ref|XP_003582335.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like [Bos
taurus]
Length = 355
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 136/272 (50%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA---SVRNRTRNPLIDR 143
+IFFLETS D CS+ESAA +P V V++ +R R+ +
Sbjct: 79 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESPVVVLMKGLPGGNASRPRHLGLSL 134
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV + +DLG F++TPL +Y + L +SD R L+K+GG YLD
Sbjct: 135 LGCFPNVQMRPLDLGELFRDTPLAAWYAAVQRRWEPYLLPVLSDASRIALLWKFGGIYLD 194
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K L +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 195 TDFIVLKDLRNLTNALGTQSRYVLNGAFLAFEQHHEFMAQCMRDFVTHYNGWIWGHQGPQ 254
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S + + +CR T PP FYP+ W++W Y + + L +
Sbjct: 255 LLTRVFKKWCSIRSLSGS-RACRGVTTLPPEAFYPIPWQNWKKYFEDISPQELTRLLNAT 313
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+A+HVWN ++ + S+ AQ+ RYCP
Sbjct: 314 FAVHVWNKKSQGTRFEATSKALLAQLHARYCP 345
>gi|449277565|gb|EMC85678.1| Lactosylceramide 4-alpha-galactosyltransferase [Columba livia]
Length = 360
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 139/274 (50%), Gaps = 9/274 (3%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP---LI 141
P N +FF+ETS + + + CS+ESAA +PG +V V++ + P
Sbjct: 75 PGN-VFFVETS----ERINPSYLFTCSVESAARAHPGTRVVVLMKGLAKGNVSLPSHWAF 129
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L + N+ I +DL F TPL +Y+Q + L+H+SD R ++K+GG Y
Sbjct: 130 SLLSCFPNMEIRPLDLAELFSGTPLAKWYSQPEHQKGRYYLAHLSDACRIAIMWKFGGIY 189
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
LD DFIV+K+L++L N G E+ +V+ + H +++ N+K+ WG G
Sbjct: 190 LDTDFIVLKNLKNLTNALGIEAQNVLNGAFLSFKPKHEFIELCMQDFVGNYKSWIWGHQG 249
Query: 262 PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
P +LTR+ K C + +I C+ FYP+ W++W + +L +
Sbjct: 250 PQLLTRVFKKWCSISNIQSGMI-CKGVGALSREAFYPIPWQNWKKLFEAIGSSELHNLLK 308
Query: 322 DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
++YA+H+WN + +++ S+ AQ+ ++CP
Sbjct: 309 NTYAVHIWNKLSHDARLEITSQALLAQLYSQFCP 342
>gi|345776768|ref|XP_538343.3| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Canis
lupus familiaris]
Length = 353
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 88/275 (32%), Positives = 136/275 (49%), Gaps = 8/275 (2%)
Query: 84 VPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL--- 140
+P +IFFLETS D CS+ESAA +P +V V++ P
Sbjct: 74 LPPGNIFFLETS----DRTNPNFLFMCSVESAARAHPESRVVVLMKGLPGGNASLPRHLG 129
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
+ L + NVH++ +DL F++TPL +Y + L +SD R ++K+GG
Sbjct: 130 LSLLGCFPNVHMLPLDLEELFRDTPLAAWYAGRQRRWEPYLLPVLSDACRIALMWKFGGI 189
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
YLD DFIV+K+L +L N GA+S V+ + ++ H +R+ ++ WG
Sbjct: 190 YLDTDFIVLKNLHNLTNTLGAQSRYVLNGAFLAFERHHEFMALCMRDFVAHYNGWIWGHQ 249
Query: 261 GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
GP +LTR+ K C +S + +CR T P FYP+ W+ W Y + + L
Sbjct: 250 GPQLLTRVFKKWCSIRSLDES-HACRGVTTLPCEAFYPIPWQDWKKYFQDISPEELHRLL 308
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+YA+HVWN ++ ++ S AQ+ RYCP
Sbjct: 309 NATYAVHVWNKKSQGTRLEATSRALLAQLHARYCP 343
>gi|145076242|gb|ABP35532.1| alpha-1,4-galactosyltransferase [Homo sapiens]
gi|145076271|gb|ABP35533.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 132/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+E AA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVELAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|321472679|gb|EFX83648.1| hypothetical protein DAPPUDRAFT_239739 [Daphnia pulex]
Length = 339
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 134/275 (48%), Gaps = 10/275 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYE 146
++ FF+ETS A L+ RQAC+IES A+ NP +++ V+ N + + +
Sbjct: 59 DNAFFIETSGNGA----LSYRQACAIESLALHNPNLKINVLFTDVKINADLDTVQQLMEN 114
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
Y NV ++ + + Y T + +Y + ++++S+ LR +T+YK+GG Y DLD
Sbjct: 115 YANVQLIDIKVDEYMAGTLMEHWYQCTNWRKGTYHVNNLSNALRLLTVYKFGGYYFDLDI 174
Query: 207 IVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
I ++ + S N+ A +V VIH D H A+++ NF+ WG NGP ++
Sbjct: 175 ISVRPVTSYRNFVAAVDREIVNNNVIHADAKHRFIELAIKDFVTNFRPDLWGNNGPALIF 234
Query: 267 RLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYL-----NETNAPATMSLF 320
R+LK C + + +SC F + P F+PVH NET A L
Sbjct: 235 RVLKKWCNSEDHKSLEYVSCPGFNVLPAPSFHPVHHFEMQKLFDEPMANETEEMAISWLT 294
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
D +H+WN K P+ PY ++ R +CP
Sbjct: 295 EDVVGVHLWNRMNKDDPIYKNKTHPYKRLVRDHCP 329
>gi|31324064|gb|AAP47164.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 132/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D C +ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCLVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|37726543|gb|AAO39151.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 132/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+ G YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFDGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|431900010|gb|ELK07945.1| Lactosylceramide 4-alpha-galactosyltransferase [Pteropus alecto]
Length = 353
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 136/272 (50%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVV---IASVRNRTRNPLIDR 143
+IFFLETS D CS+ESAA +P +V V++ + S + R+ +
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESRVMVLMKGLLGSNASLPRHLGLSL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
+ + NV + +DLG F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 MSCFPNVQMRPLDLGELFRDTPLAAWYAAARRRWEPYLLPVLSDASRLALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K L +L N G +S V+ + + H +R+ ++ WG GP
Sbjct: 193 TDFIVLKDLRNLSNALGIQSRYVLNGAFLAFKRRHAFIALCMRDFVAHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S + + +C T PP FYP+ W++W Y + + LF +
Sbjct: 253 LLTRVFKKWCSIRSLSES-HACHGVTTLPPEAFYPIPWQNWKKYFEDVSPDELPRLFNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ K S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFKATSRALLAQLHARYCP 343
>gi|45775233|gb|AAS77214.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 132/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG G
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGSQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|289157685|gb|ADC84388.1| UDP-galactose:beta-D-galactoside alpha-1,4-galactosyltransferase
[Columba livia]
Length = 360
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 135/270 (50%), Gaps = 8/270 (2%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP---LIDRLY 145
+FF+ETS + ++ + CS+ESAA +PG +V V++ + P L
Sbjct: 78 VFFVETS----ERIKPSYLFTCSVESAARAHPGTRVVVLMKGLAKGNVSLPSHWAFSLLS 133
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
+ NV I +DL F TPL +Y+Q + +SD R ++K+GG YLD D
Sbjct: 134 RFPNVEIQPLDLAELFSGTPLAKWYSQPEHQKEPYFFPVLSDACRITIMWKFGGIYLDTD 193
Query: 206 FIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVL 265
FIV+K+L++L N G +S V+ + H +++ DN+ W GP +L
Sbjct: 194 FIVLKNLKNLTNALGLQSQDVLNGAFLSFKPKHEFMELCIQDFVDNYNGWIWAHQGPELL 253
Query: 266 TRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYA 325
TR+ K C + + +I C+ + PP YP+ W+ W ++ +L +++YA
Sbjct: 254 TRVFKKLCSISNIQNGMI-CKGVSALPPDALYPIPWQDWKKLFEAISSSELHNLLKNTYA 312
Query: 326 LHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+HVWN + +++ S+ AQ+ ++CP
Sbjct: 313 VHVWNKLSHDARLEITSQALLAQLYSQFCP 342
>gi|335287503|ref|XP_003126033.2| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like [Sus
scrofa]
Length = 353
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 135/272 (49%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
SIFFLETS D CS+ESAA +P +V V++ P +
Sbjct: 77 GSIFFLETS----DRTSPNFLFMCSVESAARAHPEARVAVLMKGLPGGNASLPRHLGLSL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R L+K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLEELFRDTPLAAWYAAARRRWEPYLLPVLSDASRIALLWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNALGTQSRYVLNGAFLAFERHHEFMALCMRDFVAHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S + SCR T P FYP+ W+ W Y + + A L +
Sbjct: 253 LLTRVFKKWCSIRSLRQS-HSCRGVTALPSEAFYPIPWQDWKKYFEDISPEALPRLLNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ +++ S+ AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRLEVTSQALLAQLQARYCP 343
>gi|31324070|gb|AAP47167.1| alpha-1,4-galactosyltransferase [Homo sapiens]
gi|31324072|gb|AAP47168.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 352
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 89/274 (32%), Positives = 131/274 (47%), Gaps = 8/274 (2%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---I 141
P FLETS D CS+ESAA +P V V++ P I
Sbjct: 74 PTPGNIFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGI 129
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG Y
Sbjct: 130 SLLSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIY 189
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
LD DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG G
Sbjct: 190 LDTDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQG 249
Query: 262 PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
P +LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L
Sbjct: 250 PQLLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLS 308
Query: 322 DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+YA+HVWN ++ + S AQ+ RYCP
Sbjct: 309 ATYAVHVWNKKSQGTRFEATSRALLAQLHARYCP 342
>gi|146217398|gb|ABQ10741.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 132/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG G
Sbjct: 193 TDFIVLKTLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGLQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|301782637|ref|XP_002926729.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Ailuropoda melanoleuca]
gi|281344758|gb|EFB20342.1| hypothetical protein PANDA_016429 [Ailuropoda melanoleuca]
Length = 353
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 87/271 (32%), Positives = 134/271 (49%), Gaps = 8/271 (2%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDRL 144
+IFFLETS D CS+ESAA +P +V V++ P + L
Sbjct: 78 NIFFLETS----DRTSPNFLFMCSVESAARAHPESRVVVLMKGLPGGNASLPRHLGLSLL 133
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
+ NVH++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 134 GCFPNVHLLPLDLEELFRDTPLAAWYAARQHRWEPYLLPVLSDASRIALMWKFGGIYLDT 193
Query: 205 DFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGV 264
DFIV+K+L++L N G +S V+ + ++ H +R+ ++ WG GP +
Sbjct: 194 DFIVLKNLQNLTNTLGTQSRYVLNGAFLAFERHHEFMALCMRDFVAHYNGWIWGHQGPQL 253
Query: 265 LTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
LTR+ K C +S + +CR T P FYP+ W+ W Y E + L + +Y
Sbjct: 254 LTRVFKKWCSIRSLDES-HACRGVTALPCEAFYPIPWQDWKKYFQEVSPEELHQLLKATY 312
Query: 325 ALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
A+HVWN ++ + S AQ+ RYCP
Sbjct: 313 AVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|395541315|ref|XP_003772590.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase
[Sarcophilus harrisii]
Length = 355
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 138/277 (49%), Gaps = 14/277 (5%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---I 141
P IFF+ETS D CS+ESA+ +PG ++ V++ + + P I
Sbjct: 77 PPGDIFFVETS----DRTNPNFLFMCSVESASRTHPGTRIVVLMKGLLGRNSSLPKHLGI 132
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWP---LSHMSDLLRYVTLYKYG 198
L + NV +DL F+ TPL +Y A L W + +SD R ++K+G
Sbjct: 133 SLLSCFSNVEFQPLDLANLFEGTPLASWY---ASLNQRWHPYLVPTVSDAARIAIMWKFG 189
Query: 199 GTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG 258
G YLD DFIV+K+L++ N G +S V+ + + H +++ D++ WG
Sbjct: 190 GIYLDTDFIVLKNLKNFTNVLGIQSKYVLNGAFLAFEPKHEFIYLCMQDFVDHYNGWIWG 249
Query: 259 ANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
GP +LTR+ K C QS + SCR P FYP+ W++W Y + ++ +
Sbjct: 250 HQGPQLLTRVFKKWCGSQSL-QDRRSCRGVHALPQEAFYPITWQNWKRYFEDISSQEFHT 308
Query: 319 LFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
L +YA+HVWN ++ V ++ S+ A++ RYCP
Sbjct: 309 LLNKTYAVHVWNKKSQSVSFEVTSKVLLARLYSRYCP 345
>gi|321471284|gb|EFX82257.1| hypothetical protein DAPPUDRAFT_25188 [Daphnia pulex]
Length = 282
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 144/283 (50%), Gaps = 22/283 (7%)
Query: 90 FFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL--------- 140
FFLETS +G L +RQAC++ES A NP + V V+ + N +
Sbjct: 3 FFLETS---GNGKFLNIRQACAVESLAFHNPNLTVNVLFMMDNTAIHSNGINSKVLPAET 59
Query: 141 -IDRLYE-YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYG 198
I++L E Y N+ + ++L Y T L +Y + + ++H+SD LR +TL+KYG
Sbjct: 60 NIEKLREKYTNIEFITLNLDDYVAGTLLEKWYHCNDWRRGPYHVAHLSDGLRLLTLHKYG 119
Query: 199 GTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG 258
G Y DLD I ++ + HN+ AE+S+ + IH+D H + A+R+ +++ W
Sbjct: 120 GYYFDLDIIFVRRVTYYHNFVSAEASNGLCNNAIHVDYGHPVIQLAVRDFPLHYRKEAWT 179
Query: 259 ANGPGVLTRLLKAECKPQSYAHN-IISCRNFTIYPPRFFYPVHWEHWADYL-----NETN 312
NGP +L R++K C ++ + I+CR F + P F +H+ W D NET
Sbjct: 180 HNGPDLLMRVMKTFCGEENLSKMYYITCRGFNVLPMLTFNSLHYSRWKDLFSQRPTNETR 239
Query: 313 APATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
AP+ +S ++ +H+WN + S Q Y ++ R CP
Sbjct: 240 APSWIS--KEIVGVHIWNKLSYNETAYKNSTQEYVRLVRDNCP 280
>gi|45775235|gb|AAS77215.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 353
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 132/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D S+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMYSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 343
>gi|326912413|ref|XP_003202546.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Meleagris gallopavo]
Length = 398
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 138/275 (50%), Gaps = 10/275 (3%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---- 140
P +FF+ETS + CS+ESAA +PG +V VV++ + NR +
Sbjct: 117 PPGDVFFVETS----ERTNPNYLFMCSVESAARTHPGTRV-VVLMKGLANRNASLPNHWG 171
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L + NV I +DL F TPL +Y Q + L +SD R ++K+GG
Sbjct: 172 FSLLSCFPNVEIRPLDLSELFSGTPLAKWYLQAQQRWEPYFLPILSDACRIAIMWKFGGI 231
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
YLD DFIV+K+L++L N G +S V+ + H +++ +N+ + WG
Sbjct: 232 YLDTDFIVLKNLKNLTNVLGTQSKYVLNGAFLSFTPKHKFIELCMQDFVENYNSWIWGHQ 291
Query: 261 GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
GP +LTR+ K C +S + SC+ + P FYP+ W+ W Y ++ L
Sbjct: 292 GPQLLTRVFKKWCSIRSLRSS-KSCKGVSALPREAFYPIRWQDWKKYFEVVSSTELNELL 350
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+++YA+HVWN ++ +++ S+ AQ+ +CP
Sbjct: 351 KNTYAVHVWNKKSQGTRLEITSQALLAQLHSHFCP 385
>gi|321467171|gb|EFX78162.1| hypothetical protein DAPPUDRAFT_246526 [Daphnia pulex]
Length = 317
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 93/290 (32%), Positives = 154/290 (53%), Gaps = 27/290 (9%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPG--VQVYVVVIAS----VRNRTRNPLID 142
IFF ETS + +L L+Q+C++ESAA NP VQ+++ ++ + +++ L
Sbjct: 23 IFFHETSGS----AQLNLQQSCAVESAAKHNPDRPVQLFLRPESTEGCYSSDGSQSSLFY 78
Query: 143 R------LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYK 196
R L +Y NV + V+ RYF +PL +Y L S ++H+SD +R +TLYK
Sbjct: 79 RPVWLEILSQYPNVAAILVNEDRYFAGSPLQDWYQNGRWLQSQHQVAHLSDYIRILTLYK 138
Query: 197 YGGTYLDLDFIVIKSLESLHNY------AGAESSSVVAAGVIHLDKDHWLSGAALRELRD 250
GG YLD D + +K+ + + G+ V + GVIHLD HWLS +R L +
Sbjct: 139 GGGLYLDTDILTLKTYDQGGDMFRNCLVYGSGRMEVFSNGVIHLDAGHWLSAEIIRLLAE 198
Query: 251 NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEH----WAD 306
+ + +GP +++ ++ +C + N C++ + RFFYP+ + D
Sbjct: 199 EYDPEAYAYHGPALVSEVMGRKCGVVAGNSNSNHCKDVKLLSDRFFYPIEAPFSDVIFRD 258
Query: 307 YLNETNAPATMSLFRDSYALHVWNSFT-KRVPVKLGSEQPYAQIARRYCP 355
N T+ AT+ R+SY LH+WNS + PV++ S Q ++ +AR++CP
Sbjct: 259 NNNTTDVMATLVKIRNSYGLHLWNSLSYVHRPVEIHSNQIFSLLARQHCP 308
>gi|290563090|gb|ADD38939.1| Lactosylceramide 4-alpha-galactosyltransferase [Lepeophtheirus
salmonis]
Length = 348
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 143/279 (51%), Gaps = 13/279 (4%)
Query: 81 RLDVP-DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP 139
+++P D S FFLETS LTL++ C++ESAA ++P ++ ++ + + R+
Sbjct: 70 EVNIPKDTSFFFLETSGRS----HLTLKELCALESAAKLHPKRNIFYLM--TNKTYARSA 123
Query: 140 LIDRLYE-YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYG 198
L+ + E Y N+ ++ +DL F+ T + + ++ I S++ +HMSD+LRY +Y YG
Sbjct: 124 LVHEIMEKYSNIQLLSIDLTYVFKGTVIESLWLKNKIQNSIYFNAHMSDVLRYWFVYNYG 183
Query: 199 GTYLDLDFIVIKSLESLHNYAGAESSS--VVAAGVIHLDKDHWLSGAALRELRDNFKTTE 256
GTYLD D IV+K L +NYAG E+ +VA V+H H L + ++ N+ +
Sbjct: 184 GTYLDSDIIVLKELPLNYNYAGVENMEPLLVANSVLHFTHHHKLLKMIIADVSQNYDGSA 243
Query: 257 WGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
W NGP ++T L CK + N C N + PP F+ +++ W Y + ++
Sbjct: 244 WAKNGPLMVTSNLIQLCKAKIMKTINDAKCYNIQLLPPNTFFSIYYPSWKLYFDTSSREI 303
Query: 316 TMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYC 354
+S H W + + +K G P +A C
Sbjct: 304 VKKRLNNSLIAHYWGKLSSKTKIKSG--MPIHDLALEKC 340
>gi|410965751|ref|XP_003989405.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Felis
catus]
Length = 353
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P +V V++ P +
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESRVVVLMKGLPGGNASLPRHLGLSL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NVH++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LGCFPNVHVLPLDLEELFRDTPLAAWYAARRRRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+KSL +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 193 TDFIVLKSLRNLTNTLGTQSRYVLNGAFLAFERHHEFMALCMRDFVAHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S + +CR T P FYP+ W++W Y + + L +
Sbjct: 253 LLTRVFKKWCSVRSLGDS-HACRGVTALPCEAFYPIPWQNWKKYFEDISPQELRRLLNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ K S AQ+ RYCP
Sbjct: 312 YAVHVWNRKSQGTRFKATSRALLAQLHARYCP 343
>gi|296238759|ref|XP_002764295.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase
[Callithrix jacchus]
Length = 353
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 132/274 (48%), Gaps = 8/274 (2%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---I 141
P IFFLETS D CS+ESAA +P +V V++ P I
Sbjct: 75 PPGDIFFLETS----DRTNPNFLFMCSVESAARTHPESRVLVLMKGLPGGNASLPRHLGI 130
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG Y
Sbjct: 131 SLLSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIY 190
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
LD DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG G
Sbjct: 191 LDTDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHKFMALCMRDFVDHYNGWIWGHQG 250
Query: 262 PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
P +LTR+ K C +S + +C T PP FYP+ W+ W Y + + L
Sbjct: 251 PQLLTRVFKKWCSIRSLTES-HACHGVTTLPPEAFYPIPWQDWKKYFEDISPQELPRLLN 309
Query: 322 DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+YA+HVWN ++ ++ S AQ+ YCP
Sbjct: 310 ATYAVHVWNKKSQGTHFEVTSRALLAQLHAHYCP 343
>gi|348569630|ref|XP_003470601.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Cavia porcellus]
Length = 353
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 135/272 (49%), Gaps = 7/272 (2%)
Query: 86 DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVV--IASVRNRTRNPLIDR 143
++IFFLETS + C++ESAA +P V V V++ + R+ I
Sbjct: 77 GSNIFFLETS----ERTNPNFLFMCAVESAARAHPEVLVVVLMKGLHPGAPLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F+ TPL ++ + L SD R L+K+GG YLD
Sbjct: 133 LSCFPNVQLLPLDLDALFRGTPLAAWHAGLWQRWEPYRLPVTSDAARLALLWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+++L +L N G +S V+ + ++ H +R+ N+ + WG GP
Sbjct: 193 TDFIVLRNLRNLSNTLGTQSRYVLNGAFLAFERRHEFLALCMRDFVANYNSWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +SCR T P + FYPV W+ W Y + + L +
Sbjct: 253 LLTRVFKKWCGTRSLAQS-LSCRGVTTLPRQAFYPVPWQGWRRYFEDISPEGMSQLLNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA HVWN ++ K + AQ+ RYCP
Sbjct: 312 YAAHVWNKMSQGQRFKATPQTLLAQLQARYCP 343
>gi|351708366|gb|EHB11285.1| Lactosylceramide 4-alpha-galactosyltransferase [Heterocephalus
glaber]
Length = 353
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 134/273 (49%), Gaps = 10/273 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT----RNPLID 142
+IFFLETS D + CS+ESAA +P +V V++ + R R+ I
Sbjct: 77 GNIFFLETS----DRTNPSFLFMCSVESAARAHPEARV-AVLMKGLHGRGAPLPRHLGIS 131
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L + NV ++ +DL F+ TPL ++ + L SD R L+K+GG YL
Sbjct: 132 LLGCFPNVEMLPLDLDALFRGTPLAAWHAAAQWRWEPYLLPVTSDAARLALLWKFGGIYL 191
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D DFIV++SL +L N G +S V+ + ++ H +R+ + WG GP
Sbjct: 192 DTDFIVLRSLGNLSNALGMQSRHVLNGAFLAFERQHEFLALCMRDFVAQYNGWVWGHQGP 251
Query: 263 GVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRD 322
+LTR+ K C +S A + SC T P + FYPV W+ W Y + + L
Sbjct: 252 QLLTRVFKKWCSTRSLAQS-HSCHGVTTLPRQAFYPVPWQSWKKYFEDVSPEELARLLNG 310
Query: 323 SYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+YA+HVWN ++ + S+ AQ+ RYCP
Sbjct: 311 TYAVHVWNKKSQGTHFEATSQALLAQLHARYCP 343
>gi|321470038|gb|EFX81016.1| hypothetical protein DAPPUDRAFT_224373 [Daphnia pulex]
Length = 352
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 134/271 (49%), Gaps = 9/271 (3%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQ 148
I+F ET T D V L RQ C++ESAA NP V + N T +PL + Y
Sbjct: 77 IYFHET--TGRDSVNL--RQLCAVESAAKENPKRSVQLFFQTDYVNLTDSPLGSIMKYYP 132
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
N+ ++ ++ YF +TPL G+Y + S + H SD +R +TL+K GG Y+DLDF+
Sbjct: 133 NIAVILINASDYFADTPLEGWYLRGVWKRSPYRTEHFSDYIRILTLHKGGGMYMDLDFVT 192
Query: 209 IKSLES--LHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
+K+L+ ++ E V+ H +DH + + L ++ EW +GP +
Sbjct: 193 LKTLDPKIFQDFVPEEDKGVLTGSSFHFHRDHPIIRKMITYLASSYHPEEWTYSGPAMFQ 252
Query: 267 RLLKAECKPQ--SYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
++ C+ + H C I P ++ YP + W + + P SY
Sbjct: 253 SIVLKYCRKRLPKPTHPAFLCPGVKIMPRKYLYPYKFADWKECF-RNDVPRGRDQTSKSY 311
Query: 325 ALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
A+H++N +K PV +GS Q Y++IAR +CP
Sbjct: 312 AVHIYNKLSKNEPVLVGSNQIYSRIARIHCP 342
>gi|363728187|ref|XP_416448.3| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Gallus
gallus]
Length = 359
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 138/275 (50%), Gaps = 10/275 (3%)
Query: 85 PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---- 140
P +FF+ETS + CS+ESAA +PG +V VV++ + NR +
Sbjct: 78 PPGDVFFVETS----ERTNPNYLFMCSVESAARTHPGTRV-VVLMKGLANRNASLPNHWG 132
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L + NV I +DL F TPL +Y Q + L +SD R ++K+GG
Sbjct: 133 FSLLSCFPNVEIRPLDLPELFSGTPLAQWYLQAQQRWEPYFLPILSDACRIAIMWKFGGI 192
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
YLD DFIV+K+L++L N G +S V+ + H +++ +N+ + WG
Sbjct: 193 YLDTDFIVLKNLKNLTNVLGTQSKYVLNGAFLSFTPKHKFIELCMQDFVENYNSWIWGHQ 252
Query: 261 GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
GP +LTR+ K C +S + SC+ + P FYP+ W+ W Y ++ L
Sbjct: 253 GPQLLTRVFKKWCSIRSLRSS-KSCKGVSALPREAFYPIRWQDWKKYFEVVSSTELNELL 311
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+++YA+HVWN ++ +++ S+ AQ+ +CP
Sbjct: 312 KNTYAVHVWNKKSQGTRLEITSQALLAQLHSLFCP 346
>gi|345315396|ref|XP_001519949.2| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Ornithorhynchus anatinus]
Length = 392
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 131/270 (48%), Gaps = 8/270 (2%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDRLY 145
IFF+ETS + CS+ESAA +P +V V++ P I L
Sbjct: 118 IFFVETS----ERTNPNFLFMCSVESAARTHPETRVVVLMKGLGGGNGSLPRHLGISLLR 173
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
+ NV + +DL F TPL G++ + L +SD R ++K+GG YLD D
Sbjct: 174 CFHNVELRPLDLRELFAGTPLAGWHATVQARWEPYLLPVLSDACRIAIMWKFGGIYLDTD 233
Query: 206 FIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVL 265
FIV++SL++L N G +S V+ + ++ H +++ D++ WG GP +L
Sbjct: 234 FIVLRSLKNLTNVLGTQSEYVLNGAFLAFERGHKFIELCMQDFVDHYNGWVWGHQGPQLL 293
Query: 266 TRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYA 325
TR+ K C +S SCR P FYP+ W++W Y E L +D+YA
Sbjct: 294 TRVFKKWCSVRSLRAR-QSCRGVRALPREAFYPIRWQNWKKYFQEIGPLEFRRLLKDTYA 352
Query: 326 LHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+HVWN ++ ++ S+ AQ+ YCP
Sbjct: 353 VHVWNKKSQGAHFEIASKALLAQLHSHYCP 382
>gi|395819630|ref|XP_003783185.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Otolemur
garnettii]
Length = 353
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 86/272 (31%), Positives = 131/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT---RNPLIDR 143
+IFFLETS D CS+ESAA +P V V++ R RN I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVVVLMKGLPRGNASLPRNLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LGCFPNVQMLPLDLEELFRDTPLAAWYVAAQGRWEPYWLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+++L +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 193 TDFIVLRNLRNLTNALGIQSRYVLNGAFLAFERQHTFMELCMRDFVAHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+ TR+ K C S + +CR T P FYP+ W++W Y + + + L +
Sbjct: 253 LFTRVFKKWCAIHSLQES-RACRGVTTLPTEAFYPIPWQNWKKYFEDISPKEVVQLLNAT 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ K S AQ+ YCP
Sbjct: 312 YAVHVWNKKSQGTRFKATSRALLAQLHAHYCP 343
>gi|291410265|ref|XP_002721418.1| PREDICTED: alpha 1,4-galactosyltransferase-like [Oryctolagus
cuniculus]
Length = 348
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 87/272 (31%), Positives = 133/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVI---ASVRNRTRNPLIDR 143
SIFFLETS + CS+ESAA +P +V V++ A + R+ I
Sbjct: 72 GSIFFLETS----ERTNPNFLFMCSVESAARTHPEARVVVLMKGLPAGNASLPRHLGISL 127
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F+ TPL +Y + L +SD R L+K+GG YLD
Sbjct: 128 LGCFPNVQMLPLDLEELFRGTPLAAWYAAAQRRWEPYLLPVLSDASRIALLWKFGGIYLD 187
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV++SL +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 188 TDFIVLRSLRNLTNALGTQSRYVLNGAFLAFERRHEFLALCMRDFVAHYNRWVWGHQGPQ 247
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C + A + +CR T P FYP+ W+ W Y + + + +
Sbjct: 248 LLTRVFKKWCATRRLADS-HACRGVTALPREAFYPIAWQDWKKYFEDVSPEELARMLNAT 306
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S AQ+ RYCP
Sbjct: 307 YAVHVWNKKSQGTRFEATSRALLAQLHARYCP 338
>gi|170029761|ref|XP_001842760.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
gi|167864079|gb|EDS27462.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
Length = 377
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 138/277 (49%), Gaps = 7/277 (2%)
Query: 86 DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL-IDRL 144
+ +IFF+ ++ A +LT RQ+C+IESAA NP +V+V+ +A+ N I L
Sbjct: 91 EMNIFFVVSTIFQAGMAKLTPRQSCAIESAARANPSWKVFVLFVAARGFSFFNSSDIVSL 150
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
+NV++ V++ TPL + + + + + SDLLR + +YKYGGTYLD
Sbjct: 151 LPLENVYLSSVNMSVIAYGTPLEELVAAGTLNNASYVVENTSDLLRLLAVYKYGGTYLDT 210
Query: 205 DFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTEWGAN 260
D +V+KS L NY + VA G I+L H ++ LR+ F W AN
Sbjct: 211 DVVVMKSFNELPLNYMVSSGDGYVANGFINLQASGVGHEIAELFLRDAAQTFNGDRWAAN 270
Query: 261 GPGVLTRLLKAECKPQS--YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
GP ++TR+L+ C Y F + PP F+ V++ + + E + M
Sbjct: 271 GPSLVTRVLQKFCNITEPWYMTREKCGGQFVVLPPEQFFQVYYPQHSWFFEEKHTEEVME 330
Query: 319 LFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ HVWN T + ++ S+ Y ++A++YCP
Sbjct: 331 RMKGRILTHVWNRMTSDIKIRKDSKVAYIELAKQYCP 367
>gi|449277570|gb|EMC85683.1| Lactosylceramide 4-alpha-galactosyltransferase, partial [Columba
livia]
Length = 361
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 133/270 (49%), Gaps = 8/270 (2%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP---LIDRLY 145
+FF+ETS + + + CS+ESAA +PG +V V++ + P L
Sbjct: 79 VFFVETS----ERINPSYLFTCSVESAARAHPGTRVVVLMKGLAKGNVSLPSHWAFSLLS 134
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
+ N+ I +DL F TPL +Y+Q + +SD R ++K+GG YLD D
Sbjct: 135 CFPNMEIRPLDLAELFSGTPLAKWYSQPEHQKEPYFFPVLSDACRITIMWKFGGIYLDTD 194
Query: 206 FIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVL 265
FIV+K+L++L N G +S V+ + H +++ DN+ W GP +L
Sbjct: 195 FIVLKNLKNLTNALGLQSQDVLNGAFLSFKPKHEFMELCIQDFVDNYNGWIWAHQGPELL 254
Query: 266 TRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYA 325
TR+ K C + + +I C+ + PP P+ W+ W ++ +L +++YA
Sbjct: 255 TRVFKKLCSISNIQNGMI-CKGVSALPPDALCPIPWQDWKKLFEAISSSELHNLLKNTYA 313
Query: 326 LHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+HVWN + +++ S+ AQ+ ++CP
Sbjct: 314 VHVWNKLSHDARLEITSQALLAQLYSQFCP 343
>gi|417410099|gb|JAA51527.1| Putative alpha-14-n-acetylglucosaminyltransferase, partial
[Desmodus rotundus]
Length = 365
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 131/272 (48%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ + P
Sbjct: 89 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESWVVVLMKGLPGGNSSLPRHLGFSL 144
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R L+K+GG YLD
Sbjct: 145 LSCFPNVQMLPLDLEELFRDTPLAAWYVATQRRWEPYLLPVLSDASRLALLWKFGGVYLD 204
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+++L +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 205 TDFIVLRNLRNLTNTLGTQSRYVLNGAFLAFERHHEFLALCMRDFVAHYNGWIWGHQGPQ 264
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S + +CR T P FYP+ W++W Y + L + +
Sbjct: 265 LLTRVFKKWCSIRSLRES-HACRGVTALPSEAFYPIPWQNWKKYFEDIRPEELPQLLKGT 323
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA HVWN ++ ++ S AQ+ RYCP
Sbjct: 324 YAAHVWNKKSQGTRLEATSRALLAQLHARYCP 355
>gi|224095998|ref|XP_002189441.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Taeniopygia guttata]
Length = 360
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 138/292 (47%), Gaps = 25/292 (8%)
Query: 75 AFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN 134
A D P ++FF+ETS + + +CS+ESAA +P +V V++ +
Sbjct: 65 ALDIAAGSSPPTGNVFFVETS----EQTSPSYLFSCSVESAARTHPTSRVVVLMKGLAKG 120
Query: 135 RTRNP---LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSH------- 184
+ P L + NV I +DL F TPL +Y LWPL H
Sbjct: 121 KASLPEHWAFSLLSCFPNVEIRPLDLTELFSGTPLQRWY--------LWPLRHWEPYFLP 172
Query: 185 -MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGA 243
+SD R V ++K+GG YLD DFIV+K+L++L N G + + + + H
Sbjct: 173 VLSDACRIVLMWKFGGIYLDTDFIVLKNLDNLTNALGIQDNHELNGAFLSFKAKHKFMEL 232
Query: 244 ALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEH 303
+++ N+ WG GPG+LTR+ K C ++ ++C+ + YP+ W+
Sbjct: 233 CMQDFVQNYNGWVWGHQGPGLLTRVFKKWCSLRTLKS--MNCKGVSALAQEVVYPIPWQD 290
Query: 304 WADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
W +A L +++YA+H+WN + +++ S+ AQ+ ++CP
Sbjct: 291 WKKLFEAVSALELEKLLKNTYAVHIWNKLSHGTKLEIPSQALLAQLYAQFCP 342
>gi|224096002|ref|XP_002189476.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Taeniopygia guttata]
Length = 373
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 137/284 (48%), Gaps = 9/284 (3%)
Query: 75 AFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN 134
A D P ++FF+ETS + + +CS+ESAA +P +V V++ +
Sbjct: 78 ALDIAAGSSPPTGNVFFVETS----EQTSPSYLFSCSVESAARTHPMSRVVVLMKGLAKG 133
Query: 135 RTRNP---LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRY 191
T P L + NV I +DL F TPL +++Q L +SD R
Sbjct: 134 NTSLPEHWAFSLLSCFPNVEIRPLDLTELFSGTPLALWFSQPQRQQEPHFLPVLSDACRI 193
Query: 192 VTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDN 251
V ++K+GG YLD DFIV+K+LE+L N G + +V+ + H +++ N
Sbjct: 194 VLMWKFGGIYLDTDFIVLKNLENLTNALGIQGDNVLNGAFLSFKAKHKFVELCMQDFVQN 253
Query: 252 FKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNET 311
+ WG GPG+LTR+ K C ++ ++C+ + YP+ W+ W
Sbjct: 254 YNGWVWGHQGPGLLTRVFKKWCSLRTLKS--MNCKGVSALAQEVVYPIPWQDWKKLFEAV 311
Query: 312 NAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+A L +++YA+H+WN + +++ S+ AQ+ ++CP
Sbjct: 312 SALELEKLLKNTYAVHIWNKLSHGTKLEIPSQALLAQLYAQFCP 355
>gi|321456791|gb|EFX67891.1| hypothetical protein DAPPUDRAFT_330627 [Daphnia pulex]
Length = 304
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 155/298 (52%), Gaps = 26/298 (8%)
Query: 78 DITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR 137
D T ++ D IFF ETS +L+LRQ C++ESAA NP V++++ +S N
Sbjct: 3 DGTSRNISDQRIFFHETSGR----SQLSLRQCCAVESAARNNPDRPVHLLMRSST-NCVT 57
Query: 138 NPL-----------IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMS 186
NP+ + L Y NV IV ++ YF TPL +YT+ TS + + H+S
Sbjct: 58 NPINSQQLPINPSWLKVLSRYPNVEIVLLNEDHYFAGTPLEDWYTKGVWRTSRFEMGHLS 117
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLES--LHNYAGAESSS--VVAAGVIHLDKDHWLSG 242
D +R +TLYK GG YLD+D + +K+ + N E+++ + V+HL++ H LSG
Sbjct: 118 DYIRVLTLYKGGGLYLDMDILTLKAFQGPVFRNCLVYENAAKDTIGNSVLHLERGHHLSG 177
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPV--H 300
+R L + + + +GP + ++ C+ + C N + R+F+PV
Sbjct: 178 ELIRLLAEEYDPEAYVYHGPDAIAEIMNRVCRLVAGNPKSNECNNVRLLSHRYFHPVAAM 237
Query: 301 WEHWADYLNETNAPATMSLF--RDSYALHVWNSFTKRVPVKLGSE-QPYAQIARRYCP 355
+ H + N+ N +LF ++S+ LH+WNS + P+ + ++ Q A +AR +CP
Sbjct: 238 FSHML-FQNDGNMSDVETLFEIKESFGLHLWNSISLHHPINVDNKNQIVAILAREHCP 294
>gi|170070880|ref|XP_001869744.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
gi|167866776|gb|EDS30159.1| lactosylceramide 4-alpha-galactosyltransferase [Culex
quinquefasciatus]
Length = 263
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/197 (40%), Positives = 113/197 (57%), Gaps = 5/197 (2%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVY-VVVIASVRNRTRNPLIDRLYE 146
+IFF+ ++ + ++LT RQACSIESAA NP +++ + V AS N N I L
Sbjct: 56 NIFFILSTLSPRGEIKLTARQACSIESAARTNPDWKIFPLFVTASWFNALNNEFISPLLR 115
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
Y+N+H+ VDL + TPL + + A+ S +P+ H SD+LR + LYKYGGTYLD D
Sbjct: 116 YENIHLRTVDLATFALGTPLEDLFARHALQKSSYPVEHTSDVLRLLVLYKYGGTYLDTDV 175
Query: 207 IVIKSLESLH-NYAGAESSSVVAAGVIHLD---KDHWLSGAALRELRDNFKTTEWGANGP 262
+V KS + L N+ G+E VA GVI+L H + A + +L NF T W AN P
Sbjct: 176 VVRKSFDLLQPNFLGSEGHGYVANGVINLQATGDGHRFAEACINDLAQNFNGTVWAANVP 235
Query: 263 GVLTRLLKAECKPQSYA 279
++TR L+ C A
Sbjct: 236 FLVTRNLRRFCNVSKVA 252
>gi|241999872|ref|XP_002434579.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
gi|215497909|gb|EEC07403.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
Length = 344
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 135/276 (48%), Gaps = 13/276 (4%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYE 146
S FF+E+S + + RQAC++ESA + +P + + +++ A + T PL++ L
Sbjct: 64 KSFFFVESSRSSC----INFRQACAVESATLHHPSMTIRLLLTAKESHLTVCPLLEALKL 119
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
N+ + ++D +F +PLH +Y++ + S + +SH+SD +R++ ++KYGG Y DLD
Sbjct: 120 IGNLKVEKLDADSFFAESPLHLWYSRSSWNASRYKISHLSDAIRFLLVWKYGGIYCDLDI 179
Query: 207 IVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
+V + L N G E GV+ DK H + E + +W NGPGV+
Sbjct: 180 VVKRRFGHLRNSVGEEEPGAPVCGVLIFDKRHPFIKTCIEEFSKGYDPKKWAQNGPGVIK 239
Query: 267 RLLKA-ECKPQSYAHNIISCRNFT-----IYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
R L C Q I+ C + T ++ FY V ++ W +
Sbjct: 240 RALSNYTCNRQ--LSGILDCTDGTGTRVAVHTEEAFYAVPYQKWRLFFERKYVDIVRRAT 297
Query: 321 RDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ SY +H+WN+ + + ++GS Y + CPR
Sbjct: 298 KKSYLVHIWNALSHQKDARVGSGSVY-DLEAHNCPR 332
>gi|45775247|gb|AAS77221.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 436
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/251 (33%), Positives = 124/251 (49%), Gaps = 8/251 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTK 334
YA+HVWN ++
Sbjct: 312 YAVHVWNKKSQ 322
>gi|344296150|ref|XP_003419772.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Loxodonta africana]
Length = 353
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 129/272 (47%), Gaps = 8/272 (2%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P +V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESRVVVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV + +DLG F TPL ++ + L +SD R ++K+GG YLD
Sbjct: 133 LGCFPNVQMRPLDLGALFHGTPLAAWHAAVRQRWEPYRLPVLSDAARIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+KSL +L N G +S V+ + ++ H +++ ++ WG GP
Sbjct: 193 TDFIVLKSLRNLTNTLGTQSRYVLNGAFLAFERRHKFMALCMQDFVAHYNGWVWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S +C T P FYP+ W++W Y + + L +
Sbjct: 253 LLTRVFKKWCGIRSLGEP-RACHGVTTLPREAFYPIPWQNWKRYFEDVSPEELTRLLNST 311
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN ++ + S+ AQ+ RYCP
Sbjct: 312 YAVHVWNKKSQGTRFEATSQALLAQLHARYCP 343
>gi|45775245|gb|AAS77220.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 436
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/251 (33%), Positives = 124/251 (49%), Gaps = 8/251 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDS 323
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N L +
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDINPEELPRLLSAT 311
Query: 324 YALHVWNSFTK 334
YA+HVWN ++
Sbjct: 312 YAVHVWNKKSQ 322
>gi|321456797|gb|EFX67897.1| hypothetical protein DAPPUDRAFT_63673 [Daphnia pulex]
Length = 302
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 138/297 (46%), Gaps = 41/297 (13%)
Query: 90 FFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQN 149
FF+ETS + L RQAC++ES A NP + V V+ + N + L +Y N
Sbjct: 6 FFIETSGS----ATLNFRQACAVESLAHHNPNLTVNVLFMGGRINTSLVTLKMLKEKYDN 61
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVI 209
+H+ +L Y TPL +Y + + +SH+SD LR++TL+KYGG Y DLD I +
Sbjct: 62 IHLFSFNLDDYMAGTPLQYWYHCNGWRDGPFHVSHLSDGLRFLTLHKYGGYYFDLDVISV 121
Query: 210 KSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTE------------- 256
+ + L N+ AES + +GV+H + + + A+++ N++ E
Sbjct: 122 RPVTDLRNFVAAESDDYLGSGVLHAEFKNPVMELAVKDFAANYRQVEPIEIIFPPKYLNL 181
Query: 257 --------WGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADY 307
WG NGP +L R+LK+ CK + + +SC F + F PV DY
Sbjct: 182 NFWRRSDVWGHNGPALLLRVLKSWCKAKDLLEMDYVSCHGFNVLHYSSFCPV------DY 235
Query: 308 LNETNA-----PATMS----LFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
T PA S L +H WN T P+ S Q Y +AR +CP
Sbjct: 236 SVATKEFFIHRPANQSRPFWLTDQVVGIHTWNKLTYNKPIYKNSTQRYTWLARNHCP 292
>gi|321451401|gb|EFX63067.1| hypothetical protein DAPPUDRAFT_4415 [Daphnia pulex]
Length = 267
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 87/272 (31%), Positives = 137/272 (50%), Gaps = 11/272 (4%)
Query: 90 FFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQN 149
FF+ETS + + L+ RQ+C++ES A+ N + VYV+ + N + L +Y N
Sbjct: 1 FFIETSGSGS----LSYRQSCAVESLALHNQNLTVYVLFVNVQINSSVITLQKLRGKYGN 56
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVI 209
+ ++ ++L Y T L +Y + ++++S+ LR +TL KYGG Y DLDF+ +
Sbjct: 57 IRLISINLDDYMAGTALEYWYHCIHWKKGPYHVNNLSNGLRLLTLAKYGGYYFDLDFVFV 116
Query: 210 KSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLL 269
+SL N+ A+ + V GVIH + + A+ DNF WG NGP ++ R+L
Sbjct: 117 RSLTYYRNFVAAQDNYDVNNGVIHAELKSPIIELAMPNFVDNFSPWVWGHNGPTLIYRVL 176
Query: 270 KAECKPQSY-AHNIISCRNFTIYPPRFFYPVHWEHWADYL-----NETNAPATMSLFRDS 323
K C + + + SCR F I P F+PVH+ + NET A L
Sbjct: 177 KNWCNVDNVKSMDSASCRGFNILPRESFFPVHYTDVKELFIQRMENETEAMPDW-LTDTV 235
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+H WN +K P+ + Q YA++ R CP
Sbjct: 236 VGVHTWNKISKSQPIYKSARQDYARLVRDNCP 267
>gi|449277564|gb|EMC85677.1| Lactosylceramide 4-alpha-galactosyltransferase, partial [Columba
livia]
Length = 347
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 133/272 (48%), Gaps = 10/272 (3%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP---LIDRLY 145
+FF+ETS T CS+ESAA +PG +V V++ + P L
Sbjct: 79 VFFVETSEQTNPSYLFT----CSVESAARAHPGTRVVVLMKGLAKGNVSLPSHWAFSLLS 134
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
+ NV I +DL F TPL +Y+Q + +SD R ++K+GG YLD D
Sbjct: 135 CFPNVEIRPLDLAELFSGTPLAKWYSQPEHQKEPYFFPVLSDACRIAIMWKFGGIYLDTD 194
Query: 206 FIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVL 265
FIV+K+L++L N G ++ + + H +++ DN+ WG GP ++
Sbjct: 195 FIVLKNLKNLTNALGLQTHDELNGAFLSFKPKHEFMELCMQDFVDNYNGWIWGRQGPELV 254
Query: 266 TRLLKAECKPQSYAHNIISCRNFTIYPPRF--FYPVHWEHWADYLNETNAPATMSLFRDS 323
TR+ + C + + +I C+ + PP FYP+ W++W + +L +++
Sbjct: 255 TRVFRKWCSISNIQNGMI-CKGVSALPPETYAFYPIPWQNWKKLFEAIGSSELHNLLKNT 313
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YA+HVWN + +++ S+ AQ+ ++CP
Sbjct: 314 YAVHVWNKKSHGTRLEITSQAFLAQLYSQFCP 345
>gi|346466429|gb|AEO33059.1| hypothetical protein [Amblyomma maculatum]
Length = 285
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/271 (29%), Positives = 138/271 (50%), Gaps = 12/271 (4%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEY 147
+FFLET A ++ R AC+IESAA+ NP V+++ I +R+ +D L
Sbjct: 14 ELFFLET----AGASCISDRAACAIESAALRNPDYTVWLLTILDMRD---CRFLDNLLYL 66
Query: 148 QNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFI 207
N ++++L + +++ L +Y +D + S + ++H+SD LR + L+KYGG Y DLD +
Sbjct: 67 PNFRTLKIELNQLVKDSVLVHWYIKDDWIRSPFRINHLSDALRMLILWKYGGVYADLDVL 126
Query: 208 VIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTR 267
++KS L N E V V+ ++ H L E +++ +W NGP +L R
Sbjct: 127 ILKSFGQLRNVVAREHFPDVGNSVMVFERKHPFLLRCLEEFSWTYRSHKWAYNGPRLLER 186
Query: 268 LLKAECKPQSYAHN--IISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYA 325
+L C P++ ++ C T+ P FYPV + W + M +SYA
Sbjct: 187 VLAWFC-PRNLLGKLPLVQCSGLTVLPNTAFYPVSYLQWRKTFLRNSTVDVMRTTTESYA 245
Query: 326 LHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+H+WNS+++ + GS Y + + CP+
Sbjct: 246 IHLWNSYSRSTKTERGS--AYDVLRKALCPK 274
>gi|321460396|gb|EFX71438.1| hypothetical protein DAPPUDRAFT_227867 [Daphnia pulex]
Length = 529
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 137/271 (50%), Gaps = 9/271 (3%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQ 148
IFF ETS +L LRQ C++ESAA NPG + V + A R +P ++ L Y
Sbjct: 253 IFFHETSGRS----DLGLRQTCTVESAAKNNPGRPIQVFMTAD-RLDYSSPWLEILQTYP 307
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
V IV V+ YF NTPL +Y + TS++ + H+SD +R +TL K GG Y+DLDF+
Sbjct: 308 QVSIVLVEPRLYFANTPLADWYNEGEWRTSIYNIVHLSDYIRVLTLLKGGGMYMDLDFVS 367
Query: 209 IKSLES--LHNYAGAESSSV--VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGV 264
+K + L N+ E++ + ++ V+HL++ H L + L + ++ +GP +
Sbjct: 368 LKPFDEKFLWNFFNIETAEMKLLSNSVLHLERGHRLIEEMIHRLVKYYDVDDYMWHGPSM 427
Query: 265 LTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
++ ++ C + N +C + + P F P+ W ++ + SY
Sbjct: 428 ISNIMSKFCGVKRGQPNSNNCTDVRLLPHYNFAPISNTEWETLFSDATTENLAQIKNGSY 487
Query: 325 ALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+H W + P+ L S Q YA +AR +CP
Sbjct: 488 GVHCWGGKSVGHPLDLQSNQIYAVLAREHCP 518
>gi|449482164|ref|XP_002189509.2| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Taeniopygia guttata]
Length = 377
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 134/284 (47%), Gaps = 9/284 (3%)
Query: 75 AFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN 134
A D P ++FF+ETS + + +CS+ES A +P +V V++ +
Sbjct: 82 ALDIAAGSSPPTGNVFFVETS----EQTSPSYLFSCSVESVARRHPTSRVVVLMKGLAKG 137
Query: 135 RTRNP---LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRY 191
T P L + NV I +DL + F TPL +++Q L +SD R
Sbjct: 138 NTSLPKHWAFSLLSCFPNVEIRPLDLTKLFSGTPLALWFSQPQRQQEPHFLHVLSDACRI 197
Query: 192 VTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDN 251
V ++K+GG YLD DFIV+K+LE+L N G + + + H +++ N
Sbjct: 198 VLMWKFGGIYLDTDFIVLKNLENLTNALGIQDDHELNGAFLSFKAKHKFIELCMQDFVQN 257
Query: 252 FKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNET 311
+ WG GPG+LTR+ K C + ++C+ + YP+ W+ W
Sbjct: 258 YNGWVWGHQGPGLLTRVFKKWCSLGTLKS--MNCKGVSALAQEVVYPIPWQDWKKLFEAV 315
Query: 312 NAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+A L +++YA+H+WN + +++ S+ AQ+ ++CP
Sbjct: 316 SALELEKLLKNTYAVHIWNKLSHGTKLEIPSQALLAQLYAQFCP 359
>gi|321472682|gb|EFX83651.1| hypothetical protein DAPPUDRAFT_25102 [Daphnia pulex]
Length = 271
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 85/274 (31%), Positives = 136/274 (49%), Gaps = 12/274 (4%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYE-Y 147
IFF ETS L RQ C+IES A NPG V +++ + + P +D L E Y
Sbjct: 3 IFFHETSGRG----NLNFRQCCAIESVAKHNPGRPVQLLISGDRLDDSTGPWMDILKEHY 58
Query: 148 QNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFI 207
NV + VD YF +PL +Y + S + +H+SD +R V+LY++GG Y+DLD++
Sbjct: 59 ANVAVFSVDNDNYFSGSPLQSWYEKGEWRDSQFRTAHLSDYIRLVSLYRHGGLYMDLDYV 118
Query: 208 VIKSLES--LHNYAGAESS--SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
V+K L+ LHN E + + GV+H + H L +R L + ++ +GP
Sbjct: 119 VLKPLDEKLLHNVLLVEGADGKQLNNGVMHFEPGHRLIKELIRYLAAEYDPEDYYLHGPT 178
Query: 264 VLTRLLKAECKPQS--YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
LT + C + C + ++ + F P+ W Y E + ++S+
Sbjct: 179 ALTNVYIRLCSNGTGRIKRKSSVCPDVSLLSYKHFCPIGPPFWHLYFEEASR-QSLSMIN 237
Query: 322 DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
SY +H+WN + P+++G+ Q YA +A CP
Sbjct: 238 SSYGVHLWNFLSSNEPIRMGTRQLYAILAAENCP 271
>gi|426227172|ref|XP_004007696.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Ovis
aries]
Length = 466
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 129/273 (47%), Gaps = 10/273 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYE 146
+IFFLETS D CS+ESAA +P V V ++ S+
Sbjct: 190 GNIFFLETS----DRTNPNFLFMCSVESAARAHPESLV-VGLMNSLXXXXXXXXXXLGLS 244
Query: 147 ----YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
+ NV + +DL F+ TPL +Y + L +SD R L+K+GG YL
Sbjct: 245 LLGCFPNVQMHPLDLEELFRETPLAAWYAAVQRRWEPYLLPVLSDASRIALLWKFGGIYL 304
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D DFIV+K L +L N G +S V+ + ++ H +R+ ++ WG GP
Sbjct: 305 DTDFIVLKDLRNLTNALGTQSRYVLNGAFLAFERHHEFMAQCMRDFVAHYNGWIWGPQGP 364
Query: 263 GVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRD 322
+LTR+ K C +S + + +CR T PP FYP+ W++W Y + + L
Sbjct: 365 QLLTRVFKKWCSIRSLSES-RACRGVTTLPPEAFYPIPWQNWKKYFEDISPQELTRLLNA 423
Query: 323 SYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
++A+HVWN ++ + S AQ+ RYCP
Sbjct: 424 TFAVHVWNKKSQGTHFEATSRALLAQLHARYCP 456
>gi|25452793|sp|Q9N289.1|A4GAT_PONPY RecName: Full=Lactosylceramide 4-alpha-galactosyltransferase;
AltName: Full=Alpha-1,4-N-acetylglucosaminyltransferase;
AltName: Full=Alpha-1,4-galactosyltransferase; AltName:
Full=Globotriaosylceramide synthase; Short=Gb3 synthase;
AltName: Full=UDP-galactose:beta-D-galactosyl-beta1-R
4-alpha-D-galactosyltransferase
gi|7593038|dbj|BAA94506.1| alpha-1,4-N-acetylglucosaminyltransferase/alpha-
1,4-galactosyltransferase [Pongo pygmaeus]
Length = 218
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 108/209 (51%), Gaps = 1/209 (0%)
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
+ NV ++ +DL F++TPL +YT + L +SD R ++K+GG YLD DF
Sbjct: 1 FPNVQMLPLDLRELFRDTPLADWYTAVQGRWEPYLLPVLSDASRIALMWKFGGIYLDTDF 60
Query: 207 IVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
IV+K+L +L N G +S V+ + + H +R+ D++ WG GP +LT
Sbjct: 61 IVLKNLRNLTNVLGTQSRYVLNGAFLAFQRRHEFMALCMRDFVDHYNGWIWGHQGPQLLT 120
Query: 267 RLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYAL 326
R+ K C +S A + +CR T PP FYP+ W+ W Y + + L +YA+
Sbjct: 121 RVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDISPEELPRLLNATYAV 179
Query: 327 HVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
HVWN ++ + S AQ+ RYCP
Sbjct: 180 HVWNKKSQGTRFEATSRALLAQLHARYCP 208
>gi|346471937|gb|AEO35813.1| hypothetical protein [Amblyomma maculatum]
Length = 382
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 128/267 (47%), Gaps = 12/267 (4%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQ 148
I+F+E+S + L R+ACS+ESAA+ NP + V +++ S+ N I L
Sbjct: 96 IWFVESSLRNE--TCLNEREACSVESAALRNPTLTVNLLITGSMANNCST--IRTLSILP 151
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
N +D+ FQ TPL +Y T+ ++ MSD LR++ L+K GG YLDLD IV
Sbjct: 152 NFRATIIDVRTEFQGTPLEPWYKAGTWKTAKNKVADMSDALRWLILWKRGGIYLDLDVIV 211
Query: 209 IKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL 268
+K L+ L N E S + +K H GA + T WG+ GP V +
Sbjct: 212 LKPLKDLKNGGAFEPSGFPGTAAMFFEKQHPFLGAVHEACIREYNNTAWGSCGPTVFNNV 271
Query: 269 LKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHV 328
K S I+ P FY +++ +W + + + + R+S+ +HV
Sbjct: 272 YKRWTTGSSSPVRIL--------PTESFYTINYGYWHMFFSTKHTAEVLHAVRNSFGVHV 323
Query: 329 WNSFTKRVPVKLGSEQPYAQIARRYCP 355
WN +++ V++GS Y +AR CP
Sbjct: 324 WNKMSRKGSVQVGSGTAYDILARFNCP 350
>gi|195118535|ref|XP_002003792.1| GI21149 [Drosophila mojavensis]
gi|193914367|gb|EDW13234.1| GI21149 [Drosophila mojavensis]
Length = 789
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 152/283 (53%), Gaps = 10/283 (3%)
Query: 84 VPDNSIFFLETSC---THADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL 140
P +IFF E+S + E+ QAC+IESAA+ NP QV+V+ P+
Sbjct: 497 TPGRTIFFSESSQYLDISYNIQEMRAHQACAIESAALHNPNFQVFVLFACRTYLEKPAPI 556
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
ID L Y+N+ ++++G Y +TP+ + + S + + HMSDLLR +TLY+YGG
Sbjct: 557 IDALLSYKNIKFRELNIGTYILDTPITDWLKLNDFYRSSFLMHHMSDLLRLLTLYRYGGI 616
Query: 201 YLDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKD---HWLSGAALRELRDNFKTTE 256
YLD+D +++++ E LH NYA + ++ ++ ++ L+ H L+ L++ R +
Sbjct: 617 YLDMDVMLLRTFEDLHLNYACSMANQSISNSILGLEPKGFGHQLAKWFLQDFRKSLVKDA 676
Query: 257 WGANGPGVLTRLLKAECKPQS---YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNA 313
+ + L R+L+ C ++ + C+ F ++ FY V + +L+ T A
Sbjct: 677 FDSTEHKFLLRVLQEVCSTRNVTLMVRDPKRCKGFRVFSISEFYKVPLKQSDLFLDPTRA 736
Query: 314 PATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ ++S +H+WN F+ + P+K+ S+ Y Q+A ++CP+
Sbjct: 737 SEVLERIKNSRLIHIWNRFSAKWPLKIDSKAAYMQLAAKHCPK 779
>gi|321471306|gb|EFX82279.1| hypothetical protein DAPPUDRAFT_49127 [Daphnia pulex]
Length = 342
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 144/305 (47%), Gaps = 35/305 (11%)
Query: 82 LDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP-- 139
LD + FFLETS + L +RQAC++ES A NP + V V+ + N +
Sbjct: 32 LDDSRANAFFLETSGSGT----LNIRQACAVESLAFHNPNLIVNVLFVGGQINAGSSSSS 87
Query: 140 ---LID--RLYE-YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVT 193
++D +L E Y N+H + +DL Y + L +Y + +SH+SD LR++T
Sbjct: 88 SSWIVDARKLAETYANIHPIDLDLDEYLAGSLLEKWYHCTDWRKGPYHVSHLSDGLRFLT 147
Query: 194 LYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFK 253
L KYGG Y DLD ++++S+ N+ E+S + +GVIH D L ++++ N+
Sbjct: 148 LNKYGGYYFDLDVLILQSVTQFRNFVALENSKDMGSGVIHADLGSPLMALSVQDFAANYS 207
Query: 254 TTEWGANGPGVLTRLLKAECKPQSY-AHNIISCRNFTIYPPRFFYPVHWEHWADYL---- 308
W NGP +L R+L+ C + + I C+ F + P F PVH+
Sbjct: 208 AGLWTHNGPLLLLRVLRRWCNVDDLQSMDYIRCQGFHVLPSSSFCPVHYTQLGRLFDNQQ 267
Query: 309 ------------NETNAPATMSLFRDSY------ALHVWNSFTKRVPVKLGSEQPYAQIA 350
NET+ T+ R + +H WN + P+ S Q Y ++A
Sbjct: 268 QQQQRRRRRPDNNETSTDETIMTTRPEWLTDQVLGIHTWNKLSYNRPIHKNSTQTYNRLA 327
Query: 351 RRYCP 355
R +CP
Sbjct: 328 RHHCP 332
>gi|387913810|gb|AFK10514.1| alpha-1-4-N-acetylglucosaminyltransferase-like protein
[Callorhinchus milii]
Length = 337
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 135/291 (46%), Gaps = 15/291 (5%)
Query: 73 DFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV 132
D FD+ + +PD I F++T+ + +E T C++ESAA NP VY +
Sbjct: 41 DVDFDNSSL--IPDPGIMFVQTT----NEMEPTPLALCAVESAARENPDKPVYYFMKGLS 94
Query: 133 RNRTRNPL-----IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSD 187
N ++ P L +NV +V + L FQNTPL +Y Q + + +S+
Sbjct: 95 GNLSQYPFPQYFAAPMLSSIKNVVLVPLKLKALFQNTPLSFWYQQVNSSREQYWIHVLSN 154
Query: 188 LLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRE 247
R L+KYGG YLD D I +K L + N+ ++ +S+ + H + +
Sbjct: 155 ACRITLLWKYGGIYLDTDIISLKPL-NFTNFICSQGNSIANNAALGFQNQHQFMWDCMGD 213
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNI-ISCRNFTIYPPRFFYPVHWEHWAD 306
N+ WG GPG+++R+LK C+ + + + C + PR+FYP+ + W
Sbjct: 214 FVTNYNGQIWGQQGPGLISRVLKQWCQSDNLDKLLDLQCNGISFLSPRYFYPIAFAEWQR 273
Query: 307 YLNETNAPATMSLFRDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRYCP 355
+ N S F D+ +H+WN K + V GS ++ RYCP
Sbjct: 274 FFQPWNKNDIESFFPDTKGVHIWNFMNKGQQKRVVAGSGTLIERLFLRYCP 324
>gi|45775243|gb|AAS77219.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 348
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/229 (33%), Positives = 113/229 (49%), Gaps = 8/229 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETN 312
+LTR+ K C +S A + +CR T PP FYP+ W+ W Y + N
Sbjct: 253 LLTRVFKKWCSIRSLAES-RACRGVTTLPPEAFYPIPWQDWKKYFEDIN 300
>gi|348581582|ref|XP_003476556.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Cavia
porcellus]
Length = 336
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 134/277 (48%), Gaps = 19/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI FLETS + E +C++ESAA P Q + + + + T+ NP
Sbjct: 52 SIVFLETS----ERREPPPLVSCAVESAAKTYPE-QPVIFFMKGLNDSTQVPPNSSNPGF 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +DL R F++TPL +YT+ + L SD R ++KYGG Y
Sbjct: 107 SLLSAIDNVFLFPLDLNRLFEDTPLFSWYTRINSNAERYWLHVSSDASRLAIIWKYGGIY 166
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A+SS V + GV H A + +++ + WG G
Sbjct: 167 MDTDVISIRPIPD-ENFLAAQSSKVSSNGVFGFSPHHPFLWACMENFVEHYNSGIWGNQG 225
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+LK CK + + + + C+N + P FYP+ + W Y + + F
Sbjct: 226 PRLITRMLKLWCKLRDFKEVSDLKCQNMSFLHPHRFYPISYSQWKSYYEVWDPEPS---F 282
Query: 321 RDSYALHVWNSFT--KRVPVKLGSEQPYAQIARRYCP 355
+SYALH+WN KR VK GS + R+YCP
Sbjct: 283 NNSYALHLWNHMNQGKRTVVK-GSNTLAENLFRKYCP 318
>gi|321459171|gb|EFX70227.1| hypothetical protein DAPPUDRAFT_328228 [Daphnia pulex]
Length = 357
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 141/288 (48%), Gaps = 27/288 (9%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNP-----------GVQVYVVVIASVRNR 135
+IFFLETS LT RQACS+ESAA NP G+Q + A + +
Sbjct: 66 QNIFFLETSGESC----LTARQACSVESAARSNPNAIISVHMENSGLQNQKEIKAKLAGQ 121
Query: 136 TRNPLID-RLY-EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVT 193
TRN I RL+ E+ NV +VQ DL ++ +NT L + + S+ PL+H SD +R
Sbjct: 122 TRNCAITHRLFKEWSNVKLVQEDLIQHLRNTTLWRLQEKSLLTQSIHPLTHRSDAMRVAM 181
Query: 194 LYKYGGTYLDLDFIVIKSLESLHNYAGAES--SSVVAAGVIHLDKDHWLSGAALRELRDN 251
L+KYGG YLDLD +V++ L L N G + V GV+ + H ++ +
Sbjct: 182 LWKYGGIYLDLDCLVLRPLYCLQNTVGLVDFLADWVENGVMAFEAGHPFLQFLMKYMVFA 241
Query: 252 FKTTEWGANGPGVLTRLLKAEC-KPQSYAHNIISCRN--FTIYPPRFFYPVHWEHWADYL 308
FK E+ + GP LT +K C + + A CRN + PPR FY ++ +
Sbjct: 242 FKPEEYISLGPATLTDSIKYFCDRTELPAEEWFMCRNSSMILQPPRAFYAINNRRQNAFY 301
Query: 309 NETNAPATMSLFRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCP 355
+ P+ R SY H++++ R VP K Y +A+ +CP
Sbjct: 302 HPEADPSDYEDLRHSYLSHIYDAGNGRSVPKK----SLYGMLAQEFCP 345
>gi|255522891|ref|NP_001157349.1| alpha 1,4-N-Acetylglucosaminyltransferase [Equus caballus]
Length = 341
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 141/301 (46%), Gaps = 17/301 (5%)
Query: 64 NCVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQ 123
+C+ S + A D L SI FLETS + +E + +C++ESAA + P Q
Sbjct: 28 SCLFSCLPPSKALRDPEALLGQGRSIVFLETS----ERMEPSPLVSCAVESAARIYPE-Q 82
Query: 124 VYVVVIASVRNRTRNP------LIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT 177
V ++ + + T+ P L NV + +D+ R F++TPL +Y Q T
Sbjct: 83 PVVFLMKGLNDSTQLPPNSTASAFSLLSAIDNVFLFPLDMKRLFEDTPLSSWYAQVNSST 142
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKD 237
L SD R ++KYGG Y+D D I I+ + N+ A+SS + + GV
Sbjct: 143 ERHWLYVSSDACRLAVIWKYGGVYMDTDVISIRPIPD-ENFLAAQSSKISSNGVFGFLPR 201
Query: 238 HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFF 296
H + +++ + WG GP ++TR+L+ CK + + + C N + P+ F
Sbjct: 202 HPFLWGCMENFVEHYNSAIWGHQGPHLMTRMLRVWCKLGDFQEVSDLRCLNLSFLHPQRF 261
Query: 297 YPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYCP 355
YP+ + W Y + + F DSYALH+WN K V GS + R++CP
Sbjct: 262 YPIPFRQWRRYYEVWDTDPS---FNDSYALHLWNYMNKEGRTVVQGSNTLAENLYRKHCP 318
Query: 356 R 356
R
Sbjct: 319 R 319
>gi|321452055|gb|EFX63533.1| hypothetical protein DAPPUDRAFT_67050 [Daphnia pulex]
Length = 281
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 130/272 (47%), Gaps = 10/272 (3%)
Query: 67 LSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYV 126
+ G+ + +F I+ L D I+F ET+ L LRQ C++ES A N V +
Sbjct: 16 VKGLNQSSSFHSISLLS-DDKRIYFHETTGRD----HLNLRQLCAVESTAKENSNRSVQM 70
Query: 127 VVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMS 186
++ N T PL L +Y N+ ++ +D+ YF TPL +Y + A S + H S
Sbjct: 71 FFQSNHVNLTVGPLAHILEKYPNIFVILIDVRDYFNQTPLEDWYLRGAWRQSPYKTEHFS 130
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLES--LHNYAGAESSSVVAAGVIHLDKDHWLSGAA 244
D +R ++ YK GG Y+DLDF+ +K + N+ E SV+ H KDH +
Sbjct: 131 DYIRILSSYKGGGMYMDLDFVALKPFDDNIFWNFVPEEDDSVLTGSSFHFQKDHPIVRKM 190
Query: 245 LRELRDNFKTTEWGANGPGVLTRLLKAECKPQS--YAHNIISCRNFTIYPPRFFYPVHWE 302
+ L ++ EW +GP + ++ C+ ++ + + C + + P + YP +
Sbjct: 191 MTYLASSYHPKEWSYSGPAMFQSVVLKFCQRRAPLPTYPVFLCPDIRVLPKTYLYPYKFA 250
Query: 303 HWADYLNETNAPATMSLFRDSYALHVWNSFTK 334
W +N ++F+ SYA+H +N +K
Sbjct: 251 EWKRMFRFSNVSGDDAIFQ-SYAVHTYNKLSK 281
>gi|395832859|ref|XP_003789470.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Otolemur
garnettii]
Length = 341
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 137/278 (49%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL------I 141
SI FLETS + +E +C++ESAA + P Q V + + N T+ PL +
Sbjct: 52 SIVFLETS----ERMEPPPLVSCAVESAAKIYPK-QPVVFFMKGLNNSTQLPLNSTYPAL 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L NV + +D+ R F++TPL +YT +A W L SD R ++KYGG
Sbjct: 107 SLLSAIDNVFLFPLDMKRLFEDTPLFSWYTHINASAERNW-LHVSSDASRLAIIWKYGGV 165
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I IK + N+ A++S + GV H A + +++ + WG
Sbjct: 166 YMDTDVISIKPIPE-ENFLAAQASQYSSNGVFGFLPRHPFLWACMENFIEHYNSGIWGNQ 224
Query: 261 GPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TR+L+ CK + + + C N + P+ FYP+ + W Y + + T
Sbjct: 225 GPNLMTRMLRVWCKLGDFQDLSDLRCLNLSFLHPQRFYPISYPEWRRYYDVWD---TDPN 281
Query: 320 FRDSYALHVWNSFT-KRVPVKLGSEQPYAQIARRYCPR 356
F DSYALH+WN +R V GS + R++CPR
Sbjct: 282 FNDSYALHLWNYMNHERRAVVRGSNTLAENLYRKHCPR 319
>gi|321443393|gb|EFX60079.1| hypothetical protein DAPPUDRAFT_72860 [Daphnia pulex]
Length = 231
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 108/216 (50%), Gaps = 5/216 (2%)
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L Y N+ ++ +D+ YF TPL +Y A S + H SD +R ++ YK GG Y+D
Sbjct: 2 LENYPNIFVILIDVRDYFNQTPLEDWYLHGAWRQSPYKTEHFSDYIRILSSYKGGGMYMD 61
Query: 204 LDFIVIKSLES--LHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
LDF+ +K + N+ E SV+ H KDH + + L ++ EW +G
Sbjct: 62 LDFVALKPFDDNIFWNFVPEEDDSVLTGSSFHFQKDHPIVRKMMTYLASSYHPKEWSYSG 121
Query: 262 PGVLTRLLKAECKPQS--YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
P + +L C+ ++ + + C + + P + YP + W +N ++
Sbjct: 122 PAMFQSVLLKYCQRRAPLPTYPVFLCPDIRVLPKTYLYPYKFAEWKRMFRFSNVSGDDAI 181
Query: 320 FRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
F+ SYA+H +N +K PV +GS Q Y++IAR +CP
Sbjct: 182 FQ-SYAVHTYNKLSKNEPVYVGSNQLYSKIARLHCP 216
>gi|444707307|gb|ELW48590.1| Alpha-1,4-N-acetylglucosaminyltransferase [Tupaia chinensis]
Length = 343
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 130/277 (46%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI FLETS D + +C++ESAA + PG Q + + N T+ P +
Sbjct: 54 SIVFLETS-ERTDPPPMV---SCAVESAAKIYPG-QPVAFFMKGLNNSTQLPSNATPPAL 108
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ F++TPL +YT+ + L SD R ++KYGG Y
Sbjct: 109 SLLSAIDNVFLFPLDMRTLFEDTPLFSWYTRVNVSAERNWLHVSSDAARLAVIWKYGGIY 168
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + GV H + +++ + WG G
Sbjct: 169 MDTDVISIRPIPE-ENFLAAQASQYSSNGVFGFRARHPFLWGCMENFVEHYNSDIWGQQG 227
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P +LTR+LK C+P + + C N + P+ FYP+ + W Y + T F
Sbjct: 228 PALLTRMLKLWCRPGDFQELRDLQCPNVSFLHPQRFYPISYPEWRRYYEVWD---TEPSF 284
Query: 321 RDSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYCPR 356
DSYALH+WN + V GS + R++CPR
Sbjct: 285 NDSYALHLWNHMNQEGRAVVRGSRTLLENLYRKHCPR 321
>gi|351708297|gb|EHB11216.1| Alpha-1,4-N-acetylglucosaminyltransferase, partial [Heterocephalus
glaber]
Length = 333
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 135/278 (48%), Gaps = 21/278 (7%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI FLETS + +E +C++ESAA + P Q + + ++N T+ P
Sbjct: 52 SIVFLETS----ERLEPPPLVSCAVESAAKIYPE-QPVIFFMKGLKNSTQLPSNSSYPAF 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L NV +D+ R F+NTPL +YT ++ + W L SD R ++KYGG
Sbjct: 107 SLLSGIDNVFFFPLDMNRLFENTPLFSWYTHINSKVERHW-LHVSSDASRLAIIWKYGGV 165
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I I+ + N+ A++S V + GV H + +++ + WG
Sbjct: 166 YMDTDIISIRPIPK-ENFLAAQASQVSSNGVFGFLPRHPFLWECMENFVEHYNSHIWGNQ 224
Query: 261 GPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TR+L+ CK + + + C+N + P+ FYP+ + W Y +
Sbjct: 225 GPRLITRMLRVWCKLRDFQDLGDLRCQNISFLHPQRFYPISYSKWRRYYEVWDPEPN--- 281
Query: 320 FRDSYALHVWNSFT--KRVPVKLGSEQPYAQIARRYCP 355
F DSYALH+WN KR +K GS + R+YCP
Sbjct: 282 FNDSYALHLWNHMNQGKRTVIK-GSNTLVENLYRKYCP 318
>gi|241860748|ref|XP_002416293.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
gi|215510507|gb|EEC19960.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
Length = 299
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/269 (28%), Positives = 125/269 (46%), Gaps = 15/269 (5%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEY 147
+I+F+E+S + +L QAC+IESA++ NP + ++ + I L
Sbjct: 35 NIWFIESSGSQ----DLPFIQACAIESASLHNPAYFITLLTTGQLAASCEYCTI--LQTL 88
Query: 148 QNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFI 207
N ++ D + F ++P+ +Y S + + H+SD LRYVTL+ +GG YLDLD I
Sbjct: 89 PNFRSIRTDKEQAFLSSPMEAWYGSKMYNHSKYRVEHLSDALRYVTLWWHGGIYLDLDVI 148
Query: 208 VIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTR 267
++SL SL N E S ++ DK H +++ + + TEW GP +L
Sbjct: 149 TMRSLHSLTNGLVLEESGRPTNSILIFDKRHRFLKTVMKKCAEVYNPTEWTTCGPNLLQS 208
Query: 268 LLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH 327
L Y S +N T F + W+ W + +T A R+SY +H
Sbjct: 209 L---------YQSGGSSAQNLTFLKAETFLAIGWKRWKWFFEQTRTTAVFQEVRESYGVH 259
Query: 328 VWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ N +K ++GS Y +A CPR
Sbjct: 260 LSNHLSKHTTFQIGSGCAYELLAVWNCPR 288
>gi|300795892|ref|NP_001179713.1| alpha-1,4-N-acetylglucosaminyltransferase [Bos taurus]
gi|296491021|tpg|DAA33119.1| TPA: alpha-1,4-N-acetylglucosaminyltransferase-like [Bos taurus]
Length = 341
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 136/278 (48%), Gaps = 17/278 (6%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN------RTRNPL 140
+SI FLETS + +E T +C++ESAA + P Q ++++ + N + +P
Sbjct: 51 HSIVFLETS----ERMEPTPLVSCAVESAAKVYPE-QPVLLLMKGLNNSMQLPPNSTSPA 105
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
+ L NV + +D+ F++TPL +YTQ T + L SD R ++KYGG
Sbjct: 106 LSLLSAIDNVFLFPLDMKSLFEDTPLFSWYTQINSSTERFWLHISSDASRLAFIWKYGGV 165
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I I+ + N+ A+ S + GV H + +N+ WG
Sbjct: 166 YMDTDVISIRPIPE-DNFLAAQKSRFSSNGVFGFLPHHPFLWQCMENFVENYNPNIWGHQ 224
Query: 261 GPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TRLL+ CK + + + + C NF+ P+ FYP+ + W Y + +
Sbjct: 225 GPELMTRLLRVWCKLRDFQEVSDLKCLNFSFLHPQRFYPISFRAWRRYYEVWDTEPS--- 281
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
F +SYALH+WN + V GS + R++CP+
Sbjct: 282 FNNSYALHLWNHMNQEGKAVVRGSNTLVEHLYRKHCPK 319
>gi|301625841|ref|XP_002942110.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Xenopus (Silurana) tropicalis]
Length = 346
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 133/283 (46%), Gaps = 17/283 (6%)
Query: 83 DVPDNS-------IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNR 135
DVPD I+F+ETS + + C++ESA +P QV +++ +
Sbjct: 61 DVPDGKNHSPTGRIYFVETSERMSPNAQFM----CAVESAVRTHPDTQVTILMRGLYQQH 116
Query: 136 TRNP--LIDRLYE-YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYV 192
P L RL+ + NV + +D R F +TPL +Y+ L +SD R
Sbjct: 117 LPRPPNLAFRLFRCFPNVDVAPLDFERLFADTPLSSWYSAVEGHKEATDLPILSDASRLA 176
Query: 193 TLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNF 252
L+KYGG YLD DF+V+K L +L N G +S+ + + + H +++ D++
Sbjct: 177 ILWKYGGVYLDTDFVVLKRLTNLANSMGTQSTYTLNGAFLSFARGHKFIELCMKDFTDSY 236
Query: 253 KTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETN 312
+G GP +LTR+ K C + SCR ++ P FYP+ W++W Y +
Sbjct: 237 NFWLYGHQGPQLLTRVFKRWCS-IRRLRDRRSCRGVSVLPQEAFYPIEWQNWRKYFELIS 295
Query: 313 APATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
R++YA+HVWN +K + G+ Q+ + CP
Sbjct: 296 PSDLKGFLRNTYAVHVWNKKSKDSRPEPGTF--LDQLQSQCCP 336
>gi|426218246|ref|XP_004003360.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Ovis aries]
Length = 341
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 134/277 (48%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI FLETS + +E T C++ESAA + P Q ++++ + N + +P +
Sbjct: 52 SIVFLETS----ERMEPTPLVCCAVESAAKVYPE-QPVLLLMKGLNNSMQLPPNSTSPAL 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ F++TPL +YT+ T + L SD R ++KYGG Y
Sbjct: 107 SLLSAIDNVFLFPLDMKSLFEDTPLFSWYTRINSSTERFWLHISSDASRLAFIWKYGGVY 166
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A+ S + GV H + +N+ WG G
Sbjct: 167 MDTDVISIRPIPE-DNFLAAQKSQFSSNGVFGFLPHHPFLWECMENFVENYNPRIWGHQG 225
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TRLL+ CK + + + + C NF+ P+ FYP+ + W Y + + F
Sbjct: 226 PELMTRLLRVWCKLRDFQEVSDLKCLNFSFLHPQRFYPISFRAWRRYYEVWDTEPS---F 282
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
DSYALH+WN + V GS + R++CP+
Sbjct: 283 NDSYALHLWNHMNQEGKAVVRGSNTLVENLYRKHCPK 319
>gi|440894193|gb|ELR46704.1| Alpha-1,4-N-acetylglucosaminyltransferase [Bos grunniens mutus]
Length = 341
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 137/278 (49%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN------RTRNPLI 141
SI FLETS + +E T +C++ESAA + P Q ++++ + N + +P +
Sbjct: 52 SIVFLETS----ERMEPTPLVSCAVESAAKVYPE-QPVLLLMKGLNNSMQLPPNSTSPAL 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L NV + +D+ F++TPL +YTQ ++ + W L SD R ++KYGG
Sbjct: 107 SLLSAIDNVFLFPLDMKSLFEDTPLFSWYTQINSSIERFW-LHISSDASRLAFIWKYGGV 165
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I I+ + N+ A+ S + GV H + +N+ WG
Sbjct: 166 YMDTDVISIRPIPE-DNFLAAQKSRFSSNGVFGFLPHHPFLWQCMENFVENYNPNIWGHQ 224
Query: 261 GPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TRLL+ CK + + + + C NF+ P+ FYP+ + W Y + +
Sbjct: 225 GPELMTRLLRVWCKLRDFQEVSDLKCLNFSFLHPQRFYPISFRAWRRYYEVWDTEPS--- 281
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
F +SYALH+WN + V GS + R++CP+
Sbjct: 282 FNNSYALHLWNHMNQEGKAVVRGSNTLVEHLYRKHCPK 319
>gi|410971312|ref|XP_003992114.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Felis catus]
Length = 341
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 142/303 (46%), Gaps = 21/303 (6%)
Query: 64 NCVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQ 123
+C+ S + A D L + I F+ETS + +E T +C++ESAA P
Sbjct: 28 SCLFSCLPPQKAQQDPEALLGHGHGIVFIETS----ERMEPTPLVSCAVESAAKTYPEKP 83
Query: 124 VYVVVIASVRNRTR------NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAIL 176
V V + + TR P L NV + +D+ R F++TPL +YT +A +
Sbjct: 84 V-VFFMKGLNTTTRLPSNSTYPAFSLLSAIDNVFLFPLDMKRLFEDTPLRSWYTHINASV 142
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK 236
S W L SD R ++KYGG Y+D D I I+ + N+ A+SS + GV
Sbjct: 143 ESNW-LHVSSDASRLAIIWKYGGIYMDTDVISIRPIPE-ENFLAAQSSQYSSNGVFGFLP 200
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRF 295
H + +++ + WG GP ++TR+L+ CK + + + C N + P+
Sbjct: 201 HHPFLWECMENFVEHYNSDIWGNQGPSLMTRMLRLWCKLRDFQEVSDFRCLNLSFLHPQR 260
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRY 353
FYP+ + W Y + + F DSYALH+WN R V+ GS + R++
Sbjct: 261 FYPISYPEWRRYYEVWDKDLS---FNDSYALHLWNYMNNEGRTVVR-GSNTLVENLCRKH 316
Query: 354 CPR 356
CPR
Sbjct: 317 CPR 319
>gi|321462016|gb|EFX73043.1| hypothetical protein DAPPUDRAFT_325682 [Daphnia pulex]
Length = 626
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 149/315 (47%), Gaps = 45/315 (14%)
Query: 76 FDDITRLD---VPDN----------SIFFLETSCTHADGVELTLRQACSIESAAMMNPGV 122
F DIT D V +N +IFF+ETS LT RQAC IESAA NP
Sbjct: 17 FSDITESDSALVTENEHLVKLNKGLNIFFIETSGRSC----LTARQACGIESAARANP-- 70
Query: 123 QVYVVVIASVRNRTRN-PLIDRLY--------------EYQNVHIVQVDLGRYFQNTPLH 167
+ + + +N N P ID + ++QNV IV+ DL Y + TP
Sbjct: 71 -MATITLYMEKNSIVNLPKIDNVQKMKFECDITGALFKQFQNVRIVRGDLLDYLEGTPFW 129
Query: 168 GFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSV- 226
FY S PL H SD++R L+K GG YLDLD IV++ L+SL+N G +
Sbjct: 130 EFYRTGPFNQSTTPLVHRSDVIRVALLWKNGGVYLDLDCIVMRPLDSLNNTVGTVRDFIP 189
Query: 227 --VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSY-AHNII 283
+ GV+ H L ++ + F++ + + GP LT + C A+ +
Sbjct: 190 NWIENGVMAFTAGHPLLRFLMKSMILAFRSDNYLSLGPPALTEAILEFCNRNDLPANKWL 249
Query: 284 SC-RNFTIY--PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKL 340
SC RN +++ P FY + + + PA ++S+ H+++S ++R K+
Sbjct: 250 SCWRNSSLFIQPADSFYAIGSGRADAFYHPEVDPADWIKLKNSFLSHIYDSGSRR---KI 306
Query: 341 GSEQPYAQIARRYCP 355
S YAQ+AR+YCP
Sbjct: 307 PSSSLYAQLARKYCP 321
>gi|449277566|gb|EMC85679.1| Lactosylceramide 4-alpha-galactosyltransferase [Columba livia]
Length = 223
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 103/206 (50%), Gaps = 1/206 (0%)
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVI 209
+ I +DL F TPL +Y+Q + +SD R ++K+GG YLD DFIV+
Sbjct: 1 MEIRPLDLAELFSGTPLAKWYSQPEHQKEPYFFPVLSDACRITIMWKFGGIYLDTDFIVL 60
Query: 210 KSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLL 269
K+L++L N G +S V+ + H +++ DN+K+ W GP +LTR+
Sbjct: 61 KNLKNLTNALGLQSQDVLNGAFLSFKPKHEFIELCMQDFVDNYKSWIWAHQGPQLLTRVF 120
Query: 270 KAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVW 329
K C + +I C+ FYP+ W++W + +L +++YA+H+W
Sbjct: 121 KKWCSISNIQSGMI-CKGVGALSREAFYPIPWQNWKKLFEAIGSSELHNLLKNTYAVHIW 179
Query: 330 NSFTKRVPVKLGSEQPYAQIARRYCP 355
N + +++ S+ AQ+ ++CP
Sbjct: 180 NKLSHDARLEITSQALLAQLYSQFCP 205
>gi|281343685|gb|EFB19269.1| hypothetical protein PANDA_012514 [Ailuropoda melanoleuca]
Length = 341
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 140/301 (46%), Gaps = 19/301 (6%)
Query: 65 CVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQV 124
CV S + + D L + I F+ETS + +E T +C++ESAA + PG Q
Sbjct: 29 CVFSCLPPQKSQQDPEALLGQGHGIVFIETS----ERMEPTPLVSCAVESAAKIYPG-QP 83
Query: 125 YVVVIASVRNRTR------NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILT 177
+ N TR P L NV +D+ + F++TPL +YT +A +
Sbjct: 84 VAFFMKGFHNTTRLPSNSTYPGFSLLSAIDNVFFFPLDMEKLFEDTPLLSWYTHINASVE 143
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKD 237
W L SD R ++K+GG Y+D D I I+ + N+ A+SS + GV
Sbjct: 144 RNW-LHVSSDASRLAIIWKFGGIYMDTDVISIRPIPE-ENFLAAQSSRYSSNGVFGFLPH 201
Query: 238 HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFF 296
H + +++ + WG GP ++TR+L+ CK + + + C N + P+ F
Sbjct: 202 HPFLWECMENFVEHYNSEIWGNQGPNLMTRMLRLWCKLGDFQEVSDLRCLNLSFLHPQRF 261
Query: 297 YPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYCP 355
YP+ ++ W Y + + F +SYALH+WN K V GS + R++CP
Sbjct: 262 YPISYQEWKRYYEVWDRDLS---FNNSYALHLWNYMNKEGRAVVRGSNTLVENLCRKHCP 318
Query: 356 R 356
R
Sbjct: 319 R 319
>gi|344242647|gb|EGV98750.1| Alpha-1,4-N-acetylglucosaminyltransferase [Cricetulus griseus]
Length = 339
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 134/278 (48%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI F+ETS + VE +C++ESAA + P Q + + + N T+ +P
Sbjct: 52 SIVFIETS----ERVEPPPLVSCAVESAAKIYPE-QPIIFFMKWLSNATQLTPNASSPAF 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV V +D+ + F++TPL +YT+ T L SD R ++KYGG Y
Sbjct: 107 SLLSAIDNVFFVPLDMKKLFEDTPLFSWYTKVNSSTEKHWLHVSSDASRLAIIWKYGGIY 166
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
LD D I I+ + N+ A+ S + GV H A + +++ + WG G
Sbjct: 167 LDTDVISIRPIPE-ENFLAAQGSQHSSNGVFGFLPHHPFLWACMENFVEHYSSGIWGNQG 225
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ C+ Q + + C N + P+ FYP+ + W Y + T F
Sbjct: 226 PLLMTRMLRVWCRLQDFQELGDLKCLNISFLHPQRFYPIPYPQWRRYYQVWD---TEPSF 282
Query: 321 RDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRYCPR 356
DSYALH+WN K R V+ GS + R++CP+
Sbjct: 283 NDSYALHLWNFMNKEGRTVVR-GSNTLVENLYRKHCPK 319
>gi|335299592|ref|XP_003358616.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Sus
scrofa]
Length = 340
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 136/278 (48%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI FLETS + E + +C++ESAA + P Q ++++ + N T+ +P +
Sbjct: 51 SIVFLETS----ERTEPSPLVSCAVESAAKVYPE-QPVLLLLKGLTNSTQLLSNSTSPAL 105
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L NV + +D+ F++TPL +YT+ ++ W L SD R ++KYGG
Sbjct: 106 SLLSAIDNVFLFPLDMKSLFEDTPLFSWYTRINSSAERNW-LYVSSDASRLAIIWKYGGV 164
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I I+ + N+ A+ S + GV H + +N+ + WG
Sbjct: 165 YMDSDVISIRPIPE-ENFLAAQESQFSSNGVFGFLPHHPFLWQCMENFVENYNSDIWGHQ 223
Query: 261 GPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + +
Sbjct: 224 GPYLMTRILRVWCKLRDFQEVSDLRCMNLSFLHPQRFYPISYPAWKRYYEVWDTEPS--- 280
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
F DSYALH+WN + V GS + R+YCPR
Sbjct: 281 FNDSYALHLWNHMNREGRAVIRGSNTLVENLYRKYCPR 318
>gi|301775918|ref|XP_002923381.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like
[Ailuropoda melanoleuca]
Length = 390
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/302 (29%), Positives = 140/302 (46%), Gaps = 19/302 (6%)
Query: 64 NCVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQ 123
CV S + + D L + I F+ETS + +E T +C++ESAA + PG Q
Sbjct: 77 GCVFSCLPPQKSQQDPEALLGQGHGIVFIETS----ERMEPTPLVSCAVESAAKIYPG-Q 131
Query: 124 VYVVVIASVRNRTR------NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAIL 176
+ N TR P L NV +D+ + F++TPL +YT +A +
Sbjct: 132 PVAFFMKGFHNTTRLPSNSTYPGFSLLSAIDNVFFFPLDMEKLFEDTPLLSWYTHINASV 191
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK 236
W L SD R ++K+GG Y+D D I I+ + N+ A+SS + GV
Sbjct: 192 ERNW-LHVSSDASRLAIIWKFGGIYMDTDVISIRPIPE-ENFLAAQSSRYSSNGVFGFLP 249
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRF 295
H + +++ + WG GP ++TR+L+ CK + + + C N + P+
Sbjct: 250 HHPFLWECMENFVEHYNSEIWGNQGPNLMTRMLRLWCKLGDFQEVSDLRCLNLSFLHPQR 309
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYC 354
FYP+ ++ W Y + + F +SYALH+WN K V GS + R++C
Sbjct: 310 FYPISYQEWKRYYEVWDRDLS---FNNSYALHLWNYMNKEGRAVVRGSNTLVENLCRKHC 366
Query: 355 PR 356
PR
Sbjct: 367 PR 368
>gi|432108439|gb|ELK33189.1| Alpha-1,4-N-acetylglucosaminyltransferase [Myotis davidii]
Length = 341
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 137/278 (49%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI FLETS + E + +C++ESAA + PG Q V + + + T+ P
Sbjct: 52 SIVFLETS----ERTEPSPLVSCAVESAAKVYPG-QPVVFFMKGLSSSTQLPPNATYPAF 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L NV +V +D+ F++TPL +YTQ +A W L SD R ++KYGG
Sbjct: 107 SLLSAIDNVFLVPLDMKSLFEDTPLSSWYTQINASAERNW-LHVSSDASRLAIIWKYGGV 165
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I I+ + + N+ A++S + GV H + +++ + WG
Sbjct: 166 YMDTDIISIRPIPA-ENFLAAQASRYSSNGVFGFLPRHAFLWQCMENFVEHYNSYIWGNQ 224
Query: 261 GPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TR+L+ CK + + + + C N + P+ FYP+ + W Y E +P
Sbjct: 225 GPDLMTRMLRVWCKLEDFQELSDLRCLNVSFLHPQRFYPISYPEWRRYY-EVWSPEPS-- 281
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
F +SYALH+WN + V GS + R++CPR
Sbjct: 282 FNNSYALHLWNYMNQEGKAVVRGSNTLVENLYRKHCPR 319
>gi|354480744|ref|XP_003502564.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like
[Cricetulus griseus]
Length = 448
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 134/278 (48%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI F+ETS + VE +C++ESAA + P Q + + + N T+ +P
Sbjct: 52 SIVFIETS----ERVEPPPLVSCAVESAAKIYP-EQPIIFFMKWLSNATQLTPNASSPAF 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV V +D+ + F++TPL +YT+ T L SD R ++KYGG Y
Sbjct: 107 SLLSAIDNVFFVPLDMKKLFEDTPLFSWYTKVNSSTEKHWLHVSSDASRLAIIWKYGGIY 166
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
LD D I I+ + N+ A+ S + GV H A + +++ + WG G
Sbjct: 167 LDTDVISIRPIPE-ENFLAAQGSQHSSNGVFGFLPHHPFLWACMENFVEHYSSGIWGNQG 225
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ C+ Q + + C N + P+ FYP+ + W Y + T F
Sbjct: 226 PLLMTRMLRVWCRLQDFQELGDLKCLNISFLHPQRFYPIPYPQWRRYYQVWD---TEPSF 282
Query: 321 RDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRYCPR 356
DSYALH+WN K R V+ GS + R++CP+
Sbjct: 283 NDSYALHLWNFMNKEGRTVVR-GSNTLVENLYRKHCPK 319
>gi|116734845|ref|NP_001070892.1| alpha-1,4-N-acetylglucosaminyltransferase [Mus musculus]
gi|109730907|gb|AAI15616.1| Alpha-1,4-N-acetylglucosaminyltransferase [Mus musculus]
gi|109732015|gb|AAI15615.1| Alpha-1,4-N-acetylglucosaminyltransferase [Mus musculus]
gi|148689068|gb|EDL21015.1| mCG18102 [Mus musculus]
Length = 341
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 135/278 (48%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN------RTRNPLI 141
SI F+ETS + VE +C++ESAA + P Q + + +R+ T P
Sbjct: 52 SIMFIETS----ERVEPPPMVSCAVESAAKIYPE-QPIIFFMKGLRDSVQLTSNTSYPAF 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV V +D+ R F++TPL +YT+ T L SD R ++KYGG Y
Sbjct: 107 SLLSAINNVFFVPLDMERLFKDTPLFSWYTKVNSSTEKHWLHVSSDAARLAIIWKYGGIY 166
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I ++ + N+ A+ S + GV H A + +++ +T WG G
Sbjct: 167 MDTDVISLQPIPE-ENFLAAQGSRHSSNGVFGFLPHHPFLWACMENFVEHYDSTIWGNQG 225
Query: 262 PGVLTRLLKAECKPQSYAHNI--ISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
P ++TR+L+ C+ + + H + + C N + P+ FYP+ + W Y + +
Sbjct: 226 PQLMTRMLRVWCRLKDF-HGLGDLKCLNISFLHPQRFYPIPYPQWKRYYQVWDKEPS--- 281
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
F +SYALH+WN K V GS+ + +++CP+
Sbjct: 282 FNESYALHLWNYMNKEGKTVVRGSKTLVENLYQKHCPK 319
>gi|403278886|ref|XP_003931013.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Saimiri
boliviensis boliviensis]
Length = 340
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 127/275 (46%), Gaps = 15/275 (5%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS + +E +C++ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----ERMEPPHLVSCAVESAAKIYPERPVAFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L E NV + +D+ R F++TPL +YTQ L SD R ++KYGG Y+
Sbjct: 107 LLSEIDNVFLFPLDMKRLFEDTPLFSWYTQINTSAERNWLHISSDASRLAIIWKYGGIYM 166
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D D I I+ + N+ A+ S + GV H + +N+ + WG GP
Sbjct: 167 DTDVISIRPIPE-ENFLAAQDSQYSSNGVFGFLPHHPFLWECMENFVENYNSDIWGHQGP 225
Query: 263 GVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
G++TR+L+ CK + + + + C N + P+ FYP+ + W Y + F
Sbjct: 226 GLMTRMLRVWCKLEDFQELSDLRCLNMSFLHPQRFYPISYPEWRRYYEVWDTDPN---FN 282
Query: 322 DSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYCP 355
DSYALH+WN + V GS + +++CP
Sbjct: 283 DSYALHLWNYMNQEGRAVVSGSNTLVENLYQKHCP 317
>gi|91754027|dbj|BAE93228.1| alpha 1,4-N-Acetylglucosaminyltransferase [Equus caballus]
Length = 276
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 122/256 (47%), Gaps = 13/256 (5%)
Query: 109 ACSIESAAMMNPGVQVYVVVIASVRNRTRNP------LIDRLYEYQNVHIVQVDLGRYFQ 162
+C++ESAA + P Q V ++ + + T+ P L NV + +D+ R F+
Sbjct: 4 SCAVESAARIYPE-QPVVFLMKGLNDSTQLPPNSTASAFSLLSAIDNVFLFPLDMKRLFE 62
Query: 163 NTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAE 222
+TPL +Y Q T L SD R ++KYGG Y+D D I I+ + N+ A+
Sbjct: 63 DTPLSSWYAQVNSSTERHWLYVSSDACRLAVIWKYGGIYMDTDVISIRPIPD-ENFLAAQ 121
Query: 223 SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-N 281
SS + + GV H + +++ + WG GP ++TR+L+ CK + +
Sbjct: 122 SSKISSNGVFGFLPRHPFLWGCMENFVEHYNSAIWGHQGPDLMTRMLRVWCKLGDFQEVS 181
Query: 282 IISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRV-PVKL 340
+ C N + P+ FYP+ + W Y + + F DSYALH+WN K V
Sbjct: 182 DLRCLNLSFLHPQRFYPIPFRQWRRYYEVWDTDPS---FNDSYALHLWNYMNKEGRTVVQ 238
Query: 341 GSEQPYAQIARRYCPR 356
GS + R++CPR
Sbjct: 239 GSNTLAENLYRKHCPR 254
>gi|291399845|ref|XP_002716610.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Oryctolagus
cuniculus]
Length = 407
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 140/303 (46%), Gaps = 21/303 (6%)
Query: 64 NCVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQ 123
C+ S + + + L P SI FLETS + E +C++ESAA + PG Q
Sbjct: 28 GCLFSCLPPLKSQEGPEALLSPGRSIVFLETS----ERTEPPPLVSCAVESAARIYPG-Q 82
Query: 124 VYVVVIASVRNRTR------NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAIL 176
+ + N T+ +P L NV + +D+ ++TPL +YT + +
Sbjct: 83 PVAFFMKGLGNPTQLPSNSTHPAFSLLSAMDNVFLFPLDMKSLLEDTPLFSWYTHVNTSV 142
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK 236
W L SD R ++KYGG Y+D D I I+ + N+ A++S + GV
Sbjct: 143 QRNW-LHVSSDACRLAIIWKYGGIYMDTDVISIRPIPE-ANFLAAQASRYSSNGVFGFLP 200
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRF 295
H + +++ + WG GP ++TR+L+ CK + + + C N + P+
Sbjct: 201 HHPFLWGCMENFVEHYNSAIWGHQGPDLMTRMLRVWCKLGDFQELSDLRCLNLSFLHPQR 260
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRY 353
FYP+ + W Y + T F DSYALH+WN + R V+ GS + R++
Sbjct: 261 FYPISYPEWRRYYQVWD---TEPSFNDSYALHLWNYMNREGRAVVR-GSHTLVESLFRKH 316
Query: 354 CPR 356
CPR
Sbjct: 317 CPR 319
>gi|427791343|gb|JAA61123.1| Putative secreted protein, partial [Rhipicephalus pulchellus]
Length = 291
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 142/292 (48%), Gaps = 21/292 (7%)
Query: 15 LTAVCLTLSFLLGVHIFLSPRFLNHFVFRVKQPLRYIEYWRSGYRVQNINCVLSGVEKDF 74
LTA+ +TL F + + + LS R L + P + G+R Q L ++
Sbjct: 12 LTAL-ITLVFFVLLTVTLSARHLGY------DPRALLSGASDGFRQQ----CLGEQDQRK 60
Query: 75 AFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN 134
D L+ +IFFLET A + R ACSIESAA+ +P V+++ I +R+
Sbjct: 61 DIDRHANLNDSTKNIFFLET----AGASCINERAACSIESAALRHPHYTVWLLTILDMRD 116
Query: 135 RTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTL 194
PL + L + N ++ +DL +++ L +Y +D S + ++H+SD LR + L
Sbjct: 117 C--RPLRN-LQQLPNFRLLNIDLNSMVKDSVLVHWYLKDDWNHSPFRVNHLSDALRLLVL 173
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKT 254
+KYGG Y D+D + ++S L N E V V+ D+ H L E +K+
Sbjct: 174 WKYGGVYADMDVLTLRSFSELRNVVSRELFPDVGNSVLVFDRGHPFLLRCLEEFSRTYKS 233
Query: 255 TEWGANGPGVLTRLLKAECKPQSYAHN--IISCRNFTIYPPRFFYPVHWEHW 304
+W NGP +L R+L C P++ ++ C T+ P FYP+++ W
Sbjct: 234 RKWAHNGPRLLERVLSWFC-PRNLLGKLPLVECSGVTVLPGTAFYPINYMEW 284
>gi|431916950|gb|ELK16706.1| Alpha-1,4-N-acetylglucosaminyltransferase [Pteropus alecto]
Length = 340
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 136/278 (48%), Gaps = 21/278 (7%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNP------LID 142
I FLETS + VE + +C++ESAA + P Q + + + + P
Sbjct: 52 IVFLETS----ERVEPSPLVSCAVESAARVYPE-QPVAFFMKGLSDSMQLPPNSTYTAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +++ R F++TPL +YTQ +A W L SD R T++KYGG Y
Sbjct: 107 LLSAIDNVFLFPLNMKRLFEDTPLFSWYTQINASAQRNW-LYVSSDASRLATIWKYGGIY 165
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + GV H + +++ + WG G
Sbjct: 166 MDTDIISIRPIPE-ENFLAAQASRDSSNGVFGFLPHHSFLWECMENFVEHYNSDIWGNQG 224
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ CK +++ + C NF+ P+ FYP+ + W Y + + F
Sbjct: 225 PILMTRMLRVWCKLKNFQEVSDFRCSNFSFLHPQRFYPISYPQWRRYYEVWDTDPS---F 281
Query: 321 RDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRYCPR 356
+DSY LH+WN + R V+ GS + R++CPR
Sbjct: 282 KDSYGLHLWNYMNREGRTVVR-GSNTLVENLYRKHCPR 318
>gi|355666115|gb|AER93428.1| alpha 1,4-galactosyltransferase [Mustela putorius furo]
Length = 297
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 114/234 (48%), Gaps = 9/234 (3%)
Query: 79 ITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRN 138
++ +P N IFFLETS D CS+ESAA +P +V V++
Sbjct: 70 LSSTPLPGN-IFFLETS----DRTNPNFLFMCSVESAARAHPESRVVVLMKGLPGGNASL 124
Query: 139 PL---IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLY 195
P + L + NVH++ +DL F++TPL +Y + L +SD R ++
Sbjct: 125 PRHLGLSLLGCFPNVHLLPLDLEELFRDTPLAAWYVAWRHRWEPYVLPVLSDASRIALMW 184
Query: 196 KYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTT 255
K+GG YLD DFIV+++L++L N G +S V+ + D+ H + + ++
Sbjct: 185 KFGGIYLDTDFIVLRNLQNLTNTLGTQSRYVLNGAFLAFDRHHEFMALCMHDFVAHYNGW 244
Query: 256 EWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLN 309
WG GP +LTR+ K C +S + +CR T P FYP+ W+ W Y
Sbjct: 245 IWGHQGPQLLTRVFKKWCSIRSL-DDSHACRGVTALPSEAFYPIPWQDWRRYFQ 297
>gi|321460953|gb|EFX71990.1| hypothetical protein DAPPUDRAFT_111182 [Daphnia pulex]
Length = 407
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 137/285 (48%), Gaps = 34/285 (11%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNP--GVQVYVVVIASVRNRTRNPL------ 140
IFF ETS EL+LRQ C++ESAA NP +Q+++ + + P
Sbjct: 102 IFFHETSGRS----ELSLRQCCAVESAARHNPDRNIQLFLGSSDNCGDGINRPFRKIFSS 157
Query: 141 -----IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLY 195
+ L Y N+ +V ++ G YF TP +Y + S + + H+SD +R +TLY
Sbjct: 158 QFSSWLKVLSRYTNLSVVYLNEGYYFSGTPFQDWYRKGEWRKSPFKMGHLSDFIRILTLY 217
Query: 196 KYGGTYLDLDFIVIKSLESL--HNYAGAESSSVVAAG--VIHLDKDHWLSGAALRELRDN 251
K GG Y+DLD + +K+ + +NY E++ + G ++H ++ H ++ +R L +
Sbjct: 218 KGGGMYMDLDIMTLKTFHGIMFNNYLVYENAKMDTIGNSIMHFERGHQITIELIRLLSEE 277
Query: 252 FKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNET 311
+ + +GP + ++ C + N C + + P R+F+PV
Sbjct: 278 YDPEAYVYHGPDAIAEVMNRVCGLVAGNPNSNKCGDVKLLPHRYFHPVA----------- 326
Query: 312 NAPATMSLFR-DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
A + LF+ D + WNS ++ + + S Q ++ +AR +CP
Sbjct: 327 -AMFSHMLFKNDINGTNQWNSISRNERIDIYSNQVFSVLAREHCP 370
>gi|296228007|ref|XP_002759627.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Callithrix
jacchus]
Length = 340
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 126/275 (45%), Gaps = 15/275 (5%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRN-----RTRNPLID 142
I FLETS + +E +C++ESAA + P V + + + P
Sbjct: 51 GIVFLETS----ERIEPPHLVSCAVESAAKIYPERPVAFFMKGLTDSTLTPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L E NV + +D+ R F++TPL +YTQ L SD R ++KYGG Y+
Sbjct: 107 LLSEIDNVFLFPLDMKRLFEDTPLFSWYTQINTSAERNWLHISSDASRLAIIWKYGGIYM 166
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D D I I+ + N+ A+ S + GV H + +N+ + WG GP
Sbjct: 167 DTDVISIRPIPE-ENFLAAQDSQYSSNGVFGFLPHHPFLWECMENFVENYNSDIWGNQGP 225
Query: 263 GVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
++TR+L+ CK + + + + C N + P+ FYP+ + W Y + + F
Sbjct: 226 DLMTRMLRVWCKLEDFRELSDLRCLNMSFLHPQRFYPISYPEWRRYYEVWDTDPS---FN 282
Query: 322 DSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYCP 355
DSYALH+WN + V GS + +++CP
Sbjct: 283 DSYALHLWNYMNQEGRAVVSGSNTLVENLYQKHCP 317
>gi|355747029|gb|EHH51643.1| hypothetical protein EGM_11064 [Macaca fascicularis]
Length = 340
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 131/278 (47%), Gaps = 21/278 (7%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS + +E T +C++ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----ERMEPTHLVSCAVESAAKIYPERPVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSL---WPLSHMSDLLRYVTLYKYGG 199
L NV + +D+ R F++TPL +Y Q I TS+ W L SD R ++KYGG
Sbjct: 107 FLSAIDNVFLFPLDMKRLFEDTPLFSWYNQ--INTSIERNW-LHISSDASRLAVIWKYGG 163
Query: 200 TYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA 259
Y+D D I I+ + N+ A+ S + GV H + +++ + WG
Sbjct: 164 IYMDTDVISIRPIPE-ENFLAAQGSRYSSNGVFGFLPQHPFLWKCMENFVEHYNSDIWGN 222
Query: 260 NGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
GP ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + +
Sbjct: 223 QGPDLMTRMLRVWCKLEDFQELSDLRCLNISFLHPQRFYPISYPEWRRYYEVWDTDPS-- 280
Query: 319 LFRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCP 355
F DSYALH+WN + V GS + RR+CP
Sbjct: 281 -FNDSYALHLWNYMNQEGRAVVRGSNTLVENLYRRHCP 317
>gi|213624347|gb|AAI70971.1| hypothetical protein LOC549416 [Xenopus (Silurana) tropicalis]
Length = 334
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 133/276 (48%), Gaps = 17/276 (6%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT--RNPL--ID 142
N I F+ET+ D ++ C+IESAA + V V + +R+ T R+ L +
Sbjct: 55 NGIIFIETT----DRMKPPSLVLCTIESAARVYRHRPV-VFFMEGLRDITAMRDALKRLP 109
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L + NVH+ + + R F TPL +Y + + SD R ++++GG Y+
Sbjct: 110 TLSSFHNVHLFPLQMERLFHGTPLRRWYEKVNPERERYWTHVSSDGCRLALIWRHGGIYM 169
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D DFI ++ + + N+ AESS + G+ L H + + N++ +WG GP
Sbjct: 170 DSDFISMRPIPT-GNFLAAESSDFSSNGIFGLTPRHSFAWKGMESFVRNYRGAKWGHQGP 228
Query: 263 GVLTRLLKAEC-KPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNE-TNAPATMSLF 320
+ TR+LK C P+ + + C + +I + FYP+ +E W+ Y N P F
Sbjct: 229 QLFTRVLKQYCIAPRFQSTEDVKCGDISILKVKRFYPIPYESWSRYYEVWQNVPK----F 284
Query: 321 RDSYALHVWNSFTKRVPVKL-GSEQPYAQIARRYCP 355
RDSYALH+WN + + GS + + YCP
Sbjct: 285 RDSYALHLWNYLNRGQKTMVPGSNTLVEHLYQLYCP 320
>gi|297672066|ref|XP_002814135.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Pongo abelii]
Length = 340
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 131/278 (47%), Gaps = 19/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT------RNPLI 141
I FLETS + +E +CS+ESAA + P V V + + + T P
Sbjct: 51 GIVFLETS----ERMEPPHLVSCSVESAAKIYPEWPV-VFFMKGLTDSTLMPSNSTYPAF 105
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L NV + +D+ R F++TPL +Y Q +A W L SD R ++KYGG
Sbjct: 106 SFLSAIDNVFLFPLDMKRLFEDTPLFSWYNQINASAERNW-LHISSDASRLAIIWKYGGI 164
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I I+ + N+ A++S + GV H + +++ + WG
Sbjct: 165 YMDTDVISIRPIPE-ENFLAAQASRYSSNGVFGFLPHHPFLWECMENFVEHYNSDIWGNQ 223
Query: 261 GPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + +
Sbjct: 224 GPELMTRMLRVWCKLEDFQEVSDLRCLNISFLHPQRFYPISYREWRRYYEVWDTDPS--- 280
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
F DSYALH+WN + V GS + R++CPR
Sbjct: 281 FNDSYALHLWNHMNQEGRAVIRGSNTLVENLYRKHCPR 318
>gi|345788968|ref|XP_542803.3| PREDICTED: uncharacterized protein LOC485683 [Canis lupus
familiaris]
Length = 644
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 134/280 (47%), Gaps = 21/280 (7%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPL 140
+ I F+ETS + +E T +C++ESAA + P Q + + N TR P
Sbjct: 50 HGIVFIETS----ERMEPTPLVSCAVESAAKIYP-RQPVAFFMKGLCNTTRLPSNSTYPG 104
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGG 199
L NV + +D+ R F++TPL +YT + S W L SD R ++KYGG
Sbjct: 105 FSLLSAIDNVFLFPLDMKRLFEDTPLLSWYTHINTSAESNW-LHVSSDASRLAIIWKYGG 163
Query: 200 TYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA 259
Y+D D I I+ + N+ A+SS + GV H + +++ + WG
Sbjct: 164 IYMDTDVISIRPIPE-ENFLAAQSSRYSSNGVFGFLPHHPFLWECMENFVEHYNSEIWGN 222
Query: 260 NGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
GP ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + M
Sbjct: 223 QGPNLMTRMLRLWCKLRDFQEVSDLRCLNLSFLHPQRFYPISYPEWRRYYEVWD----MD 278
Query: 319 L-FRDSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYCPR 356
L F DSYALH+WN K V GS + R++CPR
Sbjct: 279 LSFNDSYALHLWNYMNKEGRTVVRGSNTLVDNLCRKHCPR 318
>gi|62859955|ref|NP_001016662.1| alpha-1,4-N-acetylglucosaminyltransferase precursor [Xenopus
(Silurana) tropicalis]
Length = 334
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 132/276 (47%), Gaps = 17/276 (6%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT--RNPL--ID 142
N I F+ET+ D ++ C+IESAA + V V + +R+ T R+ L +
Sbjct: 55 NGIIFIETT----DRMKPPSLVLCTIESAARVYRHRPV-VFFMEGLRDITAMRDALKRLP 109
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L + NVH+ + + R F TPL +Y + + SD R ++++GG Y+
Sbjct: 110 TLSSFHNVHLFPLQMERLFHGTPLRRWYEKVNPERERYWTHVSSDGCRLALIWRHGGIYM 169
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D DFI ++ + + N+ AESS + G+ L H + + N++ +WG GP
Sbjct: 170 DSDFISMRPIPT-GNFLAAESSDFSSNGIFGLTPRHSFAWKGMESFVRNYRGAKWGHQGP 228
Query: 263 GVLTRLLKAEC-KPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNE-TNAPATMSLF 320
+ TR+LK C P+ + + C + +I + FYP+ +E W Y N P F
Sbjct: 229 QLFTRVLKQYCIAPRFQSTEDVKCGDISILKVKRFYPIPYESWRRYYEVWQNVPK----F 284
Query: 321 RDSYALHVWNSFTKRVPVKL-GSEQPYAQIARRYCP 355
RDSYALH+WN + + GS + + YCP
Sbjct: 285 RDSYALHLWNYLNRGQKTMVPGSNTLVEHLYQLYCP 320
>gi|109049205|ref|XP_001114660.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Macaca
mulatta]
gi|402861452|ref|XP_003895105.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Papio anubis]
gi|355560008|gb|EHH16736.1| hypothetical protein EGK_12072 [Macaca mulatta]
Length = 340
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 131/278 (47%), Gaps = 21/278 (7%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS + +E T +C++ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----ERMEPTHLVSCAVESAAKIYPERPVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSL---WPLSHMSDLLRYVTLYKYGG 199
L NV + +D+ R F++TPL +Y Q I TS+ W L SD R ++KYGG
Sbjct: 107 FLSAIDNVFLFPLDMKRLFEDTPLFSWYNQ--INTSIERNW-LHISSDASRLAVIWKYGG 163
Query: 200 TYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA 259
Y+D D I I+ + N+ A+ S + GV H + +++ + WG
Sbjct: 164 IYMDTDVISIRPIPE-ENFLAAQGSRYSSNGVFGFLPQHPFLWKCMENFVEHYNSDIWGN 222
Query: 260 NGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
GP ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + +
Sbjct: 223 QGPDLMTRMLRVWCKLEDFQELSDLRCLNISFLHPQRFYPISYPEWRRYYEVWDTDPS-- 280
Query: 319 LFRDSYALHVWNSFTKRV-PVKLGSEQPYAQIARRYCP 355
F DSYALH+WN + V GS + R++CP
Sbjct: 281 -FNDSYALHLWNYMNQEGRAVVRGSNTLVENLYRKHCP 317
>gi|114589404|ref|XP_516775.2| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Pan
troglodytes]
Length = 340
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 129/277 (46%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS D +E +CS+ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----DRMEPPHLVSCSVESAAKIYPEWPVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ R ++TPL +Y Q +A W L SD R ++KYGG Y
Sbjct: 107 FLSAIDNVFLFPLDMKRLLEDTPLFSWYNQINASAERNW-LHISSDASRLAIIWKYGGIY 165
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + G+ H + +++ + WG G
Sbjct: 166 MDTDVISIRPIPE-ENFLAAQASRYSSNGIFGFLPHHPFLWECMENFVEHYNSDIWGNQG 224
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + + F
Sbjct: 225 PELMTRMLRVWCKLEDFQEVSDLRCLNISFLHPQRFYPISYREWRRYYEVWDTDPS---F 281
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
DSYALH+WN + V GS + R++CPR
Sbjct: 282 NDSYALHLWNHMNQEGRAVIRGSNTLVENLYRKHCPR 318
>gi|397503850|ref|XP_003822530.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Pan paniscus]
Length = 340
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 128/277 (46%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS D +E +CS+ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----DRMEPPHLVSCSVESAAKIYPEWPVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ R ++TPL +Y Q +A W L SD R ++KYGG Y
Sbjct: 107 FLSAIDNVFLFPLDMKRLLEDTPLFSWYNQINASAERNW-LHISSDASRLAIIWKYGGIY 165
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + G+ H + +++ + WG G
Sbjct: 166 MDTDVISIRPIPE-ENFLAAQASRYSSNGIFGFLPHHPFLWECMENFVEHYNSDIWGNQG 224
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + F
Sbjct: 225 PELMTRMLRVWCKLEDFQEVSDLRCLNISFLHPQRFYPISYREWRRYYEVWDTDPN---F 281
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
DSYALH+WN + V GS + R++CPR
Sbjct: 282 NDSYALHLWNHMNQEGRAVIRGSNTLVENLYRKHCPR 318
>gi|395519271|ref|XP_003763774.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Sarcophilus
harrisii]
Length = 344
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 132/279 (47%), Gaps = 22/279 (7%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYE- 146
SI FLET+ D +E + +C++ESAA + P + + ++ + N + + L++ Y
Sbjct: 52 SIVFLETT----DRLEPSPLVSCAVESAARIYPERSI-MFLMKGLDNNSTHLLLNATYRG 106
Query: 147 ------YQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGG 199
+NV +D+ + TPL +YTQ ++ W + SD R ++KYGG
Sbjct: 107 FSLLSAIENVFFFPLDMETLLKETPLSLWYTQVNSSAEKSW-IHISSDACRLAFIWKYGG 165
Query: 200 TYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA 259
Y+D D I I+ + N+ A++S + GV H + +++ WG
Sbjct: 166 IYMDTDIISIRPIPE-DNFLAAQASKFSSNGVFGFHHHHSFIWDCMENFIEHYNGDIWGN 224
Query: 260 NGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
GP ++TR+LK C + + C N + P+ FYP+ + W Y + +
Sbjct: 225 QGPELMTRMLKLSCNLTDFQEVKDLRCPNLSFLHPQRFYPISYPEWKRYYEVWDVEPS-- 282
Query: 319 LFRDSYALHVWNSFT--KRVPVKLGSEQPYAQIARRYCP 355
F DSYALH+WN KR ++ GS + RRYCP
Sbjct: 283 -FNDSYALHLWNYMNQEKRAVIR-GSNTLAENLYRRYCP 319
>gi|301617468|ref|XP_002938166.1| PREDICTED: hypothetical protein LOC100493555 [Xenopus (Silurana)
tropicalis]
Length = 675
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/312 (27%), Positives = 136/312 (43%), Gaps = 28/312 (8%)
Query: 58 YRVQNINCVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAA- 116
YR Q S + + + T + N I F+ET+ D ++ C+IESAA
Sbjct: 23 YRTQYTTLFKSVIYPESIYISPTDVLKEGNGIIFVETT----DRMKPPSLVLCAIESAAR 78
Query: 117 --------MMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHG 168
G+Q ++ + R R P L + NV+ + + + F +TPL
Sbjct: 79 VYKDRPVVFFMKGLQDITIIQDELEARQRFPT---LSSFDNVYFFPLQMDKLFNDTPLMP 135
Query: 169 FYTQ-DAILTSLWPLSHMS-DLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSV 226
+Y + + W +H+S D R ++KYGG Y+D DFI ++ + + N+ + S
Sbjct: 136 WYKKVNPKFERFW--THISADGCRLALIWKYGGIYMDSDFISMRPIPDV-NFLAGQCSQF 192
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK-PQSYAHNIISC 285
+ GV L H S ++ N+ WG GP + TR LK C P + + C
Sbjct: 193 SSNGVFGLSHHHNFSLKSMENFVQNYNGAIWGNQGPHLFTRTLKTFCTIPDFKSTEDVKC 252
Query: 286 RNFTIYPPRFFYPVHWEHWADYLNET-NAPATMSLFRDSYALHVWNSFTKRVPVKL-GSE 343
N + P+ FYP+ ++ W Y + N P F DSYALH WN K + G
Sbjct: 253 GNISFSNPKRFYPIPFKDWKRYYDVCPNVPT----FNDSYALHFWNFMNKEQKTMVPGDN 308
Query: 344 QPYAQIARRYCP 355
+ ++YCP
Sbjct: 309 TLIEHLYKQYCP 320
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 86/177 (48%), Gaps = 9/177 (5%)
Query: 183 SHMS-DLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLS 241
+H+S D R ++K+GG Y+D D I ++ + + N+ A+ S + GV L H S
Sbjct: 361 THVSADGCRLALVWKHGGIYMDSDIISMRPIPDV-NFLAAQYSQSSSNGVFGLSHHHNFS 419
Query: 242 GAALRELRDNFKTTEWGANGPGVLTRLLKAECK-PQSYAHNIISCRNFTIYPPRFFYPVH 300
++ N+ WG GP + TR LK C PQ ++ + C N + P+ FYP+
Sbjct: 420 WKSMENFVQNYNGAIWGNQGPQLFTRTLKTFCTIPQFKSNEDVKCGNISFLNPKRFYPIP 479
Query: 301 WEHWADYLNE-TNAPATMSLFRDSYALHVWNSFTKRVPVKL-GSEQPYAQIARRYCP 355
+E W Y + N P F DSYALH+WN K + G + ++YCP
Sbjct: 480 YEAWKRYYDVWPNVPT----FNDSYALHLWNFMNKEQKTMVPGKNTLIEHLYKQYCP 532
>gi|332232270|ref|XP_003265327.1| PREDICTED: LOW QUALITY PROTEIN:
alpha-1,4-N-acetylglucosaminyltransferase [Nomascus
leucogenys]
Length = 340
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 130/277 (46%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS + +E +CS+ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----ERMEPPHLVSCSVESAAKIYPEWAVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +++ R F++TPL +Y Q +A W L SD R ++KYGG Y
Sbjct: 107 FLSAIDNVFLFPLEMKRLFEDTPLFSWYNQINASAERNW-LHVSSDASRLAIIWKYGGIY 165
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + GV H + +++ + WG G
Sbjct: 166 MDTDVISIRPIPE-ENFLAAQASRYSSNGVFGFLPHHPFLWECMENFVEHYNSDIWGNQG 224
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + + F
Sbjct: 225 PELMTRMLRVWCKLEDFQEVSDLRCLNISFLHPQRFYPISYPEWRRYYEVWDTDPS---F 281
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
DSYALH+WN + V GS + R++CPR
Sbjct: 282 NDSYALHLWNHMNQEGRAVIRGSNTLVENLYRKHCPR 318
>gi|426342257|ref|XP_004037767.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Gorilla
gorilla gorilla]
Length = 340
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 130/277 (46%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS + +E +CS+ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----ERMEPPHLVSCSVESAAKIYPEWPVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ R ++TPL +Y Q +A W L SD R ++KYGG Y
Sbjct: 107 FLSAIDNVFLFPLDMKRLLEDTPLFSWYNQINASAERNW-LHISSDASRLAIIWKYGGIY 165
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + G+ H + +++ + WG G
Sbjct: 166 MDTDVISIRPIPE-ENFLAAQASRYSSNGIFGFLPHHPFLWECMENFVEHYNSDIWGNQG 224
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ CK + + + + C N + P+ FYP+ ++ W Y + + F
Sbjct: 225 PELMTRMLRVWCKLEDFQEVSDLRCLNISFLHPQRFYPISYQEWRRYYEVWDTDPS---F 281
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
DSYALH+WN + V GS + R++CPR
Sbjct: 282 NDSYALHLWNHMNQEGRAVIRGSNTLVENLYRKHCPR 318
>gi|334329613|ref|XP_003341243.1| PREDICTED: LOW QUALITY PROTEIN:
alpha-1,4-N-acetylglucosaminyltransferase-like
[Monodelphis domestica]
Length = 353
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 78/279 (27%), Positives = 130/279 (46%), Gaps = 18/279 (6%)
Query: 86 DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL----- 140
D SI FLET+ D +E + +C++ESAA + P + ++ N PL
Sbjct: 50 DRSIVFLETT----DQLEPSPLVSCAVESAARIYPERPIIFLMKGLDNNSIHLPLNATYP 105
Query: 141 -IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYG 198
L +NV + +D+ + TPL +Y Q ++ W + SD R +++YG
Sbjct: 106 AFSLLSAIKNVFLFPLDMETLLKETPLFSWYAQVNSSAEKSW-VHISSDACRLAFIWRYG 164
Query: 199 GTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG 258
G Y+D D I I+S+ N+ A++S + G++ H + +++ WG
Sbjct: 165 GIYMDTDIISIRSIPE-DNFLAAQASKFSSNGILGFHHHHPFIWQCMENFIEHYNGDIWG 223
Query: 259 ANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM 317
GP ++TR+LK C + + C F+ P+ FYP+ + +W Y + +
Sbjct: 224 HQGPDLMTRMLKLWCNLTDFQEVRDLRCPYFSFLHPQRFYPISYPNWKLYYQVWDVEPS- 282
Query: 318 SLFRDSYALHVWNSFTK-RVPVKLGSEQPYAQIARRYCP 355
F DSYALH+WN + + V GS + R+YCP
Sbjct: 283 --FNDSYALHLWNYMNQEKKAVIRGSNTLAENLHRKYCP 319
>gi|449277567|gb|EMC85680.1| Lactosylceramide 4-alpha-galactosyltransferase [Columba livia]
Length = 174
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 89/162 (54%), Gaps = 1/162 (0%)
Query: 194 LYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFK 253
++K+GG YLD DFIV+K+L++L N G +S V+ + H +++ +N+
Sbjct: 1 MWKFGGIYLDTDFIVLKNLKNLTNVLGTQSKYVLNGAFLSFQPKHKFIELCMQDFVENYN 60
Query: 254 TTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNA 313
+ WG GP +LTR+ K C +S + SC+ + P FYP+ W+ W Y ++
Sbjct: 61 SWIWGHQGPQLLTRVFKKWCSIRSLRSS-TSCKGVSALPREAFYPIRWQDWKKYFEVVSS 119
Query: 314 PATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
LF ++YA+HVWN ++ +++ S+ AQ+ ++CP
Sbjct: 120 LELQHLFHNTYAVHVWNKKSQGTRLEIPSQALLAQLHSQFCP 161
>gi|321471329|gb|EFX82302.1| hypothetical protein DAPPUDRAFT_49282 [Daphnia pulex]
Length = 325
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 139/317 (43%), Gaps = 44/317 (13%)
Query: 70 VEKDFAFDDITRLDVPDNS----IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVY 125
++ FA I L D I F+ETS L RQAC++ESAA N + +Y
Sbjct: 9 IQGQFAPGQIGELCREDGGGEKRIIFIETSGEKC----LRPRQACAVESAARTNRDMSIY 64
Query: 126 VVVIASVRNRTRNPLIDR-------------LYEYQNVHIVQVDLGRYFQNTPLHGFYTQ 172
+ + + NP D+ L NV++++ DL ++ +TPL Y
Sbjct: 65 LYM--ATERPPGNPEADKGEGLQRDCKTMEVLESLANVYVMRSDLPKHLVDTPLESLYFG 122
Query: 173 DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAE---SSSVVAA 229
+ S + H+SD LR LYK+GG YLDLD +V++SL L N AG S +
Sbjct: 123 GKLKGSDYAFQHLSDALRVALLYKHGGIYLDLDVVVLRSLRCLRNTAGHTFILGESSIEN 182
Query: 230 GVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQS----------YA 279
G + D+ H L +R ++ ++K E GP L+R + C S
Sbjct: 183 GFMAFDRGHKLLKFFMRWMQRSYKPNERSVIGPNGLSRAFQMLCNHPSKVISDSVYDFKC 242
Query: 280 HNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS-FTKRVPV 338
H+ + R ++ F+P+ + + E + F SY++HV+ S RVP
Sbjct: 243 HDGVEIR---LHNKTAFHPITYFEQNRFYEENFDEEELDTFSQSYSVHVYGSGHGARVP- 298
Query: 339 KLGSEQPYAQIARRYCP 355
+A +A ++CP
Sbjct: 299 ---QTSLFAFMANQFCP 312
>gi|327267342|ref|XP_003218461.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Anolis
carolinensis]
Length = 341
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 141/306 (46%), Gaps = 22/306 (7%)
Query: 61 QNINCVLSGVEKDFAFDDITRLDVPDN--SIFFLETSCTHADGVELTLRQACSIESAAMM 118
Q+ CV S K + +T DV SI FLET+ + ++ + CS+ESAA +
Sbjct: 25 QDPGCVFSC--KPISKHSLTPKDVMSQGRSIIFLETT----ENLQPSPLVLCSVESAARI 78
Query: 119 NPGVQVYVVVIASVRNRTRN-----PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ- 172
+ ++ N + P + LY +NV I + L FQ+TPL +Y Q
Sbjct: 79 YLDRPIIFLMKGMDNNSMTDLASSCPSLSFLYAMKNVFIFPLKLNILFQDTPLLPWYLQV 138
Query: 173 DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVI 232
+A W + +SD +R +++YGG Y+D D I I+ + + N+ A+SS + G+
Sbjct: 139 NATQEKHW-VYIISDAIRLAMVWRYGGIYMDTDVISIRPI-PVTNFLAAQSSQFSSNGIF 196
Query: 233 HLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIY 291
+ H + + +N+ WG GP ++TR+L C + C N +
Sbjct: 197 GFQQYHQFLWDCMEDFVENYNGDIWGNQGPYLITRMLSKLCNLTDFVDTEDQKCHNISFL 256
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFT--KRVPVKLGSEQPYAQI 349
P+ FYP+ + W Y ++ F +SYALH+WN ++ + +GS +
Sbjct: 257 NPQRFYPIPYGLWTKYYEVWDSRPE---FNNSYALHLWNYMNQDQKKNITVGSNTLVENL 313
Query: 350 ARRYCP 355
R YCP
Sbjct: 314 FRTYCP 319
>gi|148231077|ref|NP_001089625.1| alpha-1,4-N-acetylglucosaminyltransferase precursor [Xenopus
laevis]
gi|68533906|gb|AAH99303.1| A4gnt protein [Xenopus laevis]
Length = 339
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 135/286 (47%), Gaps = 25/286 (8%)
Query: 58 YRVQNINCVLSGVEKDFAFD-DITRLDV--PDNSIFFLETSCTHADGVELTLRQACSIES 114
Y+ N V+ + F FD IT DV P N IFF+ET+ D ++L C++ES
Sbjct: 21 YKAANRKTVVKFIP--FLFDTKITPDDVLSPGNGIFFIETT----DRMDLPSLVLCAVES 74
Query: 115 AAMMNPG--VQVYVVVIASVRNRTRNPL----IDRLYEYQNVHIVQVDLGRYFQNTPLHG 168
AA +NP V ++ + + + + L L Y N+++ + + +TPL
Sbjct: 75 AARINPKKPVAFFMKGLPDINSVEEHTLALNSFPTLATYNNIYLFPLRMENLLTDTPLLP 134
Query: 169 FYTQDAILTSL-WPLSHMS-DLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSV 226
+Y + + W SH+S D R +YKYGG Y+D D I +K + N+ AESS +
Sbjct: 135 WYDKVNPKNEIHW--SHVSSDASRLALMYKYGGLYMDTDIISLKPVPE-RNFLVAESSRI 191
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKA-ECK-PQSYAHNIIS 284
+ GV D + + + N+ WG GP + TR+LK C P +
Sbjct: 192 SSNGVFGFDSHRDFTWTCMEDFVKNYNGAIWGHQGPALFTRVLKKFYCDIPPFKGDEDLK 251
Query: 285 CRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWN 330
C N + PR FYP+ ++W + A T F +SYALH++N
Sbjct: 252 CGNISFLNPRRFYPIECQYWMKFFQVWKAIPT---FNESYALHLFN 294
>gi|301611157|ref|XP_002935120.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 341
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 136/305 (44%), Gaps = 27/305 (8%)
Query: 66 VLSGVEKDFAFDDITRLDV--PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQ 123
+LS + F I + N I FLET+ D +E C+IESAA + P
Sbjct: 32 ILSSISNPFTNTSINPEGILKEGNGIIFLETT----DRMEPPSLVLCAIESAARVYPDRP 87
Query: 124 VYVVVIA---------SVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPL-HGFYTQD 173
V + + + R P + L QNV+I + + F++TPL F D
Sbjct: 88 VVFFMKGLTDINTEEDEKQAKKRFPSLSSL---QNVYIFPLKMEELFKDTPLLKWFLKAD 144
Query: 174 AILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIH 233
+ W + ++SD R +++YGG Y D D I ++ + + N+ AE + V
Sbjct: 145 PKHETYW-IHNLSDGCRMAMMWRYGGFYFDSDVISMRPIPEI-NFLTAEHDQTSGSSVFG 202
Query: 234 LDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYP 292
L H + +L + N+ WG GP + TR+LK C+ ++ + I C N +
Sbjct: 203 LTPHHSFAWTSLNDFVQNYNGDAWGNQGPTLFTRVLKQSCELSAFKSLDNIVCGNISFLH 262
Query: 293 PRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWN--SFTKRVPVKLGSEQPYAQIA 350
P YP+ + W Y + T F +SYALH+WN + ++ V +GS +
Sbjct: 263 PERIYPIPYGGWKRYFEVWDKTPT---FDNSYALHLWNYMNSVEKKTVVIGSNTLVENLY 319
Query: 351 RRYCP 355
++YCP
Sbjct: 320 KQYCP 324
>gi|321472681|gb|EFX83650.1| hypothetical protein DAPPUDRAFT_239742 [Daphnia pulex]
Length = 327
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 147/299 (49%), Gaps = 34/299 (11%)
Query: 86 DNSIFFLETSCTHADGVELTLRQACSIESAAMMNPG--VQVYV----VVIASVRNRTRN- 138
D IFF ETS +L L+Q C++ESAA N VQ+++ SV + + +
Sbjct: 23 DRRIFFHETSGK----ADLNLQQCCAVESAAKHNHDRLVQLFLRPRSTAGCSVGSESSSP 78
Query: 139 ----------PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDL 188
P ++ L +Y NV V V+ YF +PL +Y + L S ++H+SD
Sbjct: 79 FPSTSTFYSPPWLEVLSQYPNVAAVLVNEDHYFAGSPLQDWYKKGQWLKSPHQVAHLSDY 138
Query: 189 LRYVTLYKYGGTYLDLDFIVIKSLESLH--NYAGAESS--SVVAAGVIHLDKDHWLSGAA 244
+R +TL K GG YLD D + +K+ + H N +SS V++ GV+HL++ H L+
Sbjct: 139 IRILTLNKGGGLYLDTDILTLKAYQGDHFRNCLSYDSSDMGVISNGVMHLERGHRLTVQM 198
Query: 245 LRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHW 304
+R + + + +E +G ++ L+ + C + +C++ + FF+ + +
Sbjct: 199 MRLMAEEYDPSEMVFHGSQAVSFLMHSSCGVVQADPSSNTCKDIHLLSSNFFFLIE-RPF 257
Query: 305 ADYL-----NETNAPATMSLFRDSYALHVWNSFT---KRVPVKLGSEQPYAQIARRYCP 355
+D L N TNA +S + SY +H+WNS +R + S+ +A+ +CP
Sbjct: 258 SDVLYDKLSNSTNAIGILSQIKKSYGVHLWNSEKENRQRPFLAADSDSILGALAQIHCP 316
>gi|241002154|ref|XP_002404889.1| secreted protein, putative [Ixodes scapularis]
gi|215491675|gb|EEC01316.1| secreted protein, putative [Ixodes scapularis]
Length = 276
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 117/253 (46%), Gaps = 11/253 (4%)
Query: 104 LTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQN 163
L RQACSIESAA+ NPG+ + ++ +++ + I L N +++ F +
Sbjct: 27 LNFRQACSIESAALHNPGIPIRLLTTSTLSPNCKYTNI--LKALPNFRAELLEISSAFHS 84
Query: 164 TPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAES 223
TP+ +Y +A S H+SD LRY L+++GG Y+DLD I++K L L N
Sbjct: 85 TPMDSWYRSNAWADSTHKTEHISDALRYTILWRHGGIYMDLDVIMLKPLNGLTNCVVMMD 144
Query: 224 SSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNII 283
+ ++ DKDH A + + ++ GPG+L ++ + S A ++
Sbjct: 145 KNRPNNNIMIFDKDHRFITALMDNCLKGYDPDDYNTCGPGLLQQMYE---DGGSLATDMS 201
Query: 284 SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSE 343
R T F + E + + + R+SY +H++NS TK +GS
Sbjct: 202 FLRKET------FLAIDIERSGWFFDRDRTASVFKEVRESYGVHIYNSQTKNRTFDIGSG 255
Query: 344 QPYAQIARRYCPR 356
Y +A CPR
Sbjct: 256 CAYELLAIMNCPR 268
>gi|301611161|ref|XP_002935110.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 345
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 128/281 (45%), Gaps = 23/281 (8%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA---------SVRNRTR 137
N I FLET+ D +E C+IESAA + P V + R + R
Sbjct: 59 NGIIFLETT----DRMEPPSLVLCAIESAARVYPDRPVVFFMKGLTDINSEDDEKRAKER 114
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
P L ++NV+I + + + F NTPL +Y + + + ++SD R +++Y
Sbjct: 115 FP---SLSSFENVYIFPLRMEKLFNNTPLLMWYLKADPKRERYWIHNLSDGCRMAMMWRY 171
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
GG Y D D I ++ + N+ AE+ + V L + + +L + N+ W
Sbjct: 172 GGFYFDADVISMRPIPE-KNFLTAENQHTSGSSVFGLSPHNSFAWTSLNDFVQNYNGDAW 230
Query: 258 GANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT 316
G GP + TR+LK C+ ++ + I C N + P YP+ + W Y + T
Sbjct: 231 GNQGPTLFTRVLKQSCELSAFKSLDNIVCGNISFLHPERIYPIPYGGWKRYFEVWDKIPT 290
Query: 317 MSLFRDSYALHVWNSFT--KRVPVKLGSEQPYAQIARRYCP 355
F +SYALH+WN ++ V +GS + ++YCP
Sbjct: 291 ---FDNSYALHLWNYMNSGEKKTVVIGSNTLVENLYKQYCP 328
>gi|7705859|ref|NP_057245.1| alpha-1,4-N-acetylglucosaminyltransferase [Homo sapiens]
gi|25452797|sp|Q9UNA3.1|A4GCT_HUMAN RecName: Full=Alpha-1,4-N-acetylglucosaminyltransferase;
Short=Alpha4GnT
gi|5726306|gb|AAD48406.1|AF141315_1 alpha-1,4-N-acetylglucosaminyltransferase [Homo sapiens]
gi|111601392|gb|AAI19641.1| Alpha-1,4-N-acetylglucosaminyltransferase [Homo sapiens]
gi|119599490|gb|EAW79084.1| alpha-1,4-N-acetylglucosaminyltransferase [Homo sapiens]
Length = 340
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 128/277 (46%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS + +E +CS+ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----ERMEPPHLVSCSVESAAKIYPEWPVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ R ++TPL +Y Q +A W L SD R ++KYGG Y
Sbjct: 107 FLSAIDNVFLFPLDMKRLLEDTPLFSWYNQINASAERNW-LHISSDASRLAIIWKYGGIY 165
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + G+ H + +++ + WG G
Sbjct: 166 MDTDVISIRPIPE-ENFLAAQASRYSSNGIFGFLPHHPFLWECMENFVEHYNSAIWGNQG 224
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + T F
Sbjct: 225 PELMTRMLRVWCKLEDFQEVSDLRCLNISFLHPQRFYPISYREWRRYYEVWD---TEPSF 281
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
SYALH+WN + V GS + R++CPR
Sbjct: 282 NVSYALHLWNHMNQEGRAVIRGSNTLVENLYRKHCPR 318
>gi|301611159|ref|XP_002935122.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 341
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 134/302 (44%), Gaps = 21/302 (6%)
Query: 66 VLSGVEKDFAFDDITRLDV--PDNSIFFLETSCTHADGVELTLRQACSIESAAMM--NPG 121
+LS + F I + N I FLET+ D +E C+IESAA + +
Sbjct: 32 ILSSISTPFTNTSINPQGILKEGNGIIFLETT----DRMEPPSLVLCAIESAARVYTDRP 87
Query: 122 VQVYVVVIASVRNRTRNPLIDR----LYEYQNVHIVQVDLGRYFQNTPL-HGFYTQDAIL 176
V ++ + + + L +QNV+I + + F++TPL F D
Sbjct: 88 VVFFMKGLTDINTEEDEKQAKKTFPSLSSFQNVYIFPLRMQELFKDTPLLKWFLKADPKK 147
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK 236
W + ++SD R +++YGG Y D D I I+ + + N+ AE + V L
Sbjct: 148 EKFW-IHNLSDGCRMAMMWRYGGFYFDSDVISIRPIPEI-NFLTAEHDQTSGSSVFGLTP 205
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRF 295
H + +L + N+ WG GP + TR+LK C+ ++ + I C N + P
Sbjct: 206 HHSFAWTSLNDFVQNYNGNVWGNQGPTLFTRVLKQSCELSAFKSLDNIVCGNISFLHPER 265
Query: 296 FYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFT--KRVPVKLGSEQPYAQIARRY 353
YP+ + W Y + T F +SYALH+WN ++ V +GS + ++Y
Sbjct: 266 IYPISYGGWKRYFEVWDKIPT---FDNSYALHLWNYMNSGEKKTVVIGSNTLVENLYKQY 322
Query: 354 CP 355
CP
Sbjct: 323 CP 324
>gi|111599768|gb|AAI19640.1| Alpha-1,4-N-acetylglucosaminyltransferase [Homo sapiens]
Length = 340
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 128/277 (46%), Gaps = 17/277 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA-----SVRNRTRNPLID 142
I FLETS + +E +CS+ESAA + P V + + + + P
Sbjct: 51 GIVFLETS----ERMEPPHLVSCSVESAAKIYPEWPVVFFMKGLTDSTPMPSNSTYPAFS 106
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ R ++TPL +Y Q +A W L SD R ++KYGG Y
Sbjct: 107 FLSAIDNVFLFPLDMKRLLEDTPLFSWYNQINASAERNW-LHISSDASRLAIIWKYGGIY 165
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + N+ A++S + G+ H + +++ + WG G
Sbjct: 166 MDTDVISIRPIPE-ENFLAAQASRYSSNGIFGFLPHHPFLWECMENFVEHYNSDIWGNQG 224
Query: 262 PGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+L+ CK + + + + C N + P+ FYP+ + W Y + + F
Sbjct: 225 PELMTRMLRVWCKLEDFQEVSDLRCLNISFLHPQRFYPISYREWRRYYEVWDTEPS---F 281
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
SYALH+WN + V GS + R++CPR
Sbjct: 282 NVSYALHLWNHMNQEGRAVIRGSNTLVENLYRKHCPR 318
>gi|321456719|gb|EFX67819.1| hypothetical protein DAPPUDRAFT_63609 [Daphnia pulex]
Length = 301
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 136/293 (46%), Gaps = 34/293 (11%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDR---- 143
+IFF+ETS LT RQAC IESAA NP ++ + + N N L +
Sbjct: 3 NIFFVETSGKSC----LTFRQACGIESAAKANPDARIKLYTEKTGMNLPENDLENDHREF 58
Query: 144 ---------LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTL 194
L + N+ IV+ +L + TPL + + S W L H+SD R L
Sbjct: 59 FRCPVTSVLLKQMNNIEIVRENLVELLEETPLWQLHKTGSFNRSSWRLFHLSDAARVALL 118
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESS-----SVVAAGVIHLDKDHWLSGAALRELR 249
+K GGTYLD+D IV++ LESL+N G + S V GV+ H ++ +
Sbjct: 119 WKKGGTYLDMDCIVMRPLESLNNTIGTVENGPNVPSWVENGVMAFSAGHPFLHFLMKYMV 178
Query: 250 DNFKTTEWGANGPGVLTRLLKAECKPQSY-AHNIISCRNFT---IYPPRFFYPVHWEHWA 305
F+ + + GP L + C ++ A++ ++CR+ + I PP FY ++
Sbjct: 179 LAFEPDNYISLGPDTLRDAMFYFCNRETLPANHWVNCRHNSSIFIQPPESFYAINNSRME 238
Query: 306 DYLNETNAPATMSLF-RDSYALHVWNSFTKRV--PVKLGSEQPYAQIARRYCP 355
+ P L R+S+ H++ + R+ P L YA++AR+YCP
Sbjct: 239 TFYQPEFDPNDWHLLHRNSFLSHIYGAGHGRIAPPGSL-----YAELARKYCP 286
>gi|293349418|ref|XP_002727173.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Rattus
norvegicus]
gi|293361317|ref|XP_001065156.2| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Rattus
norvegicus]
gi|149018795|gb|EDL77436.1| rCG25233 [Rattus norvegicus]
Length = 342
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 132/278 (47%), Gaps = 18/278 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMM--NPGVQVYVVVIASVRNRTRN----PLI 141
SI F+ETS + +E +C++ESAA + + ++ +++ T N P
Sbjct: 52 SIVFIETS----ERLEPPPLVSCAVESAAKIYQEQPILFFMKGLSNATQMTSNTSSYPAF 107
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L NV V +D+ F++TPL +YT+ ++ W L SD R ++KYGG
Sbjct: 108 SLLSAINNVFFVPLDMKTLFEDTPLFSWYTKVNSSREKHW-LHVSSDASRLAIIWKYGGV 166
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I I+ + N+ A+ S + GV H A + +++ + WG
Sbjct: 167 YMDTDVISIRPIPE-ENFLAAQGSQHSSNGVFGFLPHHPFLWACMENFVEHYNSGIWGNQ 225
Query: 261 GPGVLTRLLKAECKPQSY-AHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP ++TR+LK C+ + + + C N + P+ FYP+ + W Y + +
Sbjct: 226 GPKLMTRMLKIWCRLKDFQGLGDLKCLNISFLHPQRFYPIPYPEWRRYYQVWDRDLS--- 282
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCPR 356
F DSYALH+WN + V GS + +++CP+
Sbjct: 283 FNDSYALHLWNFMNREGKTVVRGSNSLVENLYQKHCPK 320
>gi|301623964|ref|XP_002941274.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 336
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 135/290 (46%), Gaps = 19/290 (6%)
Query: 75 AFDDITRLDV--PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASV 132
+F I+ DV N I F+ET+ D ++ C+IESAA + V V + +
Sbjct: 43 SFSPISPADVLRDGNGIIFIETT----DKMKPPSLVLCAIESAARVYRHRPV-VFFMEGL 97
Query: 133 RNRT--RNPL--IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDL 188
R+ T R+ L + L + NVH+ + + R F TPL +Y + ++ SD
Sbjct: 98 RDITAMRDALKRLPTLSSFHNVHLFPLQMERLFHGTPLGPWYEKVNPEREIYWTHVSSDG 157
Query: 189 LRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALREL 248
R ++++GG Y+D DFI ++ + + N+ A+SS V + G+ L + + +
Sbjct: 158 CRLALIWRHGGIYMDSDFISMRPIPDV-NFLAAQSSDVSSNGIFGLTPQNTFAWKGMESF 216
Query: 249 RDNFKTTEWGANGPGVLTRLLKAECKPQSY-AHNIISCRNFTIYPPRFFYPVHWEHWADY 307
N++ EWG GP + TR+LK C + + + C + + FYP+ + W Y
Sbjct: 217 VQNYRGAEWGHQGPQLFTRVLKQYCITLRFQSTEDVKCGDISFLNEMRFYPIPYPSWRRY 276
Query: 308 LNE-TNAPATMSLFRDSYALHVWNSFTKRVPVKL-GSEQPYAQIARRYCP 355
N P F DSYALH+WN K + GS + + YCP
Sbjct: 277 YEVWQNVPK----FNDSYALHLWNFMNKEQETMVPGSNTLVEHLYQLYCP 322
>gi|301623962|ref|XP_002941275.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 337
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 128/276 (46%), Gaps = 17/276 (6%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRT--RNPL--ID 142
N I F+ET+ D ++ C+IESAA + V V + +R+ T R+ L +
Sbjct: 58 NGIIFIETT----DRMKPPSLVLCAIESAARVYRHRPV-VFFMEGLRDITAMRDTLKRLP 112
Query: 143 RLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYL 202
L + NVH+ + + R F TPL +Y + ++ SD R ++++GG Y+
Sbjct: 113 TLSSFHNVHLFPLQMERLFHGTPLGPWYEKVNPEREIYWTHVSSDGCRLALIWRHGGIYM 172
Query: 203 DLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
D D I ++ + + N+ A+SS V + G+ L + + + N++ EWG GP
Sbjct: 173 DTDIISMRPIPDV-NFLAAQSSGVSSNGIFGLTPQNTFAWKGMESFVQNYRGAEWGHQGP 231
Query: 263 GVLTRLLKAEC-KPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNE-TNAPATMSLF 320
+ TR+LK C Q + + C + + P FYP+ + W Y N P F
Sbjct: 232 QLFTRVLKQYCITLQIQSMEDVKCSDISFLNPMRFYPIPYPSWRRYFEVWQNVPK----F 287
Query: 321 RDSYALHVWNSFTKRVPVKL-GSEQPYAQIARRYCP 355
SYALH+WN K + GS + + YCP
Sbjct: 288 NHSYALHLWNFMNKEQKTMVPGSNTLVEHLYQLYCP 323
>gi|301617470|ref|XP_002938167.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 408
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 129/277 (46%), Gaps = 16/277 (5%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMM--NPGVQVYVVVIASVRNRTRNPLIDR- 143
+ I FLET+ + + + C+IESAA + N V ++ + + I
Sbjct: 118 DGIIFLETT----NSLRPSSLVLCAIESAAHVYHNRPVVFFMKGLTDITTMEEESQIRNT 173
Query: 144 ---LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L Y+NV+I + L F++TPL +Y + + + +SD R ++K+GG
Sbjct: 174 FPTLATYKNVYIFPLRLEEIFEDTPLLPWYKKINVTMEPHWIHVLSDACRLALIWKHGGV 233
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D DFI + + + N+ A+SS+ + GV H+ ++ +N+ WG
Sbjct: 234 YMDTDFISVSPIPDV-NFVAAQSSTESSNGVFGFQLQHYFPWNSMENFVENYNGAVWGHQ 292
Query: 261 GPGVLTRLLKAECK-PQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GP + TR+L+ +C P A + C N + P+ FYP+ + W Y +
Sbjct: 293 GPQLFTRVLERQCDLPTFRALEDLMCGNISFLNPQHFYPIPYPSWKQYY---QVWEKLPN 349
Query: 320 FRDSYALHVWNSFTKR-VPVKLGSEQPYAQIARRYCP 355
F +SY+LH+WN K V G+ + +++CP
Sbjct: 350 FNNSYSLHLWNYMNKENKTVVPGTNTLATHLYQQHCP 386
>gi|225440658|ref|XP_002274455.1| PREDICTED: uncharacterized protein At4g19900-like [Vitis vinifera]
Length = 416
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/228 (30%), Positives = 107/228 (46%), Gaps = 30/228 (13%)
Query: 134 NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT---SLWPLS-HMSDLL 189
+R PL DR + V V D F++TP ++ + PL+ ++S+LL
Sbjct: 181 DRILKPLQDREFR---VLAVAPDASFLFKDTPAEAWFVEMKSGNKDPGEIPLAQNLSNLL 237
Query: 190 RYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSG 242
R LYKYGG YLD DFI++ SL N GA+S V+ V+ DK+H L
Sbjct: 238 RLAVLYKYGGVYLDTDFIILNKFSSLRNTIGAQSIDPVSGNWSRLNNAVLIFDKNHPLVY 297
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWE 302
+ E F +WG NGP +++R++ + Y NFT+ PP FYPV W
Sbjct: 298 KFIEEFALTFDGNKWGHNGPYLVSRVVNRVARRPGY--------NFTVLPPMAFYPVDWN 349
Query: 303 HWADYLNETNAPAT--------MSLFRDSYALHVWNSFTKRVPVKLGS 342
DY T + L +++Y +H+WN + ++ ++ GS
Sbjct: 350 RIGDYFPRPKDQVTSKWLETKLLQLGKEAYGVHLWNKQSSKLKIEPGS 397
>gi|147853779|emb|CAN83821.1| hypothetical protein VITISV_030953 [Vitis vinifera]
Length = 413
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/228 (30%), Positives = 107/228 (46%), Gaps = 30/228 (13%)
Query: 134 NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT---SLWPLS-HMSDLL 189
+R PL DR + V V D F++TP ++ + PL+ ++S+LL
Sbjct: 178 DRILKPLQDREFR---VLAVAPDASFLFKDTPAEAWFVEMKSGNKDPGEIPLAQNLSNLL 234
Query: 190 RYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSG 242
R LYKYGG YLD DFI++ SL N GA+S V+ V+ DK+H L
Sbjct: 235 RLAVLYKYGGVYLDTDFIILNKFSSLRNTIGAQSIDPVSGNWSRLNNAVLIFDKNHPLVY 294
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWE 302
+ E F +WG NGP +++R++ + Y NFT+ PP FYPV W
Sbjct: 295 KFIEEFALTFDGNKWGHNGPYLVSRVVNRVARRPGY--------NFTVLPPMAFYPVDWN 346
Query: 303 HWADYLNETNAPAT--------MSLFRDSYALHVWNSFTKRVPVKLGS 342
DY T + L +++Y +H+WN + ++ ++ GS
Sbjct: 347 RIGDYFPRPKDQVTSKWLETKLLQLGKEAYGVHLWNKQSSKLKIEPGS 394
>gi|297740224|emb|CBI30406.3| unnamed protein product [Vitis vinifera]
Length = 495
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/228 (30%), Positives = 107/228 (46%), Gaps = 30/228 (13%)
Query: 134 NRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT---SLWPLS-HMSDLL 189
+R PL DR + V V D F++TP ++ + PL+ ++S+LL
Sbjct: 260 DRILKPLQDREFR---VLAVAPDASFLFKDTPAEAWFVEMKSGNKDPGEIPLAQNLSNLL 316
Query: 190 RYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSG 242
R LYKYGG YLD DFI++ SL N GA+S V+ V+ DK+H L
Sbjct: 317 RLAVLYKYGGVYLDTDFIILNKFSSLRNTIGAQSIDPVSGNWSRLNNAVLIFDKNHPLVY 376
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWE 302
+ E F +WG NGP +++R++ + Y NFT+ PP FYPV W
Sbjct: 377 KFIEEFALTFDGNKWGHNGPYLVSRVVNRVARRPGY--------NFTVLPPMAFYPVDWN 428
Query: 303 HWADYLNETNAPAT--------MSLFRDSYALHVWNSFTKRVPVKLGS 342
DY T + L +++Y +H+WN + ++ ++ GS
Sbjct: 429 RIGDYFPRPKDQVTSKWLETKLLQLGKEAYGVHLWNKQSSKLKIEPGS 476
>gi|321460518|gb|EFX71560.1| hypothetical protein DAPPUDRAFT_60032 [Daphnia pulex]
Length = 206
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 96/184 (52%), Gaps = 4/184 (2%)
Query: 90 FFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQN 149
FF+E+S A L RQAC++ES A+ NP + V V+ N + + + + Y N
Sbjct: 11 FFIESSGNGA----LGYRQACAVESLALHNPNLTVNVLFTDVKINTSLDTVQKLVKNYAN 66
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVI 209
V ++ +++ Y T + +Y + + ++++S+ LR +T+YK+GG Y DLD I +
Sbjct: 67 VQLMSINVDEYMAGTLIEHWYQCTNWRSGSYHVNNLSNALRLLTVYKFGGYYFDLDIISV 126
Query: 210 KSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLL 269
+ + S N+ A +V VIH D H A+ NF+ WG NGP ++ R+L
Sbjct: 127 RPVTSYRNFVAAVDREIVNNNVIHADAKHPFIELAIDNFVTNFRPDLWGNNGPALIFRVL 186
Query: 270 KAEC 273
K C
Sbjct: 187 KKWC 190
>gi|118789097|ref|XP_555204.2| AGAP008261-PA [Anopheles gambiae str. PEST]
gi|116123057|gb|EAL39605.2| AGAP008261-PA [Anopheles gambiae str. PEST]
Length = 347
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 139/281 (49%), Gaps = 14/281 (4%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIAS---VRN--RTRNPLI 141
+S++F+E+S + + RQAC+IESAA NP ++ +V+ AS + N + R P +
Sbjct: 61 SSVYFIESSAPFRRIITIGPRQACAIESAARANP-LKKIIVLFASWNPITNPSQVRFPDL 119
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L + NVH +DL R+ Q TP+ D + ++S++LR V LYKYGG Y
Sbjct: 120 PTLAGFGNVHFRWLDLNRFAQGTPVEAVIRSDMLHERPNGAEYLSEILRLVLLYKYGGIY 179
Query: 202 LDLDFIVIKSLESLH-NYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA- 259
LDLD + +K+L+ ++ N+ GAE+ +V VI L + + A R L DNFK +
Sbjct: 180 LDLDVVTLKTLDFVNANFFGAETERLVGTSVIGLRRGGFGELFAERCL-DNFKYFDEQKN 238
Query: 260 --NGPGVLTRLLKAECKPQSYAHNIIS-CRN-FTIYPPRFFYPVHWEHWADYLNETNAPA 315
NG +LT + C+ + + + CR ++ F+P + + +
Sbjct: 239 IRNGSFLLTYQVVQTCETLTLQEILDNGCRGMLQVHRRSIFHPFDETNVGMMFDPSRLEE 298
Query: 316 TMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ +H+ + + ++ V G Y IA+ YCPR
Sbjct: 299 AKDRLAHAMTVHMLHRTSGKMRVA-GGTTGYQLIAQNYCPR 338
>gi|301625550|ref|XP_002941968.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 337
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 130/295 (44%), Gaps = 28/295 (9%)
Query: 73 DFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVI--- 129
+F DD+ P N IFF+ET+ D L L C++ESAA +NP V +
Sbjct: 41 NFTLDDVLS---PGNGIFFIETT-DRMDPPSLVL---CAVESAARINPDRPVAFFMKGLP 93
Query: 130 ----ASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHM 185
A +NR RN L Y N++ + + +TPL +Y + +
Sbjct: 94 DINSAEGQNRARNSF-PTLAPYNNIYFFPLRMELLLSDTPLLPWYQKVNPEKEVHWTHVS 152
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAAL 245
SD R +YKYGG Y+D+D I ++ + + N+ AESS + + GV D + +
Sbjct: 153 SDASRLALMYKYGGLYMDIDVISLRPV-PVENFLVAESSQISSNGVFGFDSHRDFTWTCM 211
Query: 246 RELRDNFKTTEWGANGPGVLTRLLKA-ECK-PQSYAHNIISCRNFTIYPPRFFYPVHWEH 303
+ N+ G GP + TR+ K C P + C N + PR FYP+ W
Sbjct: 212 EDFVKNYNGAIRGHQGPALFTRVFKQFYCDIPPFKGDEDLKCGNISFLNPRRFYPIDWMK 271
Query: 304 WADYLNETNAPATMSLFRDSYALHVWNS---FTKRVPVKLGSEQPYAQIARRYCP 355
+ D A F SYALH++NS + +RV V GS + + CP
Sbjct: 272 FFDIWKAIPA------FNKSYALHLFNSAHRYKRRVMVP-GSNTLVEHLYIQNCP 319
>gi|346464647|gb|AEO32168.1| hypothetical protein [Amblyomma maculatum]
Length = 284
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 103/220 (46%), Gaps = 12/220 (5%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQ 148
I+F+E+S + L RQAC++ESAA+ NP + V +++ S+ N I L
Sbjct: 77 IWFVESSLRNETC--LNERQACAVESAALRNPTLTVNLLITGSMANNCST--IRTLSTLP 132
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
N +D+ FQ TPL+ +Y TS + MSD LR++ L+K G YLDLD IV
Sbjct: 133 NFRTTIIDVQTEFQGTPLYPWYKTGTWKTSKNKVEDMSDALRWLVLWKRGXIYLDLDVIV 192
Query: 209 IKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL 268
+K+L+ L N E+ + H GA L + TT+WG+ GP + +
Sbjct: 193 LKALKDLKNGGAYETPGFPGTAALFFQNQHPFLGAVLESCIREYDTTKWGSCGPTLFYHV 252
Query: 269 LKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYL 308
LK S I+ + FY +++ W +
Sbjct: 253 LKRWGSGSSSPVRILRTES--------FYAINYNDWQKFF 284
>gi|255581057|ref|XP_002531344.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ricinus
communis]
gi|223529042|gb|EEF31028.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ricinus
communis]
Length = 413
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 71/228 (31%), Positives = 110/228 (48%), Gaps = 31/228 (13%)
Query: 135 RTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSL----WPLS-HMSDLL 189
R PL+D ++ V V DL F+NTP + Q+ + + PLS ++S+L+
Sbjct: 178 RILKPLVDVGFK---VAAVTPDLQFLFKNTPAE-IWLQEMMSGNKDPGEIPLSQNLSNLI 233
Query: 190 RYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSG 242
R +YKYGG Y+D DFI +KS + L N GA+S V+ V+ DK+H L
Sbjct: 234 RLAVIYKYGGIYIDTDFIFLKSFKGLRNSIGAQSIDAVSRNWTRLNNAVLVFDKNHPLMY 293
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWE 302
+ E F +WG NGP +++R++ Y NFT+ PP+ FYPV W
Sbjct: 294 KFIEEFAATFDGNKWGHNGPYLVSRVVARVAGRPEY-------NNFTVLPPKAFYPVDWN 346
Query: 303 HWADYLNETNAPAT--------MSLFRDSYALHVWNSFTKRVPVKLGS 342
+ + A + L ++Y LH+WN + R+ ++ GS
Sbjct: 347 RIGGFFKKPEDQAASRWVKAKLLQLSGETYGLHLWNKQSCRIRIEEGS 394
>gi|427781955|gb|JAA56429.1| Putative secreted protein [Rhipicephalus pulchellus]
Length = 306
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/277 (27%), Positives = 121/277 (43%), Gaps = 17/277 (6%)
Query: 89 IFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIA----SVRNRTRNPLIDRL 144
I FLET H +T R AC++ESAA +NP VY++ +A ++++ + L
Sbjct: 22 ITFLETG-GHG---RITARMACAVESAARLNPDWTVYLLSVAHGEEALKDNATSSFSQLL 77
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
NV + + FQ TPL +Y + S +P+ H++D LR +YK GG YLD+
Sbjct: 78 TSIPNVVMGTIKPQEVFQGTPLESWYESGILNKSAYPVEHLADALRLAVVYKEGGVYLDI 137
Query: 205 DFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
D IV++SL+SL A +V + + R +K EW + G
Sbjct: 138 DVIVMRSLDSLPPCVCQAPVNGGDMVGNAFFAFHRGDPFLLYLMETARKVYKPREWSSIG 197
Query: 262 PGVLTRLLKAECKPQS----YAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM 317
P +L + A C+ ++ H FT+ P F P+ W + +
Sbjct: 198 PLLLRQATLARCRAKTVKRILGHRCGGDEGFTVMPHWIFMPISAGDWKLHFAANASRQVW 257
Query: 318 SLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYC 354
+ SY +H +N+ + + G Y + A YC
Sbjct: 258 IMSAGSYVIHFYNALSSKTHAVPGCF--YREAAELYC 292
>gi|301611163|ref|XP_002935115.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 348
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/278 (25%), Positives = 129/278 (46%), Gaps = 17/278 (6%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMM--NPGVQVYVVVIASVRN----RTRNPL 140
N + FLET+ D +E C+IESAA + + V ++ + + + +
Sbjct: 70 NGVIFLETT----DRMEPPSLVLCAIESAARVYTDRPVVFFMKGLTDINSEEDEKQAKEK 125
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L ++NV+I + + F++TPL +Y + + + ++SD R +++YGG
Sbjct: 126 FPSLSSFENVYIFPLRMEELFKDTPLSEWYKKGNERWEPYWIHNLSDACRMAMIWRYGGI 185
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y D D I I+ + N+ A+S+ + V L + + L + N++ WG
Sbjct: 186 YFDADVISIRPIPE-KNFLTAQSTDTSGSSVFGLTPHNKFAWKCLNDFVLNYRGDIWGHQ 244
Query: 261 GPGVLTRLLKAEC-KPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL 319
GPG+ TR+LK C P + I C N + P YP+ +++W Y + +
Sbjct: 245 GPGLFTRVLKPLCVMPDFKVIDDIICGNISCLKPERIYPIPYQNWKKYFEVWDQVPS--- 301
Query: 320 FRDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRYCP 355
F ++Y +H+WN + + V GS + R+YCP
Sbjct: 302 FNNTYGVHLWNYMNRDEKKSVVPGSNTLVDHLYRQYCP 339
>gi|224140397|ref|XP_002323569.1| predicted protein [Populus trichocarpa]
gi|222868199|gb|EEF05330.1| predicted protein [Populus trichocarpa]
Length = 279
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/227 (30%), Positives = 108/227 (47%), Gaps = 31/227 (13%)
Query: 135 RTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT---SLWPLS-HMSDLLR 190
R PL+DR ++ V + DL F+NTP ++ + PL+ ++S+L+R
Sbjct: 52 RILKPLLDRKFK---VAAITPDLSFLFKNTPAETWFEEIKSGNKDPGEIPLAQNLSNLIR 108
Query: 191 YVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA------GVIHLDKDHWLSGAA 244
LYK+GG YLD DFIV+KS L N GA+S V + V+ D +H L
Sbjct: 109 LAVLYKFGGIYLDTDFIVLKSFADLRNAIGAQSIDVSKSWTRLNNAVLVFDMNHPLLLKF 168
Query: 245 LRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHW 304
+ E F +WG NGP +++R+++ Y NFT+ PP FYPV W
Sbjct: 169 IEEFASTFDGNKWGHNGPYLVSRVVQKVAGRPGY--------NFTVLPPMAFYPVGWNRI 220
Query: 305 ADYLNE---------TNAPATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ + NA + L ++Y LH+WN + + ++ GS
Sbjct: 221 GGFFKKPVNKVESRWVNA-KLLQLSGETYGLHLWNRQSSKFSIEEGS 266
>gi|449440279|ref|XP_004137912.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Cucumis sativus]
Length = 414
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 70/227 (30%), Positives = 107/227 (47%), Gaps = 35/227 (15%)
Query: 139 PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ---DAILTSLWPLS-HMSDLLRYVTL 194
PL+D ++ Q + DL F+NTP+ ++ + PL+ ++S+L+R L
Sbjct: 184 PLLDHGFKIQ---AIAPDLPLLFKNTPVEAWFDEMKSGKKDPGQIPLAQNLSNLMRLAVL 240
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESS-------SVVAAGVIHLDKDHWLSGAALRE 247
YKYGG Y+D DFIV+KS L N GA+S +++ V+ DK H L +
Sbjct: 241 YKYGGVYIDTDFIVLKSFMGLKNSIGAQSIDPVTKNWTILNNAVLVFDKKHPLLEKFMEN 300
Query: 248 LRDNFKTTEWGANGPGVLTRLLK----AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEH 303
NF + WG NGP +++R++ A KP N TI PP FYPV W
Sbjct: 301 FASNFDGSRWGHNGPFLVSRVIAKITGARAKPGF---------NVTILPPAAFYPVDWIK 351
Query: 304 WADYLNETNAPATMS--------LFRDSYALHVWNSFTKRVPVKLGS 342
+ + A S L ++Y +H+WN +K ++ GS
Sbjct: 352 IGELFKKPGNRAVESWAKAKLDQLNNETYGIHLWNKQSKSYVIQKGS 398
>gi|321471304|gb|EFX82277.1| hypothetical protein DAPPUDRAFT_49182 [Daphnia pulex]
Length = 271
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 104/222 (46%), Gaps = 13/222 (5%)
Query: 148 QNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFI 207
N+ V L + + L +Y + +SH+SD LR++TL KYGG Y DLD +
Sbjct: 41 NNIQFATVSLDEFIAGSLLEKWYHCTDWRKGPYHVSHLSDGLRFLTLNKYGGYYFDLDVL 100
Query: 208 VIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTE---WGANGPGV 264
+++ + N+ + VA+ IH D ++ + AL+ N++ + W NGP +
Sbjct: 101 MVRPVTYYSNFVASPDDYNVASNSIHADLNNPVMELALKNFVANYRQVKADIWAHNGPAL 160
Query: 265 LTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADY--------LNETNAPA 315
L R+LK C A N + C+ F + P F PVH+ +W + + ET+
Sbjct: 161 LLRVLKKWCNTSDVASMNYVRCKGFHVLPRPTFNPVHFSNWDELFVQQQRSTIKETSTTG 220
Query: 316 TMSLFRDSY-ALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
T D +H N + + S Q Y ++AR++CPR
Sbjct: 221 TPKWLTDQVIGVHTCNKMSNGGLLFKNSTQGYIRLARQHCPR 262
>gi|356494971|ref|XP_003516354.1| PREDICTED: uncharacterized protein At4g19900-like [Glycine max]
Length = 420
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 125/272 (45%), Gaps = 39/272 (14%)
Query: 107 RQACSIESAAMMNPGVQVYVVVIASVRN-----RTRNPLIDRLYEYQNVHIVQVDLGRYF 161
R+ S+ES +P + +++++ + R PL+DR ++ Q DL
Sbjct: 150 RELLSVESIFKNHP--KACLIILSRTLDSRHGYRILKPLLDRGFKVQ---ATAPDLSFLV 204
Query: 162 QNTPLHGFYTQ---DAILTSLWPLS-HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN 217
+ TP+ ++ + PLS ++S+L+R LYKYGG Y+D DFIV+K L L N
Sbjct: 205 KGTPVEAWFRELRKGRKDPGEIPLSQNLSNLIRLAVLYKYGGIYIDTDFIVLKPLTGLRN 264
Query: 218 YAGAESSSV-------VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLK 270
GA+S ++ + V+ D H L + E F +WG NGP +++R++K
Sbjct: 265 SIGAQSMNLDSKHWTRLNNAVLIFDIGHQLLHRFINEFALTFDGNKWGHNGPYLVSRVIK 324
Query: 271 AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT--------MSLFRD 322
K + NFT+ PP FYPV W + + L R
Sbjct: 325 RLGKRHDF--------NFTVLPPMAFYPVDWNKINGLFMKPKTQEESKWVEAKLLQLRRK 376
Query: 323 SYALHVWNSFTKRVPVKLGSEQPYAQIARRYC 354
+Y +H+WN + R+ ++ GS ++A YC
Sbjct: 377 TYGIHLWNKHSSRLTIEEGS--IMGRLASDYC 406
>gi|301617466|ref|XP_002938165.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 356
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 142/302 (47%), Gaps = 34/302 (11%)
Query: 72 KDFAFDDITRLDV--PDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYV--V 127
++ F I+ +D+ N I F+ET+ D ++ C+IESAA +VY
Sbjct: 57 ENLNFLTISPVDILSKGNGIIFVETT----DRMKPPSLVLCAIESAA------RVYKDRP 106
Query: 128 VIASVRNRTRNPLIDRLYEYQ----------NVHIVQVDLGRYFQNTPLHGFYTQ-DAIL 176
V+ ++ +R L++ E Q N++ + + + F+ TPL +Y + +
Sbjct: 107 VVFFMKGLSRINLVNDELEVQKSFPTLSYLDNIYFLPLRMEEVFRGTPLLPWYMKINPKK 166
Query: 177 TSLWPLSHMS-DLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLD 235
W +H+S D R ++K+GG Y+D D I ++ + + N+ A+SS + G+ L
Sbjct: 167 EKHW--THVSSDGCRLALIWKHGGIYMDTDIISLRPIPDV-NFLAAQSSQFSSNGIFGLF 223
Query: 236 KDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK-PQSYAHNIISCRNFTIYPPR 294
H S ++ N+ T WG GP + TR+L +C P + + C N + P+
Sbjct: 224 PHHNFSWRSMENFVQNYNGTIWGHQGPQLFTRVLGQDCVIPPFKSTEDVVCGNISFLNPQ 283
Query: 295 FFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTK-RVPVKLGSEQPYAQIARRY 353
FYP+ + W Y E T F DSYALH+WN + + + GS + ++Y
Sbjct: 284 RFYPIPYPEWRKYYEEWKDYPT---FNDSYALHLWNYMNQEQRTIIPGSNTLIDHLYKQY 340
Query: 354 CP 355
CP
Sbjct: 341 CP 342
>gi|449483691|ref|XP_004156661.1| PREDICTED: lactosylceramide 4-alpha-galactosyltransferase-like
[Cucumis sativus]
Length = 414
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 107/227 (47%), Gaps = 35/227 (15%)
Query: 139 PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ---DAILTSLWPLS-HMSDLLRYVTL 194
PL+D ++ Q + DL F+NTP+ ++ + PL+ ++S+L+R L
Sbjct: 184 PLLDHGFKIQ---AIAPDLPLLFKNTPVEAWFDEMKSGKKDPGQIPLAQNLSNLMRLAVL 240
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESS-------SVVAAGVIHLDKDHWLSGAALRE 247
YKYGG Y+D DFIV+KS L N GA+S +++ V+ DK H L +
Sbjct: 241 YKYGGVYIDTDFIVLKSFMGLKNSIGAQSIDPVTKNWTILNNAVLVFDKKHPLLEKFMEN 300
Query: 248 LRDNFKTTEWGANGPGVLTRLLK----AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEH 303
NF + WG NGP +++R++ A KP N T+ PP FYPV W
Sbjct: 301 FASNFDGSRWGHNGPFLVSRVIAKITGARAKPGF---------NVTVLPPAAFYPVDWIK 351
Query: 304 WADYLNETNAPATMS--------LFRDSYALHVWNSFTKRVPVKLGS 342
+ + A S L ++Y +H+WN +K ++ GS
Sbjct: 352 IGELFKKPGNRAVESWAKAKLDQLNNETYGIHLWNKQSKSYVIQKGS 398
>gi|297827371|ref|XP_002881568.1| hypothetical protein ARALYDRAFT_903014 [Arabidopsis lyrata subsp.
lyrata]
gi|297327407|gb|EFH57827.1| hypothetical protein ARALYDRAFT_903014 [Arabidopsis lyrata subsp.
lyrata]
Length = 404
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 122/260 (46%), Gaps = 36/260 (13%)
Query: 107 RQACSIESAAMMNPGVQVYVVVIASVRNRTRN----PLIDRLYEYQNVHIVQVDLGRYFQ 162
R+ ++ES +P ++V S+ ++ + PLIDR Y+ V D+ +
Sbjct: 139 RELLAVESVFKSHPR-GCLMIVSGSMESQQGDSILKPLIDRGYK---VFAATPDISLLLE 194
Query: 163 NTPLHGFYTQDAILT---SLWPL-SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
NTP ++ + PL ++S+L R LYKYGG YLD DFIV +S + L N
Sbjct: 195 NTPAKSWFQEMKSCKRDPGRIPLHQNLSNLARLAILYKYGGVYLDTDFIVTRSFKGLRNS 254
Query: 219 AGAESS--------SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLK 270
G ++ + V+ +K H L + + E F +WG NGP ++TR+++
Sbjct: 255 IGVQTLLEGDSKNWKTLNNAVLIFEKHHPLVYSFMEEFASTFDGNKWGHNGPCLVTRVVQ 314
Query: 271 AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYL------NETNAPAT--MSLFRD 322
A I NFT+ PP FYP +W N++ T + L R+
Sbjct: 315 -------RARETIG-DNFTVLPPVAFYPFNWLDIPRLFQTPRSSNDSTLLKTDLVKLNRE 366
Query: 323 SYALHVWNSFTKRVPVKLGS 342
SY LH+WN FT+++ + GS
Sbjct: 367 SYGLHLWNKFTRKLKIGKGS 386
>gi|321471326|gb|EFX82299.1| hypothetical protein DAPPUDRAFT_316981 [Daphnia pulex]
Length = 323
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 130/298 (43%), Gaps = 36/298 (12%)
Query: 84 VPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDR 143
V + +IFFLETS + L RQAC+IESAA NP +++ V + S P +D
Sbjct: 20 VTEKTIFFLETSGANC----LRPRQACAIESAARTNPDMKIRVHMATSPP--PGKPELDG 73
Query: 144 LY-------------EYQNVHIVQVDLGRYFQNTPLHGFYTQDAIL-TSLWPLSHMSDLL 189
Y NV IV+ DL R+ TPL S + H+SD +
Sbjct: 74 GYGLDANCQSMDVLNRLDNVRIVREDLTRHLLGTPLEALLGGGGQFEKSQFSYQHLSDAV 133
Query: 190 RYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA----ESSSVVAAGVIHLDKDHWLSGAAL 245
R L+K GG YLDLD +V++SL L N AG E + + GV+ DK H L +
Sbjct: 134 RIAMLHKSGGIYLDLDVVVLRSLGCLRNTAGEVRSPEYKAGIENGVLIFDKGHELLNQYM 193
Query: 246 RELRDNFKTTEWGANGPGVLTRLLKAECK----PQSYAHNIISCR---NFTIYPPRFFYP 298
R + + + GP + + C + CR N T+ FYP
Sbjct: 194 RLMEREYDPLGRESIGPLAFLKAAREFCGFDVCDGCNFGQLWVCRDNWNLTVLYTEAFYP 253
Query: 299 VHWEHWADYLNETNAPAT-MSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ + + + E N P T + + SY +HV+ + ++ Y +A+R+CP
Sbjct: 254 IPFRNRERFY-EPNFPLTELDNLQTSYVVHVYGAGHG---AQVAPSSLYGFLAQRFCP 307
>gi|321471330|gb|EFX82303.1| hypothetical protein DAPPUDRAFT_316974 [Daphnia pulex]
Length = 390
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 131/300 (43%), Gaps = 40/300 (13%)
Query: 84 VPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDR 143
V + +IFFLETS + L RQAC+IESAA NP +++ V + S P +D
Sbjct: 87 VTEKTIFFLETSGANC----LRPRQACAIESAARTNPDMKIRVHMATSPP--PGKPELDG 140
Query: 144 LY-------------EYQNVHIVQVDLGRYFQNTPLHGFYTQDAIL-TSLWPLSHMSDLL 189
Y NV IV+ DL R+ TPL S + H+SD +
Sbjct: 141 GYGLDANCQSMDVLNRLDNVRIVREDLTRHLLGTPLEALLGGGGQFEKSQFSYQHLSDAV 200
Query: 190 RYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA----ESSSVVAAGVIHLDKDHWLSGAAL 245
R L+K GG YLDLD +V++SL L N AG E + + GV+ DK H L +
Sbjct: 201 RIAMLHKSGGIYLDLDVVVLRSLGCLRNTAGEVRSPEYKAGIENGVLIFDKGHELLNQYM 260
Query: 246 RELRDNFKTTEWGANGPGVLTRLLKAECK----PQSYAHNIISCR---NFTIYPPRFFYP 298
R + + + GP + + C + CR N T+ FYP
Sbjct: 261 RLMEREYDPLGRESIGPLAFLKAAREFCGFDVCDGCNFGQLWVCRDNWNLTVLYTEAFYP 320
Query: 299 VHWEHWADYLNETNAPAT-MSLFRDSYALHVWNS--FTKRVPVKLGSEQPYAQIARRYCP 355
+ + + + E N P T + + SY +HV+ + + P L Y +A+R+CP
Sbjct: 321 IPFRNRERFY-EPNFPLTELDNLQTSYVVHVYGAGHGAQVTPSSL-----YGFLAQRFCP 374
>gi|356523288|ref|XP_003530272.1| PREDICTED: uncharacterized protein At4g19900-like [Glycine max]
Length = 392
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/260 (27%), Positives = 123/260 (47%), Gaps = 37/260 (14%)
Query: 107 RQACSIESAAMMNPGVQVYVV--VIASVRN-RTRNPLIDRLYEYQNVHIVQVDLGRYFQN 163
R+ SI+S ++P + ++ + ++R+ R P++D ++ Q V DL F+
Sbjct: 122 RELMSIDSVFKVHPKACLVILSRTLDTIRSYRVLKPILDEGFKVQPV---TPDLQFLFKG 178
Query: 164 TPLHGFYTQ------DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN 217
TP + + D SL+ ++S+L+R LYKYGG YLD+DF+V+K + L N
Sbjct: 179 TPAEAWLNELKKGKKDPGQISLF--QNLSNLIRLAVLYKYGGVYLDIDFVVLKPISLLRN 236
Query: 218 YAGAESSSV-------VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLK 270
GA+S + V+ D +H L + E F WG NGP +++R++K
Sbjct: 237 SIGAQSMDAGNKHWTRLNNAVLIFDMNHPLLLRFIDEFVLTFDGNRWGHNGPYLVSRVVK 296
Query: 271 AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT--------MSLFRD 322
+ + NFTI PP FYP W+ + + + L +
Sbjct: 297 RLGEKPGF--------NFTILPPIAFYPADWKKIGGLFRKPKTRSESKLVDAKLLQLSGE 348
Query: 323 SYALHVWNSFTKRVPVKLGS 342
SY +H+WN ++R+ ++ GS
Sbjct: 349 SYGVHLWNKESRRLKIEEGS 368
>gi|297827369|ref|XP_002881567.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297327406|gb|EFH57826.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 750
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 110/237 (46%), Gaps = 35/237 (14%)
Query: 139 PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT---SLWPLS-HMSDLLRYVTL 194
PLIDR Y+ V D+ +NTP ++ + PL ++S+L R L
Sbjct: 162 PLIDRGYK---VFAATPDISLLLENTPAKSWFQEMKSCKRDPGKIPLQQNLSNLARLAIL 218
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESS--------SVVAAGVIHLDKDHWLSGAALR 246
YKYGG YLD DFIV +S + L N GA++ + + V+ +KDH L + +
Sbjct: 219 YKYGGVYLDTDFIVTRSFKGLKNTIGAQTVVEGDSKNWTRLNNAVLIFEKDHPLVFSFIE 278
Query: 247 ELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWAD 306
E F +WG NGP ++TR+ + + + NFT+ PP FYP W +
Sbjct: 279 EFASTFDGNKWGHNGPYLVTRVAQRARE--------TTGDNFTVLPPVAFYPFTWLNIPR 330
Query: 307 YL------NETNAPAT--MSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
N++ T + L R+SY LH+WN T+ KL E P + ++ P
Sbjct: 331 LFQTPRSSNDSRILKTDLVKLNRESYGLHLWNKITR----KLKIESPNSTVSDNNPP 383
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 116/261 (44%), Gaps = 39/261 (14%)
Query: 107 RQACSIESAAMMNPGVQVYVVV--IASVRNRT-RNPLIDRLYEYQNVHIVQVDLGRYFQN 163
R+ ++++ NPG + ++ + S R T PL+DR + N+ V +D+ +N
Sbjct: 483 REMLAVDTLFTTNPGACLAILSNSLDSPRGYTILKPLLDRGF---NLIAVTLDIPFLVKN 539
Query: 164 TP----LHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA 219
TP L + S+ ++SDL R LYKYGG YLD D I + + L N
Sbjct: 540 TPAEAWLKRLKSGHMDPGSIPLFMNLSDLTRLAVLYKYGGVYLDTDIIFLNDMTGLRNAI 599
Query: 220 GAESS-------SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAE 272
GA+S + + V+ D H L L+E F WG N P +++R++K
Sbjct: 600 GAQSIDPGTKRWTRLNNAVMVFDIYHPLMREFLQEYATTFDGNRWGYNSPYLVSRVIKRL 659
Query: 273 CKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT-----------MSLFR 321
Y N TI+ P FYPV+W + PAT + +
Sbjct: 660 GNKPGY--------NLTIFSPDAFYPVNWIKIQKLFKK---PATTREAKWVEKTVQDMNK 708
Query: 322 DSYALHVWNSFTKRVPVKLGS 342
SY +H+WN T+++ ++ GS
Sbjct: 709 GSYMIHLWNKVTRKIKIEEGS 729
>gi|449266978|gb|EMC77956.1| Alpha-1,4-N-acetylglucosaminyltransferase, partial [Columba livia]
Length = 341
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 120/276 (43%), Gaps = 17/276 (6%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL------I 141
SI FLET+ + +E +CS+ESAA + + ++ + + N T L
Sbjct: 53 SIIFLETT----ERLEPPPLVSCSVESAARIYQDRPI-ILFMKGLTNDTAFDLNSSYAAF 107
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L +NV I + + FQ TPL +Y + + SD R ++KYGG Y
Sbjct: 108 SLLSSMKNVFIFPLQMETIFQETPLLQWYNEVVPEQEKNWVHISSDASRLALIWKYGGIY 167
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+D D I I+ + ++ A+ S + G+ H + + WG G
Sbjct: 168 MDTDVISIRPIPE-ESFLAAQKSRFSSNGIFGFPAHHKFIWDCMENFVLKYNGNIWGNQG 226
Query: 262 PGVLTRLLKAECKPQSY-AHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF 320
P ++TR+LKA C + SCRN + P+ FYP+ + W Y + F
Sbjct: 227 PFLMTRMLKAICNLTDFKGTEDHSCRNISFLNPQRFYPIPYPAWGRYYEVWDKSPN---F 283
Query: 321 RDSYALHVWNSFTK-RVPVKLGSEQPYAQIARRYCP 355
SYALH+WN + R V GS ++ + YCP
Sbjct: 284 NHSYALHLWNFMNRNRRAVVAGSNTLVEKLYKAYCP 319
>gi|168049041|ref|XP_001776973.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671674|gb|EDQ58222.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 276
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 112/226 (49%), Gaps = 34/226 (15%)
Query: 139 PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTS----LWPLSHMSDLLRYVTL 194
P I+R Y + V ++ F+N P ++ Q T + + +MS+++R L
Sbjct: 45 PFIERGYR---IMAVTPNVISLFENLPAGEWFKQQRDGTGDPGCINFMQNMSNIMRLTVL 101
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YKYGG YLD D IV+KS + L N GA+S S+ V+ D++H + LRE
Sbjct: 102 YKYGGIYLDSDVIVLKSFDGLRNVVGAQSRSIAVGEWTRLNNAVLVFDREHPVVYEFLRE 161
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHW------ 301
F ++WG NGP ++TR+L+ + K Q + + C + ++ P FYP++W
Sbjct: 162 FVATFDGSKWGWNGPYLVTRVLQ-KVKEQQWQN----CSSVSVLPLEAFYPLNWVDIVAF 216
Query: 302 -----EHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGS 342
EH D + M+ + SYA+H+WN + + V+ GS
Sbjct: 217 FHAHSEH--DQRWQEKKLEVMN--QKSYAIHLWNKKSSHLRVEKGS 258
>gi|241851436|ref|XP_002415768.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
gi|215509982|gb|EEC19435.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
Length = 300
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 125/274 (45%), Gaps = 20/274 (7%)
Query: 97 THADGVELTLRQACSIESAAMMNPGVQVYVVVIA-------SVRNRTRNPLIDRLYEYQN 149
T G LT RQAC ++SAA+ NP V++++++ + R + + L +N
Sbjct: 17 TTEPGRTLTRRQACVVQSAALHNPTFNVHLLLLSHPSQIPVPLDTREDSLFLKPLSRMRN 76
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVI 209
VHI + F + LH + + S P H+ D ++ + L+ +GGT+LDLDF+V+
Sbjct: 77 VHIWNLQTTDLFLGSYLHSWLERG---ISGRP-DHLMDAIKVLVLWNFGGTFLDLDFLVL 132
Query: 210 KSLE-SLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL 268
+S + L N + + +K H L G L++ N+ E G VLTR
Sbjct: 133 RSFQHHLDNSVLEYGGGFLTTRFLSFEKGHPLLGVWLKDFNLNYSPDEVVDFGNVVLTRN 192
Query: 269 LKAECKPQSYAH--NIISCRNFTIYPPRFFYPVHWEHWAD-YLNETNAPAT----MSLFR 321
++ C S +C+ + P + FYPVH D + N+ +P MS
Sbjct: 193 VRNLCNVSSLDEVGKRRTCK-VDVIPGKLFYPVHRREADDIFSNDIYSPDASQHLMSRTV 251
Query: 322 DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+SYA+ +W+ T + K Y RR CP
Sbjct: 252 NSYAICLWSDITSGIAYKRNYAPSYVAFERRQCP 285
>gi|357606056|gb|EHJ64901.1| hypothetical protein KGM_19366 [Danaus plexippus]
Length = 181
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 96/179 (53%), Gaps = 12/179 (6%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-HNYAGAESSSVVAAGVIHLDKD--- 237
+ D+L+Y++LY++GG +LDLD I+ ++L SL N+A E+++ V G++ + K+
Sbjct: 1 MERTRDVLKYLSLYRFGGIFLDLDIIIARTLGSLARNWAARENANKVGDGILAISKNSIG 60
Query: 238 HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECK-PQSYAHNIISCRNFTIYPPRFF 296
H ++ AA+R + +K +W V+ +L+ C P + + +C F +Y +FF
Sbjct: 61 HNITDAAIRYIVSIYKNNDWCKESQDVVMGVLQELCSTPDANYMSAATCNGFEVYGSQFF 120
Query: 297 YPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
YP+ + +Y P + +Y H+W T K+ PY+++ARR+CP
Sbjct: 121 YPIEKQSAREYF----VPGEVQDL-SAYIYHLWGDVTN--GYKISKSSPYSKLARRFCP 172
>gi|238479487|ref|NP_001154561.1| alpha 1,4-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|330254402|gb|AEC09496.1| alpha 1,4-glycosyltransferase-like protein [Arabidopsis thaliana]
Length = 380
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/224 (31%), Positives = 106/224 (47%), Gaps = 31/224 (13%)
Query: 139 PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT---SLWPL-SHMSDLLRYVTL 194
PL DR Y+ V D+ +NTP ++ + PL ++S+L R L
Sbjct: 150 PLNDRGYK---VFAATPDMSLLLENTPAKSWFQEMKSCKRDPGRIPLHQNLSNLARLAFL 206
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESS--------SVVAAGVIHLDKDHWLSGAALR 246
YKYGG YLD DFIV +S + L N GA++ + + V+ +KDH L + +
Sbjct: 207 YKYGGVYLDTDFIVTRSFKGLKNSIGAQTVVEGDSKNWTRLNNAVLIFEKDHPLVYSFIE 266
Query: 247 ELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWAD 306
E F +WG NGP ++TR+ + A I NFT+ PP FYP +W
Sbjct: 267 EFASTFDGNKWGHNGPYLVTRVAQR-------ARETIG-DNFTVLPPVAFYPFNWLDIPR 318
Query: 307 YL------NETNAPAT--MSLFRDSYALHVWNSFTKRVPVKLGS 342
N++ T + L R+SY LH+WN T+++ + GS
Sbjct: 319 LFQTPRGSNDSTLLKTDLVKLNRESYGLHLWNKITRKLKIGKGS 362
>gi|241156892|ref|XP_002407885.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
gi|215494248|gb|EEC03889.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
Length = 186
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 88/176 (50%), Gaps = 3/176 (1%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLS 241
+ H+SD LR L+ +GG YLDLD +V+ L + N +V+ G++ D+ H
Sbjct: 1 MVHLSDALRLGLLWMHGGIYLDLDVVVLVKLGAFVNSLVQSMDDMVSNGILFFDRYHPFL 60
Query: 242 GAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNI-ISCRNFTIYPPRFFYPVH 300
G +R L N+ WG NGP ++ + C + SC+ T+ P R+F P++
Sbjct: 61 GDCIRTLVSNYNPHVWGQNGPVLMRSVFLRWCNATVVEDMVEKSCKGVTLLPRRYFLPLN 120
Query: 301 WEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCPR 356
+ + + +++A + DS+ +HV+ S V + YA +ARR+CPR
Sbjct: 121 YSQHSKFFRDSDAEEVWNASADSHIMHVYGSNNADVIAE--PRSVYATVARRHCPR 174
>gi|357505891|ref|XP_003623234.1| Lactosylceramide 4-alpha-galactosyltransferase [Medicago
truncatula]
gi|355498249|gb|AES79452.1| Lactosylceramide 4-alpha-galactosyltransferase [Medicago
truncatula]
Length = 447
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 123/270 (45%), Gaps = 35/270 (12%)
Query: 107 RQACSIESAAMMNPGVQVYVV--VIASVRN-RTRNPLIDRLYEYQNVHI-VQVDLGRYFQ 162
R+ SIES + P + ++ + S+ + P ID+ ++ Q + + L
Sbjct: 186 RETLSIESVFKVQPQACLTILSRTLDSIHGYKILKPFIDKGFKVQAITPNLSFLLKGTLA 245
Query: 163 NTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAE 222
T LH + ++S+L+R LYKYGG Y+D+DFI++K L L N GA+
Sbjct: 246 ETWLHELRKGKKDPGEIPLFQNLSNLIRLAVLYKYGGVYIDIDFILLKPLSGLRNSIGAQ 305
Query: 223 SS-------SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKP 275
S + + V+ DK+H L + E F +WG NGP +++R+++ K
Sbjct: 306 SMDFGTKHWTRLNNAVLIFDKNHPLVLRFINEFALTFDGNKWGHNGPYLVSRVVERLKKR 365
Query: 276 QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS-----------LFRDSY 324
Q + NFTI PP FYPV W + + P T S L +++
Sbjct: 366 QGF--------NFTILPPMAFYPVSWTKIGGFFRK---PKTRSEEKWVEAKLKQLSGETF 414
Query: 325 ALHVWNSFTKRVPVKLGSEQPYAQIARRYC 354
+H+WN + + ++ GS A++ +C
Sbjct: 415 GVHLWNKQSSGLVIEEGS--VMARLVSNHC 442
>gi|118094891|ref|XP_426692.2| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Gallus
gallus]
Length = 338
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 128/282 (45%), Gaps = 29/282 (10%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYV--VVIASVRNRTRNPLIDR-- 143
SI FLET+ + +E +CS+ESAA ++Y +I ++ T + ++D
Sbjct: 50 SIIFLETT----ERLEPPPLVSCSVESAA------RIYQDRPIILFMKGLTNDTVLDSNS 99
Query: 144 -------LYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLY 195
L +NV + + + FQ TPL +Y + D W + SD R ++
Sbjct: 100 SYTAFSLLSSMKNVFLFPLQMENVFQETPLLQWYNEVDPEEEKNW-VHVSSDASRLALIW 158
Query: 196 KYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTT 255
KYGG Y+D D I I+ + ++ A+ S + G+ H + +
Sbjct: 159 KYGGIYMDTDVISIRPIPE-GSFLAAQKSRFSSNGIFGFPAHHKFIWDCMENFVLKYNGN 217
Query: 256 EWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAP 314
WG GP ++TR+LKA C + SC+N + P+ FYP+ + W+ Y + +
Sbjct: 218 IWGNQGPFLMTRMLKAICNLTDFEGVEDHSCQNISFLNPQRFYPIPYPAWSRYYDVWDK- 276
Query: 315 ATMSLFRDSYALHVWNSFTK-RVPVKLGSEQPYAQIARRYCP 355
+ F SYALH+WN + R V GS ++ + YCP
Sbjct: 277 --VPDFNHSYALHLWNFMNRNRKAVVAGSNSLAEKLYKTYCP 316
>gi|301611153|ref|XP_002935118.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like [Xenopus
(Silurana) tropicalis]
Length = 327
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 124/275 (45%), Gaps = 14/275 (5%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
N IFF+ET+ D +E C++ESAA + P + + +R + I
Sbjct: 47 NGIFFVETT----DRMEPPYVTLCAVESAARIYPNRPIAFFMKGLPEDRDGDKAKYRIRT 102
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L Y N+++ + L F +TPL +Y + + SD R ++KYGG Y+D
Sbjct: 103 LSSYDNIYLFPLRLEEVFYDTPLLSWYRKVKPEHESYWTDVTSDASRLALIWKYGGIYMD 162
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
D I ++ + L N+ AES+ V + + H S ++ + N+ + G GP
Sbjct: 163 NDIISVRPV-PLKNFVAAESNDVYSNSIFGCVPHHMFSWRSMEDFVQNYNGSILGHQGPA 221
Query: 264 VLTRLLKAE-CKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
+ R+LK C + + + + C N T+ P FYP+ W Y + F
Sbjct: 222 LFARILKKVFCVLRGFKYTEDVRCGNMTLTNPDRFYPIPESSWKKYYEVVDQ---FRPFS 278
Query: 322 DSYALHVW-NSFTKRVPVKLGSEQPYAQIARRYCP 355
SYA+H++ NS + + GS+ ++ +YCP
Sbjct: 279 SSYAVHLFENSNQGKYNMVPGSKTLVDRLYEQYCP 313
>gi|3335363|gb|AAC27164.1| hypothetical protein [Arabidopsis thaliana]
Length = 736
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 105/221 (47%), Gaps = 31/221 (14%)
Query: 139 PLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILT---SLWPL-SHMSDLLRYVTL 194
PL DR Y+ V D+ +NTP ++ + PL ++S+L R L
Sbjct: 147 PLNDRGYK---VFAATPDMSLLLENTPAKSWFQEMKSCKRDPGRIPLHQNLSNLARLAFL 203
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESS--------SVVAAGVIHLDKDHWLSGAALR 246
YKYGG YLD DFIV +S + L N GA++ + + V+ +KDH L + +
Sbjct: 204 YKYGGVYLDTDFIVTRSFKGLKNSIGAQTVVEGDSKNWTRLNNAVLIFEKDHPLVYSFIE 263
Query: 247 ELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWAD 306
E F +WG NGP ++TR+ + A I NFT+ PP FYP +W
Sbjct: 264 EFASTFDGNKWGHNGPYLVTRVAQR-------ARETIG-DNFTVLPPVAFYPFNWLDIPR 315
Query: 307 YL------NETNAPAT--MSLFRDSYALHVWNSFTKRVPVK 339
N++ T + L R+SY LH+WN T+++ ++
Sbjct: 316 LFQTPRGSNDSTLLKTDLVKLNRESYGLHLWNKITRKLKIE 356
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 115/261 (44%), Gaps = 38/261 (14%)
Query: 107 RQACSIESAAMMNPGVQVYVVV--IASVRNRT-RNPLIDRLYEYQNVHIVQVDLGRYFQN 163
R+ +I++ NPG + ++ + S T PL D+ + N+ V +D+ +N
Sbjct: 468 REMLAIDTLFTTNPGACLAILSNSLDSPNGYTILKPLFDQGF---NLIAVTIDIPFLVKN 524
Query: 164 TP----LHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA 219
TP L + + S+ ++SDL R LYKYGG YLD D I + + L N
Sbjct: 525 TPAEAWLKRLKSGNMDPGSIPLFMNLSDLTRLAVLYKYGGVYLDTDIIFLNDMTGLRNAI 584
Query: 220 GAESSSVVAA-------GVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAE 272
GA+SS V+ D H L L+E F +WG N P +++R++K
Sbjct: 585 GAQSSDPATKRWTRLNNAVMVFDIYHPLMREFLQEYATTFDGNKWGYNSPYLVSRVIKRL 644
Query: 273 CKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT-----------MSLFR 321
Y N TI+ P FYPV+W + PAT + +
Sbjct: 645 GNKPGY-------NNLTIFSPDAFYPVNWIKIQKLFKK---PATTREAKWVEKTVQDMNK 694
Query: 322 DSYALHVWNSFTKRVPVKLGS 342
SY +H+WN T+++ ++ GS
Sbjct: 695 GSYMIHLWNKVTRKIKIEEGS 715
>gi|224059854|ref|XP_002193057.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase [Taeniopygia
guttata]
Length = 338
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 122/281 (43%), Gaps = 27/281 (9%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYV--VVIASVRNRTRNPLIDR-- 143
SI FLET+ + +E +CS+ESAA ++Y +I +R T +D
Sbjct: 50 SIIFLETT----ERLEPPPLVSCSVESAA------RIYQDRPIILFMRGLTNETALDMNT 99
Query: 144 -------LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYK 196
L +NV + + + FQ TPL +Y Q + SD R ++K
Sbjct: 100 SYAAFSLLSSMKNVFLFPLQMETIFQETPLLQWYNQVVPEQEKNWVHVSSDASRLALIWK 159
Query: 197 YGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTE 256
YGG Y+D D I I+ + ++ A+ S + G+ H + +
Sbjct: 160 YGGIYMDTDVISIRPIPH-ESFLAAQKSRFSSNGIFGFPARHKFIWDCMENFVLKYNGNI 218
Query: 257 WGANGPGVLTRLLKAECKPQSY-AHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
WG GP ++TR+LK C + SC+N + P+ FYP+ + W+ Y +
Sbjct: 219 WGNQGPFLMTRMLKTLCNLTDFQGTEDHSCQNISFLNPQRFYPIPYPAWSRYYQVWDKSP 278
Query: 316 TMSLFRDSYALHVWNSFTK-RVPVKLGSEQPYAQIARRYCP 355
+ F SYALH+WN + R V GS ++ + YCP
Sbjct: 279 S---FNHSYALHLWNFMNRNRKVVVAGSNTLAEKLYKTYCP 316
>gi|301611149|ref|XP_002935130.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like isoform 1
[Xenopus (Silurana) tropicalis]
gi|301611151|ref|XP_002935131.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like isoform 2
[Xenopus (Silurana) tropicalis]
Length = 340
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/276 (27%), Positives = 118/276 (42%), Gaps = 16/276 (5%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQV--YVVVIASVRNRTRNPLIDRL 144
N I F+ET+ D +E CSIESAA P V ++ +A + + L
Sbjct: 51 NGIMFVETT----DRMEPPHLVLCSIESAARAYPDRPVVYFMKGLAEINSDIVRMSYPTL 106
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
Y N++ + + NTPL +Y + T + SD R +YKYGG Y+D
Sbjct: 107 LSYDNIYFFPLRMNVLLNNTPLMPWYEKVNPKTERYWNHVSSDACRLALIYKYGGIYMDT 166
Query: 205 DFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGV 264
D I + + N+ AE+S + + V+ H + + + + + T WG GP +
Sbjct: 167 DIITFRPIPE-KNFLAAETSQMTGSAVLAFAPKHTIVWQFMEDFVNGYDGTVWGQQGPLL 225
Query: 265 LTRLL-KAECK-PQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRD 322
R+L + CK P I C F+PV W + T F +
Sbjct: 226 YNRILNRLYCKVPPFKGQEDIMCGTILFLNMERFFPVPGMQWETFFQVCEKLPT---FNN 282
Query: 323 SYALHVW---NSFTKRVPVKLGSEQPYAQIARRYCP 355
SYALH++ NS ++V V GS + ++YCP
Sbjct: 283 SYALHLFNYANSNQRKVMVP-GSNTMVEHLYKKYCP 317
>gi|31324066|gb|AAP47165.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 348
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 90/184 (48%), Gaps = 7/184 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ WG GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWIWGHQGPQ 252
Query: 264 VLTR 267
+LTR
Sbjct: 253 LLTR 256
>gi|301617464|ref|XP_002938164.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like, partial
[Xenopus (Silurana) tropicalis]
Length = 226
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 99/217 (45%), Gaps = 10/217 (4%)
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L Y N+H+ + +G +NTPL +Y + + SD R +YK+GG Y+D
Sbjct: 2 LLSYDNIHLYPLRMGVLLKNTPLISWYEKIKPKNEIHWTHISSDASRLALIYKFGGLYMD 61
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
D I ++ + + N+ AESS + + GV H + + N+ WG GP
Sbjct: 62 TDMISLRPVPDI-NFLAAESSQISSNGVFGFASHHPFIWTCMEDFVKNYNGAIWGHQGPA 120
Query: 264 VLTRLLKAE-CKPQSY-AHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
+ TR+L+ C + A I C N + P FYP+ W Y + +F
Sbjct: 121 LFTRVLQERYCTLFPFEAKEDILCGNISFLNPERFYPIPCSSWKKYF---EVWEELPVFN 177
Query: 322 DSYALHVWNSFTKRVPVKL---GSEQPYAQIARRYCP 355
+SYALH++N + R K+ GS + ++YCP
Sbjct: 178 ESYALHLFN-YANRDEHKVMIPGSNTLVEHLYQQYCP 213
>gi|224285317|gb|ACN40383.1| unknown [Picea sitchensis]
Length = 425
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 113/257 (43%), Gaps = 31/257 (12%)
Query: 107 RQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQV--DLGRYFQNT 164
R+ SIES NP ++VI+ + I R + + ++ DL F+ T
Sbjct: 162 RERLSIESVFKWNPSC--CLLVISRTMDSEAGEEILRPFRSRGFRVMAAAPDLRYLFKKT 219
Query: 165 PLHGFYTQ----DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
P G+ + D + ++S++LR LYK+GG Y+D D I+++S L N G
Sbjct: 220 PAEGWLRKVESGDIDPGEVSFAQNLSNILRLAALYKFGGVYIDADVILLRSFSGLKNAIG 279
Query: 221 AESS-------SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAEC 273
A++ + + V+ DK H L ++E F +WG NGP + TR++
Sbjct: 280 AQNRDPQTGRWNRLNNAVLAFDKRHPLLFKFIQEFALTFDGNKWGHNGPYLATRVVTRVA 339
Query: 274 KPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYL--------NETNAPATMSLFRDSYA 325
Y F I PP FYPV W Y + + M L ++ YA
Sbjct: 340 NRTGY--------EFKIMPPIAFYPVDWTRIYSYFISPSDRGHAKWRSAKIMQLEKEGYA 391
Query: 326 LHVWNSFTKRVPVKLGS 342
+H+WN ++ + V+ GS
Sbjct: 392 IHLWNKQSRELNVEEGS 408
>gi|15232004|ref|NP_187514.1| lactosylceramide 4-alpha-galactosyltransferase [Arabidopsis
thaliana]
gi|5923667|gb|AAD56318.1|AC009326_5 hypothetical protein [Arabidopsis thaliana]
gi|6403486|gb|AAF07826.1|AC010871_2 unknown protein [Arabidopsis thaliana]
gi|332641188|gb|AEE74709.1| lactosylceramide 4-alpha-galactosyltransferase [Arabidopsis
thaliana]
Length = 411
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 87/174 (50%), Gaps = 23/174 (13%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDK 236
++S+L+R L+K+GG YLD D IV+KS ++L N GA++ V+ V+ DK
Sbjct: 226 NLSNLMRLAYLFKFGGVYLDTDMIVLKSFKTLRNVIGAQTLEPVSRNWTRLNNAVLIFDK 285
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
+H ++ E F WG NGP +++R+ +A Y NFTI P F
Sbjct: 286 NHPFLLKSIEEFALTFNGNVWGHNGPYLVSRVARAVEGTDGY--------NFTILTPPAF 337
Query: 297 YPVHWEHWADYLN----ETNAP----ATMSLFRDSYALHVWNSFTKRVPVKLGS 342
YPV+W E ++ + + + SY LH+WN F+++ ++ GS
Sbjct: 338 YPVNWVEIEKLFKVPRTEKDSKRVQVKVLEMQKRSYGLHLWNKFSRKFEIEQGS 391
>gi|297829434|ref|XP_002882599.1| hypothetical protein ARALYDRAFT_478210 [Arabidopsis lyrata subsp.
lyrata]
gi|297328439|gb|EFH58858.1| hypothetical protein ARALYDRAFT_478210 [Arabidopsis lyrata subsp.
lyrata]
Length = 412
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 86/174 (49%), Gaps = 23/174 (13%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVA-------AGVIHLDK 236
++S+L+R LYK+GG YLD D IV+KS ++L N GA++ V+ V+ DK
Sbjct: 227 NLSNLMRLAYLYKFGGVYLDTDMIVLKSFKTLRNVIGAQTLEPVSRKWTRLNNAVLIFDK 286
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
+H ++ E F WG NGP +++R+ +A Y NFTI P F
Sbjct: 287 NHPFLLKSIEEFALTFNGNVWGHNGPYLVSRVARAVEGTDGY--------NFTIMTPPAF 338
Query: 297 YPVHWEHWADYLN----ETNAP----ATMSLFRDSYALHVWNSFTKRVPVKLGS 342
YPV+W E ++ + + + SY LH+WN F+ + ++ GS
Sbjct: 339 YPVNWVEIEKLFKVPRTEKDSKRVQVKVLEMQKRSYGLHLWNKFSSKFEIEQGS 392
>gi|224078926|ref|XP_002305683.1| predicted protein [Populus trichocarpa]
gi|222848647|gb|EEE86194.1| predicted protein [Populus trichocarpa]
Length = 308
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 85/174 (48%), Gaps = 23/174 (13%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESS-------SVVAAGVIHLDK 236
+MS+LLR LYK+GG Y+D D IV+K+L L N GA+S S + V+ DK
Sbjct: 123 NMSNLLRLALLYKFGGIYMDTDVIVLKTLTKLRNAIGAQSIDLENGKWSRLNNAVLIFDK 182
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
H L + E F +WG NGP +++R++ + + NFT+ PP F
Sbjct: 183 KHPLLFKFIEEFALTFDGNKWGHNGPYLVSRVVSRVNRTPGF--------NFTVLPPSAF 234
Query: 297 YPVHWEHWADYLNETNAPATMSLFR--------DSYALHVWNSFTKRVPVKLGS 342
YPV+W A + R +S+A+H+WN ++++ + GS
Sbjct: 235 YPVNWSRIKSLFKGPEGKAHSTWLRKKLEQIKSESFAVHLWNRQSRKIKAESGS 288
>gi|145360743|ref|NP_181350.2| lactosylceramide 4-alpha-galactosyltransferase [Arabidopsis
thaliana]
gi|330254401|gb|AEC09495.1| lactosylceramide 4-alpha-galactosyltransferase [Arabidopsis
thaliana]
Length = 405
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 115/261 (44%), Gaps = 38/261 (14%)
Query: 107 RQACSIESAAMMNPGVQVYVVV--IASVRNRT-RNPLIDRLYEYQNVHIVQVDLGRYFQN 163
R+ +I++ NPG + ++ + S T PL D+ + N+ V +D+ +N
Sbjct: 137 REMLAIDTLFTTNPGACLAILSNSLDSPNGYTILKPLFDQGF---NLIAVTIDIPFLVKN 193
Query: 164 TP----LHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA 219
TP L + + S+ ++SDL R LYKYGG YLD D I + + L N
Sbjct: 194 TPAEAWLKRLKSGNMDPGSIPLFMNLSDLTRLAVLYKYGGVYLDTDIIFLNDMTGLRNAI 253
Query: 220 GAESSSVVAA-------GVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAE 272
GA+SS V+ D H L L+E F +WG N P +++R++K
Sbjct: 254 GAQSSDPATKRWTRLNNAVMVFDIYHPLMREFLQEYATTFDGNKWGYNSPYLVSRVIKRL 313
Query: 273 CKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT-----------MSLFR 321
Y N TI+ P FYPV+W + PAT + +
Sbjct: 314 GNKPGY-------NNLTIFSPDAFYPVNWIKIQKLFKK---PATTREAKWVEKTVQDMNK 363
Query: 322 DSYALHVWNSFTKRVPVKLGS 342
SY +H+WN T+++ ++ GS
Sbjct: 364 GSYMIHLWNKVTRKIKIEEGS 384
>gi|168021917|ref|XP_001763487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685280|gb|EDQ71676.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 236
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 90/175 (51%), Gaps = 19/175 (10%)
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTS----LWPLSHMSDLLRYVT 193
P I+R Y V + L F+N P ++ Q T + + +MS+++R
Sbjct: 53 EPFIERGYRIMAVTPNVISL---FENLPAAEWFKQQRDGTGDPGCINFMQNMSNIMRLTV 109
Query: 194 LYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALR 246
LYKYGG YLD D IV+KS + L N GA+S S+ V+ D++H + LR
Sbjct: 110 LYKYGGIYLDSDVIVLKSFDGLRNVVGAQSRSIAVGEWTRLNNAVLVFDREHPVVYEFLR 169
Query: 247 ELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHW 301
E F ++WG NGP ++TR+L+ + K Q + + C + ++ P FYP++W
Sbjct: 170 EFVATFDGSKWGWNGPYLVTRVLQ-KVKEQQWQN----CSSVSVLPLEAFYPLNW 219
>gi|147905838|ref|NP_001091207.1| uncharacterized protein LOC100036976 [Xenopus laevis]
gi|120538053|gb|AAI29753.1| LOC100036976 protein [Xenopus laevis]
Length = 339
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 78/279 (27%), Positives = 118/279 (42%), Gaps = 18/279 (6%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVV-----IASVRNRTRNPL- 140
N I F+ET+ D L L CSIESAA + P V + I S + + +
Sbjct: 51 NGIMFVETT-DRMDPPHLVL---CSIESAARVYPDRPVAFFMKGLTDINSKEDEDKARMS 106
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
L Y N++ + + + NTPL +Y + T ++ SD R +YKYGG
Sbjct: 107 YPTLLPYDNIYFFPLRMNKLLNNTPLMPWYQKVNPNTEMYWNHVSSDACRLALIYKYGGL 166
Query: 201 YLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
Y+D D I + N+ AE S + + V+ H + + + + + T WG
Sbjct: 167 YMDTDIITFRPCPE-KNFLAAEVSQMTGSAVLAFTPHHTIVWQFMEDFVNGYDGTVWGQQ 225
Query: 261 GPGVLTRLL-KAECK-PQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
GP + R+L K CK P I C F+PV W + T
Sbjct: 226 GPLLYNRILNKFYCKVPPFKGQEDIMCGTILFLNIERFFPVPGMQWKKFFEVCEKLPT-- 283
Query: 319 LFRDSYALHVWNSFTK--RVPVKLGSEQPYAQIARRYCP 355
F +SYALH++N K R + GS+ + ++YCP
Sbjct: 284 -FINSYALHLFNYANKNDRKVMVPGSKTMVELLYKQYCP 321
>gi|326925671|ref|XP_003209034.1| PREDICTED: LOW QUALITY PROTEIN:
alpha-1,4-N-acetylglucosaminyltransferase-like
[Meleagris gallopavo]
Length = 335
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/281 (26%), Positives = 126/281 (44%), Gaps = 30/281 (10%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYV--VVIASVRNRTRNPLIDR-- 143
SI FLET+ + +E +CS+ESAA ++Y +I ++ T + ++D
Sbjct: 50 SIIFLETT----ERLEPPPLVSCSVESAA------RIYQDRPIILFMKGLTNDTVLDSNS 99
Query: 144 -------LYEYQNVHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLY 195
L +NV + + + FQ TPL +Y + D W + SD R ++
Sbjct: 100 SYTAFSLLSSMKNVFLFPLQMENVFQETPLLQWYNEVDPEEEKNW-VHVSSDASRLALIW 158
Query: 196 KYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTT 255
KYGG Y+D D I I+ + ++ A+ S + G+ H + +
Sbjct: 159 KYGGIYMDTDVISIRPIPE-GSFLAAQKSRFSSNGIFGFPAHHKFIWDCMENFVLKYNGN 217
Query: 256 EWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPA 315
WG GP ++TR+L + + SC+N + P+ FYP+ + W+ Y + +
Sbjct: 218 IWGNQGPFLMTRMLXXXXXFKGVEDH--SCQNISFLNPQRFYPIPYPAWSRYYDVWDK-- 273
Query: 316 TMSLFRDSYALHVWNSFTK-RVPVKLGSEQPYAQIARRYCP 355
+ F SYALH+WN + R V GS ++ + YCP
Sbjct: 274 -VPDFNHSYALHLWNFMNRNRKAVVAGSNSLAEKLYKTYCP 313
>gi|15240929|ref|NP_195745.1| alpha 1,4-glycosyltransferase family protein [Arabidopsis thaliana]
gi|6759448|emb|CAB69853.1| putative protein [Arabidopsis thaliana]
gi|51315396|gb|AAT99803.1| At5g01250 [Arabidopsis thaliana]
gi|110738709|dbj|BAF01279.1| hypothetical protein [Arabidopsis thaliana]
gi|332002934|gb|AED90317.1| alpha 1,4-glycosyltransferase family protein [Arabidopsis thaliana]
Length = 407
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 85/177 (48%), Gaps = 29/177 (16%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAES---SSV----VAAGVIHLDK 236
++S+L+R LYKYGG YLD D IV+KS + L N GA++ SS + V+ DK
Sbjct: 222 NLSNLMRLAYLYKYGGVYLDTDMIVLKSFKGLRNVIGAQTLDPSSTNWTRLNNAVLIFDK 281
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
+H L + E F WG NGP +++R+ +A Y NFT+ P F
Sbjct: 282 NHPLLLKFMEEFAKTFNGNIWGYNGPYLVSRVARAVEGSSGY--------NFTVMRPSVF 333
Query: 297 YPVHWEHWADYLNETNAPAT-----------MSLFRDSYALHVWNSFTKRVPVKLGS 342
Y V+W P T + + R+ Y LH+WN F+++ ++ GS
Sbjct: 334 YSVNWLEIKKLF---KVPKTEKDSKWVKTKLLHMQRNGYGLHLWNKFSRKYEIEQGS 387
>gi|412990621|emb|CCO17993.1| alpha-1,4-galactosyltransferase [Bathycoccus prasinos]
Length = 492
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 117/264 (44%), Gaps = 28/264 (10%)
Query: 90 FFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQN 149
FFL + + A + +LR ++ES +PG +V +++ D Y N
Sbjct: 226 FFLAWTTSAA---KFSLRYRRTVESTLKFHPGA-CLIVYSPTMQLDHFQRFWDLGY---N 278
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQ-DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
+ + + D+ + TP +Y D + SH+++++R TL+KYGG YLD D +V
Sbjct: 279 IIVERPDVPYLIRGTPAEAWYQGIDKWKNGEYFFSHITEIIRLATLWKYGGVYLDTDVVV 338
Query: 209 IKSLESLHNYAGAE------SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP 262
++ L++LHN G E + V+ V+ K + E ++ WG NGP
Sbjct: 339 MRELDNLHNAVGTELADERGEAKVLNGAVLAFRKGSTFIHECMVEFNTTYRIDSWGWNGP 398
Query: 263 GVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLF-- 320
++TR+ A PQ I P FYP+HW Y + + +++
Sbjct: 399 QLVTRV--AARFPQG--------PELQILPTIGFYPIHWAKVRKYFTDEDPADQHAVWER 448
Query: 321 --RDSYALHVWNSFTKRVPVKLGS 342
R++Y H WN T ++ GS
Sbjct: 449 MKRETYLFHYWNKITVKLVPTPGS 472
>gi|15219856|ref|NP_176300.1| alpha 1,4-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|12323337|gb|AAG51645.1|AC018908_11 hypothetical protein; 81821-83128 [Arabidopsis thaliana]
gi|332195650|gb|AEE33771.1| alpha 1,4-glycosyltransferase-like protein [Arabidopsis thaliana]
Length = 435
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 84/162 (51%), Gaps = 20/162 (12%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YKYGG YLD D I++KSL +LHN GA++ V V+ DK+H L + E
Sbjct: 259 YKYGGIYLDTDVIILKSLSNLHNVIGAQTVDPVTKKWSRLNNAVLIFDKNHPLLKRFIDE 318
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADY 307
F +WG NGP +++R++ K S S F++ PP FYPV W +
Sbjct: 319 FSRTFNGNKWGHNGPYLVSRVI-TRIKISSS-----SDLGFSVLPPSAFYPVDWTRIKGF 372
Query: 308 ----LNETNA---PATMSLFRDSYALHVWNSFTKRVPVKLGS 342
NE++A L ++++A+H+WN +K++ ++ GS
Sbjct: 373 YRAPTNESDAWLRKRLTHLRKNTFAVHLWNRESKKLRIEEGS 414
>gi|297806103|ref|XP_002870935.1| alpha 1,4-glycosyltransferase family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297316772|gb|EFH47194.1| alpha 1,4-glycosyltransferase family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 406
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 82/174 (47%), Gaps = 23/174 (13%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAES---SSV----VAAGVIHLDK 236
++S+L+R LYKYGG YLD D IV+KS + L N GA++ SS + V+ DK
Sbjct: 222 NLSNLMRLAYLYKYGGVYLDTDMIVLKSFKGLRNVIGAQTLDPSSTNWTRLNNAVLIFDK 281
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
+H L + E F WG NGP +++R+ +A Y NFT+ P F
Sbjct: 282 NHPLLLKFIEEFAKTFNGNIWGYNGPYLVSRVARAVEGSSGY--------NFTVMRPSVF 333
Query: 297 YPVHWEHWADYLNETNAPAT--------MSLFRDSYALHVWNSFTKRVPVKLGS 342
Y V+W + + + Y LH+WN F+++ ++ GS
Sbjct: 334 YSVNWLEIKKLFKVAKTEKDSKWVKIKLLHMRKSGYGLHLWNKFSRKYEIEQGS 387
>gi|321471299|gb|EFX82272.1| hypothetical protein DAPPUDRAFT_223908 [Daphnia pulex]
Length = 401
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/274 (25%), Positives = 115/274 (41%), Gaps = 25/274 (9%)
Query: 104 LTLRQACSIESAAMMNPGVQVYVVV----------------IASVRNRTRNPLIDRLYEY 147
LT RQAC +ESAA NPG+ V V + VR+ N ++
Sbjct: 116 LTPRQACGVESAARANPGMHVKVYLNTHRIGPPGWDDLLRRPGRVRSCAFNDMLLAQDFS 175
Query: 148 QNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFI 207
NV +V+ + + ++ +S W + +SD +R + L ++GG YLD D I
Sbjct: 176 SNVQMVRQNFTQMLADSLFRPLVASGNFHSSHWSVVQISDAIRLLLLQQHGGYYLDFDNI 235
Query: 208 VIKSLESLHN-YAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
V + L L N ++ E + G++ +D +H +R L + + + GP
Sbjct: 236 VFRPLHCLRNGFSYLEEHPNIENGIMVMDANHPFLSFLIRYLMQTYDPNKRVSLGPPAFG 295
Query: 267 RLLKAECKP-----QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
+ K C +S H + N T++ P F+PV + + T +
Sbjct: 296 KAFKLFCHVNDPLFKSGLHRCLDNSNLTLFHPDSFFPVRHYELGHFYSTTWPGLDLQKME 355
Query: 322 DSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+Y HV+ S R K+ Y+++AR YCP
Sbjct: 356 RAYLTHVYLSSWGR---KVHPNSLYSRLARHYCP 386
>gi|31324074|gb|AAP47169.1| alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 281
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 87/181 (48%), Gaps = 7/181 (3%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L + NV ++ +DL F++TPL +Y + L +SD R ++K+GG YLD
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYAAVQGRWEPYLLPVLSDASRIALMWKFGGIYLD 192
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
DFIV+K+L +L N G +S V+ + ++ H +R+ D++ + G+ GP
Sbjct: 193 TDFIVLKNLRNLTNVLGTQSRYVLNGAFLAFERRHEFMALCMRDFVDHYNGWDLGSPGPA 252
Query: 264 V 264
Sbjct: 253 A 253
>gi|149447197|ref|XP_001520158.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like, partial
[Ornithorhynchus anatinus]
Length = 207
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 79/172 (45%), Gaps = 6/172 (3%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAAL 245
SD R ++K+GG Y+D D I I+ + S N+ +SS + GV + H +
Sbjct: 14 SDASRLAVVWKFGGIYMDTDVISIRPIPS-ENFLAGQSSRDSSNGVFGFVRHHPFLWQCM 72
Query: 246 RELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHW 304
+N+ WG GP ++TR+L+ C + + C+N + P FYP+ + W
Sbjct: 73 ENFVENYNGDVWGHQGPALMTRVLELWCNLTDFQEVTDLRCQNLSFLHPHRFYPIFYPQW 132
Query: 305 ADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKL-GSEQPYAQIARRYCP 355
+ A F DSYALH+WN + V + GS + R +CP
Sbjct: 133 KLFY---EVWAQQPTFNDSYALHLWNYMNQEKMVMVPGSNTLVENLYRNHCP 181
>gi|297840457|ref|XP_002888110.1| alpha 1,4-glycosyltransferase family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297333951|gb|EFH64369.1| alpha 1,4-glycosyltransferase family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 436
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 84/165 (50%), Gaps = 25/165 (15%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YKYGG YLD D I++KSL +LHN GA++ V V+ DK+H L + E
Sbjct: 259 YKYGGIYLDTDVIILKSLSNLHNVIGAQTVDPVTRKWSRLNNAVLIFDKNHPLLKRFIDE 318
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRN--FTIYPPRFFYPVHWEHWA 305
F +WG NGP +++R++ NI S + F++ PP FYPV W
Sbjct: 319 FSRTFNGNKWGHNGPYLVSRVIA--------RINISSSSDLGFSVLPPSAFYPVDWTRIK 370
Query: 306 DY----LNETNA----PATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ +ET A L ++++A+H+WN +K++ ++ GS
Sbjct: 371 GFYRAPTSETEANWLRKRLTHLRKNTFAVHLWNRESKKLRIEEGS 415
>gi|357624856|gb|EHJ75470.1| hypothetical protein KGM_15265 [Danaus plexippus]
Length = 226
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 74/129 (57%), Gaps = 8/129 (6%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYE 146
NSIFF +T+C + L +ACSIE+AA ++P VY++++ + + +I ++
Sbjct: 69 NSIFFYQTACRRT----IRLHEACSIEAAARLHPNKNVYIIILNEIS--PHDAVIKQVER 122
Query: 147 YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
Y NV + ++++ + TP + Y D I MSDLL+ +TL+KYGGT L+LD
Sbjct: 123 YSNVKVSRINIESFVNGTPFNDLY--DIIYKVKRRHKVMSDLLKIITLFKYGGTALNLDI 180
Query: 207 IVIKSLESL 215
V KSL L
Sbjct: 181 TVTKSLSKL 189
>gi|255555985|ref|XP_002519027.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ricinus
communis]
gi|223541690|gb|EEF43238.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ricinus
communis]
Length = 364
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 125/262 (47%), Gaps = 41/262 (15%)
Query: 107 RQACSIESAAMMNPGVQVYVVVIASVRNRTRN-----PLIDRLYEYQNVHIVQVDLGRYF 161
R+ +IES NP +V+++S + R PL+D+ ++ V ++ D F
Sbjct: 91 RELLAIESLFKSNP--NACLVIVSSSMDSERGSGLLRPLLDKGFK---VASIKPDFNYLF 145
Query: 162 QNTPLHGFYTQ------DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
+NT ++++ D SL ++S+LLR LYK+GGTYLD D IV+KS L
Sbjct: 146 KNTYAESWFSELKKGNVDPGEVSLG--QNLSNLLRLALLYKFGGTYLDTDVIVLKSFGKL 203
Query: 216 HNYAGAESS-------SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL 268
N GA++ S + V+ DK H L ++E F +WG NGP +++R+
Sbjct: 204 RNIIGAQTIDLETGNWSRLNNAVLIFDKKHPLLFKFIQEFALTFNGNKWGHNGPYLVSRV 263
Query: 269 LKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWA-------DYLNETNAPATMSLFR 321
+ + NFT+ PP FYPV+W D L+ + +
Sbjct: 264 VSRVSGRPGF--------NFTVLPPSAFYPVNWSRIGSIFRGPRDELHSKWLQRKLEQIK 315
Query: 322 -DSYALHVWNSFTKRVPVKLGS 342
+S A+H+WN ++++ V+ GS
Sbjct: 316 GESLAVHLWNKQSRQIKVENGS 337
>gi|224113967|ref|XP_002316629.1| predicted protein [Populus trichocarpa]
gi|222859694|gb|EEE97241.1| predicted protein [Populus trichocarpa]
Length = 308
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 83/175 (47%), Gaps = 25/175 (14%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESS-------SVVAAGVIHLDK 236
+MS+LLR LYK+GG Y+D D IV+K L N GA++ S + V+ DK
Sbjct: 123 NMSNLLRLALLYKFGGIYMDTDVIVLKRFTKLRNVIGAQTIDLETRNWSRLNNAVLIFDK 182
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLL-KAECKPQSYAHNIISCRNFTIYPPRF 295
H L + E F +WG NGP +++R++ + +P NFT+ PP
Sbjct: 183 KHPLLFKFIEEFALTFDGNKWGHNGPYLVSRVVSRVNGRPGF---------NFTVLPPPA 233
Query: 296 FYPVHWEHWADYLNETNAPATMS--------LFRDSYALHVWNSFTKRVPVKLGS 342
FYPV W + + + +S+A+H+WN ++ + V+ GS
Sbjct: 234 FYPVDWSRIRSFFRGPRDKVHSTWLHEKLEQIKSESFAVHLWNKQSREIKVESGS 288
>gi|301625548|ref|XP_002941965.1| PREDICTED: alpha-1,4-N-acetylglucosaminyltransferase-like, partial
[Xenopus (Silurana) tropicalis]
Length = 251
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 87/189 (46%), Gaps = 6/189 (3%)
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
L Y N+++ + + ++TPL +Y Q + SD R +YKYGG Y+D
Sbjct: 22 LAPYNNIYLFPLRMDILLKDTPLLPWYQQVNPEKEVHWAHVSSDASRLALMYKYGGLYMD 81
Query: 204 LDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG 263
D I ++ + + N+ AES+ + + GV + + + + N+ WG GP
Sbjct: 82 TDIISLRPV-PVENFLVAESNQLSSNGVFGFNSHRDFTWTCMEDFVKNYNGAIWGHQGPA 140
Query: 264 VLTRLLKA-ECK-PQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFR 321
+ TR+L+ C P + C N + PR FYP+ W + A T F
Sbjct: 141 LFTRVLRQFYCDIPPFKGDEDLKCGNVSFLNPRRFYPIECRFWMRFFEVWKAFPT---FV 197
Query: 322 DSYALHVWN 330
DSYALH++N
Sbjct: 198 DSYALHLFN 206
>gi|356546655|ref|XP_003541739.1| PREDICTED: uncharacterized protein At4g19900-like [Glycine max]
Length = 421
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 79/160 (49%), Gaps = 20/160 (12%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESS-------SVVAAGVIHLDKDHWLSGAALRE 247
YK+GGTY+DLD +V+KS L N GA++ S + V+ DK H L + E
Sbjct: 255 YKFGGTYIDLDVVVLKSFSKLRNTIGAQNFDTKTGKWSRLNNAVLIFDKKHPLLFKFIEE 314
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEH---- 303
F +WG NGP +++R++ + NFT+ PP FYPV W
Sbjct: 315 FALTFDGNKWGHNGPYLISRVVSRVSGRPGF--------NFTVLPPSAFYPVDWRGIRSL 366
Query: 304 WADYLNETNAPATMSLFR-DSYALHVWNSFTKRVPVKLGS 342
+ D ++ M R +S+A+H+WN ++++ V GS
Sbjct: 367 FRDEISSKWLINKMEQIRKESFAVHLWNRHSRKLKVVKGS 406
>gi|359497312|ref|XP_003635482.1| PREDICTED: uncharacterized protein At4g19900-like [Vitis vinifera]
gi|296088928|emb|CBI38494.3| unnamed protein product [Vitis vinifera]
Length = 451
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 74/166 (44%), Gaps = 29/166 (17%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YK+GG Y+D DF+V+KS L N GA++ + V+ D+ H L + E
Sbjct: 277 YKFGGIYMDTDFVVLKSFSGLRNVIGAQTMDLATGNWSRLNNAVMIFDEQHPLLLKFIEE 336
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCR---NFTIYPPRFFYPVHWEHW 304
F +WG NGP +++R++ IS R NFT+ PP FYPV W
Sbjct: 337 FALTFNGNKWGHNGPYLVSRVVSK-----------ISGRTGFNFTVLPPPAFYPVDWSKI 385
Query: 305 ADYLNETN--------APATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ A + + S+A+H+WN + + + GS
Sbjct: 386 PSFFKGPRDKPHSKWLAGKLLHVRMQSFAVHLWNRHSNNLKAEKGS 431
>gi|449488506|ref|XP_004158061.1| PREDICTED: uncharacterized protein At4g19900-like [Cucumis sativus]
Length = 413
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 76/163 (46%), Gaps = 23/163 (14%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YK+GG Y+D D I++++ +L N GA++ + V+ DK+H L ++E
Sbjct: 249 YKFGGIYIDTDVIILQNFTNLRNAIGAQTMDLKTGNWSRLNNAVMIFDKNHPLLLQFIKE 308
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADY 307
F +WG NGP +++R++ + + N TI PP FYPV W +
Sbjct: 309 FATTFDGNKWGHNGPYLVSRVISRLNQNSEF--------NLTILPPSAFYPVVWNRIKTF 360
Query: 308 LNETNAPATMS--------LFRDSYALHVWNSFTKRVPVKLGS 342
+ + S ALH+WN+ ++++ V+ GS
Sbjct: 361 FQGPKDAVHLKWIIAKLRHIQTKSLALHLWNNHSRKLQVEKGS 403
>gi|357446543|ref|XP_003593549.1| Lactosylceramide 4-alpha-galactosyltransferase [Medicago
truncatula]
gi|355482597|gb|AES63800.1| Lactosylceramide 4-alpha-galactosyltransferase [Medicago
truncatula]
Length = 439
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 76/162 (46%), Gaps = 23/162 (14%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSV-------VAAGVIHLDKDHWLSGAALRE 247
YK+GG Y+D D I++KS N GA++ V + V+ DK H L + E
Sbjct: 269 YKFGGIYIDADIIIMKSFSKFRNTIGAQNIDVKTKKWSRLNNAVLIFDKKHPLLLKFIEE 328
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADY 307
F +WG NGP +++R++ + Y NF++ PP FYPV W
Sbjct: 329 FALTFDGNKWGHNGPYLISRVVSRVSGREGY--------NFSVVPPSAFYPVDWRGIKSL 380
Query: 308 L----NETNA----PATMSLFRDSYALHVWNSFTKRVPVKLG 341
+E ++ + + ++SYA+H+WN + ++ V G
Sbjct: 381 FRGPGDEIHSKWLVKKMVQIRKESYAVHLWNRQSGKLEVVKG 422
>gi|37726545|gb|AAO39152.1| truncated alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 348
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 80/172 (46%), Gaps = 21/172 (12%)
Query: 84 VPDNSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL--- 140
P N IFFLETS D CS+ESAA +P V V++ P
Sbjct: 75 TPGN-IFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLG 129
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWP---LSHMSDLLRYVTLYKY 197
I L + NV ++ +DL F++TPL +Y A + W L +SD R ++K+
Sbjct: 130 ISLLSCFPNVQMLPLDLRELFRDTPLADWY---AAVQGRWEPYLLPVLSDASRIALMWKF 186
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELR 249
GG YLD DFIV+K+L +L N G +S V L+ W S A R
Sbjct: 187 GGIYLDTDFIVLKNLRNLTNVLGTQSRYV-------LNGASWPSSAGTSSWR 231
>gi|355666118|gb|AER93429.1| alpha-1,4-N-acetylglucosaminyltransferase [Mustela putorius furo]
Length = 149
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 5/143 (3%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAAL 245
SD R ++KYGG Y+D D I I+ + N+ A+SS + GV H +
Sbjct: 11 SDASRLAIIWKYGGIYMDTDVISIRPIPE-ENFLAAQSSRYSSNGVFGFLPHHPFLWDCM 69
Query: 246 RELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH-NIISCRNFTIYPPRFFYPVHWEHW 304
+++ + WG GP ++TR+L+ C+ + + + C N + P+ FYP+ + W
Sbjct: 70 ENFVEHYNSEIWGNQGPSLMTRMLRLWCRLGDFQEVSDLRCLNLSFLHPQRFYPISYPEW 129
Query: 305 ADYLNETNAPATMSLFRDSYALH 327
Y + + F DSYALH
Sbjct: 130 RRYYEVWDRDLS---FNDSYALH 149
>gi|5738362|emb|CAB52870.1| putative protein [Arabidopsis thaliana]
gi|7268785|emb|CAB78991.1| putative protein [Arabidopsis thaliana]
Length = 1302
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 111/285 (38%), Gaps = 34/285 (11%)
Query: 69 GVEKDFAFDDITRLDVPDNSIFFLETSCT-------HADGVELTLRQACSIESAAMMNPG 121
G+E +F D FF + C+ ++ G ++R +ES +
Sbjct: 348 GIEPSLSFSDFM-------DSFFRKEKCSMRVFMVWNSPGWMFSVRHQRGLESLLSQHRD 400
Query: 122 VQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ--DAILTSL 179
V V + RN + Y+ V + +L Q+TP H F + D T
Sbjct: 401 ACVVVFSETVELDFFRNSFVKDSYK---VAVAMPNLDELLQDTPTHVFASVWFDWRKTKF 457
Query: 180 WPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAE---SSSVVAAGVIHLDK 236
+P +H S+L+R LYKYGG YLD D IV+ SL SL N G E + + V+ +K
Sbjct: 458 YP-THYSELVRLAALYKYGGVYLDSDVIVLGSLSSLRNTIGMEDQVAGESLNGAVMSFEK 516
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
L E + NG +LTR+ K ++ N + I P F
Sbjct: 517 KSPFLLECLNEYYLTYDDKCLRCNGADLLTRVAKRFLNGKNRRMNQ---QELNIRPSSVF 573
Query: 297 YPVHWEHWADYL--------NETNAPATMSLFRDSYALHVWNSFT 333
+P++ + +Y + + +S H WNS T
Sbjct: 574 FPINSQQITNYFAYPAIEDERSQQDESFKKILNESLTFHFWNSVT 618
>gi|255566307|ref|XP_002524140.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ricinus
communis]
gi|223536607|gb|EEF38251.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ricinus
communis]
Length = 691
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 86/197 (43%), Gaps = 23/197 (11%)
Query: 162 QNTPLHGFYT--QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA 219
Q+TP H F T +P +H S+L+R LYKYGG YLD D IV+ L SLHN
Sbjct: 485 QDTPTHVFADVWSQWRSTKFYP-THYSELIRLAALYKYGGIYLDSDIIVLNPLSSLHNTV 543
Query: 220 GAE---SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQ 276
G E + S + V+ K+ L+E + T NG +LTR+ + + +
Sbjct: 544 GLEGQIAGSSLNGAVMAFKKNSPFLMECLKEFYMTYDDTNLRGNGADLLTRVAQKFYRKE 603
Query: 277 SYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS-----------LFRDSYA 325
+ + + + P F+P+ + Y PAT S + +S A
Sbjct: 604 DKSLKQLELK---LQPSYIFFPIGSQDITSYF---TTPATASEKARQDAMFIKILSESLA 657
Query: 326 LHVWNSFTKRVPVKLGS 342
H W+S T + + GS
Sbjct: 658 FHFWSSLTSALIPEPGS 674
>gi|449457819|ref|XP_004146645.1| PREDICTED: uncharacterized protein At4g19900-like [Cucumis sativus]
Length = 393
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 76/155 (49%), Gaps = 27/155 (17%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YK+GG Y+D D I++++ +L N GA++ + V+ DK+H L ++E
Sbjct: 249 YKFGGIYIDTDVIILQNFTNLRNAIGAQTMDLKTGNWSRLNNAVMIFDKNHPLLLQFIKE 308
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADY 307
F +WG NGP +++R++ + + N TI PP FYPV
Sbjct: 309 FATTFDGNKWGHNGPYLVSRVISRLNQNSEF--------NLTILPPSAFYPV-------V 353
Query: 308 LNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ + T SL ALH+WN+ ++++ V+ GS
Sbjct: 354 IAKLRHIQTKSL-----ALHLWNNHSRKLQVEKGS 383
>gi|334186714|ref|NP_193724.2| alpha 1,4-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|223635837|sp|P0C8Q4.1|Y4990_ARATH RecName: Full=Uncharacterized protein At4g19900
gi|332658843|gb|AEE84243.1| alpha 1,4-glycosyltransferase-like protein [Arabidopsis thaliana]
Length = 644
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 111/285 (38%), Gaps = 34/285 (11%)
Query: 69 GVEKDFAFDDITRLDVPDNSIFFLETSCT-------HADGVELTLRQACSIESAAMMNPG 121
G+E +F D FF + C+ ++ G ++R +ES +
Sbjct: 348 GIEPSLSFSDFM-------DSFFRKEKCSMRVFMVWNSPGWMFSVRHQRGLESLLSQHRD 400
Query: 122 VQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQ--DAILTSL 179
V V + RN + Y+ V + +L Q+TP H F + D T
Sbjct: 401 ACVVVFSETVELDFFRNSFVKDSYK---VAVAMPNLDELLQDTPTHVFASVWFDWRKTKF 457
Query: 180 WPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAE---SSSVVAAGVIHLDK 236
+P +H S+L+R LYKYGG YLD D IV+ SL SL N G E + + V+ +K
Sbjct: 458 YP-THYSELVRLAALYKYGGVYLDSDVIVLGSLSSLRNTIGMEDQVAGESLNGAVMSFEK 516
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
L E + NG +LTR+ K ++ N + I P F
Sbjct: 517 KSPFLLECLNEYYLTYDDKCLRCNGADLLTRVAKRFLNGKNRRMN---QQELNIRPSSVF 573
Query: 297 YPVHWEHWADYL--------NETNAPATMSLFRDSYALHVWNSFT 333
+P++ + +Y + + +S H WNS T
Sbjct: 574 FPINSQQITNYFAYPAIEDERSQQDESFKKILNESLTFHFWNSVT 618
>gi|226490855|ref|NP_001149936.1| transferase/ transferase, transferring glycosyl groups [Zea mays]
gi|195635601|gb|ACG37269.1| transferase/ transferase, transferring glycosyl groups [Zea mays]
Length = 464
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 74/160 (46%), Gaps = 22/160 (13%)
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRELRD 250
GG YLD D +V++ L L N GA++ + V+ D+ H L + E
Sbjct: 290 GGIYLDADVVVLRPLSELRNTIGAQAMNDATGDWRRLNNAVMVFDRAHQLVHEFIAEFAA 349
Query: 251 NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWA----- 305
F ++WG NGP +++R+ + ++ FT+ PPR FYPVHW
Sbjct: 350 AFDGSKWGHNGPYLVSRV-------AARLRHLSPGLAFTVLPPRAFYPVHWSKIGGLFVA 402
Query: 306 --DYLNETNAPATMSLFR-DSYALHVWNSFTKRVPVKLGS 342
D E A + + S+ +H+WN + R+ +++GS
Sbjct: 403 PKDRKGERWVKAKVENIKGQSFGIHLWNRESSRMEMEVGS 442
>gi|413919006|gb|AFW58938.1| transferase/ transferase [Zea mays]
Length = 464
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 74/160 (46%), Gaps = 22/160 (13%)
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRELRD 250
GG YLD D +V++ L L N GA++ + V+ D+ H L + E
Sbjct: 290 GGIYLDADVVVLRPLSELRNTIGAQAMNDATGDWRRLNNAVMVFDRAHQLVHEFIAEFAA 349
Query: 251 NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWA----- 305
F ++WG NGP +++R+ + ++ FT+ PPR FYPVHW
Sbjct: 350 AFDGSKWGHNGPYLVSRV-------AARLRHLSPGLAFTVLPPRAFYPVHWSKIGGLFVA 402
Query: 306 --DYLNETNAPATMSLFR-DSYALHVWNSFTKRVPVKLGS 342
D E A + + S+ +H+WN + R+ +++GS
Sbjct: 403 PKDRKGERWVKAKVENIKGQSFGIHLWNRESSRMEMEVGS 442
>gi|359473049|ref|XP_002276821.2| PREDICTED: uncharacterized protein At4g19900-like [Vitis vinifera]
Length = 707
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/212 (28%), Positives = 89/212 (41%), Gaps = 29/212 (13%)
Query: 160 YFQNTPLHGFYTQDAILTSLW--------PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKS 211
+F++ G T I S+W +H S+L+R LYKYGG YLD D IV+K
Sbjct: 492 FFKDFVEKGQNTAAHIFASVWFEWRKTNFYSTHYSELVRLAALYKYGGIYLDSDIIVVKP 551
Query: 212 LESLHNYAGAE---SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL 268
L SL+N G E + S + V+ KD L E + T NG +LTR+
Sbjct: 552 LSSLNNSVGLEDQLAGSSLNGAVMVFRKDSPFIMECLNEFYSTYDDTCLKCNGADLLTRV 611
Query: 269 LKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYL----NETNAPATMSLF---- 320
K ++ + + + P F+P+ + Y ET LF
Sbjct: 612 AKKFLSKENASDKQLE---LLVQPSFIFFPISPHNITRYFTTPATETEKAEQDILFSKIL 668
Query: 321 RDSYALHVWNSFTKR-VPVKLGSEQPYAQIAR 351
+S+ H WNS T +P +P + +AR
Sbjct: 669 NESFTFHFWNSLTSSLIP------EPESLVAR 694
>gi|168063559|ref|XP_001783738.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664744|gb|EDQ51452.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 442
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/220 (25%), Positives = 97/220 (44%), Gaps = 30/220 (13%)
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLW---PL--SHMSDLLRYVTLYKYG 198
+ E V +V+ +L +TP F A + W PL H ++LLR LYK+G
Sbjct: 215 IKEGYKVIVVRPNLHELLADTPSDAF----AAILPKWREKPLFYLHYTELLRLAALYKFG 270
Query: 199 GTYLDLDFIVIKSLESLHNYAGAESSS----VVAAGVIHLDKDHWLSGAALRELRDNFKT 254
G YLD+D IV+++L+SLHN G E +S + ++ DK + E + +
Sbjct: 271 GVYLDMDVIVLRALDSLHNTVGTELTSNGELRLNGAILVFDKSSLYLKKCMEEFTNTYNE 330
Query: 255 TEWGANGPGVLTRLLKAECKPQSYAHNIISCRNF----TIYPPRFFYPVHWEHWADYL-- 308
T NG +LTR+ + + N + R F + P F+P+ + +
Sbjct: 331 TLIQWNGADLLTRVANS-----TVLENGSTWRQFPDLLNVQGPFSFFPLDSSRISKFFAA 385
Query: 309 ------NETNAPATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ ++ ++Y +H+WNS T + ++ S
Sbjct: 386 PEDSIQKQRQMKLLTRIYEEAYTVHLWNSLTSNLVPEINS 425
>gi|297737957|emb|CBI27158.3| unnamed protein product [Vitis vinifera]
Length = 1664
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 84/206 (40%), Gaps = 35/206 (16%)
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLW--------PLSHMSDLLRYVTLYKYGGTY 201
V + +L +NT H I S+W +H S+L+R LYKYGG Y
Sbjct: 1410 VAVAMPNLDELLKNTAAH-------IFASVWFEWRKTNFYSTHYSELVRLAALYKYGGIY 1462
Query: 202 LDLDFIVIKSLESLHNYAGAE---SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG 258
LD D IV+K L SL+N G E + S + V+ KD L E + T
Sbjct: 1463 LDSDIIVVKPLSSLNNSVGLEDQLAGSSLNGAVMVFRKDSPFIMECLNEFYSTYDDTCLK 1522
Query: 259 ANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
NG +LTR+ K ++ + + + P F+P+ + Y PAT +
Sbjct: 1523 CNGADLLTRVAKKFLSKENASDKQLE---LLVQPSFIFFPISPHNITRYF---TTPATET 1576
Query: 319 -----------LFRDSYALHVWNSFT 333
+ +S+ H WNS T
Sbjct: 1577 EKAEQDILFSKILNESFTFHFWNSLT 1602
>gi|340750656|ref|ZP_08687494.1| polysaccharide biosynthesis protein [Fusobacterium mortiferum ATCC
9817]
gi|229420286|gb|EEO35333.1| polysaccharide biosynthesis protein [Fusobacterium mortiferum ATCC
9817]
Length = 259
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 79/165 (47%), Gaps = 22/165 (13%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW + +SD R LY+ GG YLD D +IK+++ L + + GAES V++AG+I +
Sbjct: 63 KLW--AFLSDYFRIKVLYEEGGVYLDTDMQIIKNIDKLLSNDFFIGAESEKVISAGIIGV 120
Query: 235 DKDHWLSGAALRELRDNFKTTEWGA---NGPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H ++++ + + W P ++TR++ E Q + IY
Sbjct: 121 IPKH----PLMKKILEFYSVAIWNEPIFTIPDIITRVINREYDFQINEDITKITGSMVIY 176
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVW-NSFTKR 335
PPR+FYP H+ T + D+Y +H W S+T++
Sbjct: 177 PPRYFYPYHF---------TEEFKRECIKDDTYGIHWWGKSWTQK 212
>gi|321456551|gb|EFX67655.1| hypothetical protein DAPPUDRAFT_330838 [Daphnia pulex]
Length = 113
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 53/90 (58%)
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
+Y N+H++ +L Y +PL +Y + + +SH+SD LR++TL+KYG + DLD
Sbjct: 18 KYNNIHLISFNLDNYMAGSPLKYWYHCNGWRDGPFHVSHLSDGLRFLTLHKYGVYFFDLD 77
Query: 206 FIVIKSLESLHNYAGAESSSVVAAGVIHLD 235
I ++ + L N+ ES + +GV+H D
Sbjct: 78 VISVRPVTDLRNFVATESDDYLGSGVLHAD 107
>gi|357122353|ref|XP_003562880.1| PREDICTED: uncharacterized protein At4g19900-like [Brachypodium
distachyon]
Length = 621
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 107/271 (39%), Gaps = 40/271 (14%)
Query: 106 LRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTP 165
+R +ES + +P VV+ S + + E V + +L + TP
Sbjct: 362 VRHQRGLESLLLHHPDA---CVVMLSETLELEESFQEFVKEGYKVAVAVPNLDELLEGTP 418
Query: 166 LHGFYTQDAILTSLW---------PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
H I S+W PL H S+L+R LY+YGG YLD D IV+K L+SL
Sbjct: 419 AH-------IFASVWYEWRKTIHYPL-HYSELVRLSALYRYGGIYLDSDVIVLKPLKSLQ 470
Query: 217 NYAGA----ESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAE 272
N G S + V+ L+K L E + T NG ++TR+++
Sbjct: 471 NCIGTVKQVSRDSSFSGAVLVLEKQSPFLVECLNEFYSTYDDTLLQWNGAELMTRVIRNH 530
Query: 273 CKP-QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYL----NETNAPATMSLF----RDS 323
Q H I + P FYP++ Y NE +LF DS
Sbjct: 531 SDSDQDRGHLAIK-----LEPSVIFYPINSTDITRYFSVPDNEVERAQHDALFSRIVNDS 585
Query: 324 YALHVWNSFTKRVPVKLGSEQPYAQIARRYC 354
H+WNS T + + S +I RYC
Sbjct: 586 TTFHLWNSITSSLVPE--SNSLVERILNRYC 614
>gi|242050594|ref|XP_002463041.1| hypothetical protein SORBIDRAFT_02g036660 [Sorghum bicolor]
gi|241926418|gb|EER99562.1| hypothetical protein SORBIDRAFT_02g036660 [Sorghum bicolor]
Length = 316
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 80/185 (43%), Gaps = 17/185 (9%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA----ESSSVVAAGVIHLDKDHW 239
H S+L+R LY+YGG YLD D IV+K L SL N GA +S A V+ +K
Sbjct: 132 HYSELVRLAALYRYGGIYLDSDVIVLKPLTSLRNSIGATNHVSGNSSFGAAVLAFEKQSP 191
Query: 240 LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPV 299
L L+E + T NG ++TR++ S A + + P FYP+
Sbjct: 192 LLEECLKEFYSTYDDTLMQWNGAELMTRVIS---NLSSKADENRGYLDIKLEPSVKFYPI 248
Query: 300 HWEHWADYLNE-------TNAPATMS-LFRDSYALHVWNSFTKRVPVKLGSEQPYAQIAR 351
Y +E T+ A S + DS H WN T + + S ++I
Sbjct: 249 SSTDIIRYFSEPDNMVEKTHHDAIFSRIVNDSITFHFWNGITSALVPEPSS--VVSKILN 306
Query: 352 RYCPR 356
RYC R
Sbjct: 307 RYCIR 311
>gi|260447032|emb|CBG76445.1| OO_Ba0013J05-OO_Ba0033A15.32 [Oryza officinalis]
Length = 249
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 77/166 (46%), Gaps = 25/166 (15%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YKYGG YLD D +V++ L N GA++ V+ D+ H L + E
Sbjct: 69 YKYGGVYLDADVVVLRPFSDLRNAIGAQAVDAATGDWMRLNNAVMVFDRGHPLLREFIAE 128
Query: 248 LRDNFKTTEWGANGPGVLTRLLK---AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHW 304
F ++WG NGP +++R+ +P++ A + T+ PP FYPV W
Sbjct: 129 FAAKFDGSKWGHNGPYLVSRVAAKWRRRRRPEAEA-------DLTVLPPPAFYPVDWNKI 181
Query: 305 A-------DYLNETNAPATM-SLFRDSYALHVWNSFTKRVPVKLGS 342
D +E A + S+ +S+ +H+WN ++ + ++ GS
Sbjct: 182 GGLFVAPKDRKDERWVKAKVESIKGESFGIHLWNRESRSLEMEEGS 227
>gi|168042819|ref|XP_001773884.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674728|gb|EDQ61232.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 424
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 87/213 (40%), Gaps = 28/213 (13%)
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL--------SHMSDLLRYVTLY 195
L E V +V+ ++ F +TP H ILT+ P H ++LLR LY
Sbjct: 197 LKEGYKVAVVRPNVQELFVDTPSH-------ILTASLPKWKENPLFHLHFTELLRLAALY 249
Query: 196 KYGGTYLDLDFIVIKSLESLHNYAGAE----SSSVVAAGVIHLDKDHWLSGAALRELRDN 251
K+GG YLD+D +V + L SLHN G+E S + V+ +K + E
Sbjct: 250 KFGGIYLDMDMLVSRPLNSLHNTVGSEITVTGESRLNGAVLIFEKSSLFLKKCMEEFTKT 309
Query: 252 FKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLN-- 309
+ T NG +LTR+ + + N I P F+P+ + Y +
Sbjct: 310 YDETLPQYNGADLLTRVANSAFDEKGSTWNQFP-ELLNIQGPFTFFPLTSSGISKYFDAP 368
Query: 310 ------ETNAPATMSLFRDSYALHVWNSFTKRV 336
E + ++ +H+WNS T +
Sbjct: 369 KDDIQKEQQRELLTKISEEAITVHLWNSITSDI 401
>gi|242073792|ref|XP_002446832.1| hypothetical protein SORBIDRAFT_06g023350 [Sorghum bicolor]
gi|241938015|gb|EES11160.1| hypothetical protein SORBIDRAFT_06g023350 [Sorghum bicolor]
Length = 460
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 75/160 (46%), Gaps = 22/160 (13%)
Query: 198 GGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRELRD 250
GG YLD D +V++ L L N GA++ V+ D+ H L + E
Sbjct: 286 GGIYLDADVVVLRPLSDLRNAIGAQAVDEATGDWMRLNNAVMVFDRAHPLLHEFIAEFAA 345
Query: 251 NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWA----- 305
F ++WG NGP +++R + A + +S FT+ PPR FYPVHW
Sbjct: 346 AFDGSKWGHNGPYLVSR-VAARLRHRSPGP------AFTVLPPRAFYPVHWSKIGGLFVA 398
Query: 306 --DYLNETNAPATMSLFR-DSYALHVWNSFTKRVPVKLGS 342
D ++ A + + +S+ +H+WN + R+ ++ GS
Sbjct: 399 PKDRKDKRWVKAKVENIKGESFGIHLWNRESSRLEMEEGS 438
>gi|27817893|dbj|BAC55659.1| unknown protein [Oryza sativa Japonica Group]
Length = 615
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/228 (27%), Positives = 97/228 (42%), Gaps = 30/228 (13%)
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYT--QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
E V + +L + T H F + + T +PL H S+L+R LYKYGG YLD
Sbjct: 392 EGYKVAVALPNLDELLEGTLTHDFVSVWNEWRKTKYYPL-HYSELVRLAALYKYGGIYLD 450
Query: 204 LDFIVIKSLESLHNYAG-----AESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG 258
D +V+K L +L N G +E+SS A V+ +K+ L+E +
Sbjct: 451 SDVVVLKPLNALRNSIGVVKQVSENSSFSGA-VLAFEKNSPFLAECLKEFHSTYDDELLQ 509
Query: 259 ANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS 318
NG ++TR+++ A + + P FYP+ Y +E ++ +
Sbjct: 510 WNGAELMTRVIR---NMSDKADDNSGHLDIKFEPSVAFYPISSTDITRYFSEADSTDERA 566
Query: 319 --------LFRDSYALHVWNSFTKR-VPVKLGSEQPYAQIAR---RYC 354
+ DS H+WNS T VP +P + + R RYC
Sbjct: 567 QHDALFSRIVNDSTTFHLWNSITSSLVP------EPNSLVERILNRYC 608
>gi|195142098|ref|XP_002012721.1| GI21422 [Drosophila mojavensis]
gi|193906363|gb|EDW05230.1| GI21422 [Drosophila mojavensis]
Length = 165
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 54/101 (53%), Gaps = 7/101 (6%)
Query: 85 PDNSIFFLETSCTHADG----VELTLRQACSIESAAMMNPGVQVYVVVIASVRN---RTR 137
P SIFF ETSCT D + LT RQACSIESAA+ NP Q++ V +
Sbjct: 64 PGRSIFFHETSCTSPDTASNIMNLTARQACSIESAALNNPNFQIFTVFSCPTYRPLVGRQ 123
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTS 178
L+D + Y+NV+ ++L Y NTP+ + + +L S
Sbjct: 124 KLLVDAIESYENVNFRHLNLRDYAMNTPIEDWVKRGELLNS 164
>gi|340757117|ref|ZP_08693720.1| polysaccharide biosynthesis protein [Fusobacterium varium ATCC
27725]
gi|251834385|gb|EES62948.1| polysaccharide biosynthesis protein [Fusobacterium varium ATCC
27725]
Length = 262
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 99/216 (45%), Gaps = 37/216 (17%)
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
N ++L +Y+ + + +L F+N + ++ + LW + +SD R LY+
Sbjct: 25 NSWREKLPDYEIIEWNESNLD--FENEINSNTFLKECLDRKLW--AFLSDYFRMKVLYEN 80
Query: 198 GGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDHWLSGA-----ALRELR 249
GG YLD D +IKSL+ + G ES V++AG+I GA ++++
Sbjct: 81 GGIYLDTDMQIIKSLDEFLKDEFFIGLESEDVISAGII---------GAVPHNEVVKDIM 131
Query: 250 DNFKTTEWGA---NGPGVLTRLLKAECKPQSYAHNIISCRN--FTIYPPRFFYPVHWEHW 304
D +K W P ++TR+LK + + + II+ ++ IYP +FYP H+
Sbjct: 132 DFYKEDIWNEPIYTIPAIITRVLKKKYSFE-LKNEIINIKDGAVKIYPSNYFYPYHF--- 187
Query: 305 ADYLNETNAPATMSLFRDSYALHVW-NSFTKRVPVK 339
T L +Y +H W S+ K+ +K
Sbjct: 188 ------TEEFQYSKLTEKTYGIHWWGKSWGKKKDLK 217
>gi|115459564|ref|NP_001053382.1| Os04g0529700 [Oryza sativa Japonica Group]
gi|113564953|dbj|BAF15296.1| Os04g0529700 [Oryza sativa Japonica Group]
gi|215768309|dbj|BAH00538.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195258|gb|EEC77685.1| hypothetical protein OsI_16740 [Oryza sativa Indica Group]
gi|222629252|gb|EEE61384.1| hypothetical protein OsJ_15553 [Oryza sativa Japonica Group]
Length = 464
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 19/163 (11%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YKYGG YLD D +V++ L N GA++ V+ D+ H L + E
Sbjct: 284 YKYGGVYLDADVVVLRPFSDLRNAIGAQAVDASTGDWMRLNNAVMVFDRGHPLLREFIAE 343
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWA-- 305
F ++WG NGP L+ + + T+ PP FYPV W
Sbjct: 344 FAAKFDGSKWGHNGP----YLVSRVAARWRRRRRPEAEADLTVLPPAAFYPVDWNKIGGL 399
Query: 306 -----DYLNETNAPATM-SLFRDSYALHVWNSFTKRVPVKLGS 342
D E A + S+ +S+ +H+WN ++ + ++ GS
Sbjct: 400 FVAPKDRKGERWVKAKVESIKGESFGIHLWNRESRSLEMEEGS 442
>gi|21742092|emb|CAD41203.1| OSJNBa0074L08.14 [Oryza sativa Japonica Group]
gi|32492275|emb|CAE03856.1| OSJNBa0081C01.2 [Oryza sativa Japonica Group]
gi|116312022|emb|CAJ86378.1| OSIGBa0155K17.5 [Oryza sativa Indica Group]
Length = 445
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 19/163 (11%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDKDHWLSGAALRE 247
YKYGG YLD D +V++ L N GA++ V+ D+ H L + E
Sbjct: 265 YKYGGVYLDADVVVLRPFSDLRNAIGAQAVDASTGDWMRLNNAVMVFDRGHPLLREFIAE 324
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWA-- 305
F ++WG NGP L+ + + T+ PP FYPV W
Sbjct: 325 FAAKFDGSKWGHNGP----YLVSRVAARWRRRRRPEAEADLTVLPPAAFYPVDWNKIGGL 380
Query: 306 -----DYLNETNAPATM-SLFRDSYALHVWNSFTKRVPVKLGS 342
D E A + S+ +S+ +H+WN ++ + ++ GS
Sbjct: 381 FVAPKDRKGERWVKAKVESIKGESFGIHLWNRESRSLEMEEGS 423
>gi|381186189|ref|ZP_09893763.1| polysaccharide biosynthesis protein CpsM [Flavobacterium frigoris
PS1]
gi|379651863|gb|EIA10424.1| polysaccharide biosynthesis protein CpsM [Flavobacterium frigoris
PS1]
Length = 268
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 81/174 (46%), Gaps = 13/174 (7%)
Query: 162 QNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES-LHN--Y 218
+NT L + + A + W +SD +R LY+ GG YLD D +++KSL+ L N +
Sbjct: 50 KNTDLSHPFVKHAYESQKWAF--VSDYIRLKVLYENGGIYLDTDMMLLKSLDDFLENECF 107
Query: 219 AGAESSSVVAAGVIHLDKDHWLSGAALR--ELRDNFKTTEWGANGPG-VLTRLLKAECKP 275
GAE ++V+ G+I K + + + D K T W ++T+L + +
Sbjct: 108 FGAEEENIVSCGIIGAVKQNKFIKECIEHYDFIDLEKKTNWHKTVVTIIITKLFRTKYHF 167
Query: 276 QSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVW 329
I+ TIYP FYP +E D +N + DSYALH+W
Sbjct: 168 YGVFDKRINYDTITIYPINIFYPFPYEKKEDIVN-----YKQYIKPDSYALHLW 216
>gi|168005762|ref|XP_001755579.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693286|gb|EDQ79639.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 22/188 (11%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSS----VVAAGVIHLDKDHW 239
H ++LLR L+KYGG +LD+D I+ + L ++HN G+ S V+ + DK
Sbjct: 1149 HYTELLRIAALHKYGGVWLDMDMILARPLPTIHNVLGSTVSESGEWVLNGAFMSFDKSSS 1208
Query: 240 LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPV 299
A + E + T G NG +L R+ A + + ++ + P F+P+
Sbjct: 1209 FLKACIEEFVATYDETSLGWNGADLLNRV--ASNATRRGGKTWLESKHLQVLEPVAFFPL 1266
Query: 300 HWEHWADYLNETNAPAT-----------MSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQ 348
Y AP + ++ + S+ H+WNS T R + GS +
Sbjct: 1267 SRHDIIRYF---AAPKSHQDKVEQKQMLTAILKKSHGTHLWNSVTGRHVPEPGS--LVEK 1321
Query: 349 IARRYCPR 356
+ R+C R
Sbjct: 1322 LLNRFCLR 1329
>gi|344296634|ref|XP_003420011.1| PREDICTED: LOW QUALITY PROTEIN:
alpha-1,4-N-acetylglucosaminyltransferase-like
[Loxodonta africana]
Length = 275
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 74/151 (49%), Gaps = 12/151 (7%)
Query: 88 SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTR------NPLI 141
SI FLETS + +E +C++ESAA + P Q V + + N T+ P
Sbjct: 52 SIVFLETS----ERMEPPPLVSCAVESAARVYPE-QPVVFFMKGLNNSTQLPPHATYPAF 106
Query: 142 DRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
L NV + +D+ R F++TPL +YTQ T L SD R ++KYGG Y
Sbjct: 107 SLLSAIDNVFLFPLDMKRLFEDTPLFSWYTQINTSTERNWLHVSSDASRLAIIWKYGGIY 166
Query: 202 LDLDFIVIKSLESLHNYAGAESSSVVAAGVI 232
+D D I I+ + N+ A++S + G +
Sbjct: 167 MDTDVISIRPIPE-ENFLAAQASQDSSNGGV 196
>gi|404366458|ref|ZP_10971841.1| hypothetical protein FUAG_01658 [Fusobacterium ulcerans ATCC 49185]
gi|313689308|gb|EFS26143.1| hypothetical protein FUAG_01658 [Fusobacterium ulcerans ATCC 49185]
Length = 262
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 84/171 (49%), Gaps = 17/171 (9%)
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
N ++L +Y+ + + +L F+N + + ++ + LW + +SD R LY+
Sbjct: 25 NSWREKLPDYEIIEWNESNLD--FENEINNNAFLKECLDRKLW--AFLSDYFRMKVLYEN 80
Query: 198 GGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKT 254
GG YLD D +IKSL+ + G ES V++AG+I H ++++ D +K
Sbjct: 81 GGIYLDTDMQIIKSLDEFLEDDFFIGLESEDVISAGIIGAVPHH----EVVKDILDFYKE 136
Query: 255 TEWGA---NGPGVLTRLLKAECKPQSYAHNIISCRNFTI--YPPRFFYPVH 300
W P ++TR+LK + + II ++ +I YP +FYP H
Sbjct: 137 DIWNEPIYTIPAIITRVLKKKYSFE-LKDEIIKIKDGSIKMYPSNYFYPYH 186
>gi|373497615|ref|ZP_09588138.1| hypothetical protein HMPREF0402_02011 [Fusobacterium sp. 12_1B]
gi|371962980|gb|EHO80554.1| hypothetical protein HMPREF0402_02011 [Fusobacterium sp. 12_1B]
Length = 262
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 84/171 (49%), Gaps = 17/171 (9%)
Query: 138 NPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKY 197
N ++L +Y+ + + +L F+N + + ++ + LW + +SD R LY+
Sbjct: 25 NSWREKLPDYEIIEWNESNLD--FENEINNNTFLKECLDRKLW--AFLSDYFRMKVLYEN 80
Query: 198 GGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKT 254
GG YLD D +IKSL+ + G ES V++AG+I H ++++ D +K
Sbjct: 81 GGIYLDTDMQIIKSLDEFLEDDFFIGLESEDVISAGIIGAVPHH----EVVKDILDFYKE 136
Query: 255 TEWGA---NGPGVLTRLLKAECKPQSYAHNIISCRNFTI--YPPRFFYPVH 300
W P ++TR+LK + + +I ++ +I YP +FYP H
Sbjct: 137 DIWNEPIYTIPAIITRVLKKKYSFE-LKDEVIKIKDGSIKMYPSNYFYPYH 186
>gi|356577111|ref|XP_003556671.1| PREDICTED: uncharacterized protein At4g19900-like [Glycine max]
Length = 603
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 98/251 (39%), Gaps = 38/251 (15%)
Query: 105 TLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNT 164
T+R +ES +P V V + ++ + Y+ V + +L ++
Sbjct: 343 TVRHQRGLESLLFHHPDACVVVFSETVELDFFKDSFVKDGYK---VAVAMPNLDELLKDM 399
Query: 165 PLHGFYTQDAILTSLW--------PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
P H I S+W +H S+L+R LYKYGG YLD D IV+K + L+
Sbjct: 400 PAH-------IFASVWFEWKKTNFYSTHYSELIRLAALYKYGGIYLDSDIIVLKPISFLN 452
Query: 217 NYAGAE---SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAEC 273
N G E + S + V+ + L E + T NG +LTR+ +
Sbjct: 453 NSVGMEGHGAGSALNGAVMSFPRHSLFVKECLEEFYMTYDDTSLRGNGADLLTRVARKYL 512
Query: 274 KPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMS-----------LFRD 322
++ + + + P F+PV ++ Y AP T + + +
Sbjct: 513 GDENKS---VKHLELKVEPSYIFFPVSSQNITRYF---IAPTTETEKAQQDVLLENILHN 566
Query: 323 SYALHVWNSFT 333
S H WNS T
Sbjct: 567 SLTFHFWNSVT 577
>gi|302795919|ref|XP_002979722.1| glycosyltransferase CAZy family GT32-like protein [Selaginella
moellendorffii]
gi|300152482|gb|EFJ19124.1| glycosyltransferase CAZy family GT32-like protein [Selaginella
moellendorffii]
Length = 536
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 83/174 (47%), Gaps = 22/174 (12%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA-------GVIHLDK 236
++ ++LR + LY++GG YLD D +V+ SL +L N GA++ V V+ ++
Sbjct: 339 NLGNILRLLLLYRFGGIYLDSDVLVLGSLANLSNSIGAQTEDSVTGEWQRLNNAVLAFER 398
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYP---- 292
H + + + E F ++WG NGP + TR+L + + ++ R +YP
Sbjct: 399 RHPVLHSFIHEFALTFNGSKWGHNGPYLATRVLDRARRTGTVPCGVVRTR--ALYPVTWN 456
Query: 293 --PRFFYPVHWEHWADYLNETNAPATMSLFR--DSYALHVWNSFTKRVPVKLGS 342
P F V E + E + R +S A+H+WN T+ + V+ GS
Sbjct: 457 HIPPLFRGVEGERGRAWREE-----KLRWLRSGESLAIHLWNKQTRGLRVERGS 505
>gi|326533290|dbj|BAJ93617.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 614
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 100/249 (40%), Gaps = 37/249 (14%)
Query: 106 LRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTP 165
+R +ES +P V VV+ ++ T + + Y+ V + L + TP
Sbjct: 356 VRHQRGLESLLQQHPDACV-VVLSETLELETFHEFVKEGYK---VAVAVPSLDELLEGTP 411
Query: 166 LHGFYTQDAILTSLW---------PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
H I S+W PL H S+L+R LY+YGG YLD D IV+K L+S
Sbjct: 412 TH-------IFASVWYEWRKTINYPL-HYSELVRLAALYRYGGIYLDSDVIVLKPLKSFR 463
Query: 217 NYAGA----ESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAE 272
N G S + V+ +K L+E + T NG ++TR+++
Sbjct: 464 NTIGTVKEVSRGSSFSGAVLAFEKQSPFLLECLKEWYSTYDDTLMQWNGAELMTRVIRNH 523
Query: 273 CKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM----SLFRD----SY 324
S ++ + P FYP++ Y E ++ A +LF S
Sbjct: 524 SDSDSNREHL----EIQLEPSFTFYPINSTDINWYFLEPDSAAERAQHDALFSKILNYST 579
Query: 325 ALHVWNSFT 333
H WNS T
Sbjct: 580 TFHFWNSIT 588
>gi|326519134|dbj|BAJ96566.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 606
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 100/249 (40%), Gaps = 37/249 (14%)
Query: 106 LRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTP 165
+R +ES +P V VV+ ++ T + + Y+ V + L + TP
Sbjct: 348 VRHQRGLESLLQQHPDACV-VVLSETLELETFHEFVKEGYK---VAVAVPSLDELLEGTP 403
Query: 166 LHGFYTQDAILTSLW---------PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
H I S+W PL H S+L+R LY+YGG YLD D IV+K L+S
Sbjct: 404 TH-------IFASVWYEWRKTINYPL-HYSELVRLAALYRYGGIYLDSDVIVLKPLKSFR 455
Query: 217 NYAGA----ESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAE 272
N G S + V+ +K L+E + T NG ++TR+++
Sbjct: 456 NTIGTVKEVSRGSSFSGAVLAFEKQSPFLLECLKEWYSTYDDTLMQWNGAELMTRVIRNH 515
Query: 273 CKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATM----SLFRD----SY 324
S ++ + P FYP++ Y E ++ A +LF S
Sbjct: 516 SDSDSNREHL----EIQLEPSFTFYPINSTDINWYFLEPDSAAERAQHDALFSKILNYST 571
Query: 325 ALHVWNSFT 333
H WNS T
Sbjct: 572 TFHFWNSIT 580
>gi|168056104|ref|XP_001780062.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668560|gb|EDQ55165.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 428
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 90/204 (44%), Gaps = 20/204 (9%)
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLW--PLSHM--SDLLRYVTLYKYGGTY 201
E V +V+ +L +TP F A+L L PL H+ ++LLR LY++GG Y
Sbjct: 203 EGYKVAVVRPNLQELLADTPSDVF---SAVLPKLKEKPLFHLHITELLRLAALYRFGGIY 259
Query: 202 LDLDFIVIKSLESLHNYAGAESSS----VVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
LD+D +V++ +++L N G+E ++ ++ V+ +K + E + T
Sbjct: 260 LDMDVLVLRPMDNLRNTLGSEITANGNLRLSGAVLVFEKSSLFLKKCMEEFTRTYDETLD 319
Query: 258 GANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWA--------DYLN 309
NG +LTR+ + + + I P F+P+ + D +
Sbjct: 320 QYNGADLLTRVANSTVDEEGTTWTKLP-HLLKIQGPSTFFPLDSSGISKFFAAPKDDIVK 378
Query: 310 ETNAPATMSLFRDSYALHVWNSFT 333
E + + ++ +H+WNS T
Sbjct: 379 EKQRNLLIRISEEAITVHLWNSVT 402
>gi|449490682|ref|XP_004158676.1| PREDICTED: uncharacterized protein At4g19900-like isoform 1
[Cucumis sativus]
Length = 631
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 83/207 (40%), Gaps = 37/207 (17%)
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAILTSLW--------PLSHMSDLLRYVTLYKYGGT 200
V + +L ++TP H F S+W +H S+L+R LYKYGG
Sbjct: 414 KVAVAMPNLDELLKDTPTHKF-------ASIWFEWKKTEFYSTHYSELVRLAALYKYGGI 466
Query: 201 YLDLDFIVIKSLESLHNYAGAE---SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
YLD D +V+K L SLHN G E + S + V+ ++E + +
Sbjct: 467 YLDSDIVVLKPLSSLHNSVGMEDQLAGSSLNGAVMAFRMHSPFIMECMKEYYSTYDDRSF 526
Query: 258 GANGPGVLTRL---LKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADY----LNE 310
NG +LTR+ +E + + T+ P F+P+ ++ Y +
Sbjct: 527 RWNGAELLTRVANRFSSEVPAEQF--------ELTVQPSFAFFPIASQNITRYFAVPVGA 578
Query: 311 TNAPATMSLFR----DSYALHVWNSFT 333
T L + +S H WNS T
Sbjct: 579 TEKAEHECLLKKILEESVTFHFWNSLT 605
>gi|340756837|ref|ZP_08693442.1| polysaccharide biosynthesis protein [Fusobacterium varium ATCC
27725]
gi|251834103|gb|EES62666.1| polysaccharide biosynthesis protein [Fusobacterium varium ATCC
27725]
Length = 243
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 69/166 (41%), Gaps = 26/166 (15%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW + +SD +R LY+YGG YLD D +IK+L L + G E+ +++ G++
Sbjct: 62 KLW--AFLSDYVRLRVLYQYGGIYLDTDMEIIKNLYDLLETDFFTGYENDEIISFGILGC 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGANG---PGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H + ++ D + W ++ +LT +LK E Y + IY
Sbjct: 120 IPQH----KIIEKMLDFYDNKIWDSDMYIITNILTEILKEE-----YGDKLFETSGIKIY 170
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N L ++Y +H W + P
Sbjct: 171 PKEYFYPY---------NHDEEFTEKCLTENTYGIHWWGKSWGKNP 207
>gi|241839172|ref|XP_002415223.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
gi|215509435|gb|EEC18888.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
Length = 61
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 39/57 (68%)
Query: 161 FQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN 217
+ TPL + A+ +S + + H++D+LR +YKYGG YLDLD +V++SLE LHN
Sbjct: 5 LEGTPLSSWNFSGAMRSSRYRVVHLADVLRLAVVYKYGGVYLDLDTVVLRSLEDLHN 61
>gi|241160624|ref|XP_002408775.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
gi|215494394|gb|EEC04035.1| lactosylceramide 4-alpha-galactosyltransferase, putative [Ixodes
scapularis]
Length = 54
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 38/54 (70%)
Query: 164 TPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN 217
TPL + A+ +S +P HM+D+LR +YKYGG YLDLD ++++SL+ LHN
Sbjct: 1 TPLSTWNFSGAMRSSSYPAVHMADVLRLAVVYKYGGVYLDLDIVMLRSLQDLHN 54
>gi|358465737|ref|ZP_09175639.1| hypothetical protein HMPREF9093_00098 [Fusobacterium sp. oral taxon
370 str. F0437]
gi|357069769|gb|EHI79645.1| hypothetical protein HMPREF9093_00098 [Fusobacterium sp. oral taxon
370 str. F0437]
Length = 243
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 74/166 (44%), Gaps = 27/166 (16%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW ++D +R LY YGG YLD D +IK + SL N + G E+ ++ G++ +
Sbjct: 62 QLWAF--VADYVRVKILYNYGGIYLDTDMEIIKDISSLLNSDLFLGYENEDTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H +++ + ++ W + +LT +L+ E + + +NI IY
Sbjct: 120 IPKH----RVFKKMYEFYQNEIWKSPLHIVTNILTDILEEEYQGRYKENNI------DIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N + +D+YA+H W K+ P
Sbjct: 170 PREYFYP---------FNHDEEFTEACIKKDTYAIHWWGKSWKKNP 206
>gi|159479690|ref|XP_001697923.1| hypothetical protein CHLREDRAFT_159170 [Chlamydomonas reinhardtii]
gi|158274021|gb|EDO99806.1| predicted protein [Chlamydomonas reinhardtii]
Length = 431
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 64/126 (50%), Gaps = 11/126 (8%)
Query: 149 NVHIVQVDLGRYFQNTPLHGFYTQDAI---LTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
VH+V+ + Q+TP+ +Y + + W SH++DL+R+ +YK+GG Y+D D
Sbjct: 110 GVHLVRYGVAEVLQDTPVGSWYVEKRVELEAGKYW-FSHVTDLMRFALVYKHGGIYMDTD 168
Query: 206 FIVIK-----SLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGAN 260
+V++ ++ L G SS V+ + H L+++ +++ TEW +
Sbjct: 169 VLVMRPISPANVNKLVRAVG--DSSCFECAVMFFEAGHSYLFEVLKQIPRHYRGTEWISA 226
Query: 261 GPGVLT 266
GP LT
Sbjct: 227 GPKALT 232
>gi|408409876|ref|ZP_11181143.1| Glycosyltransferase in exopolysaccharide biosynthesis
[Lactobacillus sp. 66c]
gi|407875928|emb|CCK82949.1| Glycosyltransferase in exopolysaccharide biosynthesis
[Lactobacillus sp. 66c]
Length = 241
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 33/185 (17%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW + D LR LY GG YLD D VIKS + L N + G ES+ + G+I
Sbjct: 63 KLWAFA--VDYLRLYVLYNEGGIYLDTDVEVIKSFDDLLNQEAFIGMESTGFIGTGIIGS 120
Query: 235 DKDHWLSGAALRELRDNFKTTEWGA---NGPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
+ A++E D + W N P + T++L+ E + N IY
Sbjct: 121 EP----KNPAIKEFLDFYTQKIWNVDYYNNPIIFTKVLEEEPFVRD---------NINIY 167
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS---FTKRVPVKLGSEQPYAQ 348
P +F P + E Y+N T ++Y +H +N+ ++ V L ++Q +
Sbjct: 168 PIEYFSPYNPETDKCYVNVT---------ENTYTIHWYNANWNVKRKGYVFLNTKQYRSP 218
Query: 349 IARRY 353
+AR +
Sbjct: 219 VARSF 223
>gi|414887163|tpg|DAA63177.1| TPA: hypothetical protein ZEAMMB73_977519, partial [Zea mays]
Length = 325
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 62/130 (47%), Gaps = 7/130 (5%)
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYT--QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
E V + +L ++TP H F + + T + L H S+L+R LYKYGG YLD
Sbjct: 157 EGYKVAVAVPNLDELLESTPTHVFASVWYEWRQTKYYHL-HYSELVRLAALYKYGGIYLD 215
Query: 204 LDFIVIKSLESLHNYAGA----ESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA 259
D I++K L SL N GA +S V+ +K L LRE + T
Sbjct: 216 SDVIILKPLTSLRNSIGATNHVSGNSRFGGAVLAFEKQSPLLEECLREFYSTYDDTLVQW 275
Query: 260 NGPGVLTRLL 269
NG ++TR++
Sbjct: 276 NGAELMTRVI 285
>gi|357475201|ref|XP_003607886.1| hypothetical protein MTR_4g084060 [Medicago truncatula]
gi|85719350|gb|ABC75355.1| Glycosyltransferase sugar-binding region containing DXD motif;
Alpha 1,4-glycosyltransferase conserved region [Medicago
truncatula]
gi|355508941|gb|AES90083.1| hypothetical protein MTR_4g084060 [Medicago truncatula]
Length = 576
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 96/246 (39%), Gaps = 26/246 (10%)
Query: 104 LTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQN 163
T+R +ES +P V V + ++ + Y+ + +V +L + +
Sbjct: 315 FTVRYQRGLESLLFHHPNACVVVFSETIELDFFKDSFVKDGYK---IAVVMPNLDQLLEG 371
Query: 164 TPLHGFYTQ--DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
TP + F T + T + +H S+L+R LYKYGG YLD D IV+K + L+N G
Sbjct: 372 TPANIFSTVWFEWRKTKFYS-THYSELIRLAALYKYGGIYLDSDIIVLKPISFLNNSVGM 430
Query: 222 E---SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSY 278
E + S + ++ + L E + NG +LTR+ + K
Sbjct: 431 EDQAAGSSLNGALMAFGRHSLFIKECLEEFYMTYDDNSLRWNGADLLTRVAQ---KFVGE 487
Query: 279 AHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT-----------MSLFRDSYALH 327
+ I P FYP++ Y AP T + +S H
Sbjct: 488 ENKTIKQLELNKEPSHVFYPINSHDITRYF---VAPTTEMDKAQQDVLLEKILHESLTFH 544
Query: 328 VWNSFT 333
WNS T
Sbjct: 545 FWNSLT 550
>gi|395231629|ref|ZP_10409915.1| polysaccharide biosynthesis protein family protein [Citrobacter sp.
A1]
gi|424730493|ref|ZP_18159089.1| polysaccharide biosynthesis protein family protein [Citrobacter sp.
L17]
gi|394714615|gb|EJF20531.1| polysaccharide biosynthesis protein family protein [Citrobacter sp.
A1]
gi|422895063|gb|EKU34853.1| polysaccharide biosynthesis protein family protein [Citrobacter sp.
L17]
Length = 235
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 72/156 (46%), Gaps = 29/156 (18%)
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLESLH---NYAGAESSSVVAAGVIHLDKDHWLSGA 243
D +R + L ++GG YLD D +IK L SL+ NY G ESS ++ GV+ K++W A
Sbjct: 70 DYIRCLILERFGGVYLDADIEMIKDLSSLNEGGNYLGMESSRKISCGVLAFSKNNWFV-A 128
Query: 244 ALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYA------HNIISCRNFTIYPPRFFY 297
L+E + ANG +T+ L E + + + + TIYP +FY
Sbjct: 129 KLKEKVE-------AANGLVEITK-LSTEVLSERFGVLDLDLKESLVLDDITIYPKHYFY 180
Query: 298 PVHWEHWADYLNETNAPATMSLFRD----SYALHVW 329
P + + A +F+D +Y +H W
Sbjct: 181 PYN-------PYDEGAKIDQLMFKDIKPETYCIHHW 209
>gi|338973774|ref|ZP_08629137.1| hypothetical protein CSIRO_2224 [Bradyrhizobiaceae bacterium SG-6C]
gi|338233369|gb|EGP08496.1| hypothetical protein CSIRO_2224 [Bradyrhizobiaceae bacterium SG-6C]
Length = 590
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 87/202 (43%), Gaps = 26/202 (12%)
Query: 153 VQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIK-S 211
V++ R + P+H F+ +D L+ L+ SDL RY+ + K+GG Y+DLD +V+K S
Sbjct: 54 VELVDAREVLSGPIHQFHHKDGDLS----LALHSDLFRYLAIQKFGGWYMDLDIVVLKPS 109
Query: 212 LESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRE---LRDNFKTTEWGAN----GPGV 264
L Y + V A V+ + AA+ E L T+ GA+ GP +
Sbjct: 110 LPEDKIYLAYQEDGVANAAVMKFPAQSPIMTAAIDEAMRLLPEAGTSAPGADHGIVGPAL 169
Query: 265 LTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
+TRL YA + + + P Y +H + + A S
Sbjct: 170 ITRLAS------EYAIDHL------VRPKSSAYEIHPNEVLMFFDPAQCEAAFQRVASSD 217
Query: 325 ALHVWNSFTK--RVPVKLGSEQ 344
+H+WN + R+P LG +
Sbjct: 218 FVHLWNDLWRALRIPKNLGPPE 239
>gi|416903659|ref|ZP_11930609.1| hypothetical protein B1M_01533 [Burkholderia sp. TJI49]
gi|325529519|gb|EGD06420.1| hypothetical protein B1M_01533 [Burkholderia sp. TJI49]
Length = 258
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 65/153 (42%), Gaps = 19/153 (12%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKS--LESLHNYAGAESSSVVAAGVIHLDKDHW 239
++H +D R++ K ++D D ++++ L N G E + ++ LD D
Sbjct: 41 MAHFTDYFRFMMFTKTDEIWVDTDILLLRDFDLGDTRNLIGREDPIKICNAILRLDSDDP 100
Query: 240 LSGAALRELRDNFKT-TEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYP 298
+R++ T +WG GP +LT++ + P FFYP
Sbjct: 101 RLHDTIRQIEAMKGTPIQWGETGPLLLTKIYGIDAG----------------LPQNFFYP 144
Query: 299 VHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
VH++ + + +L DSY LH+WN+
Sbjct: 145 VHYDDYYKVFLPEHFDECAALCADSYTLHLWNN 177
>gi|222637301|gb|EEE67433.1| hypothetical protein OsJ_24783 [Oryza sativa Japonica Group]
Length = 645
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 100/258 (38%), Gaps = 60/258 (23%)
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYT--QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
E V + +L + T H F + + T +PL H S+L+R LYKYGG YLD
Sbjct: 392 EGYKVAVALPNLDELLEGTLTHDFVSVWNEWRKTKYYPL-HYSELVRLAALYKYGGIYLD 450
Query: 204 LDFIVIKSLESLHNYAG-----AESSSVVAAGVIHLDKDH------WLS----------- 241
D +V+K L +L N G +E+SS A V+ +K+ WLS
Sbjct: 451 SDVVVLKPLNALRNSIGVVKQVSENSSFSGA-VLAFEKNSQLPFKGWLSKPIDQEQCRKR 509
Query: 242 -------------GAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNF 288
L+E + NG ++TR+++ A + +
Sbjct: 510 CSNVIKLNESPFLAECLKEFHSTYDDELLQWNGAELMTRVIR---NMSDKADDNSGHLDI 566
Query: 289 TIYPPRFFYPVHWEHWADYLNETNAPATMS--------LFRDSYALHVWNSFTKR-VPVK 339
P FYP+ Y +E ++ + + DS H+WNS T VP
Sbjct: 567 KFEPSVAFYPISSTDITRYFSEADSTDERAQHDALFSRIVNDSTTFHLWNSITSSLVP-- 624
Query: 340 LGSEQPYAQIAR---RYC 354
+P + + R RYC
Sbjct: 625 ----EPNSLVERILNRYC 638
>gi|269121002|ref|YP_003309179.1| glycosyltransferase family protein [Sebaldella termitidis ATCC
33386]
gi|268614880|gb|ACZ09248.1| glycosyltransferase sugar-binding region containing DXD motif
[Sebaldella termitidis ATCC 33386]
Length = 243
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 82/180 (45%), Gaps = 20/180 (11%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW +++SD +R + LY++GG Y+D+D +IK+++ L + G E V+ G+
Sbjct: 63 KLW--AYISDYVRVLHLYEHGGIYMDIDMEIIKNIDGLLKNKFFIGYEDEKNVSVGIFGT 120
Query: 235 DKDHWLSGAALRELRDN--FKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYP 292
K H + E DN ++ W P + T +L+ + ++ I + +YP
Sbjct: 121 VKGHIFLKKVI-EFYDNEIWEKPLWTI--PKIFTYILERDFGLKA-GGRYIENEDMKLYP 176
Query: 293 PRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARR 352
+FYP H +ET ++ ++Y +H WN + V + E + ++
Sbjct: 177 KEYFYPYH-------FSETYTDECIT--ENTYGIHWWNDSWNSMKVSMFMETKHLHGIKK 227
>gi|168481288|gb|ACA24779.1| WffM [Shigella dysenteriae]
Length = 228
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/167 (25%), Positives = 75/167 (44%), Gaps = 24/167 (14%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDHWLSG 242
+DL+R LY++GG YLD D +++ + +L + G E + G++ + WL
Sbjct: 67 ADLIRCDVLYRFGGIYLDTDMELVRDISALRKNIAFIGEEDIDTPSCGILGCEPKFWL-- 124
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWE 302
+EL+ P +L +L + + +I + ++ TIY ++FYP
Sbjct: 125 --FQELKAAVIKANGMQTIPFLLKNILDLHGVKKIDSQDISTIKDITIYSDKYFYP---- 178
Query: 303 HWADYLNETNAPATMSLFR----DSYALHVWN-----SFTKRVPVKL 340
Y +A + L+R D YA+H W SF +R+ K+
Sbjct: 179 ----YNPYGSAKRSQLLYRYITKDCYAIHHWAKSWKLSFLERIKRKI 221
>gi|449434200|ref|XP_004134884.1| PREDICTED: uncharacterized protein At4g19900-like [Cucumis sativus]
Length = 634
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 69/161 (42%), Gaps = 23/161 (14%)
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLS-----HMSDLLRYVTLYKYGGTYLDL 204
V + +L ++TP H F A + W + H S+L+R LYKYGG YLD
Sbjct: 479 VAVAMPNLDELLKDTPTHKF----ASIWFEWKKTEFYSTHYSELVRLAALYKYGGIYLDS 534
Query: 205 DFIVIKSLESLHNYAGAE---SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
D +V+K L SLHN G E + S + V+ ++E + + NG
Sbjct: 535 DIVVLKPLSSLHNSVGMEDQLAGSSLNGAVMAFRMHSPFIMECMKEYYSTYDDRSFRWNG 594
Query: 262 PGVLTRL---LKAECKPQSYAHNIISCRNFTIYPPRFFYPV 299
+LTR+ +E + + T+ P F+P+
Sbjct: 595 AELLTRVANRFSSEVPAEQF--------ELTVQPSFAFFPI 627
>gi|414172427|ref|ZP_11427338.1| hypothetical protein HMPREF9695_00984 [Afipia broomeae ATCC 49717]
gi|410894102|gb|EKS41892.1| hypothetical protein HMPREF9695_00984 [Afipia broomeae ATCC 49717]
Length = 595
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 89/202 (44%), Gaps = 26/202 (12%)
Query: 153 VQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIK-S 211
V++ R + PL+ F+ +D L+ L+ SDL RY+ + K+GG Y+DLD +V+K +
Sbjct: 55 VELADAREVLSGPLYQFHHKDGDLS----LALHSDLFRYLAIQKFGGWYMDLDIVVLKAA 110
Query: 212 LESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRE---LRDNFKTTEWGAN----GPGV 264
L Y + +V A V+ + AA+ E L T GA+ GP +
Sbjct: 111 LPDDKIYLAYQEDGIVNAAVMKFPAGSPIMTAAIDEAMRLLPVAGTAAPGADHGIVGPKL 170
Query: 265 LTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
+TRL S + I + I P Y +H + + A ++ S
Sbjct: 171 ITRL--------STEYAI----DHLIRPKVSAYEIHPNEVLMFFDPAQCEAALARLASSD 218
Query: 325 ALHVWNSFTK--RVPVKLGSEQ 344
+H+WN + R+P LG +
Sbjct: 219 FVHLWNDLWRALRIPKNLGPPE 240
>gi|414166112|ref|ZP_11422346.1| hypothetical protein HMPREF9696_00201 [Afipia clevelandensis ATCC
49720]
gi|410894872|gb|EKS42658.1| hypothetical protein HMPREF9696_00201 [Afipia clevelandensis ATCC
49720]
Length = 590
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 86/202 (42%), Gaps = 26/202 (12%)
Query: 153 VQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIK-S 211
V++ R + P+H F+ +D L+ L+ SDL RY+ + K+GG Y+DLD +V+K S
Sbjct: 54 VELVDAREVLSGPIHQFHHKDGDLS----LALHSDLFRYLAIQKFGGWYMDLDIVVLKPS 109
Query: 212 LESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRE---LRDNFKTTEWGAN----GPGV 264
L Y + V A V+ + AA+ E L T+ GA+ GP +
Sbjct: 110 LPEDKIYLAYQEDGVANAAVMKFPAQSPIMTAAIDEAMRLLPEAGTSAPGADHGIVGPAL 169
Query: 265 LTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSY 324
+TRL YA + + + P Y +H + + S
Sbjct: 170 ITRLAS------EYAIDHL------VRPKSSAYEIHPNEVLMFFDPAQCELAFQRVASSD 217
Query: 325 ALHVWNSFTK--RVPVKLGSEQ 344
+H+WN + R+P LG +
Sbjct: 218 FVHLWNDLWRALRIPKNLGPPE 239
>gi|159479692|ref|XP_001697924.1| hypothetical protein CHLREDRAFT_151494 [Chlamydomonas reinhardtii]
gi|158274022|gb|EDO99807.1| predicted protein [Chlamydomonas reinhardtii]
Length = 370
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/118 (23%), Positives = 62/118 (52%), Gaps = 3/118 (2%)
Query: 152 IVQVDLGRYFQNTPLHGFYT--QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVI 209
+V D+ ++TPL ++ ++ + + + SH++D++R+ +YK+GG YLD D I++
Sbjct: 114 LVHYDVHDVLEDTPLGSWFKDKEEKLRSGKYYFSHVTDMMRFALVYKHGGLYLDADVIMM 173
Query: 210 KSLESLH-NYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
+ + H N +++ V++ + H L +++++ +W GP LT
Sbjct: 174 RPISLSHLNAVVRPPHTMIECAVVYFEAGHPFIWQVLTHIKNHYAINDWTTAGPRALT 231
>gi|428178432|gb|EKX47307.1| hypothetical protein GUITHDRAFT_106757 [Guillardia theta CCMP2712]
Length = 500
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/282 (21%), Positives = 113/282 (40%), Gaps = 43/282 (15%)
Query: 109 ACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHG 168
AC IE+ NPG V V S P+ + + V + ++L F TP+
Sbjct: 92 ACKIEAFLNKNPGHTALVHVKNSETLSRSFPV-----KSERVQLRVINLTEDFHATPMEE 146
Query: 169 FYTQDAILTSLWPLSHMSDLLRYVTLYKYGG--TYLDLDFIVIKSLESLHNYA------- 219
++ T+L + D LR +YKYGG Y+DLD + + ++ HN +
Sbjct: 147 WFQSGVWQTALHKALDLCDGLRLAIIYKYGGRSCYVDLDMVSLNRID--HNGSVLVAMDE 204
Query: 220 GAESSSVVAAGV-IHLDKD-------HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKA 271
G S+ G +L D H ++ L +F + + GP + + + +
Sbjct: 205 GQRSTWETPWGAHFYLGTDFFQFPPRHQFVNDLMKSLPSHFSPSGYALLGPSLFSAVYQD 264
Query: 272 ECKPQSYAHNIISCRNFTIYPPRFFYPVHW--------EHWADYLNETNA----PAT--- 316
+C ++ C + TI P+ F+PV+ E + L + + P T
Sbjct: 265 KCLRENETRPSY-CNSMTILEPKAFHPVNMFNRLRRNDEGQGNDLQDNSVVLSFPWTERC 323
Query: 317 ---MSLFRDSYALHVWNSFTKRVPVKLGSEQPYAQIARRYCP 355
+ + S +H+W++ + + L + A+I + CP
Sbjct: 324 TQIKEIIQSSIGMHMWDTKRGSLNLDLSTNSMLAKITQVTCP 365
>gi|340752310|ref|ZP_08689111.1| polysaccharide biosynthesis protein [Fusobacterium sp. 2_1_31]
gi|422317392|ref|ZP_16398749.1| hypothetical protein FPOG_02064 [Fusobacterium periodonticum D10]
gi|229422113|gb|EEO37160.1| polysaccharide biosynthesis protein [Fusobacterium sp. 2_1_31]
gi|404589861|gb|EKA92401.1| hypothetical protein FPOG_02064 [Fusobacterium periodonticum D10]
Length = 243
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 74/166 (44%), Gaps = 27/166 (16%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW ++D +R LY YGG YLD D +IK + L + + G E+ + ++ G++ +
Sbjct: 62 QLWAF--VADYVRVKILYNYGGIYLDTDMEIIKDISPLLDTDMFLGYENENTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H ++ + ++ W ++ +LT +L+ E + + +NI IY
Sbjct: 120 IPKH----KVFEKMYEFYQDEIWKSSLHIITNILTDILEEEYQGKYKQNNI------NIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N + +++YA+H W K+ P
Sbjct: 170 PREYFYP---------FNHDEEFTETCIKKNTYAIHWWGKSWKKNP 206
>gi|302792002|ref|XP_002977767.1| glycosyltransferase CAZy family GT32 [Selaginella moellendorffii]
gi|300154470|gb|EFJ21105.1| glycosyltransferase CAZy family GT32 [Selaginella moellendorffii]
Length = 734
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 59/140 (42%), Gaps = 19/140 (13%)
Query: 150 VHIVQVDLGRYFQNTPLHGFYTQDAILTSLWP--------LSHMSDLLRYVTLYKYGGTY 201
V +V+ +L +NTP F S+W H S+LLR LYKYGG Y
Sbjct: 506 VAVVRPNLEELLENTPAEMF-------ASVWVEWRRVELFYIHYSELLRLAALYKYGGVY 558
Query: 202 LDLDFIVIKSLESLHNYAG----AESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
LD D +V+K L SL N G A+ + + V+ K + E +
Sbjct: 559 LDSDVVVLKPLTSLQNAVGMEALADGKTRLNGAVMAFKKASVFLKECMEEYTATYDDKLL 618
Query: 258 GANGPGVLTRLLKAECKPQS 277
NG +LTR+ + QS
Sbjct: 619 DYNGADLLTRVASSAIPGQS 638
>gi|296116055|ref|ZP_06834675.1| hypothetical protein GXY_09684 [Gluconacetobacter hansenii ATCC
23769]
gi|295977409|gb|EFG84167.1| hypothetical protein GXY_09684 [Gluconacetobacter hansenii ATCC
23769]
Length = 297
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 21/163 (12%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLE--SLHNYAGAESSSVVAAGVIHLDKDHW 239
LSH SDL RY + ++D D ++++ ++ E + V ++ L+++H
Sbjct: 75 LSHFSDLFRYELSRRTDHIWVDADMLMLRRIDLPRWDMLLAKEDPTSVCGAIMRLERNHP 134
Query: 240 LSGAALRE---LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
+ E LRD + WGA GP +LT L + Q Y P++F
Sbjct: 135 KLDHLVHETLALRD--RDLVWGATGPRLLTSLFGKQVIMQES------------YAPQYF 180
Query: 297 YPVHW-EHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPV 338
+P+H+ E W L + L +Y LH+WN R+ +
Sbjct: 181 FPIHFREAWKILLPQYRQECE-KLCASAYTLHLWNDQMVRIGI 222
>gi|323344103|ref|ZP_08084329.1| hypothetical protein HMPREF0663_10865 [Prevotella oralis ATCC
33269]
gi|323094832|gb|EFZ37407.1| hypothetical protein HMPREF0663_10865 [Prevotella oralis ATCC
33269]
Length = 254
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 75/174 (43%), Gaps = 39/174 (22%)
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDK 236
L+ +H SD++R + L+KYGG YLD+D I ++ +L N +S V+A ++H +
Sbjct: 75 NKLYHYAHKSDVIRLMILFKYGGIYLDIDTICLRPFTNLLN-----NSCVMAKELLHNGE 129
Query: 237 DHWLSGAAL----------------RELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAH 280
+H L A + + R K W + V ++ AE P +
Sbjct: 130 EHGLCNAVILAEKGSKFIQYWLSTYKHFRSKGKDIYWAEHSVRVPLKI--AELYPNT--- 184
Query: 281 NIISCRNFTIYPPR-FFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFT 333
I P R FFYP +++ L + N F ++Y H+W S +
Sbjct: 185 -------IHIEPERSFFYPSYFQEDLKMLFQKNLS-----FPEAYVFHLWESLS 226
>gi|336418616|ref|ZP_08598889.1| hypohetical Teichoic Acid Biosynthesis Protein [Fusobacterium sp.
11_3_2]
gi|336164462|gb|EGN67368.1| hypohetical Teichoic Acid Biosynthesis Protein [Fusobacterium sp.
11_3_2]
Length = 243
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 70/166 (42%), Gaps = 27/166 (16%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW +SD +R LY YGG YLD D +IK + +L + + G E+ ++ G++ +
Sbjct: 62 KLWAF--VSDYVRVKVLYNYGGIYLDTDMEIIKDITALLDADIFLGYENEDTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H +++ + ++ W + +LT +L+ E H N IY
Sbjct: 120 IPKH----KVFKKMYEFYQNEIWKSPLHIVTSILTEILEKE------YHGKYRENNINIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N + ++YA+H W K+ P
Sbjct: 170 PREYFYP---------FNHDEEFTKDCITANTYAIHWWGKSWKKNP 206
>gi|209516656|ref|ZP_03265509.1| conserved hypothetical protein [Burkholderia sp. H160]
gi|209502931|gb|EEA02934.1| conserved hypothetical protein [Burkholderia sp. H160]
Length = 288
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 75/172 (43%), Gaps = 41/172 (23%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKS--LESLHNYAGAESSSVVAAGVIHLDKDHW 239
+ H +D R+V + ++D D ++++ L + + G E+SS + ++ LD DH
Sbjct: 73 MQHFTDYFRFVMFTRTDEIWVDTDMLLLRDFDLSAKGDLIGRETSSSICNAMLRLDPDH- 131
Query: 240 LSGAALRELRDNFKTTE-----WGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPR 294
L EL + + + WG GP +LT + + P
Sbjct: 132 ---PRLHELIERVEAMKGTALKWGDTGPRLLTAVYGVKAG----------------LPES 172
Query: 295 FFYPVHWEHW-----ADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLG 341
FYPVH++ + Y +E A L D+Y LH+WN+ R+ VK+G
Sbjct: 173 LFYPVHFDDYYKVFLPRYFDECEA-----LCSDAYTLHLWNN---RI-VKMG 215
>gi|336401210|ref|ZP_08581982.1| hypothetical protein HMPREF0404_01273 [Fusobacterium sp. 21_1A]
gi|336161567|gb|EGN64568.1| hypothetical protein HMPREF0404_01273 [Fusobacterium sp. 21_1A]
Length = 243
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 69/166 (41%), Gaps = 27/166 (16%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW +SD +R LY YGG YLD D +IK + L + + G E+ ++ G++ +
Sbjct: 62 KLWAF--VSDYIRVKVLYNYGGIYLDTDMEIIKDITPLLDADMFLGYENEDTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H +++ + ++ W + +LT +L+ E H N IY
Sbjct: 120 IPKH----KVFKKMYEFYQNEIWKSPLHIVTSILTEILEKE------YHGKYRENNINIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N + ++YA+H W K+ P
Sbjct: 170 PREYFYP---------FNHDEEFTKDCITGNTYAIHWWGKSWKKNP 206
>gi|321467173|gb|EFX78164.1| hypothetical protein DAPPUDRAFT_105426 [Daphnia pulex]
Length = 397
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 60/113 (53%), Gaps = 9/113 (7%)
Query: 103 ELTLRQACSIESAAMMNPG--VQVYV---VVIASVRNRTRNPLIDRLYEYQNVHIVQVDL 157
L +RQ+CS+ESAA NP +Q+++ + + + + L Y NV IVQ+D
Sbjct: 104 SLNVRQSCSVESAARHNPDRPIQIFMRWKTTSSDDPGNSTDAWLAVLSHYNNVEIVQIDD 163
Query: 158 G-RYFQNTPLHGFYTQD-AILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIV 208
Y NT L+ ++ + LSH D +R +TL+K GG +LD+D ++
Sbjct: 164 DILYSNNTALYNWFKMKWRTANAQQHLSH--DYIRSLTLFKGGGLFLDMDKVM 214
>gi|237744224|ref|ZP_04574705.1| polysaccharide biosynthesis protein [Fusobacterium sp. 7_1]
gi|260494195|ref|ZP_05814326.1| polysaccharide biosynthesis protein [Fusobacterium sp. 3_1_33]
gi|423136979|ref|ZP_17124622.1| hypothetical protein HMPREF9942_00760 [Fusobacterium nucleatum
subsp. animalis F0419]
gi|229431453|gb|EEO41665.1| polysaccharide biosynthesis protein [Fusobacterium sp. 7_1]
gi|260198341|gb|EEW95857.1| polysaccharide biosynthesis protein [Fusobacterium sp. 3_1_33]
gi|371961046|gb|EHO78689.1| hypothetical protein HMPREF9942_00760 [Fusobacterium nucleatum
subsp. animalis F0419]
Length = 243
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 69/166 (41%), Gaps = 27/166 (16%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW +SD +R LY YGG YLD D +IK + L + + G E+ ++ G++ +
Sbjct: 62 KLWAF--VSDYVRVKVLYNYGGIYLDTDMEIIKDITPLLDADMFLGYENEDTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H +++ + ++ W + +LT +L+ E H N IY
Sbjct: 120 IPKH----KVFKKMYEFYQNEIWKSPLHIVTSILTEILEKE------YHGKYRENNINIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N + ++YA+H W K+ P
Sbjct: 170 PREYFYP---------FNHDEEFTKDCITGNTYAIHWWGKSWKKNP 206
>gi|19704576|ref|NP_604138.1| hypothetical protein FN1241 [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
gi|296327824|ref|ZP_06870362.1| hypothetical Teichoic Acid Biosynthesis protein [Fusobacterium
nucleatum subsp. nucleatum ATCC 23726]
gi|19714868|gb|AAL95437.1| polysaccharide biosynthesis protein [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
gi|296155080|gb|EFG95859.1| hypothetical Teichoic Acid Biosynthesis protein [Fusobacterium
nucleatum subsp. nucleatum ATCC 23726]
Length = 243
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 69/166 (41%), Gaps = 27/166 (16%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW +SD +R LY YGG YLD D +IK + L + + G E+ ++ G++ +
Sbjct: 62 KLWAF--VSDYVRVKVLYNYGGIYLDTDMEIIKDITPLLDADMFLGYENEDTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H +++ + ++ W + +LT +L+ E H N IY
Sbjct: 120 IPKH----KVFKKMYEFYQNEIWKSPLHIVTSILTEILEKE------YHGKYRENNINIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N + ++YA+H W K+ P
Sbjct: 170 PREYFYP---------FNHDEEFTKDCITGNTYAIHWWGKSWKKNP 206
>gi|332292699|ref|YP_004431308.1| glycosyltransferase family protein [Krokinobacter sp. 4H-3-7-5]
gi|332170785|gb|AEE20040.1| glycosyltransferase sugar-binding region containing DXD motif
[Krokinobacter sp. 4H-3-7-5]
Length = 258
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 78/180 (43%), Gaps = 21/180 (11%)
Query: 162 QNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---Y 218
NT + Q AI W + ++D++R L GG YLD D +++KSL+ N +
Sbjct: 42 NNTDCSLPFVQFAIRQKKW--AFVADVIRLDVLINQGGIYLDTDMLLVKSLDEYLNSKLF 99
Query: 219 AGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG-------PGVLTRLLKA 271
G E+ ++ +I D+ ++++R + +G + P +L+ K
Sbjct: 100 IGVEAPHMINGAIIGSVCDN----RYIKKVRRYYDQINFGVDFNFHNSTIPIILSETFKN 155
Query: 272 ECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
+ N T+YP + FYP+ H D N N L D+ A+H+WN+
Sbjct: 156 MFGKDLNILEPTTIDNVTMYPSQVFYPLPNSHKMDIKNYKNY-----LSEDTAAVHLWNA 210
>gi|262037806|ref|ZP_06011248.1| glycosyltransferase [Leptotrichia goodfellowii F0264]
gi|261748278|gb|EEY35675.1| glycosyltransferase [Leptotrichia goodfellowii F0264]
Length = 251
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/163 (22%), Positives = 69/163 (42%), Gaps = 24/163 (14%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL------ESLHNYAGAESSSVVAAGV 231
LW +++SD +R +Y++ G Y+D D +IK + E++ + G E ++ G+
Sbjct: 63 KLW--AYVSDYIRVHYMYEHSGIYVDTDMEIIKDITPLITKENMKFFIGYEDEKHISVGI 120
Query: 232 IHLDKDHWLSGAALRELRDNFKTTEWGA---NGPGVLTRLLKAECKPQSYAHNIISCRNF 288
D+ L++L + ++ W P + T + + + N +
Sbjct: 121 FGTDR----HNEVLKDLTEFYEKEIWEKPLWTIPKIFTYIFEKKYGLTDKRENTLKKGEI 176
Query: 289 TIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
TIYP +FYP E P ++ ++Y +H WN
Sbjct: 177 TIYPKEYFYPYG-------FKEKYTPQCIT--ENTYGIHWWND 210
>gi|254302907|ref|ZP_04970265.1| possible glycosyltransferase [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
gi|148323099|gb|EDK88349.1| possible glycosyltransferase [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
Length = 243
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 69/166 (41%), Gaps = 27/166 (16%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW +SD +R LY YGG YLD D +IK + L + + G E+ ++ G++ +
Sbjct: 62 KLWAF--VSDYVRVKILYNYGGIYLDTDMEIIKDITPLLDTDMFLGYENEDTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H +++ + ++ W + +LT +L+ E H N IY
Sbjct: 120 IPKH----KVFKKMYEFYQNEIWKSPLHIVTSILTEILEKE------YHGKYRENNINIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
P +FYP N + ++YA+H W K+ P
Sbjct: 170 PREYFYP---------FNHDEEFTKDCITANTYAIHWWGKSWKKNP 206
>gi|159479688|ref|XP_001697922.1| hypothetical protein CHLREDRAFT_151492 [Chlamydomonas reinhardtii]
gi|158274020|gb|EDO99805.1| predicted protein [Chlamydomonas reinhardtii]
Length = 467
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/155 (23%), Positives = 74/155 (47%), Gaps = 13/155 (8%)
Query: 120 PGVQVYVVVIASVRNRTRNPLIDRL-YEYQNVHIVQVDLGRYFQNTPLHGFYTQD--AIL 176
PG V ++A N+ ++ L + + V++ + + Q+TPL +Y + +
Sbjct: 88 PGAHGGVYLLA---NQLSEEKMEELGWRRRGVYLTRYSVEDVLQDTPLSSWYVEKRAELE 144
Query: 177 TSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA-----ESSSVVAAGV 231
+ SH++DL+R+ +YK+GG Y+D D +V++ + HN+ S+ V
Sbjct: 145 AGKYWFSHVTDLMRFALVYKHGGIYMDTDVLVMRPIS--HNHVNKLVRALSDSNWFECAV 202
Query: 232 IHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
+ H L+++ +++ +W + GP LT
Sbjct: 203 MFFQAGHPYLFEVLKQIPRHYRAVDWISAGPKALT 237
>gi|186471471|ref|YP_001862789.1| hypothetical protein Bphy_6719 [Burkholderia phymatum STM815]
gi|184197780|gb|ACC75743.1| hypothetical protein Bphy_6719 [Burkholderia phymatum STM815]
Length = 303
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 68/164 (41%), Gaps = 25/164 (15%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKS--LESLHNYAGAESSSVVAAGVIHLDKDHW 239
++H +D RY ++D D +++K L N G E++ ++ LD +
Sbjct: 73 MAHFTDYFRYAMFTHTEEIWVDTDMLLLKDFDLRESFNMIGRETAESACTAILRLDPNDA 132
Query: 240 LSGAALRELRDNFKT--TEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFY 297
A +R + D K +WG GP +LT + Y P +FY
Sbjct: 133 RLSALIRRV-DAIKELPIKWGDTGPRLLTAVYGVA----------------DTYAPPYFY 175
Query: 298 PVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLG 341
PVH++ + A ++Y LH+WN+ RV VK+G
Sbjct: 176 PVHFDDYYKVFLPRFADECAKQCSNAYTLHLWNN---RV-VKMG 215
>gi|322510950|gb|ADX06263.1| glycotransferase family protein [Organic Lake phycodnavirus 2]
Length = 443
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 65/139 (46%), Gaps = 25/139 (17%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL----HNYAGAESSS----VV 227
+ + ++ +D++R+ LY+YGG +LD D I + ++ + +AG E + ++
Sbjct: 33 IAEMKEINGQADIIRWEILYEYGGVFLDADSICVDKIDDVLMNCKCFAGWEHETLRKGLI 92
Query: 228 AAGVIHLDKDHWLSGAALRELRDNF----KT--TEWGANGPGVLTRLLKAECKPQSYAHN 281
A G + H L A+ ++ N KT W + GPG+LTR+ +N
Sbjct: 93 ATGTMGFPPKHPLVKEAIEWIKQNCVDYDKTGLMAWQSVGPGLLTRM-----------YN 141
Query: 282 IISCRNFTIYPPRFFYPVH 300
+ TI+P F P+H
Sbjct: 142 SGHYDDMTIFPSYTFLPIH 160
>gi|367066733|gb|AEX12645.1| hypothetical protein 2_6009_01 [Pinus taeda]
gi|367066743|gb|AEX12650.1| hypothetical protein 2_6009_01 [Pinus taeda]
gi|367066747|gb|AEX12652.1| hypothetical protein 2_6009_01 [Pinus taeda]
gi|367066749|gb|AEX12653.1| hypothetical protein 2_6009_01 [Pinus taeda]
Length = 110
Score = 50.1 bits (118), Expect = 0.002, Method: Composition-based stats.
Identities = 27/94 (28%), Positives = 48/94 (51%), Gaps = 16/94 (17%)
Query: 257 WGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETN---- 312
WG NGP ++TR++ + K + + I PP FYPV W H + Y + +
Sbjct: 5 WGHNGPYLVTRVVTSLPKQ--------TLDDIKIMPPAAFYPVDWGHISSYFSSPSDRGH 56
Query: 313 ----APATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ + L +++YA+H+WN ++ V ++ GS
Sbjct: 57 LKWISAKIIQLEKEAYAIHLWNKQSRGVNMEEGS 90
>gi|367066737|gb|AEX12647.1| hypothetical protein 2_6009_01 [Pinus taeda]
gi|367066745|gb|AEX12651.1| hypothetical protein 2_6009_01 [Pinus taeda]
Length = 110
Score = 49.7 bits (117), Expect = 0.002, Method: Composition-based stats.
Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 16/94 (17%)
Query: 257 WGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETN---- 312
WG NGP ++TR++ + P+ +I I PP FYPV W H + Y + +
Sbjct: 5 WGHNGPYLVTRVVTS--LPKQTRDDI------KIMPPAAFYPVDWGHISSYFSSPSDRGH 56
Query: 313 ----APATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ + L +++YA+H+WN ++ V ++ GS
Sbjct: 57 LKWISAKIIQLEKEAYAIHLWNKQSRGVNMEEGS 90
>gi|367066731|gb|AEX12644.1| hypothetical protein 2_6009_01 [Pinus taeda]
gi|367066735|gb|AEX12646.1| hypothetical protein 2_6009_01 [Pinus taeda]
gi|367066739|gb|AEX12648.1| hypothetical protein 2_6009_01 [Pinus taeda]
gi|367066741|gb|AEX12649.1| hypothetical protein 2_6009_01 [Pinus taeda]
Length = 110
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 16/96 (16%)
Query: 255 TEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETN-- 312
WG NGP ++TR++ + K +F I PP FYPV W H + Y + +
Sbjct: 3 NRWGHNGPYLVTRVVTSLPKQTR--------DDFKIMPPAAFYPVDWGHISSYFSSPSDR 54
Query: 313 ------APATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ + L +++YA+H+WN ++ V ++ GS
Sbjct: 55 GHLKWISAKIIQLEKEAYAIHLWNKQSRGVNMEEGS 90
>gi|289766677|ref|ZP_06526055.1| polysaccharide biosynthesis protein [Fusobacterium sp. D11]
gi|289718232|gb|EFD82244.1| polysaccharide biosynthesis protein [Fusobacterium sp. D11]
Length = 199
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 66/158 (41%), Gaps = 27/158 (17%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHL 234
LW +SD +R LY YGG YLD D +IK + L + + G E+ ++ G++ +
Sbjct: 62 KLWAF--VSDYVRVKVLYNYGGIYLDTDMEIIKDITPLLDADMFLGYENEDTMSFGIVGV 119
Query: 235 DKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
H +++ + ++ W + +LT +L+ E H N IY
Sbjct: 120 IPKH----KVFKKMYEFYQNEIWKSPLHIVTSILTEILEKE------YHGKYRENNINIY 169
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVW 329
P +FYP N + ++YA+H W
Sbjct: 170 PREYFYP---------FNHDEEFTKDCITGNTYAIHWW 198
>gi|428942839|ref|ZP_19015802.1| polysaccharide biosynthesis protein [Klebsiella pneumoniae VA360]
gi|426297873|gb|EKV60325.1| polysaccharide biosynthesis protein [Klebsiella pneumoniae VA360]
Length = 219
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/243 (22%), Positives = 93/243 (38%), Gaps = 62/243 (25%)
Query: 96 CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQV 155
C G EL + IES ++ NP Y ++I +N + +
Sbjct: 8 CIWLGGRELPALEKKCIESWSIYNPD---YKIIIWDEKN------------------LVI 46
Query: 156 DLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
D Y + A L W ++ +DL R LY+YGG YLD D I S ++L
Sbjct: 47 DDPEY-----------KSAYLKKQW--AYCADLARMRLLYEYGGIYLDTDMEAIASFDNL 93
Query: 216 HNY---AGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAE 272
Y AG E ++ ++ +K H + E++ + +T P ++T +
Sbjct: 94 LKYSFFAGKEDDEMINGAILGAEKGHEFVFSIYEEVKKSLRTN--FIPIPRIITYI---- 147
Query: 273 CKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRD----SYALHV 328
Y N + ++ I+ FYP N +P L++D + A+H
Sbjct: 148 -----YQKN-TNLKDIKIFEKEVFYP---------FNPYASPIKQLLYKDIQKNTVAIHH 192
Query: 329 WNS 331
W+
Sbjct: 193 WSK 195
>gi|297607468|ref|NP_001060020.2| Os07g0567300 [Oryza sativa Japonica Group]
gi|255677892|dbj|BAF21934.2| Os07g0567300 [Oryza sativa Japonica Group]
Length = 605
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 53/104 (50%), Gaps = 9/104 (8%)
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYT--QDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
+ E V + +L + T H F + + T +PL H S+L+R LYKYGG Y
Sbjct: 390 VKEGYKVAVALPNLDELLEGTLTHDFVSVWNEWRKTKYYPL-HYSELVRLAALYKYGGIY 448
Query: 202 LDLDFIVIKSLESLHNYAG-----AESSSVVAAGVIHLDKDHWL 240
LD D +V+K L +L N G +E+SS A V+ +K+ L
Sbjct: 449 LDSDVVVLKPLNALRNSIGVVKQVSENSSFSGA-VLAFEKNSQL 491
>gi|125558841|gb|EAZ04377.1| hypothetical protein OsI_26519 [Oryza sativa Indica Group]
Length = 619
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 52/102 (50%), Gaps = 9/102 (8%)
Query: 146 EYQNVHIVQVDLGRYFQNTPLHGFYT--QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLD 203
E V + +L + T H F + + T +PL H S+L+R LYKYGG YLD
Sbjct: 368 EGYKVAVALPNLDELLEGTLTHDFVSVWNEWRKTKYYPL-HYSELVRLAALYKYGGIYLD 426
Query: 204 LDFIVIKSLESLHNYAG-----AESSSVVAAGVIHLDKDHWL 240
D +V+K L +L N G +E+SS A V+ +K+ L
Sbjct: 427 SDVVVLKPLNALRNSIGVVKQVSENSSFSGA-VLAFEKNSQL 467
>gi|295677034|ref|YP_003605558.1| hypothetical protein BC1002_1986 [Burkholderia sp. CCGE1002]
gi|295436877|gb|ADG16047.1| conserved hypothetical protein [Burkholderia sp. CCGE1002]
Length = 288
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/162 (22%), Positives = 64/162 (39%), Gaps = 27/162 (16%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIK--SLESLHNYAGAESSSVVAAGVIHLDKDHW 239
++H +D R+V K ++D D ++++ L + + G E+ S + ++ LD D
Sbjct: 73 IAHFTDYFRFVMFTKTDEIWVDTDMLLLRDFDLNAKGDLIGRETPSSICTALLRLDPDQ- 131
Query: 240 LSGAALRELRDNFKTT-----EWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPR 294
L EL + + +WG GP LT + P
Sbjct: 132 ---PRLHELIERVEAMKGTAIKWGDTGPRPLTAVYGVRAG----------------LPES 172
Query: 295 FFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRV 336
FYPVH+ + +L D+Y LH+WN+ R+
Sbjct: 173 LFYPVHFNDYYKVFLPRYFEECAALCSDAYTLHLWNNRVVRM 214
>gi|170747599|ref|YP_001753859.1| hypothetical protein Mrad2831_1169 [Methylobacterium radiotolerans
JCM 2831]
gi|170654121|gb|ACB23176.1| hypothetical protein Mrad2831_1169 [Methylobacterium radiotolerans
JCM 2831]
Length = 292
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/160 (23%), Positives = 69/160 (43%), Gaps = 16/160 (10%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL--ESLHNYAGAESSSVVAAGVIHLDKDHW 239
LSH SD RY + ++D D ++++ L E ES + + ++ L+
Sbjct: 71 LSHFSDFFRYNLFDRTSHIWIDADMLMLRPLDFEVATEIIAMESPTSICGAIMRLNPGEK 130
Query: 240 LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPV 299
L + + + WG GP +LT + E + AH PP F+P+
Sbjct: 131 LDLILNKIDKAKDRDLIWGETGPRLLTEIF-GEQSIRKVAH-----------PPNDFFPI 178
Query: 300 -HWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPV 338
H + W +L + S ++S+++H+WN+ R+ +
Sbjct: 179 SHNDFWRVFL-PSERDWCESRCKESFSVHLWNNIVDRLGI 217
>gi|227537844|ref|ZP_03967893.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
gi|227242458|gb|EEI92473.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
Length = 235
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 70/162 (43%), Gaps = 32/162 (19%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA-GAESSSVVAAGVIHLDKD---- 237
S SD RY+ LYK GG ++D+D + +K + Y ES +V G I K
Sbjct: 72 SSFSDWFRYMLLYKKGGWWVDMDSVCLKYFDFEEEYCFSLESDDIVNNGFIKAPKKAEFL 131
Query: 238 ----HWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPP 293
H++ L + W GP + T +LK Q + I ++
Sbjct: 132 KDCIHYIDTKGLDNVM-------WLNFGPFLFTNVLK-----QYDSSAFIKSKD------ 173
Query: 294 RFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKR 335
+F PV+W+ ++ P +S+ +SY++H+W+ ++
Sbjct: 174 -YFCPVNWQD----TDKLIQPPLISISEESYSIHLWHEMWRQ 210
>gi|345891544|ref|ZP_08842385.1| hypothetical protein HMPREF1022_01045 [Desulfovibrio sp.
6_1_46AFAA]
gi|345048082|gb|EGW51927.1| hypothetical protein HMPREF1022_01045 [Desulfovibrio sp.
6_1_46AFAA]
Length = 273
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 30/125 (24%), Positives = 60/125 (48%), Gaps = 10/125 (8%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES--LHN-YAGAESSSVVAAGVIHLDKDHW 239
+++SD +R T++ +GG YLD D V K ++ LH+ + G ++ V K
Sbjct: 69 AYISDYIRIKTIFDHGGIYLDTDVTVYKKFDNLLLHDMFIGNAINNFPEMAVFGAKK--- 125
Query: 240 LSGAALRELRDNFKTTEWGANG---PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
S L+++ + +K W + + ++++ + II ++ IYPP +F
Sbjct: 126 -SNLILKKINNFYKKDIWESPFFIITNIFKKIIQYDFHLNLNIDGIIHSKDIVIYPPEYF 184
Query: 297 YPVHW 301
+P H+
Sbjct: 185 HPFHY 189
>gi|258646555|ref|ZP_05734024.1| polysaccharide biosynthesis protein CpsM [Dialister invisus DSM
15470]
gi|260403969|gb|EEW97516.1| polysaccharide biosynthesis protein CpsM [Dialister invisus DSM
15470]
Length = 248
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 73/163 (44%), Gaps = 11/163 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSV 226
Y ++A + W +SD +R LY+YGG Y+D D V+KSL L Y +G ES +
Sbjct: 36 YCREAYESKKWAF--VSDYVRLKVLYEYGGFYMDTDVEVVKSLNPLRIYDAVSGYESKTH 93
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA----NGPGVLTRLLKAECKPQSYAHNI 282
+ G + +++ G L++ + + G+ +T+L + + +
Sbjct: 94 IQTGTLGACRNNEWIGMLLKDYDERHFIRKDGSLDLKTNVQSITKLTLKRYRMKLLGQKV 153
Query: 283 ISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYA 325
+N I P + HW+ +++T+ T+ F+ S+
Sbjct: 154 TFGKNMVILPFDYLCAKHWD--TGVIDKTSNTFTIHHFKGSWV 194
>gi|443696091|gb|ELT96871.1| hypothetical protein CAPTEDRAFT_203237 [Capitella teleta]
Length = 363
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 66/146 (45%), Gaps = 21/146 (14%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVIHLDKDHWLSG 242
SDL R TL GG YLDLD IV++S + L N++ G E+ V +GV+ ++
Sbjct: 196 SDLERVTTLMDKGGIYLDLDVIVVRSFDPLRNHSCTVGLETRDKVNSGVLVCHRNSPFLR 255
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWE 302
L ++K W N V L AE P+ Y H + P R P
Sbjct: 256 LWLEHYIADYKVWMWNYNAGWVPAYL--AERYPE-YIH---------LEPNRLQRPN--- 300
Query: 303 HWADYLNETNAPATMSLFRDSYALHV 328
D ++ P + + +RD+YA+H+
Sbjct: 301 --GDEIDMIWGPESFN-WRDNYAVHL 323
>gi|358058647|dbj|GAA95610.1| hypothetical protein E5Q_02266 [Mixia osmundae IAM 14324]
Length = 1109
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 21/173 (12%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKS-LESLHNYAGAESSSVVAAGVIHLDKDHWLS 241
SH++D LR+ L++YGGTYLD+D ++++ + + GA+ S+ LD+D
Sbjct: 751 SHLTDFLRFSFLHRYGGTYLDMDAPIVRAPPDPAMEFIGADYSTEAEDLSWTLDEDRMYL 810
Query: 242 GAALRELRDNF----KTTEWGANG-------PGVLTRLLKAECKPQSYAHNIISCRNFTI 290
+ R + + +E +G V R + + KP+ + + TI
Sbjct: 811 APGVMRFRRGWTMFREISEHAFSGIYSPECFNCVGPRAITSYIKPRRRQYEL---GGLTI 867
Query: 291 YPPRFFYPVHWEHWADYLNETNAPATMSL-----FRDSYALHVWNSFTKRVPV 338
P YP +W H A L E L R+S+++H++ T + +
Sbjct: 868 LPSNILYPKNWVH-ARELVEVRDRYVAELELREISRESWSIHLFGKMTNHLKI 919
>gi|198426779|ref|XP_002122051.1| PREDICTED: hypothetical protein [Ciona intestinalis]
Length = 538
Score = 45.8 bits (107), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 9/93 (9%)
Query: 177 TSLW-----PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVA 228
T++W + H SD+ R L + GG Y+D D +V+KSL+SL N G E+ +A
Sbjct: 351 TTIWGHPVQKVEHQSDVARLEILLETGGIYMDDDVVVLKSLDSLRNNEMVLGEENYDALA 410
Query: 229 AGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+I + W +D F T+W +
Sbjct: 411 NSIIMASPNSWFLKKWFTYYKD-FNDTKWSESS 442
>gi|265762939|ref|ZP_06091507.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|383117700|ref|ZP_09938443.1| hypothetical protein BSHG_0154 [Bacteroides sp. 3_2_5]
gi|251946953|gb|EES87235.1| hypothetical protein BSHG_0154 [Bacteroides sp. 3_2_5]
gi|263255547|gb|EEZ26893.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
Length = 239
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 72/177 (40%), Gaps = 23/177 (12%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YT++A W +SD R+ LY GG YLD D ++KSL H+++G ES
Sbjct: 51 YTKEAAAKGKWAF--VSDYARFYILYHNGGIYLDTDVQILKSLNPFLKHHSFSGFESKDR 108
Query: 227 VAAGVI-HLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHN--II 283
VA G+I +K L + + + G + + E + N +
Sbjct: 109 VAPGLILGAEKGCSLMKKLMNSYHERHFINQNGLLNEKTVVSYMTEELVSEGLILNGELQ 168
Query: 284 SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH----VWNSFTKRV 336
+ +F +YP FF P E + + ++Y++H W S T R+
Sbjct: 169 NIHDFIVYPIDFFSPKSLE-----------TGKLKITSNTYSIHHYAGSWMSNTSRL 214
>gi|300926478|ref|ZP_07142273.1| polysaccharide biosynthesis protein CpsM(V) family protein
[Escherichia coli MS 182-1]
gi|422956597|ref|ZP_16969071.1| hypothetical protein ESQG_00566 [Escherichia coli H494]
gi|432719151|ref|ZP_19954120.1| hypothetical protein WCK_02772 [Escherichia coli KTE9]
gi|450218039|ref|ZP_21895755.1| glycosyltransferase family protein [Escherichia coli O08]
gi|300417497|gb|EFK00808.1| polysaccharide biosynthesis protein CpsM(V) family protein
[Escherichia coli MS 182-1]
gi|371599513|gb|EHN88298.1| hypothetical protein ESQG_00566 [Escherichia coli H494]
gi|431262963|gb|ELF54952.1| hypothetical protein WCK_02772 [Escherichia coli KTE9]
gi|449318072|gb|EMD08148.1| glycosyltransferase family protein [Escherichia coli O08]
Length = 218
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 78/183 (42%), Gaps = 23/183 (12%)
Query: 162 QNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---Y 218
N+P+ Y ++A+ + + +D +R+ L GG YLD D +IK + L N +
Sbjct: 42 DNSPVSVPYVKNALEDKRYAFA--ADYVRFYALKTMGGVYLDTDMELIKDISPLLNNKFF 99
Query: 219 AGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSY 278
ES ++ A ++ +KD ++EL+ +T + P +LT + +
Sbjct: 100 TAKESEELINAAIMGSEKDGRFVNLVIQELQ--LRTGHTYESIPKILTDIFNTGNGFKDE 157
Query: 279 AHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWN-----SFT 333
TIY +FYP + + + NE + +YA+H W +F
Sbjct: 158 T---------TIYDKEYFYP--YNPFDNKRNEVKQLFYCDITPHTYAIHHWQQSWHYTFL 206
Query: 334 KRV 336
+RV
Sbjct: 207 ERV 209
>gi|312621481|ref|YP_004023094.1| glycosyltransferase sugar-binding region containing dxd motif
[Caldicellulosiruptor kronotskyensis 2002]
gi|312201948|gb|ADQ45275.1| glycosyltransferase sugar-binding region containing DXD motif
[Caldicellulosiruptor kronotskyensis 2002]
Length = 242
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 60/121 (49%), Gaps = 12/121 (9%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLS 241
+SD +R +LY YGG Y+D D V+K+L+ + G ES++ G++ +K H
Sbjct: 63 VSDYVRIWSLYNYGGIYMDTDVEVLKNLDRFLCHRFFTGFESNNFAVTGIMGSEKGH--- 119
Query: 242 GAALRELRDNFKTTEWGAN----GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFY 297
L+EL + + ++ AN +++ L+K + + ++ + IYP +F
Sbjct: 120 -PFLKELLEWYNGQDFDANKLKTNTQIISNLMKKYGLELNGKYQVLE-HDMHIYPKEWFC 177
Query: 298 P 298
P
Sbjct: 178 P 178
>gi|262037139|ref|ZP_06010631.1| glycosyltransferase [Leptotrichia goodfellowii F0264]
gi|261748827|gb|EEY36174.1| glycosyltransferase [Leptotrichia goodfellowii F0264]
Length = 254
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 70/163 (42%), Gaps = 25/163 (15%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL------ESLHNYAGAESSSVVAAGV 231
LW ++++D R LYK GG Y+D+D ++K E++ + G ES + G+
Sbjct: 65 KLW--AYVADYARVHYLYKTGGIYMDIDMEIVKDFSELTQNENIDFFTGFESDDGIGMGL 122
Query: 232 IHLDKDHWLSGAALRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRN- 287
+ L+E+ D ++ W ++ P V +LK++ + +N
Sbjct: 123 FGVSP----QSKFLKEIMDFYEDEIWKSSLFTIPQVTKHILKSKFSCDLSKKEVKDDKNG 178
Query: 288 FTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWN 330
IYP FYP +L +M + +YA+H WN
Sbjct: 179 IYIYPKESFYP--------FLPHEKFSESM-ITDKTYAIHWWN 212
>gi|443719224|gb|ELU09498.1| hypothetical protein CAPTEDRAFT_200369 [Capitella teleta]
Length = 444
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 46/101 (45%), Gaps = 4/101 (3%)
Query: 173 DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSVVAA 229
+ I +W S S++ R V L +YGG YLDLD +++KS + L Y G E+ V
Sbjct: 267 EPIKNPVWGTSQ-SNVDRLVVLMEYGGIYLDLDVLIVKSFDPLRKYPCTVGLENPERVCG 325
Query: 230 GVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLK 270
G+I D + ++K W N V T L +
Sbjct: 326 GIIVCAADSLFLNLWMEHYIFDYKIWTWAYNSGLVPTHLAR 366
>gi|363581952|ref|ZP_09314762.1| glycosyl transferase family protein [Flavobacteriaceae bacterium
HQM9]
Length = 253
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 35/60 (58%), Gaps = 4/60 (6%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKD-HWLS 241
+D +R +Y YGG YLD D V+K+ ++L + + GAE V+ AGV +K+ WL
Sbjct: 65 ADFIRLYAVYHYGGIYLDTDIEVVKNFDNLLDRPYFIGAEGEGVIEAGVFGAEKNADWLK 124
>gi|443709458|gb|ELU04130.1| hypothetical protein CAPTEDRAFT_219581 [Capitella teleta]
Length = 296
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSVVAAGVIHLDKDHW 239
H +D+ R L ++GG YLDLD I+++S +SL + GAES ++ +G+I D
Sbjct: 130 EHKADVARIQVLLRHGGIYLDLDVIILRSFDSLLKHDVTMGAESPDLLGSGLILSKADST 189
Query: 240 LSGAALRELRDNFKTTEW 257
L R NF +W
Sbjct: 190 FLKLWLETYR-NFTDGDW 206
>gi|313243629|emb|CBY42303.1| unnamed protein product [Oikopleura dioica]
Length = 601
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 48/97 (49%), Gaps = 8/97 (8%)
Query: 156 DLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
D + TP + + ++ LTS+ H SD+ R L K+GG Y+D D +++KSL+ L
Sbjct: 405 DKLKILPRTPPNSIWGKE--LTSV---EHQSDITRLHILLKFGGIYIDDDVLILKSLDEL 459
Query: 216 HNYA---GAESSSVVAAGVIHLDKDHWLSGAALRELR 249
+ G E+ +A +I K+ W L E R
Sbjct: 460 RSKEIVLGEENYDALANSIILAKKNTWFMKRWLWEYR 496
>gi|451818014|ref|YP_007454215.1| mannosyltransferase OCH1 [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
gi|451783993|gb|AGF54961.1| mannosyltransferase OCH1 [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
Length = 242
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 69/152 (45%), Gaps = 15/152 (9%)
Query: 162 QNTPLHGF-YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HN 217
+N + F +TQ A W ++D R LY YGG YLD D V+K+L+
Sbjct: 44 KNFKIEEFDFTQKAYKAKKWAF--VADYCRIWVLYNYGGIYLDTDMEVLKTLDEFITNEA 101
Query: 218 YAGAESSSVVAAGVIHLD-KDHWLSGAALRELRDNFKTTEWGAN-GPGVLTRLLKAECKP 275
+AG E ++ A ++ K+ ++ + +N ++ + G + ++
Sbjct: 102 FAGIERDKIINAAILGAKAKNDFIKNVI--DYYNNINFIDYLDDLGKMAIPLIITELATH 159
Query: 276 QSYAHN--IISCRN-FTIYPPRFFYPVH--WE 302
+ Y N I + RN TIYP +FYP + WE
Sbjct: 160 EGYKGNKKIENVRNIITIYPKEYFYPKNHSWE 191
>gi|404449067|ref|ZP_11014058.1| mannosyltransferase OCH1-like enzyme [Indibacter alkaliphilus LW1]
gi|403765171|gb|EJZ26053.1| mannosyltransferase OCH1-like enzyme [Indibacter alkaliphilus LW1]
Length = 252
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 61/149 (40%), Gaps = 24/149 (16%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSVVAAGVIHLDKDHWLS 241
++D+ R LY+ GG YLD D +V+K L ++ E +++AG+I +
Sbjct: 64 VTDVCRLHALYQEGGIYLDTDMLVLKDFTHLMDFDFIISEEKEGLISAGIIGSTPRN--- 120
Query: 242 GAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHW 301
L +L +K + N P + L + ++ IYP +FYP+ +
Sbjct: 121 -PTLSDLLSQYKVLRFNINSPLDIPSFLTSN----------LNRNKIKIYPAAYFYPLPF 169
Query: 302 EHWADYLNETNAPATMSLFRDSYALHVWN 330
P T YA+H+WN
Sbjct: 170 SKKGQDYKSYIHPET-------YAVHLWN 191
>gi|443707265|gb|ELU02943.1| hypothetical protein CAPTEDRAFT_197966 [Capitella teleta]
Length = 957
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 40/174 (22%)
Query: 166 LHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAE 222
L+G +D + + SH S++ R L +YGG YLDLD +V KS + L + G E
Sbjct: 212 LYGEKIKDPVFYT----SH-SNVDRLKILTEYGGIYLDLDVLVTKSFDDLRQHDCVVGLE 266
Query: 223 SSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNI 282
+ +I +K+ L D+++ EW N V RL K
Sbjct: 267 LPDRICNSIILCNKESPYLMMWLNSFLDDYQVEEWSYNSGKVPYRLAK------------ 314
Query: 283 ISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSL-------FRDSYALHVW 329
R+ VH E AD +N N + +R++YA+H+W
Sbjct: 315 -----------RYPNLVHVE--ADTMNRPNFKELHKIWGSVIYDWRNNYAIHLW 355
>gi|406988495|gb|EKE08478.1| hypothetical protein ACD_17C00148G0003, partial [uncultured
bacterium]
Length = 257
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 22/132 (16%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH----NYAGAE------SSSVVAAGV 231
+ SD+LRY L ++GG Y+D DF I+ +SLH Y G E + + + +
Sbjct: 133 VGEKSDILRYEILSQFGGLYVDTDFECIQPFDSLHFLCDFYVGLETPHAKDTEAAIGNAL 192
Query: 232 IHLDKDHWLSGAALREL--RDNFKTTE--WGANGPGVLTRLLKAECKPQSYAHNIISCRN 287
I H + A L + + KT + +GPG L CK S HNI
Sbjct: 193 IGSIPHHPILQACLHRIAKKKAGKTADEVQDISGPGCLRYAFFKCCKKNSL-HNI----- 246
Query: 288 FTIYPPRFFYPV 299
+P FFYP+
Sbjct: 247 --AFPYTFFYPL 256
>gi|404372246|ref|ZP_10977545.1| hypothetical protein CSBG_00443 [Clostridium sp. 7_2_43FAA]
gi|404301234|gb|EEH96817.2| hypothetical protein CSBG_00443 [Clostridium sp. 7_2_43FAA]
Length = 233
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 58/137 (42%), Gaps = 11/137 (8%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
+T++A W +SD R LY YGG YLD D ++K L+ L ++ G E
Sbjct: 53 FTKEAYEDKKWAF--VSDYCRLWVLYNYGGIYLDTDIEILKPLDDLLDNKSFTGVEEEDQ 110
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL-LKAECKPQSYAHNIISC 285
+A G+ + G L E + +E+ N G+ + + K Y N S
Sbjct: 111 IAFGIWGCRRQDKFLGEIL-EYYNLINYSEYKGNLQGLAIPIHITNMAKKLGYIKNQDSI 169
Query: 286 R----NFTIYPPRFFYP 298
+ +YP +FYP
Sbjct: 170 SYYGDDVVVYPKDYFYP 186
>gi|410100604|ref|ZP_11295563.1| hypothetical protein HMPREF1076_04741 [Parabacteroides goldsteinii
CL02T12C30]
gi|409215320|gb|EKN08322.1| hypothetical protein HMPREF1076_04741 [Parabacteroides goldsteinii
CL02T12C30]
Length = 249
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 63/136 (46%), Gaps = 7/136 (5%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSV 226
Y ++A W ++D++R + GG Y+D D V+KSL+SL +Y +G ES +
Sbjct: 51 YVREAYEARKWAF--ITDVVRLYAMVTEGGIYMDTDVEVLKSLDSLLSYDAVSGFESETR 108
Query: 227 VAAGVIHLDKDHWLSGAALRELRD-NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISC 285
+ G++ K H L LRE +FK + + +TR+ K +N
Sbjct: 109 IPTGLMACRKGHPLFEELLREYDGIHFKRFDGSLDMTTNVTRITNTCLKYGFVPNNQKQT 168
Query: 286 RN-FTIYPPRFFYPVH 300
N FT+ P + P+
Sbjct: 169 VNGFTLLPKDYLCPIE 184
>gi|421613139|ref|ZP_16054228.1| mannosyl phosphorylinositol ceramide synthase SUR1 [Rhodopirellula
baltica SH28]
gi|408496019|gb|EKK00589.1| mannosyl phosphorylinositol ceramide synthase SUR1 [Rhodopirellula
baltica SH28]
Length = 261
Score = 44.7 bits (104), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 69/161 (42%), Gaps = 31/161 (19%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAES---------SSVVAAGVI 232
+D +RY L++YGG Y+DLDFI +KSL+ L + G E +++ ++
Sbjct: 69 ADAIRYFILHRYGGMYVDLDFICLKSLDPLLDGRACVVGMEPPQHCRHHRVPNLLCNALM 128
Query: 233 HLDKDHWLSGAALRELRDNFKTTE-----WGANGPGVLTRLLKAECKPQSYAHNIISCRN 287
H +R L + E + GP ++TR+L P S
Sbjct: 129 AAASGHPFFERVIRRLPEFVHHVENNEPILSSTGPIMMTRVLADLETPVS---------- 178
Query: 288 FTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHV 328
T+ P RF YP+ A Y + +S D++A+H+
Sbjct: 179 VTVLPSRFLYPLTMHQAAQYRLAGHVGVDLS---DAFAIHL 216
>gi|32475986|ref|NP_868980.1| surface protein Sur1 [Rhodopirellula baltica SH 1]
gi|417300261|ref|ZP_12087483.1| mannosyl phosphorylinositol ceramide synthase SUR1 [Rhodopirellula
baltica WH47]
gi|32446529|emb|CAD76365.1| similar to surface protein Sur1 [Rhodopirellula baltica SH 1]
gi|327543456|gb|EGF29878.1| mannosyl phosphorylinositol ceramide synthase SUR1 [Rhodopirellula
baltica WH47]
Length = 261
Score = 44.7 bits (104), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 31/161 (19%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAES---------SSVVAAGVI 232
+D +RY L+++GG Y+DLDFI +KSL+ L + G E +++ ++
Sbjct: 69 ADAIRYFILHRFGGMYVDLDFICLKSLDPLLDGRACVVGMEPPQHCRHHRVPNLLCNALM 128
Query: 233 HLDKDHWLSGAALRELRDNFKTTE-----WGANGPGVLTRLLKAECKPQSYAHNIISCRN 287
H +R L + E + GP ++TR+L P+S
Sbjct: 129 AAAPGHPFFERVIRRLPEFVHHVENNEPILSSTGPIMMTRVLADLETPES---------- 178
Query: 288 FTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHV 328
T+ P RF YP+ A Y + +S D++A+H+
Sbjct: 179 VTVLPSRFLYPLTMHQAAQYRLAGHVGVDLS---DAFAIHL 216
>gi|358347757|ref|XP_003637919.1| hypothetical protein MTR_111s0003, partial [Medicago truncatula]
gi|355503854|gb|AES85057.1| hypothetical protein MTR_111s0003, partial [Medicago truncatula]
Length = 160
Score = 44.7 bits (104), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 56/155 (36%), Gaps = 47/155 (30%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKT 254
YK+GG Y+D D I++KS N GA++ V K
Sbjct: 28 YKFGGIYIDADIIIMKSFSKFRNTIGAQNIDV--------------------------KN 61
Query: 255 TEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETN-- 312
E + +L+ + Y NF++ PP FYPV W
Sbjct: 62 KEMES-----FEQLVSRVSGREGY--------NFSVVPPSAFYPVDWRGIKSLFRGPGDE 108
Query: 313 ------APATMSLFRDSYALHVWNSFTKRVPVKLG 341
+ + ++SYA+H+WN + ++ V G
Sbjct: 109 IHSKWLVKKMVQIRKESYAVHLWNRQSGKLEVVKG 143
>gi|440718930|ref|ZP_20899366.1| mannosyl phosphorylinositol ceramide synthase SUR1 [Rhodopirellula
baltica SWK14]
gi|436435750|gb|ELP29570.1| mannosyl phosphorylinositol ceramide synthase SUR1 [Rhodopirellula
baltica SWK14]
Length = 261
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 31/161 (19%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAES---------SSVVAAGVI 232
+D +RY L+++GG Y+DLDFI +KSL+ L + G E +++ ++
Sbjct: 69 ADAIRYFILHRFGGMYVDLDFICLKSLDPLLDGRACVVGMEPPQHCRHHRVPNLLCNALM 128
Query: 233 HLDKDHWLSGAALRELRDNFKTTE-----WGANGPGVLTRLLKAECKPQSYAHNIISCRN 287
H + +R L + E + GP ++TR+L P S
Sbjct: 129 AAAPGHPFFESVIRRLPEFVHHVENNEPILSSTGPIMMTRVLADLETPAS---------- 178
Query: 288 FTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHV 328
T+ P RF YP+ A Y + +S D++A+H+
Sbjct: 179 VTVLPSRFLYPLTMHQAAQYRLAGHVGVDLS---DAFAIHL 216
>gi|326435635|gb|EGD81205.1| hypothetical protein PTSG_11240 [Salpingoeca sp. ATCC 50818]
Length = 1023
Score = 44.7 bits (104), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 117/294 (39%), Gaps = 51/294 (17%)
Query: 75 AFDDITRL---DVPDN--------SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQ 123
AFD + RL VPD +IFF+ TS H D R A +ESA + P +
Sbjct: 544 AFDAVERLLEGGVPDGCDPTSDICNIFFVWTS--HRDTWSFLNRLA--VESALRVFPRAR 599
Query: 124 VYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL- 182
V +V N + L V++ ++ R + G + + A+ L
Sbjct: 600 VIIV-----SNTLPVTFFNSLQASHRVYVWRIVPTRLVRAGVAGGRWLRAALREQGPHLP 654
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH----NYAG--------------AESS 224
+H SD LRYV LYKYGG + D D + + + H N+ G +S+
Sbjct: 655 THQSDFLRYVVLYKYGGLFSDTDLVWLDASPLAHAIGRNFLGKIDSRPILPRCPWCVDST 714
Query: 225 SVVAAGVIHLDKDHWLSGAALRELRD-NFKTTEWGANGPGVLTRLLKAECKPQSYAHNII 283
+A GV+ H + + L ++ + ++ A GP ++T+ A P ++I
Sbjct: 715 WYLANGVLRFQARHQMLASILGQIDTLAYDRSDRLAIGPHLVTKTFNAMQDP-----SVI 769
Query: 284 SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
++P +H Y+ A M S A+HV+ + K P
Sbjct: 770 LIDEHVLFPLPGPAVLH------YMMHIPPRARMQHMLSSAAVHVFEATYKHAP 817
>gi|443695949|gb|ELT96741.1| hypothetical protein CAPTEDRAFT_196653 [Capitella teleta]
Length = 341
Score = 44.3 bits (103), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 4/92 (4%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVIHLDKDHW 239
+ SDL R L ++GG YLD+D +VIKS + L Y G E+ V G+I D
Sbjct: 173 TSQSDLDRLAILMEHGGIYLDMDVLVIKSFDPLRKYPCTIGLENPQRVCGGIIVCAADSV 232
Query: 240 LSGAALRELRDNFKTTEWGANGPGVLTRLLKA 271
+ ++K W A G++ R + A
Sbjct: 233 FLNLWIEHFLFDYKMWTW-AYMSGIVPRKIAA 263
>gi|365834115|ref|ZP_09375563.1| hypothetical protein HMPREF0454_00381 [Hafnia alvei ATCC 51873]
gi|364570304|gb|EHM47920.1| hypothetical protein HMPREF0454_00381 [Hafnia alvei ATCC 51873]
Length = 234
Score = 44.3 bits (103), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 61/146 (41%), Gaps = 13/146 (8%)
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLSGA 243
D +R LY+YGG YLD D +IK + L + G E + G+I K
Sbjct: 75 DYIRCKILYEYGGIYLDSDIELIKDISILCENEAFVGLEKIGRASCGIIGAKKGF----D 130
Query: 244 ALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEH 303
+ L + + P +LT++L S + I TIYP +FYP +
Sbjct: 131 TMLHLMNEVVKAKGLVEMPVLLTKVLGEH--DFSLGNEIQFIGKMTIYPVHYFYP--YNP 186
Query: 304 WADYLNETNAPATMSLFRDSYALHVW 329
+ N+ + + D+YA+H W
Sbjct: 187 YGG--NDVDQLMYKDVKEDTYAIHHW 210
>gi|405960923|gb|EKC26793.1| hypothetical protein CGI_10013225 [Crassostrea gigas]
Length = 423
Score = 44.3 bits (103), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 8/69 (11%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSV---VAAGVIHLDKD- 237
+ H +D+ R + L +YGG YLD D ++++SL++L NY S +V ++ G+I +
Sbjct: 255 IEHQADVARLLILKEYGGIYLDTDEVILRSLDNLLNYTFTLSHAVDNNLSNGLILASPNA 314
Query: 238 ----HWLSG 242
HWL G
Sbjct: 315 TFISHWLDG 323
>gi|319902752|ref|YP_004162480.1| glycosyltransferase [Bacteroides helcogenes P 36-108]
gi|319417783|gb|ADV44894.1| glycosyltransferase sugar-binding region containing DXD motif
[Bacteroides helcogenes P 36-108]
Length = 252
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 78/161 (48%), Gaps = 21/161 (13%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAES-SSVVAAGVIHLDKDHWLS 241
+D +R+ LY YGG YLD D ++KS SL N + G E+ S ++ +I +K
Sbjct: 65 ADYIRFYALYHYGGIYLDSDVEMLKSFNSLLNNKTFIGYENVSGLLEPAIIGAEK----G 120
Query: 242 GAALRELRDNFKTTEWGANG----PGVLTRLLKAECK--PQSYAHN--IISCRNFTIYPP 293
+ +++ + +++ + A G P +++ L E P + +I+ + + PP
Sbjct: 121 ASWCKKMMNYYESRHFNAMGMKVAPLIISELFAEEITDYPSTPVRKPTLIAGGDILLCPP 180
Query: 294 RFFYPVHWEHWADYLNETNAPATMSLFR---DSYALHVWNS 331
+F P+ + AD + + T+S R ++Y +H +N
Sbjct: 181 EYFSPIKID--ADKNYKKTSKDTISNLRLNPNTYCIHRFNG 219
>gi|182677089|ref|YP_001831235.1| glycosyl transferase family protein [Beijerinckia indica subsp.
indica ATCC 9039]
gi|182632972|gb|ACB93746.1| glycosyltransferase sugar-binding region containing DXD motif
[Beijerinckia indica subsp. indica ATCC 9039]
Length = 712
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 73/187 (39%), Gaps = 32/187 (17%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL--------HNYAGAESSSVVAA--GVI 232
+H +D++R L +YGG YLDLD I ++ L L AG+ + + +
Sbjct: 120 AHKADVVRMAVLCRYGGIYLDLDTICVRPLTPLFRGKPVMGQELAGSHADRPIQGLCNAV 179
Query: 233 HLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYP 292
+ + L E +F +EW N VL + A+ P + TI P
Sbjct: 180 MIAPPNSLFFKTWWEAYRSFDDSEW--NRHSVLLPYVIAQTMPD----------DITILP 227
Query: 293 PRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS-----FTKRVPVKLGSEQ-PY 346
P F+ W D F D+Y H+W S ++ + SE+ Y
Sbjct: 228 PSAFFEPTW----DEAGLKTLFHADQRFPDAYCHHLWESRSWTDISRFNEDSIRSEETTY 283
Query: 347 AQIARRY 353
IARRY
Sbjct: 284 NHIARRY 290
>gi|404404787|ref|ZP_10996371.1| glycosyl transferase family protein [Alistipes sp. JC136]
Length = 248
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDK 236
+D +R+ LY YGG YLD D V++S + L +AGAE++ + A V+ +K
Sbjct: 65 ADYIRFYALYNYGGIYLDSDVEVLRSFDPLLGLPYFAGAETAGTIEAAVLGAEK 118
>gi|302835884|ref|XP_002949503.1| hypothetical protein VOLCADRAFT_89803 [Volvox carteri f.
nagariensis]
gi|300265330|gb|EFJ49522.1| hypothetical protein VOLCADRAFT_89803 [Volvox carteri f.
nagariensis]
Length = 342
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/112 (25%), Positives = 54/112 (48%), Gaps = 5/112 (4%)
Query: 163 NTPLHGFYTQDA--ILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIK--SLESLHNY 218
TPL +Y +A + + + SH +DL+R+ +Y++GG YLD D +V++ S + ++
Sbjct: 112 ETPLFDWYQSNATKLRSGRFWFSHATDLMRFALIYRHGGQYLDSDVLVMRPISPDKINKL 171
Query: 219 AGAESSS-VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLL 269
+++ S V++ H L + + +W GP LT +
Sbjct: 172 VRSKADSRYFECAVVYFTARHPFLYDVLMHITQVYNAVDWITAGPKPLTTIF 223
>gi|422671535|ref|ZP_16730901.1| YD repeat-containing protein, partial [Pseudomonas syringae pv.
aceris str. M302273]
gi|330969275|gb|EGH69341.1| YD repeat-containing protein, partial [Pseudomonas syringae pv.
aceris str. M302273]
Length = 298
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 4/107 (3%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQV---DLG 158
E ++ IE+ +++NP Q+ + + +R ++ L +NVHI + +
Sbjct: 41 AEPLMKHLGGIENTSVLNPDYQLTLHLHLDLRKGDETEIVAALRGRRNVHISDLREEEFF 100
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLD 205
R FQ TP + Y +D S SD+LRY + YGG Y D D
Sbjct: 101 RKFQKTPSYEIY-KDLYGDDPRHYSAASDVLRYSLINHYGGIYSDTD 146
>gi|335357411|ref|ZP_08549281.1| glycosyltransferase [Lactobacillus animalis KCTC 3501]
Length = 259
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 79/171 (46%), Gaps = 39/171 (22%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL----------HNYA 219
YT A W S +SD L + +Y+YGG YLD D ++++L+ + N +
Sbjct: 51 YTAQAAKVGKW--SFVSDYLGFDVVYRYGGIYLDTDVELLQTLDHILENDAFFAIEENES 108
Query: 220 GAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW-GANG-------PGVLTRLL-K 270
GAE + + G K+H L L++LRD + E+ A+G P T+LL K
Sbjct: 109 GAEVAPGLGFGA---KKNHPL----LKQLRDMYNDLEFIKADGSLNTVAIPVYTTQLLEK 161
Query: 271 AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNE--TNAPATMSL 319
K Q + + TIYP +F P+ D+L T +P T+S+
Sbjct: 162 LGFKRQD---KLQKLGDVTIYPTTYFAPM------DFLTGQITLSPETISI 203
>gi|313216171|emb|CBY37529.1| unnamed protein product [Oikopleura dioica]
Length = 260
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 8/89 (8%)
Query: 164 TPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---G 220
TP + + ++ LTS+ H SD+ R L K+GG YLD D +++KSL+ + G
Sbjct: 74 TPPNSIWGKE--LTSV---EHQSDITRLHILLKFGGIYLDDDVLILKSLDEFRSKEIVLG 128
Query: 221 AESSSVVAAGVIHLDKDHWLSGAALRELR 249
E+ +A +I K+ W L E R
Sbjct: 129 EENYDALANSIILAKKNTWFMKRWLWEYR 157
>gi|423599004|ref|ZP_17575004.1| hypothetical protein III_01806 [Bacillus cereus VD078]
gi|401235988|gb|EJR42454.1| hypothetical protein III_01806 [Bacillus cereus VD078]
Length = 246
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 68/166 (40%), Gaps = 23/166 (13%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YT++A + +SD +R LY YGG YLD D V KS + L H++ G E +
Sbjct: 57 YTREAYEKGKYAF--VSDYVRVHVLYYYGGIYLDTDVEVFKSFDDLLHHHSFWGFEQENY 114
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA----NGPGVLTRLLKA-ECKPQSYAHN 281
+A I K + L L D + G ++T +L+ E K
Sbjct: 115 IATSTIGARKGNQLIKVFLDSYTDKKFIKQDGTYDDLTNVAIVTEILQNLELKLNGEYQE 174
Query: 282 IISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH 327
I F YP +F P DY+N + +D+YA+H
Sbjct: 175 IKGAGVF--YPQTYFSPY------DYIN-----CRKFITKDTYAMH 207
>gi|313224983|emb|CBY20775.1| unnamed protein product [Oikopleura dioica]
Length = 560
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 36/71 (50%), Gaps = 3/71 (4%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVIHLDKDH 238
+ H SD+ R L K+GG YLD D +++KSL+ + G E+ +A +I K+
Sbjct: 387 VEHQSDITRLHILLKFGGIYLDDDVLILKSLDEFRSKEIVLGEENYDALANSIILAKKNT 446
Query: 239 WLSGAALRELR 249
W L E R
Sbjct: 447 WFMKRWLWEYR 457
>gi|418072469|ref|ZP_12709740.1| glycosyltransferase [Lactobacillus rhamnosus R0011]
gi|421768419|ref|ZP_16205130.1| polysaccharide biosynthesis protein CpsM [Lactobacillus rhamnosus
LRHMDP2]
gi|421773534|ref|ZP_16210177.1| polysaccharide biosynthesis protein CpsM [Lactobacillus rhamnosus
LRHMDP3]
gi|423079807|ref|ZP_17068475.1| hypothetical protein HMPREF0541_02495 [Lactobacillus rhamnosus ATCC
21052]
gi|9909364|gb|AAG01982.1|AF238861_2 hypothetical protein [Lactobacillus rhamnosus]
gi|357537246|gb|EHJ21272.1| glycosyltransferase [Lactobacillus rhamnosus R0011]
gi|357545614|gb|EHJ27583.1| hypothetical protein HMPREF0541_02495 [Lactobacillus rhamnosus ATCC
21052]
gi|411181862|gb|EKS49022.1| polysaccharide biosynthesis protein CpsM [Lactobacillus rhamnosus
LRHMDP3]
gi|411186092|gb|EKS53217.1| polysaccharide biosynthesis protein CpsM [Lactobacillus rhamnosus
LRHMDP2]
Length = 246
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----------------YAGAESSS 225
L + +D LRY LY+YGG YLD D IV K L N + GAES +
Sbjct: 60 LGYAADELRYAVLYQYGGFYLDTDMIVKKDLAPFLNQKMVWGFQYDNSILTSFIGAESKT 119
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G +L+D+ + +L RL + + Q A I
Sbjct: 120 PLLAEILDV-----YAGRKFADLQDDMQKMTSNPVVTKILMRLYPDFRMNGQRQELAPGI 174
Query: 283 ISCRNFTIYPPRFF 296
T+YP +F
Sbjct: 175 ------TVYPRDYF 182
>gi|348685520|gb|EGZ25335.1| hypothetical protein PHYSODRAFT_486818 [Phytophthora sojae]
Length = 358
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 57/139 (41%), Gaps = 33/139 (23%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----------------YAGAESSSVVA 228
++D+ RY L+ GG Y+D DF +K E LH GA SS++
Sbjct: 138 LADMARYALLHSVGGLYIDADFECLKPFEELHRDNNLFLSTEPLAHSVLLEGATSSALCN 197
Query: 229 AGVIHL-DKDHWLSGAALRELRDNFKTT-EWGANGPGVLT--RLLKAECKPQSYAHNIIS 284
A + + WL ++ DN K G P LT R++K ++Y
Sbjct: 198 ALMASVPGHPFWL------QVLDNIKVKFAEGVRDPVSLTGPRIVK-----ETYLTATPE 246
Query: 285 CRNFTIYPPRFFYP--VHW 301
+YPP +FYP HW
Sbjct: 247 AAEVIVYPPEYFYPDIAHW 265
>gi|228934993|ref|ZP_04097824.1| hypothetical protein bthur0009_34470 [Bacillus thuringiensis
serovar andalousiensis BGSC 4AW1]
gi|228824893|gb|EEM70694.1| hypothetical protein bthur0009_34470 [Bacillus thuringiensis
serovar andalousiensis BGSC 4AW1]
Length = 246
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 67/166 (40%), Gaps = 23/166 (13%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YT++A + +SD +R LY YGG YLD D V KS + L H++ G E +
Sbjct: 57 YTREAYEKGKYAF--VSDYVRVHVLYYYGGIYLDTDVEVFKSFDDLLHHHSFWGFEQENY 114
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA----NGPGVLTRLLKA-ECKPQSYAHN 281
+A I K + L L D G ++T +L+ E K
Sbjct: 115 IATSTIGAKKGNQLIKVFLDSYTDKKFIKRDGTYNDLTNVAIVTEILQNLELKLNGQYQE 174
Query: 282 IISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH 327
I F YP +F P DY+N + +D+YA+H
Sbjct: 175 IKGAGVF--YPQTYFSPY------DYIN-----CRKFITKDTYAMH 207
>gi|384488022|gb|EIE80202.1| hypothetical protein RO3G_04907 [Rhizopus delemar RA 99-880]
Length = 905
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 58/278 (20%), Positives = 108/278 (38%), Gaps = 47/278 (16%)
Query: 96 CTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQV 155
C +D T R ++E + PG + +++ S+ + R Y Q V+ +
Sbjct: 557 CWTSDPATFTDRHWRALEMVWIHEPGAAI-IMMSNSLPENFFDAYTRRGYNIQVVNFNKE 615
Query: 156 DLGR---YFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIK-S 211
+L + YF D + H++D +R + LY YGGTY+D+D + I+
Sbjct: 616 NLLKWHWYFGPGTQDWLQEWDRWEQGKFFYWHLTDYIRCLLLYNYGGTYMDMDALWIRVP 675
Query: 212 LESLHNYAGAESSSVVAAGVIHLDKDHWLSGAAL----------------RELRD----- 250
S + G++ SSV H D++ L L RE+ +
Sbjct: 676 PNSSQEFIGSDYSSV------HSDREWTLDDRGLYLPQGLMRFKRGWKLFREMAEGAFSV 729
Query: 251 -NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYP-----VHWEHW 304
N+ + GP +T ++ +++ +I P YP +H
Sbjct: 730 YNYDPECFNCGGPKAITSYVRER-------RSVLEAGGLSILPREVLYPFSYLEIHKLLQ 782
Query: 305 ADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGS 342
+ L E + + S+ +H++ T ++PV+ GS
Sbjct: 783 PNPLAEQDMKTKIEPL--SWNIHLFGKMTNKLPVQSGS 818
>gi|229553199|ref|ZP_04441924.1| glycosyltransferase [Lactobacillus rhamnosus LMS2-1]
gi|258540192|ref|YP_003174691.1| glycosyltransferase [Lactobacillus rhamnosus Lc 705]
gi|385835828|ref|YP_005873603.1| tcdA/TcdB catalytic glycosyltransferase domain protein
[Lactobacillus rhamnosus ATCC 8530]
gi|229313495|gb|EEN79468.1| glycosyltransferase [Lactobacillus rhamnosus LMS2-1]
gi|257151868|emb|CAR90840.1| Glycosyltransferase [Lactobacillus rhamnosus Lc 705]
gi|355395320|gb|AER64750.1| tcdA/TcdB catalytic glycosyltransferase domain protein
[Lactobacillus rhamnosus ATCC 8530]
Length = 246
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----------------YAGAESSS 225
L + +D LRY LY+YGG YLD D IV K L N + GAES +
Sbjct: 60 LGYAADELRYAVLYQYGGFYLDTDMIVKKDLAPFLNQKMVWGFQYDNSILTSFIGAESKT 119
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G +L+D+ + +L RL + + Q A I
Sbjct: 120 PLLAEILDV-----YAGRKFPDLQDDMQKMTSNPVVTKILMRLYPDFRMNGQRQELAPKI 174
Query: 283 ISCRNFTIYPPRFF 296
T+YP +F
Sbjct: 175 ------TVYPRDYF 182
>gi|291230161|ref|XP_002735037.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 415
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 68/153 (44%), Gaps = 29/153 (18%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSVVAAGVIHLDKDHW 239
+H SD+LR L +YGG YLDLD V KS + L Y + ++AG+I KD
Sbjct: 226 NHKSDILRLEVLTQYGGIYLDLDIWVFKSTDHLRYYDYTTNRAGPARLSAGIIFTAKD-- 283
Query: 240 LSGAALRELRDNFK---------TTEWGANGPGVLTR-LLKAE----CKP----QSYAHN 281
LR ++++ T +G N + LL E CK Q Y
Sbjct: 284 --SKFLRLFHESYRDFDMECYACTPIYGQNELAFKNQELLHVELFKFCKKMDGFQVYDDL 341
Query: 282 IISCRNFTIYPPRFFYPVHWEHWADYLNETNAP 314
++ ++++ PR Y +H++HW N+ P
Sbjct: 342 YVNPMDWSL--PR--YAIHFDHWGVTRNQQEFP 370
>gi|209542970|ref|YP_002275199.1| hypothetical protein Gdia_0794 [Gluconacetobacter diazotrophicus
PAl 5]
gi|209530647|gb|ACI50584.1| hypothetical protein Gdia_0794 [Gluconacetobacter diazotrophicus
PAl 5]
Length = 261
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 33/158 (20%), Positives = 68/158 (43%), Gaps = 17/158 (10%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA--GAESSSVVAAGVIHLDKDHW 239
+ H SD+ RY + K G ++D+D +++ + A E + +++++
Sbjct: 41 MGHFSDIFRY-RMMKTGLAWVDMDVLMMSDDKIFDKPAIVPLEDDRTINGAILYIENVPI 99
Query: 240 LSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPV 299
L ++ +T WG GP +LTR+L + P + + FYP+
Sbjct: 100 LRHLIDETMKSMDRTLRWGETGPLLLTRILFEQMNPSGFTDMAV------------FYPI 147
Query: 300 -HWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRV 336
H++ + L E + RD+ +H++N+ R+
Sbjct: 148 PHYDIYKVLLPEFRDECAEAC-RDAITIHLFNNAIVRM 184
>gi|196038296|ref|ZP_03105605.1| EpsQ [Bacillus cereus NVH0597-99]
gi|196030704|gb|EDX69302.1| EpsQ [Bacillus cereus NVH0597-99]
Length = 246
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 67/166 (40%), Gaps = 23/166 (13%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YT++A + +SD +R LY YGG YLD D V KS + L H++ G E +
Sbjct: 57 YTREAYEKGKYAF--VSDYVRVHVLYYYGGIYLDTDVEVFKSFDDLLHHHSFWGFEQENY 114
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA----NGPGVLTRLLKA-ECKPQSYAHN 281
+A I K + L L D G ++T +L+ E K
Sbjct: 115 IATSTIGARKGNQLIKVFLDSYTDKKFIKRDGTYNDLTNVAIVTEILQNLELKLNGQYQE 174
Query: 282 IISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH 327
I F YP +F P DY+N + +D+YA+H
Sbjct: 175 IKGAGVF--YPQTYFSPY------DYIN-----CRKFITKDTYAMH 207
>gi|403515724|ref|YP_006656544.1| glycosyltransferase in exopolysaccharide biosynthesis
[Lactobacillus helveticus R0052]
gi|403081162|gb|AFR22740.1| glycosyltransferase in exopolysaccharide biosynthesis
[Lactobacillus helveticus R0052]
Length = 175
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 66/158 (41%), Gaps = 30/158 (18%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAES-----SSVVAAGVIHLDK 236
MSD LR L+ YGG YLD D V+K + L + G E+ + +G+I K
Sbjct: 1 MSDYLRLWILFNYGGIYLDTDVEVLKKFDPLLKNEGFIGFEAGDKKIGEYIGSGIIGAQK 60
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
+ +++L D +K W + + + Y + +N TIYP +F
Sbjct: 61 GN----ETIKKLLDFYKEEIWNS------QEYVNTIIYKKIYLRDNNIFKNMTIYPRNYF 110
Query: 297 YPVHWEHWADYLNETNAPATMSLF---RDSYALHVWNS 331
P + AT++ +DSY +H +N+
Sbjct: 111 AP---------YSPCETQATINHVVENKDSYTIHWFNA 139
>gi|317478727|ref|ZP_07937880.1| glycosyltransferase sugar-binding region DXD domain-containing
protein [Bacteroides sp. 4_1_36]
gi|316905062|gb|EFV26863.1| glycosyltransferase sugar-binding region DXD domain-containing
protein [Bacteroides sp. 4_1_36]
Length = 243
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 63/149 (42%), Gaps = 16/149 (10%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAE--SSSVVAAGVIHLDKDHW 239
+SD +R LY+YGG YLD D ++KS L + G E +V+A+ VI + H
Sbjct: 64 VSDYVRLKALYEYGGIYLDTDIKILKSFNPLLKNEGFIGFEDVKGNVIASCVIAAKQLHP 123
Query: 240 LSGAALRELRDNFKTTEWGANGPGVL---TRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
++ +F N V+ RL+K + I IYP +F
Sbjct: 124 FIQECMQYYNQDFTIEIINKNEANVIDITQRLIKKGMQLGGGEQVI---NEMHIYPREYF 180
Query: 297 YPVH-WEHWADYLNETNAPATMSLFRDSY 324
P+ W +W N+T + LF S+
Sbjct: 181 CPMDFWGNW----NKTANTYCIHLFNGSW 205
>gi|326437845|gb|EGD83415.1| hypothetical protein PTSG_04023 [Salpingoeca sp. ATCC 50818]
Length = 1759
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 82/189 (43%), Gaps = 26/189 (13%)
Query: 75 AFDDITRL---DVPDN--------SIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQ 123
AFD + RL VPD +IFF+ TS H D R A +ESA + P +
Sbjct: 538 AFDAVERLLEGGVPDGCNPTSDICNIFFVWTS--HRDTWSFLNRLA--VESALRIFPRAR 593
Query: 124 VYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPL- 182
V +V N + L V++ ++ R + G + + A+ L
Sbjct: 594 VIIV-----SNTLPVTFFNSLQASHRVYVWRIVPTRLVRAGVAGGRWLRAALREQGPHLP 648
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH----NYAGA-ESSSVVAAGVIHLDKD 237
+H SD LRYV LYKYGG + D D + + + H N+ G +S ++A +D
Sbjct: 649 THQSDFLRYVVLYKYGGLFSDTDLVWLDASPLAHAIGRNFLGKIDSRPILARCPWCVDST 708
Query: 238 HWLSGAALR 246
+L+ LR
Sbjct: 709 WYLANGVLR 717
>gi|451927416|gb|AGF85294.1| hypothetical protein glt_00485 [Moumouvirus goulette]
Length = 247
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 9/92 (9%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVA-------AGVIHLD 235
+ +D RY LY YGG YLD+D + K+LESL Y + +V +I
Sbjct: 68 AQKADFARYAILYTYGGIYLDMDMVCRKNLESLLQYNFFLTPQIVPLFTKSYLNAIIGSR 127
Query: 236 KDHWLSGAALREL--RDNFKTTEWGANGPGVL 265
K+H + + + R N K + G G+
Sbjct: 128 KNHPIFKIIFKNIFFRKNLKNNIRASTGTGLF 159
>gi|66044304|ref|YP_234145.1| YD repeat-containing protein [Pseudomonas syringae pv. syringae
B728a]
gi|63255011|gb|AAY36107.1| YD repeat protein [Pseudomonas syringae pv. syringae B728a]
Length = 927
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 102 VELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQV---DLG 158
E ++ IE+ +++NP Q+ + + +R ++ L +NVHI + +
Sbjct: 670 AEPLMKHLGGIENTSVLNPDYQLTLHLHLDLRKGDETEIVAALRGRRNVHISDLREEEFF 729
Query: 159 RYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL 212
R FQ TP + Y +D S SD+LRY + YGG Y D D + + +
Sbjct: 730 RKFQKTPSYEIY-KDLYGDDPRHYSAASDVLRYSLINHYGGIYSDTDDMFKRGV 782
>gi|315225042|ref|ZP_07866860.1| polysaccharide biosynthesis protein CpsM(V) [Capnocytophaga
ochracea F0287]
gi|420158854|ref|ZP_14665666.1| hypothetical protein HMPREF1319_0140 [Capnocytophaga ochracea str.
Holt 25]
gi|314945017|gb|EFS97048.1| polysaccharide biosynthesis protein CpsM(V) [Capnocytophaga
ochracea F0287]
gi|394763093|gb|EJF45239.1| hypothetical protein HMPREF1319_0140 [Capnocytophaga ochracea str.
Holt 25]
Length = 240
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 87/189 (46%), Gaps = 30/189 (15%)
Query: 162 QNTPLHGF-YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN--- 217
N PL F Y + A+ + ++++D++R L +YGG Y+D D V+KSL+ +
Sbjct: 42 NNFPLDDFPYAKQALEAKKY--AYITDIVRLYVLKEYGGIYMDTDVEVLKSLDIFLDEPA 99
Query: 218 YAGAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGP------GVLTRLLKA 271
++G E++ ++ G++ K + L+ E N P ++T L+K
Sbjct: 100 FSGFENNKLLPTGLMGAKKGSKWAKDMLKYYEGKSFLDE--RNNPILIPNTIIITNLMKN 157
Query: 272 ECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH---- 327
+ + + I T YP +F P DY+ E ++L +++Y +H
Sbjct: 158 KGILINNTYQKIEGY-VTFYPSEYFCP------KDYVTE-----EINLTKNTYCIHHFSG 205
Query: 328 VWNSFTKRV 336
W S+++R+
Sbjct: 206 SWRSYSQRL 214
>gi|402493404|ref|ZP_10840157.1| glycosyl transferase family protein [Aquimarina agarilytica ZC1]
Length = 259
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 4/60 (6%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKD-HWLS 241
+D +R +Y YGG Y+D D V+KS + L + G E ++ AG+ +K+ WL
Sbjct: 65 ADFIRLYAVYNYGGIYMDTDIEVVKSFDDLLKRPYFIGTEGKGIIEAGIFGAEKNADWLK 124
>gi|423260760|ref|ZP_17241662.1| hypothetical protein HMPREF1055_03939 [Bacteroides fragilis
CL07T00C01]
gi|423266897|ref|ZP_17245879.1| hypothetical protein HMPREF1056_03566 [Bacteroides fragilis
CL07T12C05]
gi|387774521|gb|EIK36631.1| hypothetical protein HMPREF1055_03939 [Bacteroides fragilis
CL07T00C01]
gi|392699431|gb|EIY92609.1| hypothetical protein HMPREF1056_03566 [Bacteroides fragilis
CL07T12C05]
Length = 237
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 59/129 (45%), Gaps = 16/129 (12%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLE---SLHNYAGAESSSVVAAGVIHLDKDHW 239
+ ++D +R LY+ GG YLD D + KSL S + G E + +++ GV+ K++
Sbjct: 49 AFVADYVRLKVLYEQGGIYLDTDIQLYKSLNPFLSNEAFMGFERNEILSMGVMGFKKNN- 107
Query: 240 LSGAALRELRD------NFKTTEWGANGPGVLTRLLKAECK-PQSYAHNIISCRNFTIYP 292
++EL D N + + T+ L + ++ A II N IYP
Sbjct: 108 ---KIIKELLDYYDQEFNLNIVNKLESNANITTQFLSEKYGLSRNNAKQIIE--NINIYP 162
Query: 293 PRFFYPVHW 301
FF P+ +
Sbjct: 163 KTFFNPMDY 171
>gi|321471396|gb|EFX82369.1| hypothetical protein DAPPUDRAFT_223867 [Daphnia pulex]
Length = 239
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 3/89 (3%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA---ESSSVVAAGVI 232
L+ W H SD+ R L +YGG YLD D VI+SL+ + A + + +
Sbjct: 54 LSDGWRFHHGSDIGRIRVLMQYGGIYLDNDCFVIRSLDKYRKFECALNWDDKQFMGTQTL 113
Query: 233 HLDKDHWLSGAALRELRDNFKTTEWGANG 261
K+ L +DN+++ +W N
Sbjct: 114 IAHKNSRFIKLWLESYKDNYRSDKWYYNA 142
>gi|302528592|ref|ZP_07280934.1| predicted protein [Streptomyces sp. AA4]
gi|302437487|gb|EFL09303.1| predicted protein [Streptomyces sp. AA4]
Length = 5049
Score = 42.7 bits (99), Expect = 0.25, Method: Composition-based stats.
Identities = 38/151 (25%), Positives = 67/151 (44%), Gaps = 16/151 (10%)
Query: 120 PGVQVYVVVIASVRNRTRNPLID--RLYEYQNVHIVQ-VDLGRYFQNTPLHGFYTQDAIL 176
P Q + + + +PL D + + + H VQ +++ F T H TQ+ L
Sbjct: 4280 PRWQTELALTTAAPEEGPDPLHDVRDMLRWASEHRVQLINVDEVF--TSEHPMRTQEFYL 4337
Query: 177 TSLWP-----LSHMSDLLRYVTLYKYGGTYLDLDFIV------IKSLESLHNYAGAESSS 225
+ L + SD+LR ++++GG Y D D +V +++ ES YA
Sbjct: 4338 SELAKQTGPGFAAASDILRLELMHRFGGLYTDGDNVVHALDDVVRAAESDAGYAVHRVGG 4397
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTE 256
+A + K H + + L +LRDN+ T+
Sbjct: 4398 NIANSAFTMSKGHPFAASYLDQLRDNYGQTQ 4428
>gi|283797668|ref|ZP_06346821.1| polysaccharide biosynthesis protein CpsM [Clostridium sp. M62/1]
gi|291074674|gb|EFE12038.1| hypothetical protein CLOM621_07701 [Clostridium sp. M62/1]
Length = 237
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YT++A W + ++D +R LY+ GG Y+D D VIK L+ L ++G E
Sbjct: 51 YTKEAYEEKKW--AFITDYVRLKVLYESGGIYMDTDVEVIKPLDDLLTEPGFSGFELPDK 108
Query: 227 VAAGVIHLDKDHWLSGAALRELRD 250
V G++ +K++ G L +D
Sbjct: 109 VPTGIMACEKENKFIGELLSLYKD 132
>gi|149196384|ref|ZP_01873439.1| putative glycosyltransferase [Lentisphaera araneosa HTCC2155]
gi|149140645|gb|EDM29043.1| putative glycosyltransferase [Lentisphaera araneosa HTCC2155]
Length = 227
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 63/150 (42%), Gaps = 19/150 (12%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVIHLDKDHWLS 241
+SD R LY+ GG YLD D V+KS + L N G E+ + +A I + H
Sbjct: 64 VSDYARGHALYEMGGIYLDTDVEVLKSFDDLLNNRSLWGFEAGNYIATSTIGSEAKHEYI 123
Query: 242 GAALRELRD-NFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFY 297
A L + +D NF ++ + V+T LK + + +I N + P RFF
Sbjct: 124 KAYLEQYKDRNFLNSDNSLDLTTNVRVVTNYLKNKGLSLNGTKQVIEVDNIVL-PERFFS 182
Query: 298 PVHWEHWADYLNETNAPATMSLFRDSYALH 327
P M+ +DSY +H
Sbjct: 183 PY-----------DPRVGRMTRSKDSYTIH 201
>gi|153807294|ref|ZP_01959962.1| hypothetical protein BACCAC_01572 [Bacteroides caccae ATCC 43185]
gi|149130414|gb|EDM21624.1| hypothetical protein BACCAC_01572 [Bacteroides caccae ATCC 43185]
Length = 244
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 36/69 (52%), Gaps = 4/69 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAE-SSSVVAAGVIHLDKDHWLS 241
SD +R LY YGG YLD D V+KS + L + + G E S SV+ A I +K H L
Sbjct: 65 SDYIRAYALYNYGGIYLDCDVEVLKSFDDLLHLPYFIGKEKSESVIEAATIGFEKGHPLM 124
Query: 242 GAALRELRD 250
L +D
Sbjct: 125 KYLLDYYQD 133
>gi|417837730|ref|ZP_12483968.1| polysaccharide biosynthesis protein CpsM [Lactobacillus johnsonii
pf01]
gi|338761273|gb|EGP12542.1| polysaccharide biosynthesis protein CpsM [Lactobacillus johnsonii
pf01]
Length = 275
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 17/145 (11%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----------YA 219
Y Q+A W +SD R L +YGG Y D D +IKS++ + +
Sbjct: 53 YVQEAYKAKKWAF--VSDYARIAVLAEYGGLYFDTDMELIKSIDDIIQDGSFINLEKIFN 110
Query: 220 GAESSSVVAAGVI-HLDK-DHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQS 277
G ++ + A GV+ HL+ + ++ R N T + G V L+K S
Sbjct: 111 GVQAPGMSAMGVVPHLELFEDLVNIYKKRHFILNDGTYDETPIGSYVNEILIKNGV---S 167
Query: 278 YAHNIISCRNFTIYPPRFFYPVHWE 302
+++NI YP RFF P+ +E
Sbjct: 168 FSNNITKWNKINFYPARFFSPMTFE 192
>gi|189465056|ref|ZP_03013841.1| hypothetical protein BACINT_01400 [Bacteroides intestinalis DSM
17393]
gi|189437330|gb|EDV06315.1| hypothetical protein BACINT_01400 [Bacteroides intestinalis DSM
17393]
Length = 327
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 79/188 (42%), Gaps = 16/188 (8%)
Query: 72 KDFAFDDITRLDVPDNSIFFLETSCTHA--DGVELTLRQACSIESAAMMNPGVQVYVVVI 129
KD ++ + +S E + HA G L+QA SI+S + + +
Sbjct: 33 KDLYLSKVSDIQREQSSQLLTEKTIFHAYWHGT-FGLKQAFSIKSLLCTQNMESIEIWLW 91
Query: 130 ASVRNR----TRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHM 185
N NP + L + + I+ ++ NTP F + + L+
Sbjct: 92 LDSENGYAQIESNPHLQELKGH--IKILSWNVENEISNTP---FCKIKTYINAPKNLAAK 146
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGA-ESSSVVAAGVIHLDKDHWLS 241
D R ++LYKYGG Y DLD + +K L H + A E S + +++L K+ +++
Sbjct: 147 GDDFRIISLYKYGGLYFDLDVMFLKDFTPLLKGHEFVYAWEYQSYANSAILYLRKNSYIT 206
Query: 242 GAALRELR 249
++L+
Sbjct: 207 NYLAKKLQ 214
>gi|293376519|ref|ZP_06622747.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
gi|325845460|ref|ZP_08168751.1| hypothetical protein HMPREF9402_0083 [Turicibacter sp. HGF1]
gi|292644745|gb|EFF62827.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
gi|325488479|gb|EGC90897.1| hypothetical protein HMPREF9402_0083 [Turicibacter sp. HGF1]
Length = 199
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 8/76 (10%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAESSSVVAAGVIHLDKD 237
L +D++RY LY+YGG Y+D+D K++E L + G E ++ ++ + K
Sbjct: 62 LRQKADIVRYEVLYQYGGVYVDIDMECFKNIEPLLQEGEFFVGTEDDFYLSNELMAVTKK 121
Query: 238 HWLSGAALRELRDNFK 253
H L ++EL D K
Sbjct: 122 HDL----IKELLDGLK 133
>gi|313148899|ref|ZP_07811092.1| glycosyltransferase family 32 [Bacteroides fragilis 3_1_12]
gi|313137666|gb|EFR55026.1| glycosyltransferase family 32 [Bacteroides fragilis 3_1_12]
Length = 231
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDK 236
+D +R+ LY YGG YLD D V+K +SL + G E+S + A VI +K
Sbjct: 65 ADYIRFFALYHYGGIYLDADVEVLKDFKSLLIEKQFLGEEASGDIEAAVIGAEK 118
>gi|449126192|ref|ZP_21762485.1| hypothetical protein HMPREF9723_02529 [Treponema denticola OTK]
gi|448938384|gb|EMB19315.1| hypothetical protein HMPREF9723_02529 [Treponema denticola OTK]
Length = 260
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 63/138 (45%), Gaps = 14/138 (10%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YT +A + + + +SD +R+ LYKYGG Y D D VIK ++ + + G E+
Sbjct: 51 YTSEAYRSKKY--AFVSDYVRFDILYKYGGIYFDTDVEVIKPIDKIIEKGAFFGMETVGT 108
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNF-KTTEWGANGPGVLT----RLLKAECKPQSYAHN 281
V G L + E+ +++ K++ + NG LT R+ CK N
Sbjct: 109 VNPG---LGIAAFAGDPLYAEILESYEKSSFFKPNGKPDLTTIVERVTGILCKHGFLKEN 165
Query: 282 IIS-CRNFTIYPPRFFYP 298
I +N IYP +F P
Sbjct: 166 KIQLVKNINIYPIDYFNP 183
>gi|393783845|ref|ZP_10372016.1| hypothetical protein HMPREF1071_02884 [Bacteroides salyersiae
CL02T12C01]
gi|392668287|gb|EIY61788.1| hypothetical protein HMPREF1071_02884 [Bacteroides salyersiae
CL02T12C01]
Length = 249
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDK 236
+D +R LY YGG YLD D V+KS + L + + GAE++ + A +I +K
Sbjct: 65 ADYIRLYALYNYGGIYLDSDVEVLKSFDPLLDLPYFVGAENAGTIEAAIIGAEK 118
>gi|423250576|ref|ZP_17231591.1| hypothetical protein HMPREF1066_02601 [Bacteroides fragilis
CL03T00C08]
gi|423253902|ref|ZP_17234832.1| hypothetical protein HMPREF1067_01476 [Bacteroides fragilis
CL03T12C07]
gi|392651533|gb|EIY45195.1| hypothetical protein HMPREF1066_02601 [Bacteroides fragilis
CL03T00C08]
gi|392654460|gb|EIY48107.1| hypothetical protein HMPREF1067_01476 [Bacteroides fragilis
CL03T12C07]
Length = 254
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 58/129 (44%), Gaps = 16/129 (12%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLE---SLHNYAGAESSSVVAAGVIHLDKDHW 239
+ ++D +R LY+ GG YLD D + KSL S + G E +++ GV+ K++
Sbjct: 66 AFVADYVRLKVLYEQGGIYLDTDIQLYKSLNPFLSNEAFMGFERDEILSMGVMGFKKNN- 124
Query: 240 LSGAALRELRD------NFKTTEWGANGPGVLTRLLKAECK-PQSYAHNIISCRNFTIYP 292
++EL D N + + T+ L + ++ A II N IYP
Sbjct: 125 ---KIIKELLDYYDQEFNLNIVNKLESNANITTQFLSEKYGLSRNNAKQIIE--NINIYP 179
Query: 293 PRFFYPVHW 301
FF P+ +
Sbjct: 180 KTFFNPMDY 188
>gi|358058072|dbj|GAA96051.1| hypothetical protein E5Q_02712 [Mixia osmundae IAM 14324]
Length = 505
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
SHM+D++R L KYGG YLD D IV +S + L
Sbjct: 255 SHMADVIRMEALLKYGGIYLDSDVIVTRSFDEL 287
>gi|449115785|ref|ZP_21752245.1| hypothetical protein HMPREF9726_00230 [Treponema denticola H-22]
gi|448955271|gb|EMB36038.1| hypothetical protein HMPREF9726_00230 [Treponema denticola H-22]
Length = 260
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 12/125 (9%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHW 239
+ +SD +R+ LYKYGG Y D D VIK ++ + + G E+ V G L +
Sbjct: 62 AFVSDYVRFDILYKYGGIYFDTDVEVIKPIDKIIEKGAFFGMETVGTVNPG---LGIAAF 118
Query: 240 LSGAALRELRDNF-KTTEWGANGPGVLT----RLLKAECKPQSYAHNIIS-CRNFTIYPP 293
E+ +++ K++ + NG LT R+ CK N I +N IYP
Sbjct: 119 AGDPLYAEILESYEKSSFFKPNGKPDLTTIVERVTGILCKHGFLKENKIQLVKNINIYPI 178
Query: 294 RFFYP 298
+F P
Sbjct: 179 DYFNP 183
>gi|449131205|ref|ZP_21767421.1| hypothetical protein HMPREF9724_02086 [Treponema denticola SP37]
gi|448940038|gb|EMB20949.1| hypothetical protein HMPREF9724_02086 [Treponema denticola SP37]
Length = 260
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 63/138 (45%), Gaps = 14/138 (10%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YT +A + + + +SD +R+ LYKYGG Y D D VIK ++ + + G E+
Sbjct: 51 YTSEAYRSKKY--AFVSDYVRFDILYKYGGIYFDTDVEVIKPIDKIIEKGAFFGMETIGT 108
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNF-KTTEWGANGPGVLT----RLLKAECKPQSYAHN 281
V G L + E+ +++ K++ + NG LT R+ CK N
Sbjct: 109 VNPG---LGIAAFAGDPLYAEILESYEKSSFFKPNGKPDLTTIVERVTGILCKHGFLKEN 165
Query: 282 IIS-CRNFTIYPPRFFYP 298
I +N IYP +F P
Sbjct: 166 KIQLVKNINIYPIDYFNP 183
>gi|389748290|gb|EIM89467.1| hypothetical protein STEHIDRAFT_73632 [Stereum hirsutum FP-91666
SS1]
Length = 742
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GG YLD D I+++ E L Y GA
Sbjct: 480 LSDIARFVLTHRFGGVYLDADMIILRDWEELWGYKGA 516
>gi|406864910|gb|EKD17953.1| capsule polysaccharide biosynthesis protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 931
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 30/53 (56%)
Query: 173 DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSS 225
+ +T + + H SDL+R+ L KYGG Y D+ F+ I L++L N + S
Sbjct: 81 EGTITGTYAVQHTSDLVRWPLLLKYGGVYTDVGFMQIGDLDALWNKTIGDPES 133
>gi|321471346|gb|EFX82319.1| hypothetical protein DAPPUDRAFT_241514 [Daphnia pulex]
Length = 620
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 62/153 (40%), Gaps = 28/153 (18%)
Query: 125 YVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQ-----------------NTPLH 167
YVV+ A++RN D Y + N+ V D G+Y++ P
Sbjct: 375 YVVLKAAMRNHRP----DHFYYHTNMKNVTYD-GKYWEWVRKDEPLWSRIRVKYLEAPTE 429
Query: 168 GFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA---ESS 224
F + L+ W H SD+ R L +YGG YLD D VI+SL+ + A
Sbjct: 430 IFGQK---LSDGWRFHHGSDIGRIRVLMQYGGIYLDNDCFVIRSLDKYRKFECALNWGDK 486
Query: 225 SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
+ + K+ L +DN+++ +W
Sbjct: 487 QFMGTQTLIAHKNSRFIKLWLESYKDNYRSDKW 519
>gi|331088497|ref|ZP_08337411.1| hypothetical protein HMPREF1025_00994 [Lachnospiraceae bacterium
3_1_46FAA]
gi|330407837|gb|EGG87328.1| hypothetical protein HMPREF1025_00994 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 236
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 77/201 (38%), Gaps = 56/201 (27%)
Query: 162 QNTPLHG-FYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HN 217
QN +H Y ++A + + +SD+ R+ L K+GG Y D D VIK L
Sbjct: 41 QNFDIHNNRYVEEAYQSKKYAF--VSDVARFYALEKFGGLYFDTDVEVIKDFAPLLDQEA 98
Query: 218 YAGAESSSVVAAGVIHLDKD----------HWLSGAALRELRDNFKTT-----------E 256
+AG E+ + G++ K+ ++ G + +F TT +
Sbjct: 99 FAGFETDEYINPGLVLWVKEKNNRLMKEMLNYYEGLSFIREDGSFNTTTICIYFTEILKQ 158
Query: 257 WGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPAT 316
+G G G + R C N TIYP +F P + +
Sbjct: 159 YGLIGNGKMQR-----------------CGNITIYPKEYFCPFN-----------DQTGV 190
Query: 317 MSLFRDSYALHVWNSFTKRVP 337
++ ++YA+H W S T P
Sbjct: 191 LNKTENTYAIH-WYSKTWMEP 210
>gi|257870145|ref|ZP_05649798.1| glycosyltransferase [Enterococcus gallinarum EG2]
gi|357050402|ref|ZP_09111601.1| hypothetical protein HMPREF9478_01584 [Enterococcus saccharolyticus
30_1]
gi|257804309|gb|EEV33131.1| glycosyltransferase [Enterococcus gallinarum EG2]
gi|355381439|gb|EHG28564.1| hypothetical protein HMPREF9478_01584 [Enterococcus saccharolyticus
30_1]
Length = 208
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 67/166 (40%), Gaps = 28/166 (16%)
Query: 145 YEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDL 204
YE + +DL ++ G Y Q+A W +SD++R L GG Y+D
Sbjct: 34 YEIKRWDESTIDLSKF-------GSYLQEAYKQEEWAF--VSDVVRLYALVTEGGIYMDT 84
Query: 205 DFIVIKSLESLHNYA---GAESSSVVAAGVI-----HLDKDHWLSGAALRELRDNFKTTE 256
D V+K L+ L NY G E + ++ G+I H + W RE KT +
Sbjct: 85 DVEVVKPLDDLLNYEAFMGFEIETKISTGIIGAVPHHPFMEEWYHDYDDREFVRAEKTED 144
Query: 257 WGANGPGVLTRLLKAECKPQSYAHNI----ISCRNFTIYPPRFFYP 298
V+T +++ Q Y + + + I+P F P
Sbjct: 145 -------VVTNVIRVTELLQEYGLQLNNQRQTVKGVEIFPQMTFSP 183
>gi|422341937|ref|ZP_16422877.1| glycosyltransferase [Treponema denticola F0402]
gi|325474005|gb|EGC77193.1| glycosyltransferase [Treponema denticola F0402]
Length = 260
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 12/125 (9%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHW 239
+ +SD +R+ LYKYGG Y D D VIK ++ + + G E+ V G L +
Sbjct: 62 AFVSDYVRFDILYKYGGIYFDTDVEVIKPIDKIIEKGAFFGMETVGTVNPG---LGIAAF 118
Query: 240 LSGAALRELRDNF-KTTEWGANGPGVLT----RLLKAECKPQSYAHNIIS-CRNFTIYPP 293
E+ +++ K++ + NG LT R+ CK N I +N IYP
Sbjct: 119 AGDPLYAEILESYEKSSFFKPNGKHDLTTIVERVTGILCKHGFLKENKIQLVKNINIYPI 178
Query: 294 RFFYP 298
+F P
Sbjct: 179 DYFNP 183
>gi|282856082|ref|ZP_06265368.1| Eps7I [Pyramidobacter piscolens W5455]
gi|282586103|gb|EFB91385.1| Eps7I [Pyramidobacter piscolens W5455]
Length = 243
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 6/77 (7%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
+T++A W + +SD LR LY+ GG YLD D V+K L+ + G E
Sbjct: 52 FTREAAAMKKW--AFVSDFLRLYALYRRGGVYLDSDVEVLKPLDGFLRHRAFTGFERPGC 109
Query: 227 VAAGVIHLDKDH-WLSG 242
G++ + H W+ G
Sbjct: 110 PVTGIMGAEAGHPWIKG 126
>gi|392571145|gb|EIW64317.1| hypothetical protein TRAVEDRAFT_138986 [Trametes versicolor
FP-101664 SS1]
Length = 700
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 26/37 (70%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SDL+R+V ++YGG YLD+D + ++ E L + GA
Sbjct: 455 LSDLVRFVLCHRYGGVYLDVDMLFLRDWEELWGWTGA 491
>gi|321475275|gb|EFX86238.1| hypothetical protein DAPPUDRAFT_313308 [Daphnia pulex]
Length = 385
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 3/60 (5%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVI 232
L+ W L H SD+ R T+ KYGG YLD D V+++L+ Y G + + + VI
Sbjct: 166 LSDGWRLFHGSDIARIRTMMKYGGIYLDNDVYVVQNLDKYRKYEIAMGWDEGQFLGSQVI 225
>gi|187735637|ref|YP_001877749.1| glycosyl transferase family protein [Akkermansia muciniphila ATCC
BAA-835]
gi|187425689|gb|ACD04968.1| Glycosyl transferase, family 31 [Akkermansia muciniphila ATCC
BAA-835]
Length = 562
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 67/186 (36%), Gaps = 44/186 (23%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVI-KSLESLHNYAGAESSSVVAAGVIHLDKDH-- 238
L+ SD R L + GG ++D+D I + L + + E + VVA G + H
Sbjct: 70 LAPFSDWFRMKFLSQEGGFWVDMDVICLGDELPASPLWFCREWAEVVAVGAMAFPPGHSV 129
Query: 239 ----------------WLSGAALR----------ELRDNFKTTEWGANGPGVLTRLLKAE 272
W S +R ++ D + WG GP +TR L+
Sbjct: 130 PATLCRLAEDPALRVPWDSPEEVRAKEELLRRVPDIADRRRQVPWGFCGPTGMTRALR-- 187
Query: 273 CKPQSYAHNIISCRNFT-IYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
C F P YPV W W D N + A L +++ +H+W
Sbjct: 188 -----------HCGLFDRAAPSSHMYPVPWTRWRDCYNGSIRLAGPEL-SNAWCVHLWGE 235
Query: 332 FTKRVP 337
+R P
Sbjct: 236 MARREP 241
>gi|120400348|gb|ABM21403.1| glycosyltransferase [Lactobacillus johnsonii]
Length = 241
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 67/159 (42%), Gaps = 28/159 (17%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLS 241
++D +R LY GG Y+D D +IKSL++L N+ E V GV +
Sbjct: 65 LTDYVRLDVLYNEGGVYMDTDVKLIKSLDNLVEKGNFMSFEKVGRVNTGVGFASEK---G 121
Query: 242 GAALRELRDNFKTTEWGANGPG--------VLTRLLKAECKPQSYAHN-IISCRNFTIYP 292
+ ++E D + T + + TR+L Y HN + +NFTIYP
Sbjct: 122 NSIIKENLDYYDTHSFLDKNNNFIPEICVKITTRILAKY--GLDYTHNELQQVKNFTIYP 179
Query: 293 PRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
+F P + +++ ++Y +H++ S
Sbjct: 180 SEYFSP-----------KKMGTNKITITNNTYGIHLYAS 207
>gi|81429191|ref|YP_396192.1| teichoic acid/polysaccharide biosynthesis protein [Lactobacillus
sakei subsp. sakei 23K]
gi|78610834|emb|CAI55885.1| Putative teichoic acid/polysaccharide biosynthesis protein
[Lactobacillus sakei subsp. sakei 23K]
Length = 282
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 3/63 (4%)
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGT 200
I RLY Q+V I+ + F P + QD I L P +H+SDL+R L K+GGT
Sbjct: 76 IKRLYPKQHVQIITLANLSDFITFPT---FLQDKIDAGLIPYAHLSDLVRVALLTKHGGT 132
Query: 201 YLD 203
++D
Sbjct: 133 WID 135
>gi|326428380|gb|EGD73950.1| hypothetical protein PTSG_05645 [Salpingoeca sp. ATCC 50818]
Length = 270
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 40/171 (23%), Positives = 68/171 (39%), Gaps = 42/171 (24%)
Query: 159 RYFQNTPLHGFYTQDAILTSLWP-LSHMSDLLRYVTLYKYGGTYLDLDFIVI--KSLESL 215
R ++ + + +++ + +L P ++ +D+ RY LY+YGG ++D D + I KSL+ L
Sbjct: 51 RLWRTEDIEALHLENSDIFALEPHPANKADIARYEILYRYGGVFIDADSMWINGKSLDPL 110
Query: 216 HNYAGAES------------------SSVVAAGVIHLDKDHWLS-------GAALRELRD 250
A + ++A GVI + H G R+LR
Sbjct: 111 LQQASQTGLFFGEEVRPGKMTAPSTVTPILAVGVIGCAQHHPAMAHVVRTLGRVYRDLRF 170
Query: 251 NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHW 301
W A GP T L+ K T++P +F+P W
Sbjct: 171 TQGVARWEATGPDFATAALRGVPK--------------TVFPTFYFFPGGW 207
>gi|373856976|ref|ZP_09599719.1| glycosyltransferase sugar-binding region containing DXD motif
[Bacillus sp. 1NLA3E]
gi|372453222|gb|EHP26690.1| glycosyltransferase sugar-binding region containing DXD motif
[Bacillus sp. 1NLA3E]
Length = 244
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 58/135 (42%), Gaps = 32/135 (23%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDH 238
+ +SD +R LY YGG YLD D V KS + L ++ G E + +A I K +
Sbjct: 65 FAFVSDYVRVHALYYYGGIYLDTDVEVFKSFDPLLHHDSFWGFEQENFIATSTIGASKGN 124
Query: 239 WLSGAALRELRDNFKTTEWGANGP-------GVLTRLLK----------------AECKP 275
L ++ D++ ++ NG ++T +LK A P
Sbjct: 125 SL----IKIFLDSYNGKKFNENGTNNELTNVAIITEILKNKGLKMDGTFQEMPGIATFFP 180
Query: 276 QSY--AHNIISCRNF 288
Q Y ++ I+CRNF
Sbjct: 181 QPYFSPYDYINCRNF 195
>gi|145356106|ref|XP_001422280.1| hypothetical protein OSTLU_28420 [Ostreococcus lucimarinus CCE9901]
gi|144582521|gb|ABP00597.1| hypothetical protein OSTLU_28420 [Ostreococcus lucimarinus CCE9901]
Length = 296
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 27/38 (71%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
+L PL+ +DLLRY LY GG ++D D +++KSL+ L
Sbjct: 127 ALKPLAFKADLLRYCILYTEGGAWMDDDILLVKSLDEL 164
>gi|321249833|ref|XP_003191591.1| snoRNA binding protein [Cryptococcus gattii WM276]
gi|317458058|gb|ADV19804.1| snoRNA binding protein, putative [Cryptococcus gattii WM276]
Length = 1287
Score = 41.6 bits (96), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
+SD+ R++ Y++GG YLD D I ++ E L NY G
Sbjct: 1023 LSDMARFIVTYRFGGIYLDADTIFLRDWEELWNYRG 1058
>gi|257126764|ref|YP_003164878.1| hypothetical protein Lebu_2025 [Leptotrichia buccalis C-1013-b]
gi|257050703|gb|ACV39887.1| conserved hypothetical protein [Leptotrichia buccalis C-1013-b]
Length = 256
Score = 41.6 bits (96), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 35/169 (20%), Positives = 66/169 (39%), Gaps = 31/169 (18%)
Query: 179 LWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL-------------ESLHNYAGAESSS 225
LW +++SD +R +Y+ G Y+D D +IK L ++ + G E
Sbjct: 64 LW--AYVSDYMRVHFMYENSGIYVDTDMEIIKDLTPILEGKDEFSKNRKMNFFIGYEDEK 121
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA---NGPGVLTRLLKAECKPQSYAHNI 282
++ G+ K L++++D ++ W P + T + + N
Sbjct: 122 HISVGIFGTTK----HNEVLKDIKDFYENDIWKKPIWTIPKIFTYTFEKKYNLSEKRENA 177
Query: 283 ISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
+ I+P +FYP ++ Y ++ P T Y +H WN
Sbjct: 178 LKNGEIVIFPKEYFYPYGFKE--VYSDDCIKPET-------YGIHWWND 217
>gi|68642799|emb|CAI33148.1| putative glycosyl transferase [Streptococcus pneumoniae]
gi|68642826|emb|CAI33168.1| putative glycosyl transferase [Streptococcus pneumoniae]
Length = 239
Score = 41.6 bits (96), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 79/180 (43%), Gaps = 19/180 (10%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GA-ESSS 225
+ Q A W + +SD R +Y+ GG YLD D +I++L+ L +A GA + +
Sbjct: 52 FVQTAYENKAW--AFVSDYARLRIIYENGGIYLDTDVELIRNLDELLEHAAFFGAHQING 109
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDN--FKTTEWGANGPGVLTRLLKAECKPQSYAHNII 283
+VA G+ K L L DN F + +L + + Q + NI+
Sbjct: 110 LVATGLGFGSKKGTKILKELLRLYDNTDFDPCKKNELACPILNSFVFTDFGYQP-SSNIV 168
Query: 284 SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSE 343
FTIYP +F P++ A A L +Y++H +++ V V++ ++
Sbjct: 169 QTEYFTIYPAEYFDPIY----------VGANARNLLSEKTYSIHHYSASWTPVRVRIKNK 218
>gi|87301680|ref|ZP_01084520.1| hypothetical protein WH5701_03359 [Synechococcus sp. WH 5701]
gi|87283897|gb|EAQ75851.1| hypothetical protein WH5701_03359 [Synechococcus sp. WH 5701]
Length = 242
Score = 41.6 bits (96), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 169 FYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ Q AI W + SD+LR L ++GG YLD D + I+SLE L Y
Sbjct: 53 LFAQQAIALKQWAFA--SDVLRLWVLVQHGGVYLDADVLAIRSLEPLLGY 100
>gi|402217818|gb|EJT97897.1| hypothetical protein DACRYDRAFT_24830 [Dacryopinax sp. DJM-731 SS1]
Length = 873
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 26/37 (70%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GGTYLD D I+++ E L + GA
Sbjct: 608 LSDMARFVLCHRFGGTYLDADTILLRDWEELWGWKGA 644
>gi|403168049|ref|XP_003889755.1| hypothetical protein, variant [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375167327|gb|EHS63386.1| hypothetical protein, variant [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 469
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 114 SAAMMNPGVQVYVVVIASVRNRTRNPLI-DRLYEYQN-VHIVQVDLGRYFQNTPLHGFYT 171
S + P Q+ + S R+ +PL+ LY Y++ + + +DL ++TPLH +
Sbjct: 214 SPNAIQPISQLILWTNPSTRSLENDPLLAPLLYRYRDRLSLRIIDLLSLSKHTPLHEHHL 273
Query: 172 QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
++I L DL+R + L+ YGG ++D+D +V++ L L
Sbjct: 274 LNSIFDKKAWLD--GDLVRVLVLWHYGGLWIDMDNLVLRDLRVL 315
>gi|331230177|ref|XP_003327753.1| hypothetical protein PGTG_08520 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309306743|gb|EFP83334.1| hypothetical protein PGTG_08520 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 495
Score = 41.2 bits (95), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 114 SAAMMNPGVQVYVVVIASVRNRTRNPLI-DRLYEYQN-VHIVQVDLGRYFQNTPLHGFYT 171
S + P Q+ + S R+ +PL+ LY Y++ + + +DL ++TPLH +
Sbjct: 214 SPNAIQPISQLILWTNPSTRSLENDPLLAPLLYRYRDRLSLRIIDLLSLSKHTPLHEHHL 273
Query: 172 QDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
++I L DL+R + L+ YGG ++D+D +V++ L L
Sbjct: 274 LNSIFDKKAWLD--GDLVRVLVLWHYGGLWIDMDNLVLRDLRVL 315
>gi|265766791|ref|ZP_06094620.1| glycosyltransferase [Bacteroides sp. 2_1_16]
gi|263253168|gb|EEZ24644.1| glycosyltransferase [Bacteroides sp. 2_1_16]
Length = 231
Score = 41.2 bits (95), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 15/100 (15%)
Query: 152 IVQVDLGRYFQNTPLHGFYTQDA----ILTSLW--------PLSHMSDLLRYVTLYKYGG 199
+V+ + + +N P + F DA +++LW + +D +R+ LY YGG
Sbjct: 19 LVEKCINSWKKNLPEYDFVLWDANRVDTISNLWLKQAYENKKYAFAADYIRFYALYYYGG 78
Query: 200 TYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDK 236
YLD D V+K+ L + G E+ + A VI +K
Sbjct: 79 IYLDADVEVLKTFNDLLDQKQFLGEEAGGDIEAAVIGAEK 118
>gi|402084907|gb|EJT79925.1| hypothetical protein GGTG_05007 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 350
Score = 41.2 bits (95), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 73/176 (41%), Gaps = 33/176 (18%)
Query: 180 WPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHW 239
W ++HM+D+LR L + GG YLD D V++ ++L GA + G + + +
Sbjct: 163 WDMAHMADVLRLTVLLEEGGIYLDADAFVLRPFDTL--LKGARDVVMGHEGGVRMGLTNA 220
Query: 240 L----SGAALRE----LRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIY 291
+ +GA + + +F T W + + RL Y + C T+
Sbjct: 221 VIMSKAGAPFIKRWLGMYTSFDKTLWNEHSVKLPRRL------EDQYPDEL--C---TLS 269
Query: 292 PPRFFYPVHWEHWADYL------------NETNAPATMSLFRDSYALHVWNSFTKR 335
P F++P+ E D++ +ET +LF D LH W+ +R
Sbjct: 270 PSAFYWPMWTEGSVDWMHEPLSDKEAAEVDETMRKNDGALFGDQLVLHAWSHPARR 325
>gi|385815192|ref|YP_005851583.1| Glycosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
2038]
gi|325125229|gb|ADY84559.1| Glycosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
2038]
Length = 233
Score = 41.2 bits (95), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R LY+ GG YLD D +V+K L N + G E++
Sbjct: 51 YVKQAYQAKKWAF--VSDYVRAKALYEMGGIYLDTDVMVLKDFAGLLNDRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A +I +K H
Sbjct: 109 LSAAIIAAEKGH 120
>gi|104773592|ref|YP_618572.1| glycosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
ATCC 11842]
gi|116513586|ref|YP_812492.1| mannosyltransferase OCH1 related enzyme [Lactobacillus delbrueckii
subsp. bulgaricus ATCC BAA-365]
gi|418030018|ref|ZP_12668535.1| Hexosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
CNCM I-1632]
gi|418035294|ref|ZP_12673751.1| Hexosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
CNCM I-1519]
gi|103422673|emb|CAI97286.1| Glycosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
ATCC 11842]
gi|116092901|gb|ABJ58054.1| Mannosyltransferase OCH1 related enzyme [Lactobacillus delbrueckii
subsp. bulgaricus ATCC BAA-365]
gi|354688481|gb|EHE88520.1| Hexosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
CNCM I-1632]
gi|354690283|gb|EHE90231.1| Hexosyltransferase [Lactobacillus delbrueckii subsp. bulgaricus
CNCM I-1519]
Length = 233
Score = 41.2 bits (95), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R LY+ GG YLD D +V+K L N + G E++
Sbjct: 51 YVKQAYQAKKWAF--VSDYVRAKALYEMGGIYLDTDVMVLKDFAGLLNDRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A +I +K H
Sbjct: 109 LSAAIIAAEKGH 120
>gi|260891469|ref|ZP_05902732.1| hypohetical Teichoic Acid Biosynthesis Protein [Leptotrichia
hofstadii F0254]
gi|260858852|gb|EEX73352.1| hypohetical Teichoic Acid Biosynthesis Protein [Leptotrichia
hofstadii F0254]
Length = 210
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 35/169 (20%), Positives = 66/169 (39%), Gaps = 31/169 (18%)
Query: 179 LWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLE-------------SLHNYAGAESSS 225
LW +++SD +R +Y+ G Y+D D +IK L ++ + G E
Sbjct: 19 LW--AYVSDYMRVHFMYENSGIYVDTDMEIIKDLTPILEGKDEISKNGKMNFFIGYEDEK 76
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA---NGPGVLTRLLKAECKPQSYAHNI 282
++ G+ K L++++D ++ W P + T + + N
Sbjct: 77 HISVGIFGTTK----HNEVLKDIKDFYENDIWKKPIWTIPKIFTYTFEKKYNLSEKRENA 132
Query: 283 ISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
+ I+P +FYP ++ Y ++ P T Y +H WN
Sbjct: 133 LKNGEIVIFPKEYFYPYGFKE--VYSDDCIKPET-------YGIHWWND 172
>gi|300812027|ref|ZP_07092479.1| conserved hypothetical protein [Lactobacillus delbrueckii subsp.
bulgaricus PB2003/044-T3-4]
gi|300496992|gb|EFK32062.1| conserved hypothetical protein [Lactobacillus delbrueckii subsp.
bulgaricus PB2003/044-T3-4]
Length = 233
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R LY+ GG YLD D +V+K L N + G E++
Sbjct: 51 YVKQAYQAKKWAF--VSDYVRAKALYEMGGIYLDTDVMVLKDFAGLLNDRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A +I +K H
Sbjct: 109 LSAAIIAAEKGH 120
>gi|313123167|ref|YP_004033426.1| mannosyltransferase och1 related enzyme [Lactobacillus delbrueckii
subsp. bulgaricus ND02]
gi|312279730|gb|ADQ60449.1| Mannosyltransferase OCH1 related enzyme [Lactobacillus delbrueckii
subsp. bulgaricus ND02]
Length = 233
Score = 41.2 bits (95), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R LY+ GG YLD D +V+K L N + G E++
Sbjct: 51 YVKQAYQAKKWAF--VSDYVRAKALYEMGGIYLDTDVMVLKDFAGLLNDRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A +I +K H
Sbjct: 109 LSAAIIAAEKGH 120
>gi|237721036|ref|ZP_04551517.1| glycosyltransferase family 32 protein [Bacteroides sp. 2_2_4]
gi|298481022|ref|ZP_06999217.1| glycosyltransferase [Bacteroides sp. D22]
gi|229449871|gb|EEO55662.1| glycosyltransferase family 32 protein [Bacteroides sp. 2_2_4]
gi|298273045|gb|EFI14611.1| glycosyltransferase [Bacteroides sp. D22]
Length = 250
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLE---SLHNYAGAESSSVVAAGVIHLDK 236
+D +R LY YGG YLD D V+KS + L + GAE++ + A +I +K
Sbjct: 65 ADYIRMYALYHYGGIYLDSDVEVLKSFDEFLKLPYFVGAENAGTIEAAIIGAEK 118
>gi|408410569|ref|ZP_11181778.1| Putative uncharacterized protein [Lactobacillus sp. 66c]
gi|408410790|ref|ZP_11181990.1| Putative uncharacterized protein [Lactobacillus sp. 66c]
gi|409349920|ref|ZP_11233217.1| Putative uncharacterized protein [Lactobacillus equicursoris CIP
110162]
gi|407875031|emb|CCK83796.1| Putative uncharacterized protein [Lactobacillus sp. 66c]
gi|407875252|emb|CCK83584.1| Putative uncharacterized protein [Lactobacillus sp. 66c]
gi|407877792|emb|CCK85275.1| Putative uncharacterized protein [Lactobacillus equicursoris CIP
110162]
Length = 233
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 5/98 (5%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
Y + A W +SD +R LY GG YLD D +V+ S ++L + G E++
Sbjct: 51 YVEQAYQAKKWAF--VSDYVRAKALYDMGGIYLDTDVMVLDSFDNLLSDRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGV 264
++A +I +K H L+ L D T + G GV
Sbjct: 109 LSAAIIGCEKGHPLASDILHYYDDLDFTFDQGDQLAGV 146
>gi|422845153|ref|ZP_16891863.1| glycosyltransferase [Lactobacillus delbrueckii subsp. lactis DSM
20072]
gi|325684634|gb|EGD26789.1| glycosyltransferase [Lactobacillus delbrueckii subsp. lactis DSM
20072]
Length = 233
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R LY+ GG YLD D +V+K L N + G E++
Sbjct: 51 YVKQAYQAKKWAF--VSDYVRAKALYEMGGIYLDTDVMVLKDFAGLLNDRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A +I +K H
Sbjct: 109 LSAAIIAAEKGH 120
>gi|212533711|ref|XP_002147012.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210072376|gb|EEA26465.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 330
Score = 41.2 bits (95), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 180 WPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES-LHN 217
W +SH++D LR L++ GG YLD D I +KS ++ LHN
Sbjct: 142 WQVSHLADALRLDILHREGGIYLDADVIALKSFDTLLHN 180
>gi|145356264|ref|XP_001422353.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582594|gb|ABP00670.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 292
Score = 41.2 bits (95), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 27/38 (71%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
+L PL+ +DLLRY LY GG ++D D +++KSL+ L
Sbjct: 123 ALKPLAFKADLLRYCILYTEGGAWMDDDILLVKSLDEL 160
>gi|427785167|gb|JAA58035.1| Putative secreted protein [Rhipicephalus pulchellus]
Length = 340
Score = 40.8 bits (94), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+ H SD++R L KYGG YLD D V+KSL+ Y A
Sbjct: 144 IQHASDIVRIRVLRKYGGIYLDSDSYVVKSLDKYRRYEAA 183
>gi|54307464|ref|YP_128484.1| hypothetical protein PBPRA0243 [Photobacterium profundum SS9]
gi|46911884|emb|CAG18682.1| hypothetical protein PBPRA0243 [Photobacterium profundum SS9]
Length = 484
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 141 IDRLYEYQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSL--WPLSHMSDLLRYVTLYKYG 198
I R Y++ +I VD+ + ++ L FY I+ L + L+ SD+ R LY+YG
Sbjct: 172 IKRRYKHIGSYIDLVDINYFIKDVFLSNFYYDCYIMEILLRYNLAAASDITRLSILYRYG 231
Query: 199 GTYLDLD 205
GTY+D+D
Sbjct: 232 GTYIDVD 238
>gi|383125340|ref|ZP_09945984.1| hypothetical protein BSIG_5103 [Bacteroides sp. 1_1_6]
gi|382983366|gb|EES65909.2| hypothetical protein BSIG_5103 [Bacteroides sp. 1_1_6]
Length = 235
Score = 40.8 bits (94), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLSG 242
+D +R+ LY YGG YLD D V+KS L H + G E+ + A VI +K
Sbjct: 65 ADYIRFYALYYYGGIYLDADVEVLKSFNPLLGEHYFLGEEAGGDIEAAVIGAEKGSSWVK 124
Query: 243 AALRELRD 250
L RD
Sbjct: 125 ECLDYYRD 132
>gi|320170422|gb|EFW47321.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 421
Score = 40.8 bits (94), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 26/34 (76%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
SHMSDL+R + +Y+ GG Y+DLD ++++ L L+
Sbjct: 195 SHMSDLIRTILMYQCGGIYMDLDSVLLRPLHFLN 228
>gi|417990271|ref|ZP_12630757.1| putative mannosyltransferase involved in polysaccharide
biosynthesis [Lactobacillus casei A2-362]
gi|410535542|gb|EKQ10160.1| putative mannosyltransferase involved in polysaccharide
biosynthesis [Lactobacillus casei A2-362]
Length = 246
Score = 40.8 bits (94), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES----------------LHNYAGAESSS 225
L + +D LRY LY+YGG YLD D IV K L L ++ GAE+
Sbjct: 60 LGYAADELRYAVLYQYGGFYLDTDMIVKKDLAPFLKQRMVWGFQYDNSILTSFIGAEAKE 119
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G EL+++ + +L +L + K Q A I
Sbjct: 120 PLLAKILDV-----YAGRRFPELQNDMQNMTSNPVVTKILMKLYPEFRMNGKQQELASGI 174
Query: 283 ISCRNFTIYPPRFF 296
+ +YP +F
Sbjct: 175 L------VYPRDYF 182
>gi|227534094|ref|ZP_03964143.1| glycosyltransferase [Lactobacillus paracasei subsp. paracasei ATCC
25302]
gi|227188275|gb|EEI68342.1| glycosyltransferase [Lactobacillus paracasei subsp. paracasei ATCC
25302]
Length = 247
Score = 40.8 bits (94), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES----------------LHNYAGAESSS 225
L + +D LRY LY+YGG YLD D IV K L L ++ GAE+
Sbjct: 61 LGYAADELRYAVLYQYGGFYLDTDMIVKKDLAPFLKQRMVWGFQYDNSILTSFIGAEAKE 120
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G EL+++ + +L +L + K Q A I
Sbjct: 121 PLLAKILDV-----YAGRRFPELQNDMQNMTSNPVVTKILMKLYPEFRMNGKQQELASGI 175
Query: 283 ISCRNFTIYPPRFF 296
+ +YP +F
Sbjct: 176 L------VYPRDYF 183
>gi|418005700|ref|ZP_12645684.1| putative mannosyltransferase involved in polysaccharide
biosynthesis [Lactobacillus casei UW1]
gi|418015043|ref|ZP_12654626.1| polysaccharide biosynthesis protein [Lactobacillus casei Lpc-37]
gi|410546132|gb|EKQ20401.1| putative mannosyltransferase involved in polysaccharide
biosynthesis [Lactobacillus casei UW1]
gi|410552362|gb|EKQ26389.1| polysaccharide biosynthesis protein [Lactobacillus casei Lpc-37]
Length = 246
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES----------------LHNYAGAESSS 225
L + +D LRY LY+YGG YLD D IV K L L ++ GAE+
Sbjct: 60 LGYAADELRYAVLYQYGGFYLDTDMIVKKDLAPFLKQRMVWGFQYDNSILTSFIGAEAKE 119
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G EL+++ + +L +L + K Q A I
Sbjct: 120 PLLAKILDV-----YAGRRFPELQNDMQNMTSNPVVTKILMKLYPEFRMNGKQQELASGI 174
Query: 283 ISCRNFTIYPPRFF 296
+ +YP +F
Sbjct: 175 L------VYPRDYF 182
>gi|418008547|ref|ZP_12648410.1| polysaccharide biosynthesis protein [Lactobacillus casei UW4]
gi|410546475|gb|EKQ20731.1| polysaccharide biosynthesis protein [Lactobacillus casei UW4]
Length = 246
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES----------------LHNYAGAESSS 225
L + +D LRY LY+YGG YLD D IV K L L ++ GAE+
Sbjct: 60 LGYAADELRYAVLYQYGGFYLDTDMIVKKDLAPFLKQRMVWGFQYDNSILTSFIGAEAKE 119
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G EL+++ + +L +L + K Q A I
Sbjct: 120 PLLAKILDV-----YAGRRFPELQNDMQNMTSNPVVTKILMKLYPEFRMNGKQQELASGI 174
Query: 283 ISCRNFTIYPPRFF 296
+ +YP +F
Sbjct: 175 L------VYPRDYF 182
>gi|378767881|ref|YP_005196351.1| tetraacyldisaccharide 4'-kinase LpxK [Pantoea ananatis LMG 5342]
gi|365187364|emb|CCF10314.1| tetraacyldisaccharide 4'-kinase LpxK [Pantoea ananatis LMG 5342]
Length = 333
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 18/109 (16%)
Query: 203 DLDFIVIKSLESLHNYA-GAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG-AN 260
+LD IV + L +YA + VV GV WL +RE KT + N
Sbjct: 139 ELDLIVTD--DGLQHYALQRDREIVVVDGVRRFGNGWWLPAGPMRERASRLKTVDAVIVN 196
Query: 261 G-------------PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
G PG T LL E +P S HN+++ +PPRFF
Sbjct: 197 GGEPQAGEIAMQLRPGKATHLLSGEQRPLSTFHNVVAMAGIG-HPPRFF 244
>gi|386015275|ref|YP_005933556.1| tetraacyldisaccharide 4'-kinase LpxK [Pantoea ananatis AJ13355]
gi|327393338|dbj|BAK10760.1| tetraacyldisaccharide 4'-kinase LpxK [Pantoea ananatis AJ13355]
Length = 333
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 18/109 (16%)
Query: 203 DLDFIVIKSLESLHNYA-GAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG-AN 260
+LD IV + L +YA + VV GV WL +RE KT + N
Sbjct: 139 ELDLIVTD--DGLQHYALQRDREIVVVDGVRRFGNGWWLPAGPMRERASRLKTVDAVIVN 196
Query: 261 G-------------PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
G PG T LL E +P S HN+++ +PPRFF
Sbjct: 197 GGEPQAGEIAMQLRPGKATHLLSGEQRPLSTFHNVVAMAGIG-HPPRFF 244
>gi|291616911|ref|YP_003519653.1| LpxK [Pantoea ananatis LMG 20103]
gi|291151941|gb|ADD76525.1| LpxK [Pantoea ananatis LMG 20103]
Length = 333
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 18/109 (16%)
Query: 203 DLDFIVIKSLESLHNYA-GAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG-AN 260
+LD IV + L +YA + VV GV WL +RE KT + N
Sbjct: 139 ELDLIVTD--DGLQHYALQRDREIVVVDGVRRFGNGWWLPAGPMRERASRLKTVDAVIVN 196
Query: 261 G-------------PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
G PG T LL E +P S HN+++ +PPRFF
Sbjct: 197 GGEPQAGEIAMQLRPGKATHLLSGEQRPLSTFHNVVAMAGIG-HPPRFF 244
>gi|239629871|ref|ZP_04672902.1| conserved hypothetical protein [Lactobacillus paracasei subsp.
paracasei 8700:2]
gi|301066992|ref|YP_003789015.1| hypothetical protein LCAZH_1954 [Lactobacillus casei str. Zhang]
gi|417981197|ref|ZP_12621870.1| polysaccharide biosynthesis protein [Lactobacillus casei 12A]
gi|417983981|ref|ZP_12624612.1| polysaccharide biosynthesis protein [Lactobacillus casei 21/1]
gi|417999675|ref|ZP_12639881.1| polysaccharide biosynthesis protein [Lactobacillus casei T71499]
gi|418002780|ref|ZP_12642888.1| polysaccharide biosynthesis protein [Lactobacillus casei UCD174]
gi|239527483|gb|EEQ66484.1| conserved hypothetical protein [Lactobacillus paracasei subsp.
paracasei 8700:2]
gi|300439399|gb|ADK19165.1| hypothetical protein LCAZH_1954 [Lactobacillus casei str. Zhang]
gi|410523093|gb|EKP98024.1| polysaccharide biosynthesis protein [Lactobacillus casei 12A]
gi|410527030|gb|EKQ01906.1| polysaccharide biosynthesis protein [Lactobacillus casei 21/1]
gi|410538606|gb|EKQ13156.1| polysaccharide biosynthesis protein [Lactobacillus casei T71499]
gi|410543361|gb|EKQ17731.1| polysaccharide biosynthesis protein [Lactobacillus casei UCD174]
Length = 246
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES----------------LHNYAGAESSS 225
L + +D LRY LY+YGG YLD D IV K L L ++ GAE+
Sbjct: 60 LGYAADELRYAVLYQYGGFYLDTDMIVKKDLAPFLKQRMVWGFQYDNSILTSFIGAEAKE 119
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G EL+++ + +L +L + K Q A I
Sbjct: 120 PLLAKILDV-----YAGRRFPELQNDMQNMTSNPVVTKILMKLYPEFRMNGKQQELASGI 174
Query: 283 ISCRNFTIYPPRFF 296
+ +YP +F
Sbjct: 175 L------VYPRDYF 182
>gi|336410730|ref|ZP_08591205.1| hypothetical protein HMPREF1018_03222 [Bacteroides sp. 2_1_56FAA]
gi|335944092|gb|EGN05918.1| hypothetical protein HMPREF1018_03222 [Bacteroides sp. 2_1_56FAA]
Length = 231
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDK 236
+D +R+ LY YGG YLD D V+K +SL G E+S + A VI +K
Sbjct: 65 ADYIRFFALYHYGGIYLDADVEVLKDFKSLLIEKQLLGEEASGDIEAAVIGAEK 118
>gi|386079983|ref|YP_005993508.1| tetraacyldisaccharide 4'-kinase LpxK [Pantoea ananatis PA13]
gi|354989164|gb|AER33288.1| tetraacyldisaccharide 4'-kinase LpxK [Pantoea ananatis PA13]
Length = 333
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 18/109 (16%)
Query: 203 DLDFIVIKSLESLHNYA-GAESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWG-AN 260
+LD IV + L +YA + VV GV WL +RE KT + N
Sbjct: 139 ELDLIVTD--DGLQHYALQRDREIVVVDGVRRFGNGWWLPAGPMRERASRLKTVDAVIVN 196
Query: 261 G-------------PGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
G PG T LL E +P S HN+++ +PPRFF
Sbjct: 197 GGEPQAGEIAMQLRPGKATHLLSGEQRPLSTFHNVVAMAGIG-HPPRFF 244
>gi|441432379|ref|YP_007354421.1| glycosyltransferase [Acanthamoeba polyphaga moumouvirus]
gi|440383459|gb|AGC01985.1| glycosyltransferase [Acanthamoeba polyphaga moumouvirus]
Length = 248
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ +D RY LY YGG YLD+D + K+LESL Y
Sbjct: 68 AQKADFARYAILYTYGGIYLDMDMVCRKNLESLLQY 103
>gi|399022485|ref|ZP_10724561.1| mannosyltransferase OCH1-like enzyme [Chryseobacterium sp. CF314]
gi|398084803|gb|EJL75475.1| mannosyltransferase OCH1-like enzyme [Chryseobacterium sp. CF314]
Length = 219
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 19/100 (19%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA----------GVIHLD 235
+D RY LYK GG YLD+D + K L++L + +V++ G+I +
Sbjct: 70 ADFFRYAILYKKGGIYLDVDSAITKPLKNL---IKEDDEAVISKERHPDLFVQWGLI-FN 125
Query: 236 KDHWLSGAALRELRDNFKTTEW-----GANGPGVLTRLLK 270
K+H L + DN KT + GP V ++ ++
Sbjct: 126 KNHPFLKKTLELMLDNIKTHRYPHDIHSTTGPTVFSKAIQ 165
>gi|390601146|gb|EIN10540.1| hypothetical protein PUNSTDRAFT_43418 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 767
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 173 DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSS 225
D +T L H SDL+R+ L KYGG Y D+ I I +L+ L N A +S
Sbjct: 466 DGKITGHHALQHTSDLVRFPLLLKYGGVYADVGMIQIGNLDRLWNETIANPAS 518
>gi|449551029|gb|EMD41993.1| hypothetical protein CERSUDRAFT_90589 [Ceriporiopsis subvermispora
B]
Length = 761
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 24/37 (64%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SDL R+V ++YGG YLD D I ++ E L + GA
Sbjct: 524 LSDLARFVLCHRYGGIYLDADTIFLRDWEELWGWKGA 560
>gi|379705073|ref|YP_005203532.1| Polysaccharide biosynthesis protein CpsM(V) [Streptococcus
infantarius subsp. infantarius CJ18]
gi|374681772|gb|AEZ62061.1| Polysaccharide biosynthesis protein CpsM(V) [Streptococcus
infantarius subsp. infantarius CJ18]
Length = 245
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGV-IHLDKDHWL 240
+SD R +Y GG YLD D ++K+L+ L H Y G E V G+ +K H
Sbjct: 64 VSDYARLDIIYNEGGFYLDTDVELLKALDDLTSEHCYMGMEQVGRVNTGLGFGAEKGHPF 123
Query: 241 SGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVH 300
+++ D+ + + T LL ++ ++ IS + ++YP FF P +
Sbjct: 124 IKENMQQYEDSSFNCKMLETCVDITTNLLLSKGLLVENSYQRIS--DVSVYPTDFFCPFN 181
Query: 301 WEHWADYLNETNAPATMSLFRDSYALHVWNS 331
+ M + +++Y++H ++S
Sbjct: 182 MQ-----------TQEMGITKNTYSIHHYDS 201
>gi|298705556|emb|CBJ34172.1| surface protein Sur1 [Ectocarpus siliculosus]
Length = 201
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAESSSV 226
SD+LRY L ++GG Y+D+DF+ + S + LH YAG ++
Sbjct: 114 FGQKSDILRYEVLLRHGGLYVDVDFLCLGSFDDLHKRYEFYAGVSNTGT 162
>gi|443686210|gb|ELT89561.1| hypothetical protein CAPTEDRAFT_212320 [Capitella teleta]
Length = 418
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 4/63 (6%)
Query: 173 DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAA 229
+ I +W S S++ R V L +YGG YLDLD ++++S + L Y G ES +
Sbjct: 241 EPIKNPVWGTSQ-SNVDRLVVLMEYGGIYLDLDVLIVQSFDPLRKYPCTLGLESPVKICG 299
Query: 230 GVI 232
+I
Sbjct: 300 SII 302
>gi|393248112|gb|EJD55619.1| hypothetical protein AURDEDRAFT_109879 [Auricularia delicata
TFB-10046 SS5]
Length = 783
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 26/37 (70%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GGTYLD D ++++ E L + GA
Sbjct: 514 LSDMARFVLCHRFGGTYLDADTLLMRDWEELWGWKGA 550
>gi|371944926|gb|AEX62747.1| uncharacterized glycosyl transferase [Moumouvirus Monve]
Length = 119
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ +D RY LY YGG YLD+D + K+LESL Y
Sbjct: 68 AQKADFARYAILYTYGGIYLDMDMVCRKNLESLLQY 103
>gi|395334585|gb|EJF66961.1| hypothetical protein DICSQDRAFT_48274 [Dichomitus squalens LYAD-421
SS1]
Length = 700
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 27/37 (72%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SDL+R+V +++GG YLD+D + ++ E L ++G+
Sbjct: 456 LSDLVRFVLCHRFGGVYLDVDMLFLRDWEELWGWSGS 492
>gi|409078053|gb|EKM78417.1| hypothetical protein AGABI1DRAFT_76024 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 770
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 24/37 (64%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V ++YGG YLD D I ++ E L + GA
Sbjct: 529 LSDMARFVLCHRYGGVYLDADTIFLRDWEELWGWKGA 565
>gi|390944068|ref|YP_006407829.1| mannosyltransferase OCH1-like enzyme [Belliella baltica DSM 15883]
gi|390417496|gb|AFL85074.1| mannosyltransferase OCH1-like enzyme [Belliella baltica DSM 15883]
Length = 256
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 22/149 (14%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDHWLS 241
+SD+ R L + GG YLD D + +KS L N + G + G+I K+H +
Sbjct: 64 VSDVARLYALQQEGGIYLDTDMLFVKSFSDLINQEFFLGDHIQGKIGLGIIGSIKNHNII 123
Query: 242 GAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHW 301
L + E+ P ++ L + + ++ +Y P +FY + +
Sbjct: 124 DKVLSY----YINQEFNLLTPTLIPDLFDQLLQEER--------KSIRVYEPDYFYALPF 171
Query: 302 EHWADYLNETNAPATMSLFRDSYALHVWN 330
+ E + ++ +SYA+H+WN
Sbjct: 172 DQ-----KEKDYQGFLT--NNSYAVHLWN 193
>gi|150007925|ref|YP_001302668.1| glycosyl transferase family protein [Parabacteroides distasonis
ATCC 8503]
gi|255013388|ref|ZP_05285514.1| glycosyl transferase family protein [Bacteroides sp. 2_1_7]
gi|410103760|ref|ZP_11298681.1| hypothetical protein HMPREF0999_02453 [Parabacteroides sp. D25]
gi|423331576|ref|ZP_17309360.1| hypothetical protein HMPREF1075_01373 [Parabacteroides distasonis
CL03T12C09]
gi|149936349|gb|ABR43046.1| glycosyltransferase family 32 [Parabacteroides distasonis ATCC
8503]
gi|409230146|gb|EKN23014.1| hypothetical protein HMPREF1075_01373 [Parabacteroides distasonis
CL03T12C09]
gi|409236489|gb|EKN29296.1| hypothetical protein HMPREF0999_02453 [Parabacteroides sp. D25]
Length = 250
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLES---LHNYAGAESSSVVAAGVIHLDK 236
+D +R LY +GG YLD D ++KSL++ L + GAE+S + A ++ +K
Sbjct: 65 ADYIRLYALYHHGGIYLDSDVEILKSLDNFLELPYFVGAETSGTIEAAILGAEK 118
>gi|426194054|gb|EKV43986.1| hypothetical protein AGABI2DRAFT_209653 [Agaricus bisporus var.
bisporus H97]
Length = 769
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 24/37 (64%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V ++YGG YLD D I ++ E L + GA
Sbjct: 528 LSDMARFVLCHRYGGVYLDADTIFLRDWEELWGWKGA 564
>gi|295426524|ref|ZP_06819174.1| polysaccharide biosynthesis protein CpsM(V) [Lactobacillus
amylolyticus DSM 11664]
gi|295063892|gb|EFG54850.1| polysaccharide biosynthesis protein CpsM(V) [Lactobacillus
amylolyticus DSM 11664]
Length = 233
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D +V+ +LESL H + G E+
Sbjct: 51 YVEQAYKAKKWAF--VSDYIRAKAIYEQGGIYLDTDVLVLDNLESLLDNHAFVGFENKEN 108
Query: 227 VAAGVIHLDKDH 238
V +K H
Sbjct: 109 PFTAVFGAEKHH 120
>gi|451311485|gb|AGF34240.1| putative glycosyl transferase WcrL [Streptococcus pneumoniae]
Length = 240
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 63/140 (45%), Gaps = 17/140 (12%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAESSS 225
+ QDA W +SD R +Y+ GG YLD D VIK+L+ L + + +
Sbjct: 52 FVQDAYRDKAWAF--VSDYARLKIIYENGGIYLDTDVEVIKNLDKLLENKAFFGVHQVNH 109
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG------PGVLTRLLKAECKPQSYA 279
+VA G+ + + L+EL + + E+ + P + T + K +Y+
Sbjct: 110 LVATGLGFGSEK---GTSILKELLNLYDEIEFDLSKKDELLCPELNTPVFKR--LGYTYS 164
Query: 280 HNIISCRNFTIYPPRFFYPV 299
++ FTIYP +F P+
Sbjct: 165 DCVVKNEYFTIYPEEYFDPI 184
>gi|392576673|gb|EIW69803.1| hypothetical protein TREMEDRAFT_43487 [Tremella mesenterica DSM
1558]
Length = 768
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
+SD+ R+V +++GG YLD D I ++ E L NY G
Sbjct: 524 LSDMARFVLTHRFGGIYLDADTIFLRDWEELWNYRG 559
>gi|330718137|ref|ZP_08312737.1| putative glycosyltransferase [Leuconostoc fallax KCTC 3537]
Length = 165
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
Y Q+A+ ++W + +SD +R L + GG YLD D ++KS + L + G ES
Sbjct: 53 YVQEALQENMW--AFVSDYVRLDILNREGGIYLDTDVEILKSFDELLKNKAFIGRESKYA 110
Query: 227 VAAGVIHLD-KDHWLS 241
+ VI + K WL
Sbjct: 111 FSTAVIASEPKQRWLQ 126
>gi|332298679|ref|YP_004440601.1| glycosyltransferase family protein [Treponema brennaborense DSM
12168]
gi|332181782|gb|AEE17470.1| glycosyltransferase sugar-binding region containing DXD motif
[Treponema brennaborense DSM 12168]
Length = 261
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 76/180 (42%), Gaps = 26/180 (14%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHW 239
+ +SD R+ LY+YGG Y D D VIK L+ + +AG E + + AG L
Sbjct: 62 AFVSDYARFDILYRYGGIYFDTDVEVIKPLDDILERGAFAGVECAGALNAG---LGIASP 118
Query: 240 LSGAALRELRD-----NFKTTEWGANGPGVLTRLLKAECKPQ-SYAHNIISCRNFTIYPP 293
+ +E+ D NF + + V+TR+ K + I + TIYP
Sbjct: 119 AASPIYKEILDSYEKSNFLKQDGTQDLTTVVTRVSDIFRKHGFTDKDEIQTVAGITIYPT 178
Query: 294 RFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWN-SFTKRVPVKLGSEQP--YAQIA 350
+F P E + + +++Y +H ++ S+T + K ++ YAQ
Sbjct: 179 EYFCPKSLE-----------TGIIRITKNTYTIHHYDGSWTSKAQKKYDGKRKKHYAQFG 227
>gi|366087418|ref|ZP_09453903.1| glycosyltransferase [Lactobacillus zeae KCTC 3804]
Length = 246
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 56/134 (41%), Gaps = 30/134 (22%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES----------------LHNYAGAESSS 225
L + +D LRY LY+YGG YLD D I+ K L L ++ GAE+ +
Sbjct: 60 LGYAADELRYAVLYQYGGFYLDTDMIIKKDLAPFLDHKMVWGFQYDNSILTSFIGAEAKT 119
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRL---LKAECKPQSYAHNI 282
+ A ++ + +G +L+D+ + +L RL + + Q I
Sbjct: 120 PLLAEILDV-----YAGRKFPDLQDDMQKMTSNPVVTKILMRLYPDFRMNGQRQELESGI 174
Query: 283 ISCRNFTIYPPRFF 296
I +YP +F
Sbjct: 175 I------VYPRDYF 182
>gi|405117966|gb|AFR92741.1| snoRNA binding protein [Cryptococcus neoformans var. grubii H99]
Length = 1216
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
+SD+ R++ +++GG YLD D I ++ E L NY G
Sbjct: 952 LSDMARFIVTHRFGGIYLDADTIFLRDWEELWNYRG 987
>gi|356557675|ref|XP_003547140.1| PREDICTED: uncharacterized protein At4g19900-like [Glycine max]
Length = 255
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 8/54 (14%)
Query: 195 YKYGGTYLDLDFIVIKSLESLHNYAGAE--------SSSVVAAGVIHLDKDHWL 240
YK+GGTY+DLD +V+KS L N G S S + V+ DK+ L
Sbjct: 202 YKFGGTYIDLDVVVLKSFSKLRNTIGGHRTLMSKPASGSSLNNAVLIFDKEKRL 255
>gi|299133438|ref|ZP_07026632.1| glycosyltransferase sugar-binding protein containing DXD motif
[Afipia sp. 1NLS2]
gi|298591274|gb|EFI51475.1| glycosyltransferase sugar-binding protein containing DXD motif
[Afipia sp. 1NLS2]
Length = 589
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 49/108 (45%), Gaps = 9/108 (8%)
Query: 144 LYEYQNVHIVQ--VDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
LY Y V V+L + PL F ++ W + SDL RYV L K+GG Y
Sbjct: 42 LYAYDKSLAVPDGVELRDAAEVLPLDIFNRYNSGNAKAW--ARHSDLFRYVMLEKFGGWY 99
Query: 202 LDLDFIVI-KSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALREL 248
DLD + + +L Y G + + + AG++ K A L+EL
Sbjct: 100 ADLDIVCLADALPQGEVYFGRANEARIFAGLLKFPK----GAAVLKEL 143
>gi|58259067|ref|XP_566946.1| snoRNA binding protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134107129|ref|XP_777877.1| hypothetical protein CNBA5740 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260575|gb|EAL23230.1| hypothetical protein CNBA5740 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223083|gb|AAW41127.1| snoRNA binding protein, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1382
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
+SD+ R++ +++GG YLD D I ++ E L NY G
Sbjct: 1118 LSDMARFIVTHRFGGIYLDADTIFLRDWEELWNYRG 1153
>gi|386337607|ref|YP_006033776.1| glycosyltransferase in exopolysaccharide biosynthesis
[Streptococcus gallolyticus subsp. gallolyticus ATCC
43143]
gi|334280243|dbj|BAK27817.1| glycosyltransferase in exopolysaccharide biosynthesis
[Streptococcus gallolyticus subsp. gallolyticus ATCC
43143]
Length = 242
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 47/103 (45%), Gaps = 6/103 (5%)
Query: 178 SLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVIHL 234
LW + MSD LR L + GG YLD D V+K + L Y G E + V G+I
Sbjct: 63 KLW--AFMSDYLRLWILEREGGIYLDTDVQVLKPFDDLLKYPMFLGYEMNDYVGTGIIGA 120
Query: 235 DKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQS 277
+K + + A L D ++ N P + ++L+ QS
Sbjct: 121 EKGNPVIKALLNFYADKIWNVDYYIN-PVIFGQILEENPSYQS 162
>gi|392591132|gb|EIW80460.1| hypothetical protein CONPUDRAFT_137658 [Coniophora puteana
RWD-64-598 SS2]
Length = 829
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 26/37 (70%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD++R++ L+++GG Y+D D I ++ E L + GA
Sbjct: 594 LSDMVRFILLHRFGGIYVDADTIFLRDWEELWGWTGA 630
>gi|423132639|ref|ZP_17120286.1| hypothetical protein HMPREF9715_00061 [Myroides odoratimimus CIP
101113]
gi|423328253|ref|ZP_17306060.1| hypothetical protein HMPREF9711_01634 [Myroides odoratimimus CCUG
3837]
gi|371650016|gb|EHO15490.1| hypothetical protein HMPREF9715_00061 [Myroides odoratimimus CIP
101113]
gi|404605156|gb|EKB04769.1| hypothetical protein HMPREF9711_01634 [Myroides odoratimimus CCUG
3837]
Length = 241
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 14/87 (16%)
Query: 169 FYTQDAILTSLWPLSH----------MSDLLRYVTLYKYGGTYLDLDFIVIKSLES-LHN 217
F+T+D +P + +SD+ R L +YGG YLD D ++K+L+ LH+
Sbjct: 38 FWTEDNFDLDRYPFARQALECKKYAFVSDVCRLYVLKQYGGIYLDTDVEILKNLDCFLHH 97
Query: 218 --YAGAESSSVVAAGVIHLDKD-HWLS 241
++G E ++ V G++ KD W+S
Sbjct: 98 EAFSGFEDNNSVPTGIMASIKDGRWVS 124
>gi|428309950|ref|YP_007120927.1| outer membrane cobalamin receptor protein [Microcoleus sp. PCC
7113]
gi|428251562|gb|AFZ17521.1| outer membrane cobalamin receptor protein [Microcoleus sp. PCC
7113]
Length = 732
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 42/93 (45%), Gaps = 7/93 (7%)
Query: 66 VLSGVEKDFAFDDITRLDVPDNSIFFLET-SCTHADGVELTLRQACSIESAAMMNPGVQV 124
LS V A ++ D ++F L T T A ++L LRQ + E + +NP V
Sbjct: 444 TLSTVPNRIALNETENRDRFHTALFALNTWKLTDAFQIDLGLRQNFNSEFGSYLNPSVGT 503
Query: 125 YVVVIASVRNR------TRNPLIDRLYEYQNVH 151
V SV R RNP +D+LY Y VH
Sbjct: 504 RWAVTPSVAVRGSWVSVQRNPGLDQLYVYDTVH 536
>gi|238587372|ref|XP_002391454.1| hypothetical protein MPER_09115 [Moniliophthora perniciosa FA553]
gi|215456133|gb|EEB92384.1| hypothetical protein MPER_09115 [Moniliophthora perniciosa FA553]
Length = 279
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GGTYLD D I ++ E L + GA
Sbjct: 121 LSDMARFVLCHRFGGTYLDADTIFLRDWEELWGWKGA 157
>gi|428181073|gb|EKX49938.1| hypothetical protein GUITHDRAFT_67557, partial [Guillardia theta
CCMP2712]
Length = 111
Score = 39.7 bits (91), Expect = 2.1, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 28/43 (65%), Gaps = 4/43 (9%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAG 220
L SDLLR ++++GG Y+D+DF +KS + LH+ YAG
Sbjct: 52 LGAKSDLLRLEVVWRFGGLYIDIDFECLKSFDVLHDHLDFYAG 94
>gi|223993615|ref|XP_002286491.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977806|gb|EED96132.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 866
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 28/42 (66%)
Query: 174 AILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
++ SL P ++ +D RY+ LYK GG Y+D+D ++ +L+S
Sbjct: 597 SVFDSLLPGAYKADFFRYLVLYKEGGIYVDVDVMLDTNLDSF 638
>gi|393218398|gb|EJD03886.1| hypothetical protein FOMMEDRAFT_140007 [Fomitiporia mediterranea
MF3/22]
Length = 766
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 9/75 (12%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA-----ESSSVVAAGVIHLDKDHW 239
+SD+ R+V +++GG YLD D ++++ E L + GA V+HL+K
Sbjct: 522 LSDMARFVLCHRFGGIYLDADTLLLRDWEELWGWKGAFAYRWSRLETYNTAVLHLNKGSA 581
Query: 240 LSG----AALRELRD 250
L ALR L D
Sbjct: 582 LGTFLFRTALRNLLD 596
>gi|228917876|ref|ZP_04081413.1| glycosyltransferase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841812|gb|EEM86922.1| glycosyltransferase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 247
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 35/168 (20%), Positives = 64/168 (38%), Gaps = 19/168 (11%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
+ ++A W +SD +R LY YGG YLD D V+K + +++G E
Sbjct: 58 FVKEAYEEKKWAF--VSDYVRLWALYNYGGIYLDTDVEVLKGFDVFLEHDSFSGFEDDIT 115
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHN---II 283
+ G+I K+ L+ + E G ++ C Q + N +
Sbjct: 116 IPTGIIGAMKNSQAINMFLKYYDNRSFYLENGDLDTTTNVEIITEICSDQGFEPNGNYQV 175
Query: 284 SCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNS 331
IYP +F P ++ + + +++Y +H +N
Sbjct: 176 LNNGMHIYPKEYFCPKEYK-----------TGILRVTKNTYCIHHFNG 212
>gi|19114646|ref|NP_593734.1| mannosyltransferase Imt3 [Schizosaccharomyces pombe 972h-]
gi|1723483|sp|Q10323.1|IMT3_SCHPO RecName: Full=Inositol phosphoceramide mannosyltransferase 3;
AltName: Full=IPC mannosyltransferase 3
gi|1213260|emb|CAA93694.1| mannosyltransferase Imt3 [Schizosaccharomyces pombe]
Length = 356
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG---AESSSVVAAGVIHLDKDH 238
+D++RY LYKYGG YLDLD ++L+ L +Y S S ++ V+ K H
Sbjct: 123 ADVVRYFILYKYGGIYLDLDVGCNRTLDPLLHYPAWVRRTSPSGISNNVMGFAKGH 178
>gi|414163161|ref|ZP_11419408.1| hypothetical protein HMPREF9697_01309 [Afipia felis ATCC 53690]
gi|410880941|gb|EKS28781.1| hypothetical protein HMPREF9697_01309 [Afipia felis ATCC 53690]
Length = 594
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 78/207 (37%), Gaps = 32/207 (15%)
Query: 144 LYEYQNVHIVQ--VDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTY 201
LY Y +V V+L PL + +A W + SDL RYV L ++GG Y
Sbjct: 42 LYAYDKSLVVPDGVELHDAADVLPLDILHRYNAGNPDAW--ARHSDLFRYVMLERFGGWY 99
Query: 202 LDLDFIVIKS-LESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALREL-----RDNFKTT 255
DLD + + L + Y G S AG++ K ALRE+ R +
Sbjct: 100 ADLDIVCLADRLPQVEMYFGRSSGMRAFAGLLKFPKGF----PALREIIPEAERILSEAG 155
Query: 256 EWGAN------GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLN 309
+ AN G +L+R LK H C P + Y + + + +
Sbjct: 156 DVQANKTRAVIGTPLLSRALK--------QHGFNDC----AAPTQDAYAIPYNEALAFFD 203
Query: 310 ETNAPATMSLFRDSYALHVWNSFTKRV 336
+ A DS H+WN R+
Sbjct: 204 PSQCEALEVRLADSTFTHLWNGAWNRL 230
>gi|440720125|ref|ZP_20900545.1| YD repeat-containing protein [Pseudomonas syringae BRIP34876]
gi|440726252|ref|ZP_20906507.1| YD repeat-containing protein [Pseudomonas syringae BRIP34881]
gi|440366411|gb|ELQ03492.1| YD repeat-containing protein [Pseudomonas syringae BRIP34876]
gi|440366856|gb|ELQ03931.1| YD repeat-containing protein [Pseudomonas syringae BRIP34881]
Length = 926
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 4/105 (3%)
Query: 111 SIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQV---DLGRYFQNTPLH 167
I++ +++NP Q+ + + +R + L +NVHI + + R FQ TP +
Sbjct: 678 GIKNTSVLNPDYQLTLHLHLDLRKGDETEIASALRGRRNVHISDLREEEFFRKFQKTPSY 737
Query: 168 GFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL 212
Y +D S SD+LRY + YGG Y D D + + +
Sbjct: 738 EIY-KDLYGDDPRHYSAASDVLRYSLINHYGGIYSDTDDMFKRGV 781
>gi|442803639|ref|YP_007371788.1| polysaccharide biosynthesis protein CpsM(V) family protein
[Clostridium stercorarium subsp. stercorarium DSM 8532]
gi|442739489|gb|AGC67178.1| polysaccharide biosynthesis protein CpsM(V) family protein
[Clostridium stercorarium subsp. stercorarium DSM 8532]
Length = 242
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 76/176 (43%), Gaps = 19/176 (10%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVIHLDKDHW 239
+ +SD R+ L +YGG YLD+D ++K ++ L NY G ES V G+I +
Sbjct: 62 AFVSDFARFYVLNQYGGIYLDVDVELVKPIDDLLNYKTVLGFESLGRVNPGLILASEPAT 121
Query: 240 LSGAALRELRDNFKTTEWGANGPGVLTRL-------LKAE-CKPQSYAHNIISCRNFTIY 291
L + EL N + NG LT + LK + + ++ I T++
Sbjct: 122 LFLKEMIELYKNLSFID--NNGNMNLTTIVTYTTDYLKTKGLREENVKQEIYGV---TVF 176
Query: 292 PPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSEQPYA 347
P +F P+ + + L T ++ F S+ + W F K + +G + Y
Sbjct: 177 PTDYFCPIDMK--TNELIITENTYSIHHFNSSW-ISKWGKFKKFIRKIIGGKLYYK 229
>gi|194373761|dbj|BAG56976.1| unnamed protein product [Homo sapiens]
Length = 381
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 38/94 (40%), Gaps = 6/94 (6%)
Query: 98 HADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDL 157
H D + A SA + PGV + + V N NP I R++EY + D+
Sbjct: 231 HTDSFRMLYDDAGVPISAMFITPGVTPWKTTLPGVVNGANNPAI-RVFEYDRATLSLKDM 289
Query: 158 GRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRY 191
YF N +A T W L + S L RY
Sbjct: 290 VTYFMN-----LSQANAQGTPRWELEYQSTLQRY 318
>gi|321453435|gb|EFX64671.1| hypothetical protein DAPPUDRAFT_333961 [Daphnia pulex]
Length = 277
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 66/157 (42%), Gaps = 29/157 (18%)
Query: 125 YVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRY-----------------FQNTPLH 167
Y+V++A++RN NP D+ + + +V +Q G+Y F P
Sbjct: 35 YMVLLAAMRNH--NP--DQFFFHTDVEDIQF-TGKYWGLVKQDEDLWSRIKVFFLEAPAE 89
Query: 168 GFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESS 224
F + L W H SD+ R L KYGG Y+D D VI+SL+ + + +
Sbjct: 90 IFGQK---LNEGWRFYHGSDIGRIHVLMKYGGIYVDNDAYVIRSLDKYRKFEFVINWDEN 146
Query: 225 SVVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG 261
+ VI KD L RD +++ +W N
Sbjct: 147 QFMGTQVIIAHKDARFLPLWLESYRD-YRSDKWYYNA 182
>gi|443645290|ref|ZP_21129140.1| Rhs family protein [Pseudomonas syringae pv. syringae B64]
gi|443285307|gb|ELS44312.1| Rhs family protein [Pseudomonas syringae pv. syringae B64]
Length = 926
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 4/105 (3%)
Query: 111 SIESAAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQV---DLGRYFQNTPLH 167
I++ +++NP Q+ + + +R + L +NVHI + + R FQ TP +
Sbjct: 678 GIKNTSVLNPDYQLTLHLHLDLRKGDETEIASALRGRRNVHISDLREEEFFRKFQKTPSY 737
Query: 168 GFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL 212
Y +D S SD+LRY + YGG Y D D + + +
Sbjct: 738 EIY-KDLYGDDPRHYSAASDVLRYSLINHYGGIYSDTDDMFKRGV 781
>gi|427783675|gb|JAA57289.1| Putative secreted protein [Rhipicephalus pulchellus]
Length = 352
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+ H SD++R L KYGG YLD D ++KSL+ Y A
Sbjct: 144 IQHASDIVRIKVLRKYGGIYLDSDSYIVKSLDKYRRYETA 183
>gi|321475358|gb|EFX86321.1| hypothetical protein DAPPUDRAFT_236977 [Daphnia pulex]
Length = 242
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 41/85 (48%), Gaps = 4/85 (4%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVI 232
L +W + H SDL+R TL KYGG +LD D V+ +L+ + G + + V+
Sbjct: 159 LKPVWRVHHGSDLIRIRTLMKYGGIFLDNDVYVVHNLDKYRQFEMTLGWPRNESLGTMVL 218
Query: 233 HLDKDHWLSGAALRELRDNFKTTEW 257
+K+ L RD +K +W
Sbjct: 219 CANKNARFLPLWLDNYRD-YKADQW 242
>gi|409083147|gb|EKM83504.1| hypothetical protein AGABI1DRAFT_66179 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 741
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R++ ++YGG YLD D I ++ E L + GA
Sbjct: 506 LSDMARFILCHRYGGIYLDADTIFLRDWEELWGWKGA 542
>gi|159490672|ref|XP_001703297.1| hypothetical protein CHLREDRAFT_188189 [Chlamydomonas reinhardtii]
gi|158280221|gb|EDP05979.1| predicted protein [Chlamydomonas reinhardtii]
Length = 515
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 122 VQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGF-YTQDAILT--S 178
+ V+V + +V + + E+ + + QV R TPL G + +DA L +
Sbjct: 150 MSVWVDNVTAVLEHPAGQFLHKYSEF--ISVRQVQWERALVGTPLQGHPFWRDAALVRKN 207
Query: 179 LWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSL 212
++ + +D+LR + LY++GG +LD D ++++ +
Sbjct: 208 IYTAAGFADVLRLLLLYQHGGVWLDTDVVLLQDM 241
>gi|331084401|ref|ZP_08333504.1| hypothetical protein HMPREF0992_02428 [Lachnospiraceae bacterium
6_1_63FAA]
gi|330401497|gb|EGG81081.1| hypothetical protein HMPREF0992_02428 [Lachnospiraceae bacterium
6_1_63FAA]
Length = 253
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 56/142 (39%), Gaps = 20/142 (14%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAE-SSS 225
YT+ A W +SD +R L KYGG Y+D D VI+ L H + +
Sbjct: 51 YTRIAYQKKKWAF--ISDYVRLYALEKYGGVYMDTDLEVIRDFSELLKKHEFVSSTLEGG 108
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKT---------TEWGANGPGVLTRLLKAECKPQ 276
++ AG I H + L+ + T E+ N P + TR+ K +
Sbjct: 109 LITAGFIATRAQH----PYIVTLKKKYDTGYFQRDDGSIEFSMN-PLLFTRIAKEMYGFK 163
Query: 277 SYAHNIISCRNFTIYPPRFFYP 298
+ +NF IY +F P
Sbjct: 164 ISNAGFMEQKNFMIYSIEYFMP 185
>gi|426201801|gb|EKV51724.1| hypothetical protein AGABI2DRAFT_182678 [Agaricus bisporus var.
bisporus H97]
Length = 743
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R++ ++YGG YLD D I ++ E L + GA
Sbjct: 506 LSDMARFILCHRYGGIYLDADTIFLRDWEELWGWKGA 542
>gi|406699429|gb|EKD02632.1| snoRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
Length = 803
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
+SD+ R+V +++GG YLD D + ++ E L NY G
Sbjct: 556 LSDMARFVLTHRFGGVYLDADTLFLRDWEELWNYRG 591
>gi|401887916|gb|EJT51890.1| snoRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
Length = 816
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
+SD+ R+V +++GG YLD D + ++ E L NY G
Sbjct: 569 LSDMARFVLTHRFGGVYLDADTLFLRDWEELWNYRG 604
>gi|390603041|gb|EIN12433.1| hypothetical protein PUNSTDRAFT_61548 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 683
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
MSDL R++ + +GG YLD+D ++++ E L + GA
Sbjct: 421 MSDLARFLVTHAFGGIYLDVDNVLLRDWEELWGWKGA 457
>gi|154490266|ref|ZP_02030527.1| hypothetical protein PARMER_00498 [Parabacteroides merdae ATCC
43184]
gi|423721812|ref|ZP_17695988.1| hypothetical protein HMPREF1078_00051 [Parabacteroides merdae
CL09T00C40]
gi|154089158|gb|EDN88202.1| hypothetical protein PARMER_00498 [Parabacteroides merdae ATCC
43184]
gi|409242825|gb|EKN35584.1| hypothetical protein HMPREF1078_00051 [Parabacteroides merdae
CL09T00C40]
Length = 141
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 23/29 (79%)
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
D++RY+ LY+YGG Y DLD+ I +L+SL
Sbjct: 62 DMIRYLILYEYGGIYADLDYECIDALDSL 90
>gi|170095475|ref|XP_001878958.1| glycosyltransferase family 32 protein [Laccaria bicolor S238N-H82]
gi|164646262|gb|EDR10508.1| glycosyltransferase family 32 protein [Laccaria bicolor S238N-H82]
Length = 749
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GG YLD D I+++ E L + GA
Sbjct: 493 LSDMARFVLCHRFGGIYLDADTILLRDWEELWGWKGA 529
>gi|392571187|gb|EIW64359.1| hypothetical protein TRAVEDRAFT_139072 [Trametes versicolor
FP-101664 SS1]
Length = 683
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GG YLD D I+++ E L + GA
Sbjct: 446 LSDMARFVLCHRFGGIYLDADTILLRDWEELWGWKGA 482
>gi|395334685|gb|EJF67061.1| hypothetical protein DICSQDRAFT_96113 [Dichomitus squalens LYAD-421
SS1]
Length = 750
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GG YLD D I+++ E L + GA
Sbjct: 509 LSDMARFVLCHRFGGIYLDADTILLRDWEELWGWKGA 545
>gi|329954572|ref|ZP_08295663.1| hypothetical protein HMPREF9445_00492 [Bacteroides clarus YIT
12056]
gi|328527540|gb|EGF54537.1| hypothetical protein HMPREF9445_00492 [Bacteroides clarus YIT
12056]
Length = 243
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 53/124 (42%), Gaps = 9/124 (7%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAE--SSSVVAAGVIHLDKD 237
+ +SD +R LY+YGG YLD D ++KS L + G E +V+A+ VI
Sbjct: 62 AFVSDYVRLKALYEYGGIYLDTDIKILKSFNPLLEDEGFIGFEDVKGNVIASCVIAAKPK 121
Query: 238 HWLSGAALRELRDNFKTTEWGANGPGV--LTRLLKAECKPQSYAHNIISCRNFTIYPPRF 295
H ++ +F N V +T+ L + II+ IYP +
Sbjct: 122 HPFIKECMQYYNQDFTMEIINKNEANVIDITQRLVQKGMQLGGKEQIIA--GMHIYPREY 179
Query: 296 FYPV 299
F P+
Sbjct: 180 FCPM 183
>gi|443923136|gb|ELU42433.1| snoRNA binding protein [Rhizoctonia solani AG-1 IA]
Length = 843
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GG YLD D ++++ E L Y GA
Sbjct: 583 VSDMARFVLCHRFGGIYLDADTLLLRDWEELWGYKGA 619
>gi|423340135|ref|ZP_17317874.1| hypothetical protein HMPREF1059_03799 [Parabacteroides distasonis
CL09T03C24]
gi|409227570|gb|EKN20466.1| hypothetical protein HMPREF1059_03799 [Parabacteroides distasonis
CL09T03C24]
Length = 258
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 65/148 (43%), Gaps = 7/148 (4%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSV 226
Y ++A W + ++D++R + GG Y+D D V+K L+ L Y +G ESSS
Sbjct: 51 YVREAYEAKKW--AFITDVVRLYAMVTEGGIYMDTDVEVLKPLDELLQYDAVSGFESSSR 108
Query: 227 VAAGVIHLDKDHWLSGAALRELRD-NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISC 285
+ G++ + L L E +FK + + +TR+ A K +N
Sbjct: 109 IPTGLMACREGQPLFEELLHEYDVLHFKRPDGSLDVTTNVTRITNACLKYGFVPNNTKQT 168
Query: 286 RN-FTIYPPRFFYPVHWEHWADYLNETN 312
N FT+ P + P + YL E +
Sbjct: 169 VNGFTLLPQDYLCPKSYVDGKIYLTENS 196
>gi|336389809|gb|EGO30952.1| hypothetical protein SERLADRAFT_404908 [Serpula lacrymans var.
lacrymans S7.9]
Length = 751
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R++ ++YGG YLD D I ++ E L + GA
Sbjct: 516 LSDMARFILCHRYGGIYLDADTIFLRDWEELWGWKGA 552
>gi|320170421|gb|EFW47320.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 441
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 26/34 (76%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
SHMSDL+R + +++ GG Y+DLD ++++ L L+
Sbjct: 215 SHMSDLMRAILMHQCGGIYMDLDSVLLRPLHFLN 248
>gi|317476951|ref|ZP_07936193.1| glycosyltransferase sugar-binding region containing DXD
domain-containing protein [Bacteroides eggerthii
1_2_48FAA]
gi|316906744|gb|EFV28456.1| glycosyltransferase sugar-binding region containing DXD
domain-containing protein [Bacteroides eggerthii
1_2_48FAA]
Length = 231
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDK 236
+D +R+ LY YGG YLD D V KS L N + G E+ + A V+ +K
Sbjct: 65 ADYIRFYALYYYGGIYLDADVEVFKSFTPLLNQKQFIGEEAGGDIEAAVMGAEK 118
>gi|238916223|ref|YP_002929740.1| hypothetical protein EUBELI_00257 [Eubacterium eligens ATCC 27750]
gi|238871583|gb|ACR71293.1| Hypothetical protein EUBELI_00257 [Eubacterium eligens ATCC 27750]
Length = 375
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 74/181 (40%), Gaps = 43/181 (23%)
Query: 169 FYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSS 225
Y + A W + D+ R LY YGG Y+D D VI++L+ L Y AG E
Sbjct: 188 LYMKQAYEAKKW--GFVPDIARLDILYNYGGLYIDTDVEVIRNLDDLLKYEAFAGVEKWG 245
Query: 226 VVAAGVIHLDKDHWLSGA-----ALRELRDNFKTTE--WGANGPGVLTR--------LLK 270
+ G SGA ++E+ D F+ E +G LT L+K
Sbjct: 246 NINMG--------GCSGAVPHHPVIKEMLD-FRKNEPFVMQDGSFNLTTCGYYETMPLIK 296
Query: 271 AECKPQSYAHNIISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWN 330
KP + NI TI+ +F+P DY++ + +++Y++H +N
Sbjct: 297 KGFKPNNTVQNIAG---MTIFSSDYFHPY------DYMS-----GETCITKNTYSIHHFN 342
Query: 331 S 331
Sbjct: 343 G 343
>gi|425765372|gb|EKV04069.1| hypothetical protein PDIG_91230 [Penicillium digitatum PHI26]
gi|425765758|gb|EKV04407.1| hypothetical protein PDIP_87310 [Penicillium digitatum Pd1]
Length = 405
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 26/43 (60%)
Query: 173 DAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL 215
D LT + L H SDL+R+ L +YGG Y D+ + I L++L
Sbjct: 105 DETLTGPYALQHTSDLVRWPLLLRYGGVYADVGMMQIGDLDTL 147
>gi|336376878|gb|EGO05213.1| hypothetical protein SERLA73DRAFT_68839 [Serpula lacrymans var.
lacrymans S7.3]
Length = 768
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R++ ++YGG YLD D I ++ E L + GA
Sbjct: 533 LSDMARFILCHRYGGIYLDADTIFLRDWEELWGWKGA 569
>gi|403165059|ref|XP_003325088.2| hypothetical protein PGTG_06625 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165514|gb|EFP80669.2| hypothetical protein PGTG_06625 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 525
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN 217
+H +D++R L YGG YLDLD V+++ +SL N
Sbjct: 315 FAHKADIIRLEALRDYGGIYLDLDVFVVRNFDSLLN 350
>gi|409051754|gb|EKM61230.1| hypothetical protein PHACADRAFT_168658 [Phanerochaete carnosa
HHB-10118-sp]
Length = 669
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 27/41 (65%)
Query: 181 PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
P +SDL+R+V +++GG YLD+D ++++ E L GA
Sbjct: 443 PSVALSDLVRFVLCHRFGGVYLDVDTLLLRDWEELWGAPGA 483
>gi|328766900|gb|EGF76952.1| hypothetical protein BATDEDRAFT_37493 [Batrachochytrium
dendrobatidis JAM81]
Length = 326
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 24/36 (66%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+D++RY+ +YKYGG Y DLD IK+ L + GA
Sbjct: 127 ADVVRYMYMYKYGGFYADLDVECIKNHRPLARWGGA 162
>gi|443724645|gb|ELU12549.1| hypothetical protein CAPTEDRAFT_54872, partial [Capitella teleta]
Length = 185
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 50/115 (43%), Gaps = 18/115 (15%)
Query: 174 AILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL--------------HNY- 218
A+ S + H +D +RY L+ +GG Y+DLD +K L++ H Y
Sbjct: 56 ALFESYPEVIHRADAMRYFILHHFGGVYMDLDMEPLKPLDNWTSNYDVVLSQECYEHTYV 115
Query: 219 AGAESSSVVAAGVIHLDKDHWLSGAALRELRD---NFKTTEWGANGPGVLTRLLK 270
ES + GV+ H +A++ L D + A GP L+R+LK
Sbjct: 116 VRGESEPNIVNGVMFARPQHPFFLSAVKSLPDAANKYFGDFLQATGPVFLSRVLK 170
>gi|221481448|gb|EEE19834.1| glycosyltransferase sugar-binding region containing DXD
motif-containing protein, putative [Toxoplasma gondii
GT1]
Length = 1064
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 25/34 (73%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+SD+LR + LYK GG YLD+D + ++S SL ++
Sbjct: 708 VSDILRVLFLYKEGGLYLDVDVLTLRSFASLPSF 741
>gi|288560330|ref|YP_003423816.1| hypothetical protein mru_1073 [Methanobrevibacter ruminantium M1]
gi|288543040|gb|ADC46924.1| hypothetical protein mru_1073 [Methanobrevibacter ruminantium M1]
Length = 532
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 29/48 (60%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAG 230
S ++L RY LY+YGGT+LDLD ++IK L G+++ + +
Sbjct: 77 SGFANLFRYKRLYEYGGTWLDLDLLLIKRLSDEDIIIGSQTQEDIYSN 124
>gi|237838847|ref|XP_002368721.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211966385|gb|EEB01581.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
Length = 1057
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 25/34 (73%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+SD+LR + LYK GG YLD+D + ++S SL ++
Sbjct: 701 VSDILRVLFLYKEGGLYLDVDVLTLRSFASLPSF 734
>gi|301309663|ref|ZP_07215602.1| polysaccharide biosynthesis protein CpsM(V) [Bacteroides sp. 20_3]
gi|300831237|gb|EFK61868.1| polysaccharide biosynthesis protein CpsM(V) [Bacteroides sp. 20_3]
Length = 240
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 65/148 (43%), Gaps = 7/148 (4%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSV 226
Y ++A W + ++D++R + GG Y+D D V+K L+ L Y +G ESSS
Sbjct: 33 YVREAYEAKKW--AFITDVVRLYAMVTEGGIYMDTDVEVLKPLDELLQYDAVSGFESSSR 90
Query: 227 VAAGVIHLDKDHWLSGAALRELRD-NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISC 285
+ G++ + L L E +FK + + +TR+ A K +N
Sbjct: 91 IPTGLMACREGQPLFEELLHEYDVLHFKRPDGSLDVTTNVTRITNACLKYGFVPNNTKQT 150
Query: 286 RN-FTIYPPRFFYPVHWEHWADYLNETN 312
N FT+ P + P + YL E +
Sbjct: 151 VNGFTLLPQDYLCPKSYVDGKIYLTENS 178
>gi|221505414|gb|EEE31059.1| glycosyltransferase sugar-binding region containing DXD
motif-containing protein, putative [Toxoplasma gondii
VEG]
Length = 1061
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 25/34 (73%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+SD+LR + LYK GG YLD+D + ++S SL ++
Sbjct: 705 VSDILRVLFLYKEGGLYLDVDVLTLRSFASLPSF 738
>gi|198416466|ref|XP_002120464.1| PREDICTED: hypothetical protein, partial [Ciona intestinalis]
Length = 347
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAA---GVIHLDKDHWLSG 242
+D R L KYGG Y+DLD I +K+ + L Y SV A G++ +K+ L
Sbjct: 171 TDAARIEILIKYGGIYIDLDVITLKNFDELRKYPVTMGRSVEVAFSMGILLAEKNSLLLR 230
Query: 243 AALRELRDNF 252
+E ++F
Sbjct: 231 EFYKEYPNHF 240
>gi|321469454|gb|EFX80434.1| hypothetical protein DAPPUDRAFT_304121 [Daphnia pulex]
Length = 364
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 30/60 (50%), Gaps = 3/60 (5%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAESSSVVAAGVI 232
L+ W L H SD+ R + KYGG YLD D V+ +L + Y G + + + VI
Sbjct: 166 LSEDWGLFHGSDVARIQIMMKYGGIYLDNDVYVVHNLNAYRKYEIAMGWDEGEFIGSQVI 225
>gi|293370085|ref|ZP_06616650.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292634813|gb|EFF53337.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 270
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 5/58 (8%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESS 224
Y +A+ W + +D LR +YKYGG +LD D V KS + N + GAE++
Sbjct: 51 YANEAVSVKKWAFA--ADFLRNYVVYKYGGIWLDTDIEVFKSFDPFLNNRMFIGAEAN 106
>gi|407015454|gb|EKE29327.1| glycosyltransferase family 32 protein [uncultured bacterium (gcode
4)]
Length = 375
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/115 (24%), Positives = 52/115 (45%), Gaps = 18/115 (15%)
Query: 188 LLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSGAALRE 247
+LR+V +K+GG ++D+D I++ + + E + V LD +++ AA+
Sbjct: 126 ILRFVVPFKHGGAWIDMDTILLNDFKPI---LDQEWAYVWGQN---LDFENFWPIAAIMN 179
Query: 248 LRDNFKTTEWGANGPGVLTRLLKAECKP------QSYAHNIISCRNFTIYPPRFF 296
+ +W + L +LK E +P + + R FTI+P FF
Sbjct: 180 IH------KWSEHAKICLEEILKTEVEPMTTCLDKDLLAKVYKRRKFTIFPAAFF 228
>gi|399926010|ref|ZP_10783368.1| glycosyltransferase [Myroides injenensis M09-0166]
Length = 249
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 42/168 (25%), Positives = 76/168 (45%), Gaps = 25/168 (14%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL--HN-YAGAESSSVVAAGVIHLDKDHWLS 241
+SD+ R L+++GG YLD D V+ SL+ HN ++G E ++ V G++ +K
Sbjct: 73 VSDVCRLAVLHEFGGIYLDTDVEVLGSLDPFLHHNAFSGFEDNNCVPTGIMASEK----G 128
Query: 242 GAALRELRDNFKTTEW-GANGPG-------VLTRLLKAECKPQSYAHNIISCRNF-TIYP 292
G+ +++ ++ + NG ++T ++K K + + N+ T YP
Sbjct: 129 GSWAKDMLAYYEGKHFIDENGKMDMTTNVIIITEMMKD--KKLQFNNTFQEIENYITFYP 186
Query: 293 PRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKL 340
+F P H + T+ T+ F S W S KR +K+
Sbjct: 187 SEYFCPK--SHVDGKIRITSNSITIHHFAGS-----WLSTAKRREIKI 227
>gi|298480510|ref|ZP_06998707.1| polysaccharide biosynthesis protein CpsM(V) [Bacteroides sp. D22]
gi|298273331|gb|EFI14895.1| polysaccharide biosynthesis protein CpsM(V) [Bacteroides sp. D22]
Length = 247
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 34/70 (48%), Gaps = 5/70 (7%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSS--VVAAGVIHLDKDHWL 240
+D +R LY YGG YLD D V+KS L + + G E S + A V+ +K H L
Sbjct: 65 ADYIRAYALYNYGGIYLDCDVEVLKSFNDLLHLPYFIGEERSDDFAIEAAVVGFEKGHLL 124
Query: 241 SGAALRELRD 250
L +D
Sbjct: 125 FKNMLDYYQD 134
>gi|68642851|emb|CAI33187.1| putative glycosyl transferase [Streptococcus pneumoniae]
gi|451311466|gb|AGF34222.1| putative glycosyl transferase WcrL [Streptococcus pneumoniae]
Length = 240
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 63/140 (45%), Gaps = 17/140 (12%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAESSS 225
+ QDA W +SD R +Y+ GG YLD D VIK+L+ L + + +
Sbjct: 52 FVQDAYRDKAWAF--VSDYARLKIIYENGGIYLDTDVEVIKNLDKLLENKAFFGVHQVNH 109
Query: 226 VVAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANG------PGVLTRLLKAECKPQSYA 279
+V+ G+ + + L+EL + + E+ + P + T + K +Y+
Sbjct: 110 LVSTGLGFGSEK---GTSILKELLNLYDEIEFDLSKKDELLCPELNTPVFKR--LGYTYS 164
Query: 280 HNIISCRNFTIYPPRFFYPV 299
++ FTIYP +F P+
Sbjct: 165 DCVVKNEYFTIYPEEYFDPI 184
>gi|755724|emb|CAA88565.1| alpha-toxin [Clostridium novyi]
Length = 2178
Score = 38.9 bits (89), Expect = 3.9, Method: Composition-based stats.
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 166 LHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFI 207
L +Y Q+ I T+ L+ SD+LR L KYGG Y DLDF+
Sbjct: 249 LKSYYYQELIQTN--NLAAASDILRIAILKKYGGVYCDLDFL 288
>gi|385826057|ref|YP_005862399.1| Polysaccharide biosynthesis protein [Lactobacillus johnsonii DPC
6026]
gi|329667501|gb|AEB93449.1| Polysaccharide biosynthesis protein [Lactobacillus johnsonii DPC
6026]
Length = 241
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 59/141 (41%), Gaps = 19/141 (13%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
YTQ A + + ++D +R +Y GG Y+D D +IKS++ L N+ E
Sbjct: 52 YTQLAYRSKKYAF--LTDYVRLDVIYNEGGIYMDTDVKLIKSIDDLIEKGNFMSFEKVGR 109
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGANGPG--------VLTRLLKAECKPQSY 278
V GV + + L+E D + E+ + T+LL E Y
Sbjct: 110 VNTGVGFASEK---GTSILKENLDYYNNHEFVDKNNNFTPEICVKITTQLL--ERHGLDY 164
Query: 279 AHNII-SCRNFTIYPPRFFYP 298
N I NFTIYP +F P
Sbjct: 165 RRNAIQEVENFTIYPSDYFSP 185
>gi|346319170|gb|EGX88772.1| mannosyl phosphorylinositol ceramide synthase CSH1, putative
[Cordyceps militaris CM01]
Length = 335
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 22/31 (70%)
Query: 189 LRYVTLYKYGGTYLDLDFIVIKSLESLHNYA 219
+RY L +YGG Y+DLDF + SLESL Y+
Sbjct: 139 VRYFILRQYGGIYIDLDFGCVDSLESLRPYS 169
>gi|400595251|gb|EJP63058.1| glycosyl transferase [Beauveria bassiana ARSEF 2860]
Length = 387
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 6/90 (6%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-----HNYAGAESSSVVAAGVIHLDK 236
+ H SD +R + ++GG YLD D ++ + L H G + + + +GV + K
Sbjct: 156 IEHKSDFVRVQAIREHGGVYLDFDVHPLRDVRVLRESGFHAVTGRQQGAEINSGVF-MSK 214
Query: 237 DHWLSGAALRELRDNFKTTEWGANGPGVLT 266
H E + T EW A+ GVLT
Sbjct: 215 PHSRMIELWSEGMNEAFTGEWTAHSNGVLT 244
>gi|45775239|gb|AAS77217.1| truncated alpha-1,4-galactosyltransferase [Homo sapiens]
Length = 281
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 87 NSIFFLETSCTHADGVELTLRQACSIESAAMMNPGVQVYVVVIASVRNRTRNPL---IDR 143
+IFFLETS D CS+ESAA +P V V++ P I
Sbjct: 77 GNIFFLETS----DRTNPNFLFMCSVESAARTHPESHVLVLMKGLPGGNASLPRHLGISL 132
Query: 144 LYEYQNVHIVQVDLGRYFQNTPLHGFYT 171
L + NV ++ +DL F++TPL +Y
Sbjct: 133 LSCFPNVQMLPLDLRELFRDTPLADWYA 160
>gi|317478841|ref|ZP_07937992.1| glycosyltransferase sugar-binding region DXD domain-containing
protein [Bacteroides sp. 4_1_36]
gi|316905017|gb|EFV26820.1| glycosyltransferase sugar-binding region DXD domain-containing
protein [Bacteroides sp. 4_1_36]
Length = 244
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 58/140 (41%), Gaps = 22/140 (15%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
Y ++A W ++D +R + LY+ GG Y D D V KS +SL + G ES
Sbjct: 52 YVREAYDNKKWAF--VTDYVRLIALYENGGIYFDTDVEVFKSFDSLLSEKAFFGFESKDY 109
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEW----------GANGPGVLTRLLKAECKPQ 276
+ VI +K + +++ D+++ ++ N V LL +P
Sbjct: 110 LCTAVIACEK----GNSFIKKFIDSYENRKFILSDGSFDTATTNVVAVTRMLLSKGLRPN 165
Query: 277 SYAHNIISCRNFTIYPPRFF 296
+ + TIYP +F
Sbjct: 166 GKMQIV---DDVTIYPQYYF 182
>gi|365175752|ref|ZP_09363178.1| hypothetical protein HMPREF1006_01123 [Synergistes sp. 3_1_syn1]
gi|363612191|gb|EHL63742.1| hypothetical protein HMPREF1006_01123 [Synergistes sp. 3_1_syn1]
Length = 245
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 6/76 (7%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
Y ++A W + ++D +R L ++GG Y+D D V+K L+ ++G E +
Sbjct: 51 YVEEAYSAKKW--AFVTDYVRLYVLKEFGGIYMDTDVEVLKPLDRFLVHPAFSGFEQQTS 108
Query: 227 VAAGVIHLDKDH-WLS 241
+ G+I +KD+ W+S
Sbjct: 109 IPTGIIASEKDNFWVS 124
>gi|242000328|ref|XP_002434807.1| secreted protein, putative [Ixodes scapularis]
gi|215498137|gb|EEC07631.1| secreted protein, putative [Ixodes scapularis]
Length = 311
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ H SD+LR + L KYGG YLD D ++KSL++ +
Sbjct: 132 VQHASDVLRALVLMKYGGIYLDSDSYLVKSLDAYRQF 168
>gi|68642876|emb|CAI33206.1| putative glycosyl transferase [Streptococcus pneumoniae]
Length = 240
Score = 38.9 bits (89), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 64/141 (45%), Gaps = 19/141 (13%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAESSS 225
+ QDA W +SD R +Y+ GG YLD D VIK+L+ L + + +
Sbjct: 52 FVQDAYRDKAWAF--VSDYARLKIIYENGGIYLDTDVEVIKNLDKLLENKAFFGVHQVNH 109
Query: 226 VVAAGV-IHLDKDHWLSGAALRELRDNFKTTEWGANG------PGVLTRLLKAECKPQSY 278
+VA G+ +K ++ L+EL + + E+ P + T + K +Y
Sbjct: 110 LVATGLGFGSEKGTFI----LKELLNLYDEIEFDLRKKDELLCPELNTPVFKR--LGYTY 163
Query: 279 AHNIISCRNFTIYPPRFFYPV 299
+ ++ FTIYP +F P+
Sbjct: 164 SDCVVKNEYFTIYPEEYFDPI 184
>gi|328767437|gb|EGF77487.1| hypothetical protein BATDEDRAFT_37448 [Batrachochytrium
dendrobatidis JAM81]
Length = 347
Score = 38.9 bits (89), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 187 DLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
D +RY LY+YGG Y+DLD +KSL+ L Y
Sbjct: 131 DAVRYFILYEYGGIYMDLDIGCMKSLDPLLRYPA 164
>gi|320535540|ref|ZP_08035640.1| polysaccharide biosynthesis protein CpsM(V) family protein
[Treponema phagedenis F0421]
gi|320147601|gb|EFW39117.1| polysaccharide biosynthesis protein CpsM(V) family protein
[Treponema phagedenis F0421]
Length = 262
Score = 38.9 bits (89), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 59/126 (46%), Gaps = 12/126 (9%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLS 241
+SD R+ LY+YGG Y D D VI+ ++ L +AG E + AG+ K H
Sbjct: 69 VSDYARFDILYEYGGIYFDTDVEVIRPIDELVHKGAFAGIELPGRINAGLGLAAKHH--- 125
Query: 242 GAALRELRDNFKTTEW-GANGP----GVLTRLLKAECKPQ-SYAHNIISCRNFTIYPPRF 295
E+ ++++ + + NG V+TR+ K S + I + TIYP F
Sbjct: 126 DPIYAEILESYRCSNFIKPNGEFDLTTVVTRVTDILKKYGFSDENQIQTVAGITIYPIDF 185
Query: 296 FYPVHW 301
F P ++
Sbjct: 186 FCPKNY 191
>gi|290891081|ref|ZP_06554144.1| hypothetical protein AWRIB429_1534 [Oenococcus oeni AWRIB429]
gi|419758268|ref|ZP_14284585.1| glycosyltransferase [Oenococcus oeni AWRIB304]
gi|419856473|ref|ZP_14379194.1| glycosyltransferase [Oenococcus oeni AWRIB202]
gi|421184657|ref|ZP_15642073.1| glycosyltransferase [Oenococcus oeni AWRIB318]
gi|421194980|ref|ZP_15652192.1| glycosyltransferase [Oenococcus oeni AWRIB568]
gi|421196850|ref|ZP_15654031.1| glycosyltransferase [Oenococcus oeni AWRIB576]
gi|290479298|gb|EFD87959.1| hypothetical protein AWRIB429_1534 [Oenococcus oeni AWRIB429]
gi|399904890|gb|EJN92341.1| glycosyltransferase [Oenococcus oeni AWRIB304]
gi|399966259|gb|EJO00808.1| glycosyltransferase [Oenococcus oeni AWRIB318]
gi|399976169|gb|EJO10195.1| glycosyltransferase [Oenococcus oeni AWRIB576]
gi|399976764|gb|EJO10777.1| glycosyltransferase [Oenococcus oeni AWRIB568]
gi|410499518|gb|EKP90949.1| glycosyltransferase [Oenococcus oeni AWRIB202]
Length = 243
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 34/72 (47%), Gaps = 5/72 (6%)
Query: 164 TPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAG 220
T + Y Q A+ T +W +SD +R L K GG YLD D +I S + L + G
Sbjct: 47 TRVKSKYVQQAVATKMWAF--VSDYVRLDILNKEGGIYLDTDVELIASFDDLLSHKAFIG 104
Query: 221 AESSSVVAAGVI 232
ES + VI
Sbjct: 105 RESQYAFSTAVI 116
>gi|363539843|ref|YP_004894574.1| mg523 gene product [Megavirus chiliensis]
gi|350611008|gb|AEQ32452.1| putative glycosyltransferase [Megavirus chiliensis]
Length = 254
Score = 38.5 bits (88), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ +D RY+ LY YGG YLD+D + K+LE Y
Sbjct: 77 AQKADFARYIILYTYGGIYLDMDMVCKKNLEPFLQY 112
>gi|448825492|ref|YP_007418423.1| putative glycosyltransferase [Megavirus lba]
gi|444236677|gb|AGD92447.1| putative glycosyltransferase [Megavirus lba]
Length = 254
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ +D RY+ LY YGG YLD+D + K+LE Y
Sbjct: 77 AQKADFARYIILYTYGGIYLDMDMVCKKNLEPFLQY 112
>gi|265750598|ref|ZP_06086661.1| glycosyltransferase [Bacteroides sp. 3_1_33FAA]
gi|263237494|gb|EEZ22944.1| glycosyltransferase [Bacteroides sp. 3_1_33FAA]
Length = 259
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESS 224
SD++R +YKYGG Y+D D +V KS L H + G E+S
Sbjct: 65 SDVIRLYVIYKYGGIYMDTDVMVYKSFNPLLENHAFIGRENS 106
>gi|425701419|gb|AFX92581.1| putative glycosyltransferase [Megavirus courdo11]
Length = 245
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ +D RY+ LY YGG YLD+D + K+LE Y
Sbjct: 68 AQKADFARYIILYTYGGIYLDMDMVCKKNLEPFLQY 103
>gi|365866564|ref|ZP_09406174.1| hypothetical protein SPW_6478 [Streptomyces sp. W007]
gi|364003986|gb|EHM25116.1| hypothetical protein SPW_6478 [Streptomyces sp. W007]
Length = 5697
Score = 38.5 bits (88), Expect = 4.7, Method: Composition-based stats.
Identities = 20/56 (35%), Positives = 27/56 (48%)
Query: 151 HIVQVDLGRYFQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDF 206
H+ D G +Q PL T L + SD+ RY L+K+GG YLD+D
Sbjct: 4281 HLFAKDKGPIWQKNPLSKAQTLYEAALGLESFAMASDVARYALLHKHGGVYLDVDL 4336
>gi|423251484|ref|ZP_17232497.1| hypothetical protein HMPREF1066_03507 [Bacteroides fragilis
CL03T00C08]
gi|423254807|ref|ZP_17235737.1| hypothetical protein HMPREF1067_02381 [Bacteroides fragilis
CL03T12C07]
gi|392650802|gb|EIY44469.1| hypothetical protein HMPREF1066_03507 [Bacteroides fragilis
CL03T00C08]
gi|392653373|gb|EIY47029.1| hypothetical protein HMPREF1067_02381 [Bacteroides fragilis
CL03T12C07]
Length = 243
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSS 225
YTQ+A + + +SD R+ LYKYGG Y D D VIKS++ + ++ G E +
Sbjct: 51 YTQEAYAAKKY--AFVSDYARFWILYKYGGVYFDTDVEVIKSMDDILVRGSFMGCEKNG 107
>gi|427384619|ref|ZP_18881124.1| hypothetical protein HMPREF9447_02157 [Bacteroides oleiciplenus YIT
12058]
gi|425727880|gb|EKU90739.1| hypothetical protein HMPREF9447_02157 [Bacteroides oleiciplenus YIT
12058]
Length = 325
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 36/160 (22%), Positives = 69/160 (43%), Gaps = 13/160 (8%)
Query: 106 LRQACSIES----AAMMNPGVQVYVVVIASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYF 161
L+QA SI+S + + + +++ NP + L + + IV ++ +
Sbjct: 68 LKQAFSIKSFLCTQNLESTELWLWLDEANGYSQVESNPYLHELKD--KIKIVSWNVEKEI 125
Query: 162 QNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNY 218
TP F + + L+ D R ++LYKYGG Y DLD + +K L H +
Sbjct: 126 VGTP---FLKIKEYIHAAKNLAAKGDDFRIISLYKYGGLYFDLDVMFLKDFTPLLKGHEF 182
Query: 219 AGA-ESSSVVAAGVIHLDKDHWLSGAALRELRDNFKTTEW 257
A E + +++L K+ +++ ++L+ W
Sbjct: 183 VYAWEYQPYANSAILYLRKNSYITNYLAKKLQKRKAAMPW 222
>gi|371943848|gb|AEX61676.1| uncharacterized glycosyl transferase [Megavirus courdo7]
Length = 128
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+ +D RY+ LY YGG YLD+D + K+LE Y
Sbjct: 77 AQKADFARYIILYTYGGIYLDMDMVCKKNLEPFLQY 112
>gi|431797943|ref|YP_007224847.1| mannosyltransferase OCH1-like enzyme [Echinicola vietnamensis DSM
17526]
gi|430788708|gb|AGA78837.1| mannosyltransferase OCH1-like enzyme [Echinicola vietnamensis DSM
17526]
Length = 245
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 54/118 (45%), Gaps = 16/118 (13%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLS 241
+SD+ R L++ GG YLD D ++I SL S+ + G E+ + +I + H
Sbjct: 64 VSDVARLYALHQEGGIYLDTDMLLIDSLNSVLATDFFIGKENEKSLNGAIIGAIRGH--- 120
Query: 242 GAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPV 299
+ L+ L ++ ++ G + +L EC +++ R +Y FYPV
Sbjct: 121 -SYLKALIGVYQDNKFELGGKITIPMVLN-EC--------LLTNRELRVYSSEVFYPV 168
>gi|393246011|gb|EJD53520.1| hypothetical protein AURDEDRAFT_52940 [Auricularia delicata
TFB-10046 SS5]
Length = 525
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+D +RY LY YGG YLDLD IK L+ L Y
Sbjct: 128 ADAIRYFILYHYGGIYLDLDVGCIKRLDPLLAY 160
>gi|321465529|gb|EFX76530.1| hypothetical protein DAPPUDRAFT_322297 [Daphnia pulex]
Length = 345
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES 214
L+ W + H SD+ R T+ KYGG YLD D +V+++L+
Sbjct: 160 LSRDWRVYHGSDIARIRTMMKYGGIYLDNDVLVLQNLDK 198
>gi|229162566|ref|ZP_04290526.1| hypothetical protein bcere0009_33380 [Bacillus cereus R309803]
gi|228620829|gb|EEK77695.1| hypothetical protein bcere0009_33380 [Bacillus cereus R309803]
Length = 255
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 64/154 (41%), Gaps = 27/154 (17%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLS 241
+SD R LY +GG YLD D V KS + L H++ G E +A I K + L
Sbjct: 69 VSDYARVHVLYDFGGIYLDTDVEVFKSFDDLLHHHSFWGFEQEDYIATSTIGARKGNPL- 127
Query: 242 GAALRELRDNFKTTEWGANGP--------GVLTRLLKAECKPQSYAHNIISCRNFTIYPP 293
++ D++K ++ N ++T +LK Q+ + I YP
Sbjct: 128 ---IKVFLDSYKEKKFIKNNGAYDDLTNVAIVTEILKNLGLKQNGQYQEIEDIG-VFYPQ 183
Query: 294 RFFYPVHWEHWADYLNETNAPATMSLFRDSYALH 327
+F P DY+N + ++YA+H
Sbjct: 184 TYFSPY------DYIN-----CRKFITENTYAMH 206
>gi|321465416|gb|EFX76417.1| hypothetical protein DAPPUDRAFT_322294 [Daphnia pulex]
Length = 345
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLES 214
L+ W + H SD+ R T+ KYGG YLD D +V+++L+
Sbjct: 160 LSRDWRVYHGSDIARIRTMMKYGGIYLDNDVLVLQNLDK 198
>gi|227877009|ref|ZP_03995101.1| glycosyltransferase [Lactobacillus crispatus JV-V01]
gi|256844680|ref|ZP_05550165.1| glycosyltransferase [Lactobacillus crispatus 125-2-CHN]
gi|256850564|ref|ZP_05555990.1| glycosyltransferase [Lactobacillus crispatus MV-1A-US]
gi|262047036|ref|ZP_06019995.1| glycosyltransferase [Lactobacillus crispatus MV-3A-US]
gi|293381893|ref|ZP_06627861.1| conserved hypothetical protein [Lactobacillus crispatus 214-1]
gi|423318091|ref|ZP_17295988.1| hypothetical protein HMPREF9250_01696 [Lactobacillus crispatus
FB049-03]
gi|423321434|ref|ZP_17299306.1| hypothetical protein HMPREF9249_01306 [Lactobacillus crispatus
FB077-07]
gi|227863380|gb|EEJ70807.1| glycosyltransferase [Lactobacillus crispatus JV-V01]
gi|256613221|gb|EEU18425.1| glycosyltransferase [Lactobacillus crispatus 125-2-CHN]
gi|256712587|gb|EEU27582.1| glycosyltransferase [Lactobacillus crispatus MV-1A-US]
gi|260572613|gb|EEX29174.1| glycosyltransferase [Lactobacillus crispatus MV-3A-US]
gi|290921540|gb|EFD98574.1| conserved hypothetical protein [Lactobacillus crispatus 214-1]
gi|405595532|gb|EKB68920.1| hypothetical protein HMPREF9249_01306 [Lactobacillus crispatus
FB077-07]
gi|405596815|gb|EKB70144.1| hypothetical protein HMPREF9250_01696 [Lactobacillus crispatus
FB049-03]
Length = 232
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D V+ L+SL N + G E+
Sbjct: 51 YIEQAYAAKKWAF--VSDYVRARAIYEQGGIYLDTDVRVVDKLDSLLNASAFIGFENHDY 108
Query: 227 VAAGVIHLDKDH 238
++A + +K H
Sbjct: 109 LSAAIFGAEKGH 120
>gi|409043924|gb|EKM53406.1| hypothetical protein PHACADRAFT_259765 [Phanerochaete carnosa
HHB-10118-sp]
Length = 757
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
+SD+ R+V +++GG YLD D I+++ E L + GA
Sbjct: 519 LSDMARFVLCHRFGGIYLDADTILLRDWEELWGWRGA 555
>gi|170292408|pdb|2VK9|A Chain A, Crystal Structure Of The Catalytic Domain Of Alpha-Toxin
From Clostridium Novyi
Length = 551
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 164 TPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFI 207
L +Y Q+ I T+ L+ SD+LR L KYGG Y DLDF+
Sbjct: 247 NKLKSYYYQELIQTN--NLAAASDILRIAILKKYGGVYCDLDFL 288
>gi|403416452|emb|CCM03152.1| predicted protein [Fibroporia radiculosa]
Length = 530
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
+D++RY LY YGG YLDLD ++ L++L Y
Sbjct: 132 ADVIRYFILYHYGGIYLDLDIGCLRPLDALLTY 164
>gi|346326894|gb|EGX96490.1| glycosyl transferase [Cordyceps militaris CM01]
Length = 384
Score = 38.5 bits (88), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 8/92 (8%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL-----HNYAGAE--SSSVVAAGVIHL 234
+ H SD +R + +YGG YLD D ++ + +L H AG + ++ + +GV +
Sbjct: 149 MEHKSDFVRVQAMREYGGAYLDFDVHPLRDVRALRESGFHAVAGRQQGDNAEINSGVF-M 207
Query: 235 DKDHWLSGAALRELRDNFKTTEWGANGPGVLT 266
+K H E + T EW A+ G LT
Sbjct: 208 NKPHSQMIELWSEGMNEAFTGEWSAHSNGALT 239
>gi|110629049|gb|ABE60732.1| cytochrome P450 monooxygenase [Trichoderma virens]
Length = 512
Score = 38.5 bits (88), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 5/71 (7%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIH-LDKDHWLS 241
HM L R + +YK G +LDF + SL G+E+++ + AG ++ L KD +
Sbjct: 283 EHMDFLQRMIEIYKAG----NLDFPTLSEHASLLTIGGSETTATLLAGAVYFLAKDPKVY 338
Query: 242 GAALRELRDNF 252
G +E+R F
Sbjct: 339 GKLTKEIRTKF 349
>gi|295692386|ref|YP_003600996.1| glycosyltransferase [Lactobacillus crispatus ST1]
gi|295030492|emb|CBL49971.1| Glycosyltransferase [Lactobacillus crispatus ST1]
Length = 232
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D V+ L+SL N + G E+
Sbjct: 51 YIEQAYAAKKWAF--VSDYVRARAIYEQGGIYLDTDVRVVDKLDSLLNDSAFIGFENHDY 108
Query: 227 VAAGVIHLDKDH 238
++A + +K H
Sbjct: 109 LSAAIFGAEKGH 120
>gi|321475360|gb|EFX86323.1| hypothetical protein DAPPUDRAFT_313306 [Daphnia pulex]
Length = 307
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 25/46 (54%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA 221
L W L H D+ R T+ KYGG YLD D VI++L+ + A
Sbjct: 119 LNPQWRLYHGGDIARIRTMMKYGGIYLDNDVYVIQNLDKYRKFEIA 164
>gi|302885910|ref|XP_003041846.1| hypothetical protein NECHADRAFT_53107 [Nectria haematococca mpVI
77-13-4]
gi|256722752|gb|EEU36133.1| hypothetical protein NECHADRAFT_53107 [Nectria haematococca mpVI
77-13-4]
Length = 399
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 10/65 (15%)
Query: 161 FQNTPLHGFYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAG 220
F+N L G + + H SDL+R+ L +YGG Y D+ F+ I L+ L N
Sbjct: 107 FKNGTLDGEFAK----------QHYSDLVRFPLLVRYGGVYTDVGFMQIGDLDRLWNETV 156
Query: 221 AESSS 225
A + S
Sbjct: 157 ANADS 161
>gi|67903998|ref|XP_682255.1| hypothetical protein AN8986.2 [Aspergillus nidulans FGSC A4]
gi|40745162|gb|EAA64318.1| hypothetical protein AN8986.2 [Aspergillus nidulans FGSC A4]
gi|259486564|tpe|CBF84514.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 417
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 38/69 (55%), Gaps = 8/69 (11%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFI---VIKSLESLHNYAGAESSSVVAAGV-IHLDKDHWLS 241
SDLLRY+ L+ YGG Y D+D IK +L G S+ ++ V I +D+ H
Sbjct: 172 SDLLRYLVLWYYGGYYADMDVYPAKAIKECPALQPVFGDSGSANISLAVGIEIDEPH--- 228
Query: 242 GAALRELRD 250
A+L+ +RD
Sbjct: 229 -ASLQLMRD 236
>gi|384249676|gb|EIE23157.1| hypothetical protein COCSUDRAFT_66191 [Coccomyxa subellipsoidea
C-169]
Length = 280
Score = 38.1 bits (87), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 51/135 (37%), Gaps = 27/135 (20%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLES--------LHNYAGAESSSVVAAGVIHLDKD 237
+D +RY+ +Y GG Y DLDF I+SLE L E + + K
Sbjct: 107 ADSVRYLYMYHVGGVYADLDFESIRSLEDVLAGRQVVLAEMTEEEWDQKIPNAWLASTKG 166
Query: 238 H--WL---------SGAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCR 286
H WL +G+ D + E GP +L ++A K +
Sbjct: 167 HPFWLYCIQQIIKAAGSCAATNTDRWDWLE-ATTGPVMLFHAVEAYKK-------VKGLS 218
Query: 287 NFTIYPPRFFYPVHW 301
TI PP YP+ W
Sbjct: 219 GLTILPPGVIYPIDW 233
>gi|392379993|ref|YP_004987151.1| protein of unknown function [Azospirillum brasilense Sp245]
gi|356882360|emb|CCD03368.1| protein of unknown function [Azospirillum brasilense Sp245]
Length = 284
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 25/35 (71%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN 217
S SD++RY+ LYK+GG + DLD + ++S+ L N
Sbjct: 115 SFYSDIVRYILLYKHGGVWFDLDCLFLRSVTPLLN 149
>gi|336054653|ref|YP_004562940.1| glycosyltransferase [Lactobacillus kefiranofaciens ZW3]
gi|333958030|gb|AEG40838.1| Glycosyltransferase [Lactobacillus kefiranofaciens ZW3]
Length = 232
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D VI L L N + G E++
Sbjct: 51 YIEQAYKAKKWAF--VSDYIRARAIYEQGGIYLDTDVRVIADLTPLLNDRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A + +K+H
Sbjct: 109 LSAAIFGAEKEH 120
>gi|332297392|ref|YP_004439314.1| glycosyltransferase family protein [Treponema brennaborense DSM
12168]
gi|332180495|gb|AEE16183.1| glycosyltransferase sugar-binding region containing DXD motif
[Treponema brennaborense DSM 12168]
Length = 253
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 56/122 (45%), Gaps = 10/122 (8%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLE---SLHNYAGAESSSVVAAGVIHLDKDHWLSG 242
+D +R+ +LY YGG YLD D V+K+ + ++ ++ G E ++ A V+ ++
Sbjct: 65 ADYIRFWSLYNYGGIYLDSDVEVVKNFDFFLNVKSFIGFEYLNIPEAAVVGAEEGTDWVK 124
Query: 243 AALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISC---RNF---TIYPPRFF 296
L E G V+ RL+K + + Y I+ R F TIYP +F
Sbjct: 125 TCLDWYDGKSFFAENGEMKKDVVPRLVKRVLE-KKYNQKIVDTGKIREFEGLTIYPYFYF 183
Query: 297 YP 298
P
Sbjct: 184 SP 185
>gi|418960824|ref|ZP_13512711.1| capsular polysaccharide synthesis protein [Lactobacillus salivarius
SMXD51]
gi|380344491|gb|EIA32837.1| capsular polysaccharide synthesis protein [Lactobacillus salivarius
SMXD51]
Length = 317
Score = 38.1 bits (87), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 50/218 (22%), Positives = 92/218 (42%), Gaps = 29/218 (13%)
Query: 123 QVYVVVIASVRNRTRNPLIDRLYE--YQNVHIVQVDLGRYFQNTPLHGFYTQDAILTSLW 180
++ + I S++N +N + L E Y+N + Y ++ G+ T+
Sbjct: 99 EIVKLCINSLKNNIKNRKVVVLNEKNYKNY----IKFPEYIEDKVKKGYITK-------- 146
Query: 181 PLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWL 240
+H+SDLLR L +YGGT++D L+ +G + + + ++ L
Sbjct: 147 --THLSDLLRIELLTRYGGTWVDSTV--------LYTGSGQDEEVLESKLFMYQILKPGL 196
Query: 241 SGAALRELRDNFKTTEWGANGPGVLTR-LLKAECKPQSYAHNIISCRNFTIYPPRFFYPV 299
G LR ++ + W N +LT+ LL K +Y ++ +F I YP
Sbjct: 197 DGHTLR--ISSWLMSSWSNNKILLLTKELLYEYWKENNYMYDYFLLHDF-IELSIELYPE 253
Query: 300 HWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVP 337
W + A Y + T + +F D+Y ++ S K+
Sbjct: 254 IWGNVAQYSSSTPHILQLKMF-DTYNERIYKSIIKQTS 290
>gi|400601511|gb|EJP69154.1| MIPC synthase [Beauveria bassiana ARSEF 2860]
Length = 332
Score = 38.1 bits (87), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 189 LRYVTLYKYGGTYLDLDFIVIKSLESLHNYA 219
+RY L YGG Y+DLDF + SLESL Y+
Sbjct: 138 IRYFILRHYGGIYIDLDFGCVNSLESLRPYS 168
>gi|358378095|gb|EHK15778.1| hypothetical protein TRIVIDRAFT_74289 [Trichoderma virens Gv29-8]
Length = 512
Score = 38.1 bits (87), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 5/71 (7%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIH-LDKDHWLS 241
HM L R + +YK G +LDF + SL G+E+++ + AG ++ L KD +
Sbjct: 283 KHMDFLQRMIEIYKAG----NLDFPTLSEHASLLTIGGSETTATLLAGAVYFLAKDPKVY 338
Query: 242 GAALRELRDNF 252
G +E+R F
Sbjct: 339 GKLTKEIRTRF 349
>gi|423558760|ref|ZP_17535062.1| hypothetical protein II3_03964 [Bacillus cereus MC67]
gi|401190529|gb|EJQ97570.1| hypothetical protein II3_03964 [Bacillus cereus MC67]
Length = 247
Score = 38.1 bits (87), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 69/166 (41%), Gaps = 23/166 (13%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
Y Q+A + + +SD +R LY +GG YLD D V KS + L ++ G E +
Sbjct: 57 YVQEAYKSG--KFAFVSDYVRVHALYNFGGIYLDTDVEVFKSFDVLLYHDSFWGFEQENY 114
Query: 227 VAAGVIHLDKDHWLSGAALRELRDNFKTTEWGA----NGPGVLTRLLK-AECKPQSYAHN 281
+A I K++ L L ++ E G+ ++T +LK K
Sbjct: 115 IATSTIGAAKENQLIKMFLDLYKEKNFINEDGSYDDLTNVAMITEILKDIGLKINGEYQE 174
Query: 282 IISCRNFTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALH 327
I F YP +F P DY+N + +++YA+H
Sbjct: 175 IEGIGAF--YPQTYFSPY------DYIN-----CRKFITKNTYAIH 207
>gi|440802705|gb|ELR23634.1| glycosyltransferase sugar-binding region containing DXD motif
protein [Acanthamoeba castellanii str. Neff]
Length = 855
Score = 38.1 bits (87), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 64/164 (39%), Gaps = 35/164 (21%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAGVIHLDKDHWLSG 242
+H SD++R L + GG YLD D +V++S + L + S+ G + DK +
Sbjct: 509 AHKSDIIRLQVLMREGGIYLDTDALVLRSFDPLRTR---NAVSLAKDGAVPDDKIPLIGS 565
Query: 243 AAL-------------RELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFT 289
L E R F ++W + V +L AE P
Sbjct: 566 GVLVAPPNASFLQRWWAEFR-TFDNSKWNVHSCKVSRQL--AESHPDEA----------N 612
Query: 290 IYPPRFFYPVHWE--HWADYLNETNAPATMSLFRDSYALHVWNS 331
+ P FYP W+ H A + + DSYA+H +NS
Sbjct: 613 LLPHTAFYPRSWQPPHLAIAYKADDCRSEA----DSYAVHRYNS 652
>gi|443703969|gb|ELU01262.1| hypothetical protein CAPTEDRAFT_224311 [Capitella teleta]
Length = 522
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVAAG-----VIHLDKD 237
S SD+ R +YK GG Y+D D + +K ++ L Y S ++A +L
Sbjct: 337 SLQSDIFRGEIIYKEGGIYMDSDALFVKPIDDLRGYDAVASLDIMAEKGKFPEYFNLGVT 396
Query: 238 HWLSGAA----LRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHN 281
+GA+ E F++ +G NG + +L E P S HN
Sbjct: 397 MGKAGASFWKNFNEANHKFRSDLYGYNGLALPYKLW--EQNPTSIYHN 442
>gi|308809323|ref|XP_003081971.1| unnamed protein product [Ostreococcus tauri]
gi|116060438|emb|CAL55774.1| unnamed protein product [Ostreococcus tauri]
Length = 366
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 29/51 (56%)
Query: 65 CVLSGVEKDFAFDDITRLDVPDNSIFFLETSCTHADGVELTLRQACSIESA 115
CV S VE D + +D+ R+D D F E+ HADG + LR A ++E A
Sbjct: 63 CVESNVEPDGSSNDVFRVDKSDFRRFVRESDEEHADGEVVELRMAKTVEHA 113
>gi|42519600|ref|NP_965530.1| hypothetical protein LJ1725 [Lactobacillus johnsonii NCC 533]
gi|41583889|gb|AAS09496.1| hypothetical protein LJ_1725 [Lactobacillus johnsonii NCC 533]
Length = 233
Score = 37.7 bits (86), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D V+K L+ L N + G E++
Sbjct: 51 YIEQAYEAKKWAF--VSDYIRARVIYEQGGIYLDTDVRVLKKLDPLLNNRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A + + +H
Sbjct: 109 LSAAIFGAEMNH 120
>gi|320168516|gb|EFW45415.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 499
Score = 37.7 bits (86), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 25/34 (73%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLH 216
+H SDL+R + +Y+ GG YLDLD ++++ L L+
Sbjct: 248 THTSDLMRTIFMYQCGGIYLDLDSVLLRPLHFLN 281
>gi|198275344|ref|ZP_03207875.1| hypothetical protein BACPLE_01505 [Bacteroides plebeius DSM 17135]
gi|198271927|gb|EDY96197.1| hypothetical protein BACPLE_01505 [Bacteroides plebeius DSM 17135]
Length = 241
Score = 37.7 bits (86), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGV 231
+D +R +Y YGG Y+D+D VIKSL L N G E+ + AG+
Sbjct: 76 ADYIRLYAVYNYGGIYMDMDVEVIKSLNPLLNNDYLLGYENQIGIEAGI 124
>gi|441432378|ref|YP_007354420.1| glycosyltransferase [Acanthamoeba polyphaga moumouvirus]
gi|440383458|gb|AGC01984.1| glycosyltransferase [Acanthamoeba polyphaga moumouvirus]
Length = 249
Score = 37.7 bits (86), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 22/37 (59%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
++ +D RY LYKYGG YLD+D + K+L Y
Sbjct: 67 MAQKADFARYAILYKYGGIYLDMDMVCRKNLAPFLQY 103
>gi|268319981|ref|YP_003293637.1| putative glycosyltransferase [Lactobacillus johnsonii FI9785]
gi|262398356|emb|CAX67370.1| putative glycosyltransferase [Lactobacillus johnsonii FI9785]
Length = 233
Score = 37.7 bits (86), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D V+K L+ L N + G E++
Sbjct: 51 YIEQAYEAKKWAF--VSDYIRARVIYEQGGIYLDTDVRVLKKLDPLLNNRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A + + +H
Sbjct: 109 LSAAIFGAEMNH 120
>gi|332880633|ref|ZP_08448307.1| hypothetical protein HMPREF9074_04082 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045997|ref|ZP_09107627.1| hypothetical protein HMPREF9441_01643 [Paraprevotella clara YIT
11840]
gi|332681621|gb|EGJ54544.1| hypothetical protein HMPREF9074_04082 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531003|gb|EHH00406.1| hypothetical protein HMPREF9441_01643 [Paraprevotella clara YIT
11840]
Length = 235
Score = 37.7 bits (86), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 6/73 (8%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKS----LESLHNYAGAESSS 225
+T++A + +SD+ R LY GG YLD D I++K+ L S +AG E
Sbjct: 51 FTREAYFAKKYAF--VSDVCRLYALYSEGGIYLDTDVIILKTFPDKLRSHKAFAGFEHEI 108
Query: 226 VVAAGVIHLDKDH 238
+ G++ +K++
Sbjct: 109 NIGTGIMGSEKNN 121
>gi|321471476|gb|EFX82449.1| hypothetical protein DAPPUDRAFT_302526 [Daphnia pulex]
Length = 339
Score = 37.7 bits (86), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 36/89 (40%), Gaps = 3/89 (3%)
Query: 176 LTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGA---ESSSVVAAGVI 232
L+ + H SD+ R L +YGG YLD D VI SL+ + + + V
Sbjct: 154 LSEDFRFYHGSDIARIRVLMQYGGIYLDNDCFVIHSLDKYRKFECVVNWDEGQFLGNQVF 213
Query: 233 HLDKDHWLSGAALRELRDNFKTTEWGANG 261
K+ L +DN+ W NG
Sbjct: 214 IAHKNSRFLALYLESYKDNYYPDRWYYNG 242
>gi|224024991|ref|ZP_03643357.1| hypothetical protein BACCOPRO_01723 [Bacteroides coprophilus DSM
18228]
gi|224018227|gb|EEF76225.1| hypothetical protein BACCOPRO_01723 [Bacteroides coprophilus DSM
18228]
Length = 236
Score = 37.7 bits (86), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 49/116 (42%), Gaps = 16/116 (13%)
Query: 186 SDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDH-WLS 241
+D +R LY YGG YLD D VIKS L + G E+S + A ++ +K WL
Sbjct: 65 ADYIRCYALYHYGGIYLDADVEVIKSFTPLLSEPILIGEEASGDIEAAILGAEKGATWLK 124
Query: 242 GAALRELRDNFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFY 297
E+ N P + + + KP N I + F YP + FY
Sbjct: 125 AC-----------LEYYENRPFIKSNGT-FDTKPIPLLLNQIKQKYFPNYPIKPFY 168
>gi|210622226|ref|ZP_03293016.1| hypothetical protein CLOHIR_00963 [Clostridium hiranonis DSM 13275]
gi|210154360|gb|EEA85366.1| hypothetical protein CLOHIR_00963 [Clostridium hiranonis DSM 13275]
Length = 244
Score = 37.7 bits (86), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 44/208 (21%), Positives = 86/208 (41%), Gaps = 29/208 (13%)
Query: 130 ASVRNRTRNPLIDRLYEYQNVHIVQVDLGRYFQNTPLHGF-YTQDAILTSLWPLSHMSDL 188
+V N N D+L +Y+ I++ + + + G + ++ LW ++D
Sbjct: 16 GNVENICINIWKDKLPDYE---IIEWNEKNFDIEKEIKGKKFLEECYKQKLWAF--IADY 70
Query: 189 LRYVTLYKYGGTYLDLDFIVIKSLESLHN----YAGAESSSVVAAGVIHLDKDHWLSGAA 244
+R LY+ GG Y+D D V+K L L + G ES +I + H
Sbjct: 71 IRLKVLYEQGGLYMDTDIQVLKDLTPLMEENRLFFGYESKEYANGAIIGAEPHH----PF 126
Query: 245 LRELRDNFKTTEWGAN---GPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFFYPVHW 301
++++ + + +N P ++T +L K + + N S +Y +FYP ++
Sbjct: 127 IKDMLEFYNEDVMNSNLYTIPKIMTYILNE--KYEKFDRNNFS-EGIRVYDEEYFYPFNY 183
Query: 302 EHWADYLNETNAPATMSLFRDSYALHVW 329
NE ++ ++Y +H W
Sbjct: 184 -------NEEYTDDCIT--ENTYVIHWW 202
>gi|443715965|gb|ELU07691.1| hypothetical protein CAPTEDRAFT_189062 [Capitella teleta]
Length = 339
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYA---GAES-----SSVVAAGVIH 233
+ H SD+LR T++ GG Y+D D +++KS L ++ G ES SS++ A
Sbjct: 175 VEHSSDILRIHTVFDNGGIYIDTDVLILKSFYPLRQFSMTLGRESHYGLGSSIIIAKPRA 234
Query: 234 LDKDHWLSG 242
HWLS
Sbjct: 235 PFLCHWLSA 243
>gi|371944927|gb|AEX62748.1| glycosyl transferase [Moumouvirus Monve]
Length = 249
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 22/37 (59%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
++ +D RY LYKYGG YLD+D + K+L Y
Sbjct: 67 MAQKADFARYAILYKYGGIYLDMDMVCRKNLAPFLQY 103
>gi|423670817|ref|ZP_17645846.1| hypothetical protein IKO_04514 [Bacillus cereus VDM034]
gi|423672956|ref|ZP_17647895.1| hypothetical protein IKS_00499 [Bacillus cereus VDM062]
gi|401295104|gb|EJS00729.1| hypothetical protein IKO_04514 [Bacillus cereus VDM034]
gi|401311056|gb|EJS16364.1| hypothetical protein IKS_00499 [Bacillus cereus VDM062]
Length = 246
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 56/135 (41%), Gaps = 14/135 (10%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDH 238
+ +SD +R LYK+GG YLD D V KS + + ++ G E + +A I K +
Sbjct: 66 FAFVSDYVRVYALYKFGGIYLDTDVEVFKSFDDMLHHDSFWGFEQENYIATSTIGATKGN 125
Query: 239 WLSGAALRELRDNFKTTEWGA----NGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPR 294
L L + E G +++ +LK + + + IS YP
Sbjct: 126 VLIRMFLDSYEEKSFIKEDGKYDSLTNVAIISEILKNKGLQLNGKYQEISGIG-AFYPQT 184
Query: 295 FFYPVHWEHWADYLN 309
+F P DY+N
Sbjct: 185 YFSPY------DYIN 193
>gi|385826403|ref|YP_005862745.1| hypothetical protein [Lactobacillus johnsonii DPC 6026]
gi|329667847|gb|AEB93795.1| hypothetical protein LJP_1475c [Lactobacillus johnsonii DPC 6026]
Length = 233
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D V+K L+ L N + G E++
Sbjct: 51 YIEQAYEAKKWAF--VSDYIRVRVIYEQGGIYLDTDVRVLKKLDPLLNNRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A + + +H
Sbjct: 109 LSAAIFGAEMNH 120
>gi|423388437|ref|ZP_17365663.1| hypothetical protein ICG_00285 [Bacillus cereus BAG1X1-3]
gi|401643138|gb|EJS60840.1| hypothetical protein ICG_00285 [Bacillus cereus BAG1X1-3]
Length = 246
Score = 37.7 bits (86), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 56/135 (41%), Gaps = 14/135 (10%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDH 238
+ +SD +R LYK+GG YLD D V KS + + ++ G E + +A I K +
Sbjct: 66 FAFVSDYVRVYALYKFGGIYLDTDVEVFKSFDDMLHHDSFWGFEQENYIATSTIGATKGN 125
Query: 239 WLSGAALRELRDNFKTTEWGA----NGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPR 294
L L + E G +++ +LK + + + IS YP
Sbjct: 126 VLIRMFLDSYEEKSFIKEDGKYDSLTNVAIISEILKNKGLQLNGKYQEISGIG-AFYPQT 184
Query: 295 FFYPVHWEHWADYLN 309
+F P DY+N
Sbjct: 185 YFSPY------DYIN 193
>gi|423332257|ref|ZP_17310041.1| hypothetical protein HMPREF1075_02054 [Parabacteroides distasonis
CL03T12C09]
gi|409229380|gb|EKN22257.1| hypothetical protein HMPREF1075_02054 [Parabacteroides distasonis
CL03T12C09]
Length = 261
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 76/179 (42%), Gaps = 14/179 (7%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY---AGAESSSV 226
Y ++A W + ++D++R + GG Y+D D V++ L+ L Y +G E+ +
Sbjct: 51 YVREAYEAKKW--AFITDVVRLYAMVTEGGIYMDTDVEVLRPLDDLLVYEAVSGFETKTR 108
Query: 227 VAAGVIHLDKDHWLSGAALRELRD-NFKTTEWGANGPGVLTRLLKAECKPQSYAHNIISC 285
+ G++ + L L E +FK + + +TR+ A K +N
Sbjct: 109 IPTGLMACREGQPLFEELLHEYDGLHFKRPDGSLDVTTNVTRITNACLKYGFVPNNTKQT 168
Query: 286 RN-FTIYPPRFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVKLGSE 343
N FT+ P + P +E YL E + T+ F S W S R +LG +
Sbjct: 169 VNGFTLLPQDYLCPKSYEDGKIYLTENS--MTIHHFAGS-----WVSDVDRYAQELGKK 220
>gi|417838086|ref|ZP_12484324.1| putative mannosyltransferase involved in polysaccharide
biosynthesis [Lactobacillus johnsonii pf01]
gi|338761629|gb|EGP12898.1| putative mannosyltransferase involved in polysaccharide
biosynthesis [Lactobacillus johnsonii pf01]
Length = 233
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSV 226
Y + A W +SD +R +Y+ GG YLD D V+K L+ L N + G E++
Sbjct: 51 YIEQAYEAKKWAF--VSDYIRARVIYEQGGIYLDTDVRVLKKLDPLLNNRAFIGFENNDY 108
Query: 227 VAAGVIHLDKDH 238
++A + + +H
Sbjct: 109 LSAAIFGAEMNH 120
>gi|255658803|ref|ZP_05404212.1| amylopullulanase [Mitsuokella multacida DSM 20544]
gi|260849201|gb|EEX69208.1| amylopullulanase [Mitsuokella multacida DSM 20544]
Length = 1160
Score = 37.7 bits (86), Expect = 9.1, Method: Composition-based stats.
Identities = 20/55 (36%), Positives = 25/55 (45%), Gaps = 8/55 (14%)
Query: 294 RFFYPVHWEHWADYLNETNAPATMSLFRDSYALHVWNSFTKRVPVK----LGSEQ 344
+ F P HWE+W + NE T + YA H W + VP K GSEQ
Sbjct: 798 QVFSPWHWENWFEIKNEK----TQDMMGKKYAYHDWQGYDSLVPFKDADYPGSEQ 848
>gi|374605213|ref|ZP_09678151.1| glycosyltransferase sugar-binding region containing dxd motif
[Paenibacillus dendritiformis C454]
gi|374389192|gb|EHQ60576.1| glycosyltransferase sugar-binding region containing dxd motif
[Paenibacillus dendritiformis C454]
Length = 251
Score = 37.7 bits (86), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 59/151 (39%), Gaps = 20/151 (13%)
Query: 185 MSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSVVAAGVIHLDKDHWLS 241
+SD +R LY GG Y+D D VIK L ++G E + +G + K H
Sbjct: 69 VSDYVRLHALYHEGGVYMDTDVEVIKPLHRFLVHEAFSGFEDHQFLQSGTMGAVKHHPWI 128
Query: 242 GAALRELRDNFKTTEWG-----ANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
G L+ D + G +++R+ + + H + YP FF
Sbjct: 129 GELLQYYADRPFLLQGGDVFDLTTNTAIMSRISQKHGLKLNGQHQTLPG-GVVFYPRWFF 187
Query: 297 YPVHWEHWADYLNETNAPATMSLFRDSYALH 327
P + + +Y+N+ DSY +H
Sbjct: 188 SPYDYINGGNYIND-----------DSYTIH 207
>gi|13876779|gb|AAK43611.1|AF355776_11 putative glycosyltransferase CpsIVN [Streptococcus agalactiae]
gi|406718016|emb|CCG97595.1| alphA-1-4 glucosyltransferase [Streptococcus agalactiae]
gi|406718099|emb|CCG97677.1| alphA-1-4 glucosyltransferase [Streptococcus agalactiae]
Length = 245
Score = 37.7 bits (86), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 11/124 (8%)
Query: 183 SHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHN---YAGAESSSVVAAGVIHLDKDHW 239
+ +SD R +Y+YGG YLD D +IK ++ L N Y E V G L
Sbjct: 67 AFVSDYARLDIIYQYGGIYLDTDVELIKPIDDLLNCESYFACELPGEVNTG---LGFGSI 123
Query: 240 LSGAALRELRDNFKTTEW---GANGPGVLTRLLKAECKPQSYAHNIISCRNFTIYPPRFF 296
++E ++ TE+ + T LL + Y + I IYPP +F
Sbjct: 124 SRNPFIKENMKIYEDTEYYSLKKTCVAITTDLLTEQ--GLKYINKIQVINGILIYPPSYF 181
Query: 297 YPVH 300
PV+
Sbjct: 182 CPVN 185
>gi|262047476|ref|ZP_06020432.1| conserved hypothetical protein [Lactobacillus crispatus MV-3A-US]
gi|260572246|gb|EEX28810.1| conserved hypothetical protein [Lactobacillus crispatus MV-3A-US]
Length = 275
Score = 37.7 bits (86), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 66/150 (44%), Gaps = 26/150 (17%)
Query: 169 FYTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNYAGAESSSVVA 228
Y ++A W + +SD R L KYGG YLD D +IK L+ + E S +A
Sbjct: 53 LYVKEAYKAKKW--AFVSDYARLYVLNKYGGVYLDTDVKLIKGLDKI-----IEKGSFLA 105
Query: 229 AGVIHLDKDHWLSGAA-------LRELRDNFKTTEW-----GANGPGVLT----RLLKAE 272
++ K + G A ++++ + +K + N ++T RL+K
Sbjct: 106 CESLYPIKVNPGLGMATYSNNDLIKDILEEYKAESFILKNNQYNKKTIVTRFTGRLIKDG 165
Query: 273 CKPQSYAHNIISCRNFTIYPPRFFYPVHWE 302
+ + +I F IYP +F P+++E
Sbjct: 166 LQDVNKVQDI---DGFNIYPVEYFCPLNFE 192
>gi|451927417|gb|AGF85295.1| hypothetical protein glt_00486 [Moumouvirus goulette]
Length = 249
Score = 37.4 bits (85), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 22/37 (59%)
Query: 182 LSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESLHNY 218
++ +D RY LYKYGG YLD+D + K+L Y
Sbjct: 67 MAQKADFARYAILYKYGGIYLDMDMVCRKNLTPFLQY 103
>gi|397613671|gb|EJK62357.1| hypothetical protein THAOC_17032 [Thalassiosira oceanica]
Length = 333
Score = 37.4 bits (85), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 184 HMSDLLRYVTLYKYGGTYLDLDFIVIK 210
H +D LRY LYK+GG YLD+D I+
Sbjct: 54 HKADFLRYCLLYKFGGYYLDMDMFPIR 80
>gi|257893072|ref|ZP_05672725.1| glycosyltransferase [Enterococcus faecium 1,231,408]
gi|424785536|ref|ZP_18212338.1| hypothetical protein HMPREF1383_00541 [Enterococcus faecium V689]
gi|424952892|ref|ZP_18367891.1| hypothetical protein HMPREF1377_00523 [Enterococcus faecium R494]
gi|425037435|ref|ZP_18442104.1| hypothetical protein HMPREF1353_00151 [Enterococcus faecium 513]
gi|425060714|ref|ZP_18463995.1| hypothetical protein HMPREF1346_01105 [Enterococcus faecium 503]
gi|430825765|ref|ZP_19443966.1| hypothetical protein OGC_03333 [Enterococcus faecium E0164]
gi|257829451|gb|EEV56058.1| glycosyltransferase [Enterococcus faecium 1,231,408]
gi|402923533|gb|EJX43819.1| hypothetical protein HMPREF1383_00541 [Enterococcus faecium V689]
gi|402940399|gb|EJX59230.1| hypothetical protein HMPREF1377_00523 [Enterococcus faecium R494]
gi|403022125|gb|EJY34529.1| hypothetical protein HMPREF1353_00151 [Enterococcus faecium 513]
gi|403042298|gb|EJY53259.1| hypothetical protein HMPREF1346_01105 [Enterococcus faecium 503]
gi|430445829|gb|ELA55546.1| hypothetical protein OGC_03333 [Enterococcus faecium E0164]
Length = 239
Score = 37.4 bits (85), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 31/65 (47%), Gaps = 5/65 (7%)
Query: 170 YTQDAILTSLWPLSHMSDLLRYVTLYKYGGTYLDLDFIVIKSLESL---HNYAGAESSSV 226
Y DA W + D R+ +Y +GG YLD D +IKSL+ L H Y G E
Sbjct: 51 YMADAYKERKWGF--VPDYARFDIIYNHGGFYLDTDVELIKSLDILRENHAYMGFEGEIW 108
Query: 227 VAAGV 231
V G+
Sbjct: 109 VNGGI 113
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.139 0.441
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,869,530,171
Number of Sequences: 23463169
Number of extensions: 249293917
Number of successful extensions: 588456
Number of sequences better than 100.0: 662
Number of HSP's better than 100.0 without gapping: 390
Number of HSP's successfully gapped in prelim test: 272
Number of HSP's that attempted gapping in prelim test: 586953
Number of HSP's gapped (non-prelim): 704
length of query: 356
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 213
effective length of database: 9,003,962,200
effective search space: 1917843948600
effective search space used: 1917843948600
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)