BLASTP 2.2.22 [Sep-27-2009]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= gi|254780201|ref|YP_003064614.1| hypothetical protein
CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62]
(394 letters)
Database: nr
13,984,884 sequences; 4,792,584,752 total letters
Searching..................................................done
>gi|254780201|ref|YP_003064614.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter
asiaticus str. psy62]
gi|254039878|gb|ACT56674.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter
asiaticus str. psy62]
Length = 394
Score = 434 bits (1116), Expect = e-119, Method: Composition-based stats.
Identities = 394/394 (100%), Positives = 394/394 (100%)
Query: 1 MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF 60
MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF
Sbjct: 1 MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF 60
Query: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDS 120
QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDS
Sbjct: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDS 120
Query: 121 EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLF 180
EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLF
Sbjct: 121 EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLF 180
Query: 181 VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240
VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ
Sbjct: 181 VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240
Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300
REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR
Sbjct: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300
Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA 360
SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA
Sbjct: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA 360
Query: 361 LEHAVERFFACSVRYTEFSIESVDCVAEYERLLH 394
LEHAVERFFACSVRYTEFSIESVDCVAEYERLLH
Sbjct: 361 LEHAVERFFACSVRYTEFSIESVDCVAEYERLLH 394
>gi|315122628|ref|YP_004063117.1| hypothetical protein CKC_04400 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313496030|gb|ADR52629.1| hypothetical protein CKC_04400 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 399
Score = 368 bits (946), Expect = e-100, Method: Composition-based stats.
Identities = 287/390 (73%), Positives = 335/390 (85%)
Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQE 62
K+FRLK K +E L+ RLDVE KG++ +YIPA++SGYY+LWS S +Q+ITS+DV F+E
Sbjct: 8 KIFRLKIKSETLEKLVFRLDVENKGSVNTLYIPANISGYYMLWSLSKEQKITSEDVFFEE 67
Query: 63 LSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEK 122
++ F++ +FWLRSFL FSKYS+LSFPSCRIFFYGSRK++KAF RLNRFMSNSRMPFD +K
Sbjct: 68 VTTFKACLFWLRSFLTFSKYSQLSFPSCRIFFYGSRKDKKAFFRLNRFMSNSRMPFDGKK 127
Query: 123 FLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVT 182
FLY+KELFEGW + S K + I SKIAIVVHCYYQDTW EISH+LLRLNFDFDLF+T
Sbjct: 128 FLYIKELFEGWKNLSSLDNKGKIKINSKIAIVVHCYYQDTWDEISHLLLRLNFDFDLFIT 187
Query: 183 VVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
V+ NKDFEQDVLK FPSA+LYVMENKGRDV PFL LLELGVF YDYLCKIHGKKS R
Sbjct: 188 TVKKNKDFEQDVLKNFPSARLYVMENKGRDVLPFLCLLELGVFYDYDYLCKIHGKKSARR 247
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE 302
YHP EGI+WRRW+FFDLLGFSDIA+RIIN FEQNP +GMIGS R+RRYK++SFF KRS+
Sbjct: 248 NYHPFEGILWRRWIFFDLLGFSDIALRIINKFEQNPSIGMIGSGRFRRYKKYSFFKKRSK 307
Query: 303 VYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALE 362
VY+RV+DLA+R FP + L LDFFNGTMFWV+PKCLEPLRN+HL GEFEEE NL+DGALE
Sbjct: 308 VYKRVVDLARRIDFPVEELDLDFFNGTMFWVRPKCLEPLRNIHLTGEFEEECNLEDGALE 367
Query: 363 HAVERFFACSVRYTEFSIESVDCVAEYERL 392
HAVERFF SV+ FS+ESVDCVAEY++L
Sbjct: 368 HAVERFFPLSVQRAGFSLESVDCVAEYDQL 397
>gi|254780923|ref|YP_003065336.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter
asiaticus str. psy62]
gi|254040600|gb|ACT57396.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter
asiaticus str. psy62]
Length = 365
Score = 351 bits (902), Expect = 9e-95, Method: Composition-based stats.
Identities = 141/327 (43%), Positives = 198/327 (60%), Gaps = 4/327 (1%)
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
F FW + L + + KL + + YGSR +K F + N +M + FD ++ + +
Sbjct: 38 FFFWFWT-LFYKRSKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQ 96
Query: 129 LFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK 188
L GW + P+ K + IK+KIAIVVH YY D WIEI+++L L+ FDL VT+V +
Sbjct: 97 LLHGW-ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155
Query: 189 DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIE 248
+ ++LK FP+A++++MEN GRDV PFL LLE YDY+CKIHGKKS+R+GY E
Sbjct: 156 SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWE 215
Query: 249 GIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV- 307
G +WRRWLF+DLLG + +II TF+ + +GMIGSR YR ++ + R +
Sbjct: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMI 275
Query: 308 IDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEE-RNLKDGALEHAVE 366
LA R G + LDFF GTMFWV+ + L+P++NL L FE + DG +EHAVE
Sbjct: 276 CTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVE 335
Query: 367 RFFACSVRYTEFSIESVDCVAEYERLL 393
R F+ SV+ F I VDC+ Y + L
Sbjct: 336 RCFSLSVKKANFRISDVDCILGYRKSL 362
>gi|77747764|ref|NP_636021.2| hypothetical protein XCC0629 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|77761299|ref|YP_244667.2| hypothetical protein XC_3605 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 546
Score = 337 bits (865), Expect = 2e-90, Method: Composition-based stats.
Identities = 73/378 (19%), Positives = 139/378 (36%), Gaps = 35/378 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ P G W P++ + + ++
Sbjct: 187 LARDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ART 237
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
+ + R+ F + E + A L + + + + + ++
Sbjct: 238 VQHRLANAPSAHRMVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICS 296
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
PS+ +V+H +Y D E+ ++ + +T + + +
Sbjct: 297 PSA------------CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQ 344
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR
Sbjct: 345 RRGIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGE 400
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LLG I+N F +P G+ + + L R G
Sbjct: 401 MLTALLG-PQRVDAIVNAFSTDPLAGLAAPEDHLLPVTEFIG----GNADALDYLTVRTG 455
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
+ F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF +V
Sbjct: 456 SDAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVT 515
Query: 375 YTEFSIESVDCVAEYERL 392
++ + +V+ +
Sbjct: 516 HSGHRVTTVEQTLGITKT 533
>gi|188993121|ref|YP_001905131.1| conserved protein involved in carbohydrate biosynthesis
[Xanthomonas campestris pv. campestris str. B100]
gi|189030067|sp|B0RVK2|WXCX_XANCB RecName: Full=Uncharacterized protein wxcX
gi|167734881|emb|CAP53093.1| conserved protein involved in carbohydrate biosynthesis
[Xanthomonas campestris pv. campestris]
Length = 695
Score = 335 bits (860), Expect = 6e-90, Method: Composition-based stats.
Identities = 73/378 (19%), Positives = 140/378 (37%), Gaps = 35/378 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ P G W P++ + + ++
Sbjct: 336 LARDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ART 386
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
+ + R+ F + E + A L + + + + + ++
Sbjct: 387 VQHRLANAPSAHRMVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICS 445
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
PS+ +V+H +Y D E+ ++ + +T + + +
Sbjct: 446 PSA------------CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQ 493
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR
Sbjct: 494 RRGIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGE 549
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LLG I+N F +P +G+ + + L R G
Sbjct: 550 MLTALLG-PQRVDAIVNAFSTDPLVGLAAPEDHLLPVTEFIG----GNADALDYLTVRTG 604
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
+ F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF +V
Sbjct: 605 SDAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVT 664
Query: 375 YTEFSIESVDCVAEYERL 392
++ + +V+ +
Sbjct: 665 HSGHRVTTVEQTLGITKT 682
>gi|122879048|ref|YP_199439.6| hypothetical protein XOO0800 [Xanthomonas oryzae pv. oryzae
KACC10331]
Length = 546
Score = 335 bits (860), Expect = 7e-90, Method: Composition-based stats.
Identities = 84/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ + P G W P++ + + ++ L
Sbjct: 187 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MLT 237
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
K + P+ R+ F + E + A L + + + + + L+ E+
Sbjct: 238 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 297
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
+ +V+H +Y D E L L VT Q +
Sbjct: 298 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 344
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR
Sbjct: 345 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 400
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LL I+ F ++P LG++ ++ + L R G
Sbjct: 401 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFMG----GNADALDYLTVRTG 455
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V
Sbjct: 456 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 515
Query: 375 YTEFSIESVDCVAE 388
++ + +++ +
Sbjct: 516 HSGQRVATIEQLLG 529
>gi|189030068|sp|P0C7J1|WXCX_XANCP RecName: Full=Uncharacterized protein wxcX
Length = 695
Score = 334 bits (858), Expect = 1e-89, Method: Composition-based stats.
Identities = 73/378 (19%), Positives = 139/378 (36%), Gaps = 35/378 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ P G W P++ + + ++
Sbjct: 336 LARDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ART 386
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
+ + R+ F + E + A L + + + + + ++
Sbjct: 387 VQHRLANAPSAHRMVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICS 445
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
PS+ +V+H +Y D E+ ++ + +T + + +
Sbjct: 446 PSA------------CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQ 493
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR
Sbjct: 494 RRGIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGE 549
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LLG I+N F +P G+ + + L R G
Sbjct: 550 MLTALLG-PQRVDAIVNAFSTDPLAGLAAPEDHLLPVTEFIG----GNADALDYLTVRTG 604
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
+ F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF +V
Sbjct: 605 SDAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVT 664
Query: 375 YTEFSIESVDCVAEYERL 392
++ + +V+ +
Sbjct: 665 HSGHRVTTVEQTLGITKT 682
>gi|295687882|ref|YP_003591575.1| rhamnan synthesis protein F [Caulobacter segnis ATCC 21756]
gi|295429785|gb|ADG08957.1| Rhamnan synthesis F [Caulobacter segnis ATCC 21756]
Length = 818
Score = 334 bits (856), Expect = 2e-89, Method: Composition-based stats.
Identities = 89/382 (23%), Positives = 146/382 (38%), Gaps = 32/382 (8%)
Query: 8 KSKLGKIENL--LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
++ GK+ + ++R + E + A Y+P + G W ++ H +
Sbjct: 454 ENFTGKVYDYPAVVRHKLSELSRVDAAYVPGVMPG----WDNQARKPWAGHAFHNADP-- 507
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
ES++ WL L + + F + E + A+L +R+ + +
Sbjct: 508 -ESYLTWLSGAL--THAVARHPKGEAMVFVNAWNEWGEGAYLEPDRWFGHGYLHATRAAL 564
Query: 124 -LYVKELFEGWNDRPSSPKKSGLTIKSKIAI-VVHCYYQDTWIEISHILLRLNFDFDLFV 181
Y L + P + +K A+ ++H +Y + + L DL +
Sbjct: 565 SAYQPRLTDA---HPLVAQAQAAFVKRADAVTLLHLFYPELIDWFAERLAATADVLDLMI 621
Query: 182 TVVEANKDF-EQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240
TV E + FP A L + EN+GRD+RPF+ L Y CK+H K+S
Sbjct: 622 TVPETWSEADLARARATFPMAHLAIAENRGRDIRPFVETLRRARTLGYSVFCKLHSKRSP 681
Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRR--YRRYKRWSFFA 298
H +G WR L LLG A+ + Q+ LG++ + R
Sbjct: 682 ----HRAKGDEWRAELVDGLLGGEAAALALRAF-AQDAKLGLLAAAGSRLRIGDPDVMNN 736
Query: 299 KRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLK 357
R + R LA+R G F G+MFW + + PL +L +F E
Sbjct: 737 NRQDADR----LARRMGLKLAPET-PFSAGSMFWGRTEAFAPLSDLTDAEIDFGPELGRV 791
Query: 358 DGALEHAVERFFACSVRYTEFS 379
DG HA+ER A V +
Sbjct: 792 DGTTAHAIERLTAAIVARAGYR 813
>gi|166713445|ref|ZP_02244652.1| hypothetical protein Xoryp_18900 [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 695
Score = 333 bits (855), Expect = 2e-89, Method: Composition-based stats.
Identities = 84/377 (22%), Positives = 145/377 (38%), Gaps = 35/377 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ + P G W P++ + + ++ L
Sbjct: 336 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ILT 386
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
K + P+ R+ F + E + A L + + + + + L+ E+
Sbjct: 387 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 446
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
+ +V+H +Y D E L L VT Q +
Sbjct: 447 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 493
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR
Sbjct: 494 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 549
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LL I+ F ++P LG++ ++ + L R G
Sbjct: 550 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFIG----GNADALDYLTVRTG 604
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V
Sbjct: 605 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 664
Query: 375 YTEFSIESVDCVAEYER 391
++ + +++ + +
Sbjct: 665 HSGQRVATIEQLLGIPK 681
>gi|84622385|ref|YP_449757.1| hypothetical protein XOO_0728 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188578640|ref|YP_001915569.1| hypothetical protein PXO_03177 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|84366325|dbj|BAE67483.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188523092|gb|ACD61037.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 695
Score = 333 bits (854), Expect = 3e-89, Method: Composition-based stats.
Identities = 84/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ + P G W P++ + + ++ L
Sbjct: 336 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MLT 386
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
K + P+ R+ F + E + A L + + + + + L+ E+
Sbjct: 387 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 446
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
+ +V+H +Y D E L L VT Q +
Sbjct: 447 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 493
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR
Sbjct: 494 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 549
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LL I+ F ++P LG++ ++ + L R G
Sbjct: 550 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFMG----GNADALDYLTVRTG 604
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V
Sbjct: 605 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 664
Query: 375 YTEFSIESVDCVAE 388
++ + +++ +
Sbjct: 665 HSGQRVATIEQLLG 678
>gi|58425017|gb|AAW74054.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
KACC10331]
Length = 727
Score = 333 bits (854), Expect = 3e-89, Method: Composition-based stats.
Identities = 84/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ + P G W P++ + + ++ L
Sbjct: 368 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MLT 418
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
K + P+ R+ F + E + A L + + + + + L+ E+
Sbjct: 419 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 478
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
+ +V+H +Y D E L L VT Q +
Sbjct: 479 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 525
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR
Sbjct: 526 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 581
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LL I+ F ++P LG++ ++ + L R G
Sbjct: 582 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFMG----GNADALDYLTVRTG 636
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V
Sbjct: 637 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 696
Query: 375 YTEFSIESVDCVAE 388
++ + +++ +
Sbjct: 697 HSGQRVATIEQLLG 710
>gi|77748730|ref|NP_643883.2| hypothetical protein XAC3576 [Xanthomonas axonopodis pv. citri str.
306]
Length = 546
Score = 332 bits (851), Expect = 7e-89, Method: Composition-based stats.
Identities = 85/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ + P G W P++ + + ++
Sbjct: 187 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MRT 237
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
+ P+ R+ F + E + A L + + + + + L+ G + R
Sbjct: 238 VRDRLTNTPPAHRLVFINAWNEWAEGAVLEPDTRLGYAWLHATRQALLHTAGAATGSDLR 297
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
+ +V+H +Y D E + L VT + Q +
Sbjct: 298 DA-------------CVVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQ 344
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR
Sbjct: 345 QRGVQAQVDGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDAWRRE 400
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+F LL A I+ F +P LG+ ++ + LA R G
Sbjct: 401 MFSALL-TPQHADAIMRGFTDDPLLGLAAPAQHLLPVTDFIG----GNADALDYLAVRTG 455
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
H F +G+MFWVK + L PL + +L EFE E+ DG L HA+ERF A +V
Sbjct: 456 TDAIDEHSVFASGSMFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVS 515
Query: 375 YTEFSIESVDCVAE 388
+ + ++D +
Sbjct: 516 HCGHHVATIDQLLG 529
>gi|325928558|ref|ZP_08189746.1| Lipopolysaccharide biosynthesis protein/putative glycosyl
transferase [Xanthomonas perforans 91-118]
gi|325541097|gb|EGD12651.1| Lipopolysaccharide biosynthesis protein/putative glycosyl
transferase [Xanthomonas perforans 91-118]
Length = 695
Score = 328 bits (842), Expect = 6e-88, Method: Composition-based stats.
Identities = 83/374 (22%), Positives = 145/374 (38%), Gaps = 35/374 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ + P G W P++ + + ++
Sbjct: 336 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MRT 386
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
+ + P+ R+ F + E + A L + + + + + L+ G +
Sbjct: 387 VRDRLRNTPPAHRLVFINAWNEWAEGAVLEPDTRLGYAWLHATRQALLHTAGAATGSD-- 444
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
+ + +V+H +Y D E + L +T + Q +
Sbjct: 445 -----------QRDVCVVLHAWYLDVLDEALEAIAHCGLSLRLVITTDITMVEQVRQRLQ 493
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR
Sbjct: 494 QRGVQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRE 549
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+F LL I+ F +P LG+ ++ + LA R G
Sbjct: 550 MFSALLA-PQHVDAIMRGFADDPLLGLAAPAQHLLPVTDFIG----GNADALDYLAVRTG 604
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
H F +G+MFWVK + L PL + HL EFE+E+ DG L HA+ERF A +V
Sbjct: 605 TDAINEHSMFASGSMFWVKLEALRPLLDAHLHPSEFEDEQGQIDGTLAHAIERFLAVAVG 664
Query: 375 YTEFSIESVDCVAE 388
+ + +V+ +
Sbjct: 665 HCGHHVATVEQLLG 678
>gi|325921211|ref|ZP_08183074.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC
19865]
gi|325548310|gb|EGD19301.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC
19865]
Length = 706
Score = 321 bits (824), Expect = 1e-85, Method: Composition-based stats.
Identities = 79/369 (21%), Positives = 139/369 (37%), Gaps = 35/369 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ P G W P++ + + WL
Sbjct: 329 LASDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRD---WLSRT-- 379
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
+ + P+ R+ F + E + A L + + ++ + E + ++
Sbjct: 380 VQQRLANALPAHRMVFINAWNEWAEGAVLEPDARLGHAWLEATREALIGPSKVVSELAPH 439
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
++ +V+H +Y D E+ + L +T + V
Sbjct: 440 -------------RVCVVLHAWYLDVLDEMLDAVAHCAISPRLVITTDLTMVVEVRHRVQ 486
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR
Sbjct: 487 QRGMQAEVEGFENRGRDILPFLHVANRLLDEGVCLVVKLHTKKST----HRSDGDTWRHE 542
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LL + A I+N F +P LG+ + + L R G
Sbjct: 543 MLSALLA-PERADAIVNAFSSDPLLGLAAPDGHLLPVADFIG----GNTDALDYLGARTG 597
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374
T F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF S
Sbjct: 598 TETAIEQGMFASGSMFWARLEALRPLLDAHLHPSEFETEQGQIDGTLAHAIERFMGISAI 657
Query: 375 YTEFSIESV 383
+ + I ++
Sbjct: 658 QSGYRIATI 666
>gi|134297301|ref|YP_001121036.1| lipopolysaccharide biosynthesis protein-like protein [Burkholderia
vietnamiensis G4]
gi|134140458|gb|ABO56201.1| Lipopolysaccharide biosynthesis protein-like protein [Burkholderia
vietnamiensis G4]
Length = 1231
Score = 321 bits (823), Expect = 1e-85, Method: Composition-based stats.
Identities = 77/382 (20%), Positives = 149/382 (39%), Gaps = 24/382 (6%)
Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69
G + + + K + + W ++ ++S
Sbjct: 864 FSGHVYDYNEYAENATKVIADKKHT---FPCVMMNWDNEARKPGKGHIFLGASPESYKS- 919
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127
WLR F + S R+ F + E + +L +R + + ++ +
Sbjct: 920 --WLRRCFDFVLSNNKQ--SERLVFINAWNEWAEGTYLEPDRRYGYAYLHATADLL---R 972
Query: 128 ELFEGWNDRPSSPKKSGLTIKS-KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA 186
+ + + S + +K + A+V H YY D E+ ++ R + D F+T+
Sbjct: 973 QYYNSEDLDESIKINNQRFVKKNENALVAHLYYFDLLPELLSLIERN-VNLDAFITIPVH 1031
Query: 187 -NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYH 245
+++ ++L + + ++N+GRD+ PFL + + Y L K+H KKS +
Sbjct: 1032 FSREQVGEILASLDNVYVLRVQNRGRDILPFLNIYPIIKSYSYANLVKVHSKKSPQ---- 1087
Query: 246 PIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR 305
+G + R+ +LL I ++ +P +G+I S + +
Sbjct: 1088 RADGALLRKRALLELL-DPSIVPGVLRALNTDPKIGLIAPSNSLCSLSNSDYLIN--NRK 1144
Query: 306 RVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHA 364
++ R G L+ +F G+MFW + L L +L L +FEEE DG L HA
Sbjct: 1145 QLNYCLSRLGLVDSSLNFEFIAGSMFWARVDALRMLSDLSLREEDFEEELGQLDGTLAHA 1204
Query: 365 VERFFACSVRYTEFSIESVDCV 386
+ER F ++ + VD +
Sbjct: 1205 IERLFCFLGKHVGYRTLPVDQI 1226
>gi|16124886|ref|NP_419450.1| hypothetical protein CC_0633 [Caulobacter crescentus CB15]
gi|221233606|ref|YP_002516042.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000]
gi|13421844|gb|AAK22618.1| conserved hypothetical protein [Caulobacter crescentus CB15]
gi|51039815|tpg|DAA00361.1| TPA_exp: conserved hypothetical protein [Caulobacter vibrioides]
gi|220962778|gb|ACL94134.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000]
Length = 818
Score = 321 bits (822), Expect = 2e-85, Method: Composition-based stats.
Identities = 87/380 (22%), Positives = 142/380 (37%), Gaps = 32/380 (8%)
Query: 10 KLGKIENL--LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
GK+ + + R ++E + A ++P + G W ++ H + E
Sbjct: 456 FTGKVYDYPAVARHKLDELEQVPAAFVPGVMPG----WDNQARKPWAGVAFHNADP---E 508
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF-L 124
S+ WL L + F + E + A+L +R+ + +
Sbjct: 509 SYFGWLSGAL--KHAEARHPKGEALVFVNAWNEWGEGAYLEPDRWFGHGYLHATRTALSA 566
Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAI-VVHCYYQDTWIEISHILLRLNFDFDLFVTV 183
++ L P + K A+ ++H +Y + + L DL +TV
Sbjct: 567 WLPRLTNA---HPIIAEAQSQFAKRADAVTLLHLFYPELIDWFAERLAATADVLDLMITV 623
Query: 184 VEANKDF-EQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
E + FP+A L + EN+GRD+RPF+ L Y CK+H K+S
Sbjct: 624 PETWSEADLARARAAFPTAHLAIAENRGRDIRPFVETLRRARALGYSVFCKLHSKRSP-- 681
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRR--YKRWSFFAKR 300
H +G WR L LLG A+ + Q+P LG++ + R R
Sbjct: 682 --HQAKGDQWRTTLVEGLLGGEAAALALRAF-AQDPKLGLLAAAGARMRIGDPDVMDNNR 738
Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDG 359
+E R L+ G + F G+MFW + + PL +L F E DG
Sbjct: 739 AEADR----LSAHMGLKPRPET-PFAAGSMFWGRTEAFAPLTDLSDDEIAFGPELGRVDG 793
Query: 360 ALEHAVERFFACSVRYTEFS 379
HA+ER A V +
Sbjct: 794 TTAHAIERLTAAIVERAGYR 813
>gi|325915787|ref|ZP_08178089.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937]
gi|325538051|gb|EGD09745.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937]
Length = 695
Score = 315 bits (807), Expect = 9e-84, Method: Composition-based stats.
Identities = 84/377 (22%), Positives = 142/377 (37%), Gaps = 35/377 (9%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
L D+E++ + P G W P++ + + WL +
Sbjct: 336 LASDIEQRPLREYTLYPGVNPG----WDNEPRRSGKGRVYLHASPRRYRD---WLSTT-- 386
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136
+ R+ F + E + A L + + ++ + + +
Sbjct: 387 VHHRLAHVPTAHRLVFINAWNEWAEGAVLEPDMRLGHAWLDATRQAMTR--------SAH 438
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195
++ + +VVH +Y D EI L L VT +
Sbjct: 439 DVPAPRT-----YRACVVVHAWYLDVLDEILDALAPSVAMLRLIVTTDLTLVGQVRGRLQ 493
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
++ A++ EN+GRD+ PFL++ + + + K+H KKS H +G WRR
Sbjct: 494 QHGIEAEVEGFENRGRDILPFLHIANRLLDEGEQLVVKLHTKKST----HRHDGDAWRRE 549
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ LLG I+N F +P LG+ ++ + LA R G
Sbjct: 550 MLAALLGG-GRVDAIVNAFVADPQLGLAAPAQHLLAVTDFIG----GNADALDYLAVRTG 604
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHAVERFFACSVR 374
T H F +G+MFW K L PL + HL +FE E+ DG L HA+ERF +V
Sbjct: 605 TGTVTEHDRFASGSMFWAKLDALRPLLDAHLQPGDFEGEQGQIDGTLAHAIERFLGHAVL 664
Query: 375 YTEFSIESVDCVAEYER 391
++ I ++D +
Sbjct: 665 HSGHRIATIDGLMGQRE 681
>gi|291520004|emb|CBK75225.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens
16/4]
Length = 984
Score = 310 bits (794), Expect = 3e-82, Method: Composition-based stats.
Identities = 67/393 (17%), Positives = 129/393 (32%), Gaps = 22/393 (5%)
Query: 1 MYKVFRLKSKLGKIENLLLRLDVEEKG---NMQAIYIPAHVSGYYVLWSFSPKQRITSKD 57
+Y+V + G + + E + + Q+ Y V L+ + + TS D
Sbjct: 169 LYRVVKFSELPGNLVEISDEEKAEYQKMENHFQSNYCFKDVKNLKELFDHAESRSKTSAD 228
Query: 58 VHFQELSIFESFIFWLRSFLAFS-KYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRM 116
+ L + + + R+ + + + + S +
Sbjct: 229 FAIASRDYQIKQLQELIAAKDVHIRNIEAVNEQLRVIYDNTVNTKGYKALESIRAFKSFL 288
Query: 117 PF------DSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHIL 170
++++ ++ + + + +A+ +H +Y D E
Sbjct: 289 TGKPSPAREAKRLEKEEKKARKAAAKEAKKAAAKGEEAPSVAVHLHLFYVDLLPEFVSYF 348
Query: 171 LRLNFDFDLFVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFD 226
+ F FDL+++ E LK + + N+GRD+ P
Sbjct: 349 ANIPFRFDLYISCQEGADVSVIKSGVKELKMANKVVIRPLPNRGRDLAPLYVGFADE-IR 407
Query: 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSR 286
++DY +H KKS G E WR++ LLG + I N F +N G++
Sbjct: 408 QHDYFLHVHSKKSLYSG---AEKGGWRQFSLELLLGSPEKVNSIFNLF-KNKNAGLVYPD 463
Query: 287 RYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLH- 345
+ L ++ G+ FW + L P+ N +
Sbjct: 464 IHEEVP--MIAYSWLANAGLGRKLFDEFELGEMPTVFNYPAGSFFWARTDALMPIFNRNY 521
Query: 346 LIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
+ +F EE DG L HA+ER R +
Sbjct: 522 IYEDFPEEAGQTDGTLAHALERIIPFVSRKLGY 554
>gi|194364297|ref|YP_002026907.1| hypothetical protein Smal_0519 [Stenotrophomonas maltophilia
R551-3]
gi|194347101|gb|ACF50224.1| conserved hypothetical protein [Stenotrophomonas maltophilia
R551-3]
Length = 686
Score = 301 bits (772), Expect = 1e-79, Method: Composition-based stats.
Identities = 75/369 (20%), Positives = 133/369 (36%), Gaps = 39/369 (10%)
Query: 21 LDVEEKG-NMQAIYIPAH--VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77
D E M+ +P + G W ++ + + + WL
Sbjct: 328 RDWRELAAQMRTAPLPDYPLYPGVNPGWDNEARRPGRGRVLLHASPRGYAD---WLHDT- 383
Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWND 135
+ P+ R+ F + E + A L + + ++ +
Sbjct: 384 -VHGRLRDVPPARRMVFINAWNEWAESAVLEPDARLGHAWLQATRRAMT----------- 431
Query: 136 RPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVL 195
PS P S +V+H ++ D E+ + L +T + Q +
Sbjct: 432 -PSQPAPSRPC------VVIHAWHLDALPELLSAVKDSGLPARLVITTTSDRQAQVQSIT 484
Query: 196 KYFPS-AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254
+ A+++ +N GRD+ PFL+ + + + K+H K+S H G WRR
Sbjct: 485 ESHGLPAEIWAYDNHGRDILPFLHAADRLLQQNESLVLKLHTKRST----HRDNGDQWRR 540
Query: 255 WLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRA 314
+ LLG + A + + +P LG++ + +R+ L +
Sbjct: 541 EMVDALLGPAQAAAN-LAHLQADPRLGLMAPAGHLLNVADYIG----GNAQRMERLWAQL 595
Query: 315 GFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSV 373
G F +G+MFWV+ + L PL + HL+ FE E DG L HA+ER
Sbjct: 596 GLDGAPGDGQFASGSMFWVRLQALRPLLDAHLLPSMFEVEAGQIDGTLAHAIERATGAVA 655
Query: 374 RYTEFSIES 382
FS+
Sbjct: 656 TCAGFSVGD 664
>gi|315122651|ref|YP_004063140.1| hypothetical protein CKC_04515 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313496053|gb|ADR52652.1| hypothetical protein CKC_04515 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 405
Score = 301 bits (770), Expect = 2e-79, Method: Composition-based stats.
Identities = 153/327 (46%), Positives = 204/327 (62%), Gaps = 7/327 (2%)
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKF 123
S F F FW+RS F +Y L + RI YGSR +K F N+ M +PFD EK
Sbjct: 70 SFFLGFFFWIRSLFLFKRYQTLRYDENRIIAYGSRIGKKFFACSNKDMLARGVPFDGEKI 129
Query: 124 LYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV 183
L GW+ PSS K + + I+S++AIVVH YY D W EI+++L LNF FDL +T+
Sbjct: 130 HRFPRLLHGWD-SPSSEKIASVKIQSRVAIVVHIYYADLWAEIANLLSGLNFSFDLHITL 188
Query: 184 VEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243
V + ++LK FP+A +YVMEN GRD+R FL LLE G D YDY+CKIHGKKS+R G
Sbjct: 189 VTEIASIKSEILKRFPNAHIYVMENYGRDIRSFLKLLEGGKLDSYDYVCKIHGKKSKRNG 248
Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303
+ +G +WRRWLFFDLLG IA+ II TFE+ P +GMIGSR YR ++ S R
Sbjct: 249 HVWWDGDLWRRWLFFDLLGAPGIALEIIKTFEKYPKIGMIGSRTYRYDQKISLGNNR--- 305
Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLK--DGAL 361
V +A + G + +DFF GTMFWV+P+ L+P++NL L F+ + ++ DG L
Sbjct: 306 -EFVCAIANKMGVSFEDTKIDFFGGTMFWVRPQALDPIKNLALTQYFKSKVDMVGLDGCL 364
Query: 362 EHAVERFFACSVRYTEFSIESVDCVAE 388
EHA+ER F+ SV F + VDC++E
Sbjct: 365 EHAIERCFSISVEKANFDLAYVDCLSE 391
>gi|285019449|ref|YP_003377160.1| hypothetical protein XALc_2689 [Xanthomonas albilineans GPE PC73]
gi|283474667|emb|CBA17166.1| conserved hypothetical protein [Xanthomonas albilineans]
Length = 686
Score = 300 bits (768), Expect = 3e-79, Method: Composition-based stats.
Identities = 82/354 (23%), Positives = 135/354 (38%), Gaps = 37/354 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
G W ++ + +E WLR+ + + + R+ F
Sbjct: 342 YPLYPGVNPGWDNEARRPGNGRVYLHASPRGYED---WLRATIHTRLQGRRA--EQRLVF 396
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIA 152
+ E + A L + + ++ + + E +
Sbjct: 397 VNAWNEWAEGAVLEPDTRLGHAYLDATRRALS-PARVREATAPHHA-------------- 441
Query: 153 IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY--FPSAQLYVMENKG 210
+VH +Y + E+ + L + L VT Q L+ FP ++ V+EN+G
Sbjct: 442 -IVHAWYPNVLPELLNPLAASALPWRLLVTTSPDQASAVQAQLRDCSFPY-EVMVLENRG 499
Query: 211 RDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270
RD+ PFL+ E + D D + K+H K+S H G WR L L G +D A RI
Sbjct: 500 RDILPFLHAGERLLQDGVDVVLKLHTKRST----HLHNGDAWRSELLQRLAG-ADRAARI 554
Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-PTKRLHLDFFNGT 329
+ F Q+P LG++ + L +R G+ F +G+
Sbjct: 555 LEAFAQDPMLGLVAPEGHLLPLADF----WGGNRMAADYLLRRTGYTDVCLDEAHFISGS 610
Query: 330 MFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIES 382
MFWV+ L PL + HL EFE E+ DG L HA ER A ++ + + +
Sbjct: 611 MFWVRLHALRPLLDSHLCPSEFEPEQGQIDGTLAHAAERVTALLAQHRGYRVAT 664
>gi|325928537|ref|ZP_08189725.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans
91-118]
gi|325541076|gb|EGD12630.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans
91-118]
Length = 1415
Score = 296 bits (758), Expect = 4e-78, Method: Composition-based stats.
Identities = 82/371 (22%), Positives = 148/371 (39%), Gaps = 36/371 (9%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
++ +V +K + + G + W ++ V + + + WLR
Sbjct: 1054 VVDYANVVDKALSEVKPEFDLIRGVFPSWDNDARKPGRGYTVARSTPARYRT---WLRGA 1110
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWN 134
+ +S+ + + F + E + A L +R + +
Sbjct: 1111 IDYSRKFPVR--GESLVFVNAWNEWAEGAHLEPDRKYGYAYLEATRRAL----------- 1157
Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDV 194
RP P+ ++A+V+H +Y + E+ L + + L ++ V D +
Sbjct: 1158 RRPVMPRTPE-----RVAVVIHAFYPEILPEMLKELQSWDVPYFLIISTVADKADEVRGY 1212
Query: 195 LKYFPS-AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253
L A + V EN+GRD+ PFL +++ R + K+H K+S H +G WR
Sbjct: 1213 LADLSVVADVRVFENRGRDILPFLEIMKDLR-GRESLVLKLHTKRSL----HRQDGESWR 1267
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
R + LL +A I F + LG+ + S V L+K+
Sbjct: 1268 RDMLEKLLA-PKVASEIFAAFREQERLGLAAPEGHIL----SMTTYWGANADTVHRLSKQ 1322
Query: 314 AGF-PTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFAC 371
P + F G+MF+V+P+ ++ + +L L +FE E DG L HA+ER F+
Sbjct: 1323 MHVDPVNPVTAMFAAGSMFYVRPEAIDSIMDLDLRREDFEPEAGQVDGTLAHAIERCFSL 1382
Query: 372 SVRYTEFSIES 382
+V T + I S
Sbjct: 1383 AVCSTGYYIAS 1393
>gi|145588508|ref|YP_001155105.1| methyltransferase type 11 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
gi|145046914|gb|ABP33541.1| Methyltransferase type 11 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
Length = 1082
Score = 291 bits (746), Expect = 9e-77, Method: Composition-based stats.
Identities = 84/360 (23%), Positives = 147/360 (40%), Gaps = 30/360 (8%)
Query: 24 EEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYS 83
E ++ Y + W + +++ S + + ++ WL + + +K S
Sbjct: 743 NEVKKLEPEY--KQYRAAMLSWDNTARRKNNSHIMANFSIRRYK---QWLSNIASCTKNS 797
Query: 84 KLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPK 141
+ + F + E + L + + + P +
Sbjct: 798 IRLNENEKFIFINAWNEWAEGTHLEPDTKYGFKYLQATYDILKNY--------INPEHAE 849
Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILL---RLNFDFDLFVTVVEANKDFEQDVLKYF 198
+ ++ IAIVVH +Y DTW +I I+ ++ D+++T+ N + Q + F
Sbjct: 850 IIRESQENSIAIVVHIHYMDTWEDIKKIIKKILSVHDS-DIYITIT--NLEQYQSIKNDF 906
Query: 199 PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258
PSA + ++EN+GRD+ PF+ +L+ + Y +CKIH KKS + +G + R+ L+F
Sbjct: 907 PSANIELVENRGRDILPFINVLKKIIHKNYVAICKIHSKKS----EYRSDGEVIRKELYF 962
Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT 318
L+ +I FE N LGM+ +Y + + G
Sbjct: 963 SLINNEITLEKIPKFFEVNKKLGMLVPGKYFLQHNDI---NMYFNRENISKVCSVIGVNF 1019
Query: 319 KRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
K F G+MFW +P L+ L L F+ E L DG + HAVER F + F
Sbjct: 1020 KESK--FPAGSMFWARPAALQKLLKLESGELFDVEEGLADGTVAHAVERLFGLVSESSGF 1077
>gi|190572709|ref|YP_001970554.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia
K279a]
gi|190010631|emb|CAQ44240.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia
K279a]
Length = 707
Score = 289 bits (741), Expect = 4e-76, Method: Composition-based stats.
Identities = 74/367 (20%), Positives = 130/367 (35%), Gaps = 37/367 (10%)
Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79
R + P G W ++ + + + WL
Sbjct: 352 RELATQMRRAPLADYP-LYPGVNPGWDNEARRPGRGRVLLHASPRGYSD---WLHDT--V 405
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRP 137
+ + P+ R+ F + E + A L + + ++ + + RP
Sbjct: 406 HQRLRHVAPARRLVFINAWNEWAESAVLEPDARLGHAWLQATRRAL--FPS--QAAPSRP 461
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLK 196
IV+H +Y D E+ + L +T E + +
Sbjct: 462 --------------CIVIHAWYLDALPELLQAVKDSGLQARLVITTTGERQAQVQSIIDA 507
Query: 197 YFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256
+A+++V +N GRDV PFL+ + + + K+H K+S H G WRR +
Sbjct: 508 EGLTAEIWVYDNHGRDVLPFLHAADRLLQQNESLVLKLHTKRST----HRDNGDQWRREM 563
Query: 257 FFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF 316
LLG + A + + NP +G++ + +R+ L G
Sbjct: 564 VDALLGTAQAAANLAHL-LANPSIGLMAPAGHLLKVADYIG----GNAQRMERLWALLGL 618
Query: 317 PTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRY 375
+ F +G+MFWV+ L PL + HL+ F+ E DG L HA+ER V
Sbjct: 619 DSAPGDGQFASGSMFWVRLPALRPLLDAHLLPSMFDTEAGQIDGTLAHAIERATGAVVSA 678
Query: 376 TEFSIES 382
F++
Sbjct: 679 AGFTVAD 685
>gi|21111631|gb|AAM39945.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66575237|gb|AAY50647.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 296
Score = 289 bits (740), Expect = 5e-76, Method: Composition-based stats.
Identities = 65/305 (21%), Positives = 120/305 (39%), Gaps = 26/305 (8%)
Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKS 149
+ F + E + A L + + + + + ++ PS+
Sbjct: 1 MVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICSPSA---------- 49
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208
+V+H +Y D E+ ++ + +T + + + + A++ EN
Sbjct: 50 --CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFEN 107
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
+GRD+ PFL++ + + + K+H KKS H +G WR + LLG
Sbjct: 108 RGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGEMLTALLG-PQRVD 162
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNG 328
I+N F +P G+ + + L R G + F +G
Sbjct: 163 AIVNAFSTDPLAGLAAPEDHLLPVTEFIG----GNADALDYLTVRTGSDAPDTNSLFASG 218
Query: 329 TMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVDCVA 387
+MFW + + L PL + HL EFE E+ DG L HA+ERF +V ++ + +V+
Sbjct: 219 SMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVTHSGHRVTTVEQTL 278
Query: 388 EYERL 392
+
Sbjct: 279 GITKT 283
>gi|312962408|ref|ZP_07776899.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas
fluorescens WH6]
gi|311283335|gb|EFQ61925.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas
fluorescens WH6]
Length = 1308
Score = 288 bits (736), Expect = 1e-75, Method: Composition-based stats.
Identities = 83/383 (21%), Positives = 154/383 (40%), Gaps = 32/383 (8%)
Query: 5 FRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELS 64
+ + ++ N + Y + W + +++ S H L
Sbjct: 954 ADFNGHIFSYDQVV----ANAVANKEPEY--KLFRASMLSWDNTARKQYNSHTFHGFSLL 1007
Query: 65 IFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
++ WL S + ++ F + E + L +R + +
Sbjct: 1008 RYK---QWLSSITNNVFNNAKYSKDEKLVFVNAWNEWAEGTHLEPDRKYGYGYLQATDDV 1064
Query: 123 FLYVKELFEGWNDRPSSPKKSGLTIKSKI-AIVVHCYYQDTWIEISHILLRLNF-DFDLF 180
+ S +++ A+V+H +Y D W +I L ++DL+
Sbjct: 1065 LAEY-------DISKVSRMAFKRSVRQADYAVVLHLHYDDLWDDIKSYLDSFGQLEYDLY 1117
Query: 181 VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240
VTV ++ V + +P A + ++EN+GRDV PFL +L++ Y +CKIH K+S
Sbjct: 1118 VTVTSSSAGVR--VAQEYPKAHIQLVENRGRDVLPFLKILQVIKDMGYVAVCKIHSKRSL 1175
Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300
+G R L LLG + + +++ FE+ +G+I +Y
Sbjct: 1176 Y----RDDGDKIRGELIGSLLGSKETILSVVDRFERQKDIGVIVPVKYLIPHTDHNMTYC 1231
Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA 360
+ V +L+ + GF +F G+MFW +PK LE L ++ FE E L DG
Sbjct: 1232 GAI---VTELSSKLGFNFSYC--EFIAGSMFWFRPKALEALLSIDESS-FEVEDGLADGT 1285
Query: 361 LEHAVERFFACSVRYTEFSIESV 383
+ H +ER V+ +++E++
Sbjct: 1286 IAHGIERVLCNVVKKANYTVETI 1308
>gi|21109952|gb|AAM38419.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 296
Score = 283 bits (724), Expect = 3e-74, Method: Composition-based stats.
Identities = 76/301 (25%), Positives = 123/301 (40%), Gaps = 26/301 (8%)
Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKS 149
+ F + E + A L + + + + + L+ G + R +
Sbjct: 1 MVFINAWNEWAEGAVLEPDTRLGYAWLHATRQALLHTAGAATGSDLRDA----------- 49
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208
+V+H +Y D E + L VT + Q + + AQ+ EN
Sbjct: 50 --CVVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQQRGVQAQVDGFEN 107
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
+GRD+ PFL + + + + K+H KKS H +G WRR +F LL A
Sbjct: 108 RGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDAWRREMFSALL-TPQHAD 162
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNG 328
I+ F +P LG+ ++ + LA R G H F +G
Sbjct: 163 AIMRGFTDDPLLGLAAPAQHLLPVTDFIG----GNADALDYLAVRTGTDAIDEHSVFASG 218
Query: 329 TMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVDCVA 387
+MFWVK + L PL + +L EFE E+ DG L HA+ERF A +V + + ++D +
Sbjct: 219 SMFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVSHCGHHVATIDQLL 278
Query: 388 E 388
Sbjct: 279 G 279
>gi|158422520|ref|YP_001523812.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium
caulinodans ORS 571]
gi|158329409|dbj|BAF86894.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium
caulinodans ORS 571]
Length = 661
Score = 269 bits (687), Expect = 8e-70, Method: Composition-based stats.
Identities = 90/381 (23%), Positives = 160/381 (41%), Gaps = 24/381 (6%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
++ +G ++ + D+ + + +P G +P Q ++ + + +
Sbjct: 262 RAFVGPVDEFMFVADLAQ-HRARQATVP-LFPGICAGHDSTPGQGADARIMV--SPDLGD 317
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
+ WL LA ++ ++ S + F + + + L + ++ + +
Sbjct: 318 DYARWLTEVLAIARARPVAGAS--LVFINAWNDWLNGSHLLPDARYGHALLRATASTCA- 374
Query: 126 VKELFEGWNDRPSSPKKSGLTIKS-KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184
RP++ + +++ +A VVH YY+D + L LFVT
Sbjct: 375 -PYAGAIGARRPAAAPVTPRPVRTGSLASVVHGYYEDLLPGLIAGL----DPAHLFVTTP 429
Query: 185 -EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243
E + + + P+A+L V+EN+GRDVRPFL LL + YD + K+H K+S +G
Sbjct: 430 PEKAEAVRAVLARAAPAARLRVVENRGRDVRPFLSLLPELEAEGYDLVLKVHTKRSPHQG 489
Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303
EG W + L LL + R+ FE +P +G++G+ + + +A +
Sbjct: 490 ---KEGSDWLQRLSGPLLKLARS-ERLAPVFEAHPQMGLLGAAGHVLDG--ALYAGSAGN 543
Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNL-HLIGEFEEERNLKDGALE 362
+ LA G L + GTMF + PLR L+ F+ + LKDG L
Sbjct: 544 AAWMRRLAAELG-TGAPLTSPYVAGTMFVARLGIFAPLRGASELLDLFDTDMGLKDGTLA 602
Query: 363 HAVERFFACSVRYTEFSIESV 383
HA ERFF S+ V
Sbjct: 603 HAFERFFGVLAAEAGLSVGEV 623
>gi|289662624|ref|ZP_06484205.1| hypothetical protein XcampvN_05932 [Xanthomonas campestris pv.
vasculorum NCPPB702]
Length = 945
Score = 268 bits (685), Expect = 1e-69, Method: Composition-based stats.
Identities = 71/275 (25%), Positives = 116/275 (42%), Gaps = 12/275 (4%)
Query: 113 NSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLR 172
+ + P ++ K+ ++VH +Y D E + L +
Sbjct: 47 RGFLERVRLAGRKQPAAHRLADQAPFGRPVPSAQLQLKVGVMVHVFYPDLIDEFAQSLQQ 106
Query: 173 LNFDFDLFVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY 228
+ +DL V+V++ + + L+ + ++ N+GRD+ P L +
Sbjct: 107 MPVGYDLLVSVMDNAAEAQARDRFSKLQQIEKLDIRIVPNRGRDIAPLLVTFREQILA-L 165
Query: 229 DYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRY 288
D + +H KKS G E WRR+L L+G ++ + F+ P LGM+ Y
Sbjct: 166 DVVGHLHTKKSLYTG---SEQGQWRRYLVSSLMGSAERIAWQLGMFQAEPRLGMLYPESY 222
Query: 289 RRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KRLHLDFFNGTMFWVKPKCLEPLRNLHL- 346
R W+ + LA+R GF ++DF G+MFW K L PL L+L
Sbjct: 223 ERVPLWA--HTWLSNFEVCRTLAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLE 280
Query: 347 IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
+ +F EE DG L HA+ER F VR+ + I
Sbjct: 281 LKDFPEEHGQIDGTLHHAMERMFVAVVRHQHYRIG 315
>gi|289668432|ref|ZP_06489507.1| hypothetical protein XcampmN_08015 [Xanthomonas campestris pv.
musacearum NCPPB4381]
Length = 945
Score = 267 bits (684), Expect = 1e-69, Method: Composition-based stats.
Identities = 71/275 (25%), Positives = 116/275 (42%), Gaps = 12/275 (4%)
Query: 113 NSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLR 172
+ + P ++ K+ ++VH +Y D E + L +
Sbjct: 47 RGFLERVRLAGRKQPAAHRLADQAPFGRPVPSAQLQVKVGVMVHVFYPDLIDEFAQSLQQ 106
Query: 173 LNFDFDLFVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY 228
+ +DL V+V++ + + L+ + ++ N+GRD+ P L +
Sbjct: 107 MPVGYDLLVSVMDNAAEAQARDRFSKLQQIEKLDIRIVPNRGRDIAPLLVTFREQILA-L 165
Query: 229 DYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRY 288
D + +H KKS G E WRR+L L+G ++ + F+ P LGM+ Y
Sbjct: 166 DVVGHLHTKKSLYTG---SEQGQWRRYLVSSLMGSAERIAWQLGMFQAEPRLGMLYPESY 222
Query: 289 RRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KRLHLDFFNGTMFWVKPKCLEPLRNLHL- 346
R W+ + LA+R GF ++DF G+MFW K L PL L+L
Sbjct: 223 ERVPLWA--HTWLSNFEVCRTLAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLE 280
Query: 347 IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
+ +F EE DG L HA+ER F VR+ + I
Sbjct: 281 LKDFPEEHGQIDGTLHHAMERMFVAVVRHQHYRIG 315
>gi|258591058|emb|CBE67353.1| protein of unknown function [NC10 bacterium 'Dutch sediment']
Length = 1460
Score = 260 bits (664), Expect = 3e-67, Method: Composition-based stats.
Identities = 73/331 (22%), Positives = 125/331 (37%), Gaps = 39/331 (11%)
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSR-----MPFDSEKFL 124
I WLR + + R + L++ + + + +
Sbjct: 509 IRWLRHPI------RALPGKDRFAIDFA------HLKVTLRKAYFYHRKIGLRATVRRII 556
Query: 125 YVKELFEGWNDRPSSPKKSGLTI----------KSKIAIVVHCYYQDTWIEISHILLRLN 174
P+ L I S+IA+ H YY D E++ L +
Sbjct: 557 VELRSLHTKARGPALCSSELLNIHDIYPMPGDISSRIAVHAHAYYPDLTKELASYLKNMP 616
Query: 175 FDFDLFVTVV-EANKDFEQDVLKYFPSAQ---LYVMENKGRDVRPFLYLLELGVFDRYDY 230
F FDLFV+V + +D + P A+ + V+ N+GRD+ P + G YDY
Sbjct: 617 FAFDLFVSVSNDEARDVCRQAFAGLPQARRVIVDVVANRGRDIAPMVCHFG-GRLATYDY 675
Query: 231 LCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRR 290
+C +H KKS + W +L L+G D RI + F+ +P G+I + Y
Sbjct: 676 ICHLHTKKSMYAQ---GKMDGWLEYLLRQLMGSEDQVRRIFSMFQSDPRAGIIYPQNYEY 732
Query: 291 YKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL-HLDFFNGTMFWVKPKCLEPLRNLHL-IG 348
W + ++ G + D+ G+MFW + + + L + + +
Sbjct: 733 LPYW--GNTWLSNKALGAQMCRQMGITDVPEGYFDYPAGSMFWARSEAIRNLFSADIRLT 790
Query: 349 EFEEERNLKDGALEHAVERFFACSVRYTEFS 379
+F EE DG+L H +ER R+ +
Sbjct: 791 DFPEEAGQTDGSLAHCIERLLVLVARHAGYK 821
>gi|260890973|ref|ZP_05902236.1| conserved hypothetical protein [Leptotrichia hofstadii F0254]
gi|260859000|gb|EEX73500.1| conserved hypothetical protein [Leptotrichia hofstadii F0254]
Length = 319
Score = 260 bits (664), Expect = 3e-67, Method: Composition-based stats.
Identities = 61/242 (25%), Positives = 106/242 (43%), Gaps = 10/242 (4%)
Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY---FPSA 201
+ +K K+ ++ H Y++D E H + + DL +T + + F +
Sbjct: 2 IYLKYKVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNI 61
Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261
++ V+EN+GRDV L + V YDY+C +H KK+ + + G +R + + L
Sbjct: 62 EVRVIENRGRDVSSLLVGAKDAVM-NYDYVCFMHDKKTAQLKPYSS-GQGFRYKCYENNL 119
Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRS-EVYRRVIDLAKRAGFPTKR 320
+I TF++NP LGM+ + +++ L K+ G
Sbjct: 120 ATKKYVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVDF 179
Query: 321 L---HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYT 376
GTMFW +P+ L+ L + +F EE N DG + HAVER + +V+
Sbjct: 180 HWNLEPISPLGTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFAVQDA 239
Query: 377 EF 378
+
Sbjct: 240 GY 241
>gi|320531350|ref|ZP_08032322.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str.
F0337]
gi|320136441|gb|EFW28417.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str.
F0337]
Length = 626
Score = 258 bits (660), Expect = 8e-67, Method: Composition-based stats.
Identities = 57/239 (23%), Positives = 100/239 (41%), Gaps = 10/239 (4%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYV 205
+ K+A++ H Y+ D + DL +TV + + + + P + + V
Sbjct: 307 QQKVALIAHLYFMDLLDSTLAYARSMPEGTDLILTVGSQEKAELVERACQDLPYNVDVRV 366
Query: 206 MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSD 265
+EN+GRDV L + V D YD +C +H KK + + G + R F +LL +
Sbjct: 367 IENRGRDVSALLVGCKDIV-DDYDLVCFMHDKKVTQLSPY-TVGEGFARKCFDNLLPTRE 424
Query: 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY--RRVIDLAKRAGFPTKRL-- 321
++ TF+ P LG++ + ++ R + L K
Sbjct: 425 FVENVVATFDSEPRLGLLSPTPPNHADYFPIYSYSWGPNFDRTKMLLEKELNLNVPLDAH 484
Query: 322 -HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
+ GTMFW +P L+PL + +F E N DG + HA+ER + + + +
Sbjct: 485 KEVIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNDIDGTILHAIERAYGYVAQASGY 543
>gi|13474020|ref|NP_105588.1| hypothetical protein mll4799 [Mesorhizobium loti MAFF303099]
gi|14024772|dbj|BAB51374.1| mll4799 [Mesorhizobium loti MAFF303099]
Length = 386
Score = 258 bits (659), Expect = 1e-66, Method: Composition-based stats.
Identities = 97/244 (39%), Positives = 131/244 (53%), Gaps = 4/244 (1%)
Query: 137 PSSPKKSGL-TIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVL 195
P + L T++ KIA+ +H +Y D W E +L F LF+T+ + Q V
Sbjct: 126 PQAEAPERLPTVEPKIAVALHLHYPDLWPEFEALLEATGRQFQLFLTLTRPDAALAQRVQ 185
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
FP A++ V EN+GRDV PF+ LL G FD +D +CK+HGKKS + G + G IWR+
Sbjct: 186 ARFPGAEITVYENRGRDVGPFIQLLREGKFDPFDLICKLHGKKSGQSGPRMVLGEIWRQV 245
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV-IDLAKRA 314
FDL+G + RII FE++P MIGSRR+R W R + ++L +
Sbjct: 246 SAFDLIGSRGVVDRIIANFERSPDTQMIGSRRFRLPNEWKGEKSAWGENRAMALNLLETM 305
Query: 315 GFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373
G P LDFF GTMFWV+ LEPLR L L + F EE +DG L+HA+ER
Sbjct: 306 GMP-SSSRLDFFAGTMFWVRRGALEPLRRLDLPLAAFPEETGQQDGTLQHALERVLGMIC 364
Query: 374 RYTE 377
Sbjct: 365 TKIG 368
>gi|326772082|ref|ZP_08231367.1| rhamnan synthesis protein F [Actinomyces viscosus C505]
gi|326638215|gb|EGE39116.1| rhamnan synthesis protein F [Actinomyces viscosus C505]
Length = 652
Score = 257 bits (658), Expect = 2e-66, Method: Composition-based stats.
Identities = 58/246 (23%), Positives = 100/246 (40%), Gaps = 10/246 (4%)
Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP 199
+ K+A++ H YY D + D +TV + + ++ K P
Sbjct: 326 AVAREPKPQKVALIAHLYYMDLLEPTLAYARSMPEGTDFILTVGSQEKVELVEEACKDLP 385
Query: 200 -SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258
+ + ++EN+GRDV L + V YD +C IH KK + + G + R F
Sbjct: 386 YNVTVRLIENRGRDVSALLVGCKDIV-SDYDLVCFIHDKKVTQLSPY-TVGEGFARKCFD 443
Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY--RRVIDLAKRAGF 316
+LL + +I+TF+ P LG++ + ++ R + L K
Sbjct: 444 NLLPTREFVENVISTFDSEPRLGLLSPTPPNHADYFPIYSYSWGPNFDRTKMLLEKELNL 503
Query: 317 PTKRL---HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACS 372
+ GTMFW +P L+PL + +F E N DG + HA+ER +
Sbjct: 504 SVPLDAHKEVIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNDIDGTILHAIERAYGYV 563
Query: 373 VRYTEF 378
+ + +
Sbjct: 564 AQASGY 569
>gi|331086190|ref|ZP_08335272.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium
9_1_43BFAA]
gi|330406349|gb|EGG85863.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium
9_1_43BFAA]
Length = 592
Score = 257 bits (657), Expect = 2e-66, Method: Composition-based stats.
Identities = 66/244 (27%), Positives = 108/244 (44%), Gaps = 10/244 (4%)
Query: 143 SGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDF-EQDVLKYFP-- 199
T ++KIA+V+H Y++D E H + + D+++T K + V P
Sbjct: 250 QKQTTENKIALVMHLYFEDLLEESYHYVSAMPEKADIYLTTDTEKKKAAIEKVFAKLPCN 309
Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
++ V++N+GRDV L ++ + D YD +C H KK+ + I G + F +
Sbjct: 310 KLEVRVIKNRGRDVSSLLVGVKDVIMD-YDLVCFAHDKKTAQVKPGTI-GASFAYKCFEN 367
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV-IDLAKRAGFPT 318
L +INTF NP +G++ ++ + DLAK+ G
Sbjct: 368 TLSNKAYVGNVINTFVNNPRMGLLCPPEPNHSTFFTTIGFEWGPNFNITRDLAKKLGLTV 427
Query: 319 K---RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVR 374
GTMFW +PK ++PL N +F E N DG L HA+ER + V+
Sbjct: 428 PISVASPPVAPLGTMFWFRPKAMKPLYNKDWKYEDFPAEPNKIDGTLLHAIERIYPFIVQ 487
Query: 375 YTEF 378
+ +
Sbjct: 488 ESGY 491
>gi|260890969|ref|ZP_05902232.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia
hofstadii F0254]
gi|260859295|gb|EEX73795.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia
hofstadii F0254]
Length = 709
Score = 257 bits (656), Expect = 3e-66, Method: Composition-based stats.
Identities = 58/239 (24%), Positives = 101/239 (42%), Gaps = 10/239 (4%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY---FPSAQLY 204
+ K+ ++ H Y++D E H + + DL +T + + F + ++
Sbjct: 150 EDKVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNIEVR 209
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
V+EN+GRDV L + V YDY+C +H KK+ + + ++ + L
Sbjct: 210 VIENRGRDVSSLLVGAKDAVM-NYDYVCFMHDKKTAQLKPYSSLNDVYINYC-KGTLATK 267
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRL-- 321
+I TF++NP LGM+ + +++ L K+ G
Sbjct: 268 KYVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVDFHWN 327
Query: 322 -HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GTMFW +P+ L+ L + +F EE N DG + HAVER + V+ +
Sbjct: 328 LEPISPLGTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFVVQDAGY 386
>gi|310829395|ref|YP_003961752.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612]
gi|308741129|gb|ADO38789.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612]
Length = 627
Score = 256 bits (653), Expect = 6e-66, Method: Composition-based stats.
Identities = 65/239 (27%), Positives = 104/239 (43%), Gaps = 10/239 (4%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKYF--PSAQLY 204
+ +IA + H Y++D E L + + D+++T K Q+ K F + ++
Sbjct: 310 EKRIAAIFHLYFEDLIDETYRYLSSMPEEADIYITTDTEPKKKLIQEKFKDFSCRNFKVI 369
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
+++N+GRDV L + + YDY+C H KK + + I G + F + L
Sbjct: 370 LIQNRGRDVSALLVATKAFIM-NYDYVCFAHDKKVTQTKPYSI-GGAFAYKCFENTLQNK 427
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRS-EVYRRVIDLAKRAGFPTKRLH- 322
+ + IIN FE+NP LGM+ + Y +L G
Sbjct: 428 NFVLNIINAFEKNPRLGMLMPAPPNNGPYYPTLGNEWMCNYEVTKNLIDELGIKVPMDPG 487
Query: 323 --LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GTMFW +PK L+ L + + +F EE N DG L HA+ER + V+ F
Sbjct: 488 KEPISPLGTMFWFRPKALKVLFDKNWEYSDFPEEPNKVDGTLLHAIERAYGLIVQSEGF 546
>gi|329944276|ref|ZP_08292535.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str.
F0386]
gi|328531006|gb|EGF57862.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str.
F0386]
Length = 636
Score = 254 bits (650), Expect = 1e-65, Method: Composition-based stats.
Identities = 61/238 (25%), Positives = 96/238 (40%), Gaps = 9/238 (3%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYV 205
K KIA++ H YY D + + D+F++ E + E + ++ +
Sbjct: 307 KQKIALIAHLYYMDLVEPTLKYIRNMPEGIDIFLSTSSPEKVEQVEAACKGLPYNIEVRL 366
Query: 206 MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSD 265
+EN+GRDV PFL + V YD +C H KK + + G + F +LL D
Sbjct: 367 VENRGRDVGPFLVAWKDVV-HDYDVVCYTHDKKVTQLYPYS-VGDGFAYKCFENLLPTRD 424
Query: 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRLH-- 322
+I TF+ P LG + + F + R L + G
Sbjct: 425 FVKNVIATFDAEPRLGFLAPTPPNHADYFPVFTYGWGPNFDRTKALLRELGLDVPLDPTK 484
Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
G+MFW +P+ L+PL + EF E DG L HA+ER + + +
Sbjct: 485 EPIAPLGSMFWFRPQALKPLFDHDWQWEEFPPEPCPIDGTLMHAIERSHGYVAQGSGY 542
>gi|227546966|ref|ZP_03977015.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis ATCC 55813]
gi|227212567|gb|EEI80455.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis ATCC 55813]
Length = 631
Score = 253 bits (647), Expect = 3e-65, Method: Composition-based stats.
Identities = 61/238 (25%), Positives = 99/238 (41%), Gaps = 10/238 (4%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYVM 206
KIA+ +H YY D H + + D+ +TV EAN + ++ K FP + + V+
Sbjct: 309 KKIALAIHVYYMDLLESTFHYIQSMPEGCDIIITVGSEANAETVREYCKQFPYNFDVRVI 368
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
EN+GRDV L +F +YDY+C H KK + I G + F ++L +
Sbjct: 369 ENRGRDVSALLVGCGEDLF-QYDYVCFAHDKKVTQLSPQSI-GDGFAYKCFENILASKEY 426
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY----RRVIDLAKRAGFPT-KRL 321
+I+ FE+NP LG+ + + + ++ P
Sbjct: 427 VSNVIDLFERNPRLGIAMPTPPNHASYFPGYTFPWGPNFPGTKDFLEQTLNMHVPLNADK 486
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GTMFW +P+ L + +F E N DG L H +ER + + +
Sbjct: 487 EPVAPMGTMFWFRPEAFRGLLDHGWEYTDFPPEPNKVDGTLLHFIERAYGYVPQANGY 544
>gi|90425670|ref|YP_534040.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18]
gi|90107684|gb|ABD89721.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18]
Length = 846
Score = 252 bits (645), Expect = 5e-65, Method: Composition-based stats.
Identities = 62/251 (24%), Positives = 104/251 (41%), Gaps = 16/251 (6%)
Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK- 196
P++ + +IAI H YY D ++ DLF+T E + +
Sbjct: 586 PRRESNAARPRIAIHGHFYYPDLLESFLKLIAANASSVDLFLTTSGPEQAAQIRKSLRAF 645
Query: 197 YFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256
+A ++ + N+GRD+ PFL + YD + HGK+S+ G WR +
Sbjct: 646 GIQNADVWSVPNRGRDIGPFLKEMPD-KLGSYDIVGHFHGKRSKHVD--STVGDQWRDFA 702
Query: 257 FFDLLGFS-DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
+ L+G + + I + F ++ LG++ + E LA+R
Sbjct: 703 WQHLIGDAFPMIDVIADAFAEDAKLGLVFAEDPYL-------NGWDENRDLAERLAQRMK 755
Query: 316 FPTK-RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373
H DF GTMFW + L+PL L+L ++ E DG + HA+ER +V
Sbjct: 756 IEAPLPEHFDFPIGTMFWARVAALQPLFQLNLDWNDYPHEPLPIDGTILHALERIVPFAV 815
Query: 374 RYTEFSIESVD 384
+ + F +
Sbjct: 816 QKSGFEYATTY 826
>gi|160894491|ref|ZP_02075267.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50]
gi|156863802|gb|EDO57233.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50]
Length = 646
Score = 252 bits (644), Expect = 6e-65, Method: Composition-based stats.
Identities = 58/246 (23%), Positives = 101/246 (41%), Gaps = 10/246 (4%)
Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFP 199
K + K K+A+V+H Y+ D + + + D+++T K+ V K P
Sbjct: 305 KMDEILKKRKLALVMHLYFPDLVEDSFQWASNVPKETDVYITTDTVEKKEAILKVFKNLP 364
Query: 200 S--AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLF 257
++ V+ N+GRDV L ++ V YDY C +H KK+ + G + +
Sbjct: 365 CNHLEVRVIVNRGRDVSSILVGVKD-VIQNYDYACFVHDKKTAQAKPGS-VGDSFGYKCW 422
Query: 258 FDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGF 316
+ L + ++ TFE N LG++ + + + ++A + G
Sbjct: 423 NNTLYNKEFVCNVLQTFEDNERLGILSPPEPNHGPFYQTLGNEWGCNFEKSREVADKLGI 482
Query: 317 PTK---RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACS 372
GT FW +P L+ L + EF EE N DG + HA+ER +
Sbjct: 483 TIPMSEDKEALAPYGTFFWFRPTALKVLFDHDWQYEEFPEEPNNFDGTILHAIERLYPIC 542
Query: 373 VRYTEF 378
V+ +
Sbjct: 543 VQQAGY 548
>gi|325067622|ref|ZP_08126295.1| hypothetical protein AoriK_07369 [Actinomyces oris K20]
Length = 626
Score = 252 bits (644), Expect = 7e-65, Method: Composition-based stats.
Identities = 58/238 (24%), Positives = 98/238 (41%), Gaps = 10/238 (4%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYVM 206
KIA++ H YY D + + DL +TV + + ++ K P + + ++
Sbjct: 308 QKIALIAHLYYMDLLEPTLAYVKSMPEGTDLILTVGSQEKAELVEEACKDLPYNVTVRLI 367
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
EN+GRDV L + + YD +C H KK + + G + F +LL D
Sbjct: 368 ENRGRDVSALLVGCKD-IIHDYDLVCFTHDKKVTQVKPYS-VGDGFAIKCFENLLATRDF 425
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKR-AGFPTKRLH-- 322
+I TF+ P LG++ + F+ + R L ++
Sbjct: 426 VKNVIATFDAEPRLGLLAPTPPNHGDYFPVFSMGWGPNFERTKTLLEKELNLSVPIDESR 485
Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GTMFW +P L+PL + +F E N DG + HA+ER + + + +
Sbjct: 486 APIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNNIDGTILHAIERAYGYVAQASGY 543
>gi|308235695|ref|ZP_07666432.1| hypothetical protein GvagA14_05663 [Gardnerella vaginalis ATCC
14018]
gi|311114292|ref|YP_003985513.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019]
gi|310945786|gb|ADP38490.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019]
Length = 637
Score = 252 bits (644), Expect = 8e-65, Method: Composition-based stats.
Identities = 64/249 (25%), Positives = 100/249 (40%), Gaps = 10/249 (4%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLK 196
SS S T K K+A+ +H YY D + H + + D+ +TV + N+ + ++
Sbjct: 303 SSTATSESTAKPKVALCMHLYYMDLLDKSLHYIQSMPQGCDVILTVGSKENQQIVKQRVE 362
Query: 197 YFP-SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ P + ++EN+GRDV FL +YDY+C H KK + I G +
Sbjct: 363 HLPYDVDVRLIENRGRDVSAFLVGGGAD-LMKYDYVCFAHDKKVTQLSPRSI-GDGFAYK 420
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID--LAKR 313
F ++L + +IN FE +P LGM + F L K
Sbjct: 421 CFENILASKEYVQNVINLFETHPRLGMAMPTPPNHADYFPGFTYTWGPNFEGTKKFLEKT 480
Query: 314 AGFPTKRLH---LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFF 369
G GTMFW + K + L + +F E DG L H +ER +
Sbjct: 481 LGISVPLDENKDAIAPLGTMFWFRTKAMRGLLDRKWTYEDFPAEPLKIDGTLLHFIERAY 540
Query: 370 ACSVRYTEF 378
+Y +
Sbjct: 541 GYVPQYNGY 549
>gi|311063512|ref|YP_003970237.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum
PRL2010]
gi|310865831|gb|ADP35200.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum
PRL2010]
Length = 631
Score = 252 bits (643), Expect = 8e-65, Method: Composition-based stats.
Identities = 61/249 (24%), Positives = 99/249 (39%), Gaps = 10/249 (4%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLK 196
S L KIA+ +H YY D + + D+ +TV EAN + ++ K
Sbjct: 298 SQSLSVPLPEGKKIALAIHVYYMDLLESTFRYIQSMPEGCDIIITVGSEANAEIVREYCK 357
Query: 197 YFP-SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
FP + V+EN+GRDV L +F +YDY+C H KK + I G +
Sbjct: 358 QFPYRFDVRVIENRGRDVSSLLVGCGEDLF-QYDYVCFAHDKKVTQLSPQSI-GDGFAYK 415
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY----RRVIDLA 311
+ ++L + +I+ FE+NP LG+ + + + ++
Sbjct: 416 CYENILASKEYVSNVIDLFEKNPRLGIAMPTPPNHASYFPGYTFPWGPNFPGTKDFLEQT 475
Query: 312 KRAGFPTKRLHLD-FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFF 369
P GTMFW +P+ L + +F E N DG L H +ER +
Sbjct: 476 LNMHVPLNANKEPVAPMGTMFWFRPEAFRGLLDHGWKYEDFPPEPNKVDGTLLHFIERAY 535
Query: 370 ACSVRYTEF 378
+ +
Sbjct: 536 GYVPQANGY 544
>gi|119026520|ref|YP_910365.1| hypothetical protein BAD_1502 [Bifidobacterium adolescentis ATCC
15703]
gi|118766104|dbj|BAF40283.1| hypothetical protein [Bifidobacterium adolescentis ATCC 15703]
Length = 647
Score = 252 bits (643), Expect = 9e-65, Method: Composition-based stats.
Identities = 62/251 (24%), Positives = 100/251 (39%), Gaps = 10/251 (3%)
Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP- 199
+ + +IA+++H YY D + + D TV E N ++ K P
Sbjct: 301 TTPIPEGKRIALIMHLYYMDLLDKTLEYAKSMPEGCDFIFTVGSEENAKLVRERCKGLPY 360
Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
+ + V++N+GRDV L +YDY+C H KK + + I G + F +
Sbjct: 361 NVDVRVIQNRGRDVSALLIGAGKDCL-KYDYVCFAHDKKVTQLSPYSI-GDGFAYKCFEN 418
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID--LAKRAGFP 317
+LG + IIN FEQ+P G++ + FA L + G
Sbjct: 419 ILGSKALVSNIINHFEQDPHAGLLAPTSPNHADYFGNFASLWGPNFEGTKKMLEETLGVK 478
Query: 318 T---KRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373
GTMFW +PK L L ++ +F E N DG++ H +ER +
Sbjct: 479 VPLNPYKEPIAPLGTMFWFRPKALHQLFDIDWKYEDFPPEPNKIDGSMLHFIERAYGYLP 538
Query: 374 RYTEFSIESVD 384
+ + V
Sbjct: 539 QANGYYTGFVY 549
>gi|225352528|ref|ZP_03743551.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium
pseudocatenulatum DSM 20438]
gi|225156722|gb|EEG70116.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium
pseudocatenulatum DSM 20438]
Length = 648
Score = 251 bits (641), Expect = 1e-64, Method: Composition-based stats.
Identities = 59/250 (23%), Positives = 104/250 (41%), Gaps = 10/250 (4%)
Query: 143 SGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-S 200
+ + +IA+++H YY D + + D TV E N ++ K P +
Sbjct: 303 APIPTNKRIALIMHLYYMDLLDKTLEYAKSMPEGCDFIFTVGSEENATIVRERCKDLPYN 362
Query: 201 AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260
+ V++N+GRDV L +YDY+C H KK + + I G + F ++
Sbjct: 363 VDVRVIQNRGRDVSALLVGAGKDCL-QYDYVCFAHDKKVTQLSPYSI-GDGFSYKCFENV 420
Query: 261 LGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY----RRVIDLAKRAGF 316
LG + IIN FE +P G++ + FA +++++ +
Sbjct: 421 LGSKALVSNIINHFENDPHAGVLAPAPPNHADYFGNFASLWGPNYEGTKKMLEETLQVKV 480
Query: 317 PTKRLHLD-FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVR 374
P + GTMFW +PK L+ ++ +F E N DG++ H VER + +
Sbjct: 481 PLDKSKEPIAPMGTMFWFRPKALQQFFDIDWKYEDFPPEPNKIDGSMLHFVERAYGYVPQ 540
Query: 375 YTEFSIESVD 384
+ +
Sbjct: 541 ANGYYTGYIY 550
>gi|13476280|ref|NP_107850.1| hypothetical protein mlr7559 [Mesorhizobium loti MAFF303099]
gi|14027041|dbj|BAB53995.1| mlr7559 [Mesorhizobium loti MAFF303099]
Length = 644
Score = 251 bits (640), Expect = 2e-64, Method: Composition-based stats.
Identities = 58/245 (23%), Positives = 100/245 (40%), Gaps = 12/245 (4%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYF--PSAQLYV 205
KIA+ H YY D EI + + +D T E + E + + + V
Sbjct: 298 KIAVCAHIYYTDMLDEILGLTGNIPVPYDFIATTNTPEKKAEIETALANRPGVKNVIVRV 357
Query: 206 ME-NKGRDVRPFLYLLELGV-FDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263
+E N+GRD+ L + DRYD +C++H KKS + G +++R + +LL
Sbjct: 358 VEQNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQV--QSSMGNLFKRHMVDNLLNS 415
Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF--PTKRL 321
+++ F NP +G+ + + +V + A+
Sbjct: 416 RGYVHNVLDMFHDNPSVGLAIPPIFHISYP-TMGFSWFANKPKVEETARLLNINVKFDEN 474
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
GTMFW +P+ L + EF E + DG HA+ER A +V+ ++
Sbjct: 475 TPVAAYGTMFWFRPRALRKMFEHKWKWEEFNAEPDHVDGGFAHALERLIAYAVQNAGYTT 534
Query: 381 ESVDC 385
+ + C
Sbjct: 535 QHIMC 539
>gi|310816773|ref|YP_003964737.1| lipopolysaccharide biosynthesis protein-like protein
[Ketogulonicigenium vulgare Y25]
gi|308755508|gb|ADO43437.1| lipopolysaccharide biosynthesis protein-like protein
[Ketogulonicigenium vulgare Y25]
Length = 726
Score = 250 bits (639), Expect = 3e-64, Method: Composition-based stats.
Identities = 70/250 (28%), Positives = 104/250 (41%), Gaps = 16/250 (6%)
Query: 139 SPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYF 198
P++ I + +H YYQ+ + L ++ L+V+ A K + +
Sbjct: 455 QPRREAPAPARPIGVFLHLYYQELAPVFAKRLAQIPLPLSLYVSTDTAEKA--AQIERAL 512
Query: 199 PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258
P AQ+ V+ N+GRD+ P LY D +D + +HGKKS H W +
Sbjct: 513 PQAQVRVLPNRGRDIFPKLYGFGDAYAD-HDIVLHLHGKKSL----HSSMLDEWLSHILD 567
Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV-IDLAKRAGFP 317
LLG RI++ F+ P LG++ R A R + +LA R G
Sbjct: 568 CLLGDPADVNRILSLFDSVPRLGIVMP----VVHRSVLNAAHWGFNRDIGAELAYRMGMA 623
Query: 318 TK---RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373
T L F G+MFW + L+P+ +L L F E DG L HAVER
Sbjct: 624 TPLPENDALQFPAGSMFWARTAALQPILDLALEASHFPPEAGQVDGTLAHAVERMLGVVC 683
Query: 374 RYTEFSIESV 383
R + + V
Sbjct: 684 RAGGYYMLPV 693
>gi|160936495|ref|ZP_02083863.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC
BAA-613]
gi|158440580|gb|EDP18318.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC
BAA-613]
Length = 674
Score = 249 bits (637), Expect = 4e-64, Method: Composition-based stats.
Identities = 64/246 (26%), Positives = 101/246 (41%), Gaps = 13/246 (5%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFP-----SAQL 203
KIA+V H +Y D E L + + DL++TV AN + + V YF + ++
Sbjct: 291 KKIAVVAHLFYPDLMDETLRYLQNIQENIDLYITV--ANIETKYKVYNYFESIRRSNVKV 348
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263
+ N+GRD L +Y+YLC +H KK+ R G G + + + L
Sbjct: 349 LLSGNRGRDAGSLLVACR-EYLMQYEYLCFVHDKKTTRGGGPVTVGKAFMYHAWENTLRS 407
Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWS--FFAKRSEVYRRVIDLAK--RAGFPTK 319
II FE+N LG++ + + + Y++ +LA+ P
Sbjct: 408 GGFVSSIIKLFEKNDRLGILTPPVPALGGYLTELVGNEWTCCYQKTKELAEILSLKVPMS 467
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
F T FW +P L+PL +F EE DG L HA+ER + +
Sbjct: 468 PQKQPFALATAFWCRPAALKPLFEYPWRYEDFPEEPLASDGTLNHAIERIIIYVAQSEGY 527
Query: 379 SIESVD 384
V+
Sbjct: 528 YTAMVE 533
>gi|261367011|ref|ZP_05979894.1| putative polysaccharide biosynthesis protein [Subdoligranulum
variabile DSM 15176]
gi|282571129|gb|EFB76664.1| putative polysaccharide biosynthesis protein [Subdoligranulum
variabile DSM 15176]
Length = 646
Score = 249 bits (636), Expect = 6e-64, Method: Composition-based stats.
Identities = 55/255 (21%), Positives = 98/255 (38%), Gaps = 11/255 (4%)
Query: 139 SPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVL--- 195
+ + L + +IA+ +H Y+ D + + D+FV+ K + +
Sbjct: 298 AKQAEELCAQRRIALAMHLYFMDMLEQSVAFAAKFPPQTDVFVSTNSEEKKEQIEQAFSG 357
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ S + V+EN+GRDV FL L YDY C +H KK+ + G +
Sbjct: 358 QKLHSVTVMVVENRGRDVGAFLCDL-APHLRNYDYACFMHDKKAIQTKPGS-VGASFGYV 415
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRVIDLAKR 313
++ + + ++ FE +P LG++ + + L K
Sbjct: 416 CNENVCKNAAHVLNVLCEFENDPYLGILCPPYPTHGLYFMNMCSGGWGPNFENTKKLLKE 475
Query: 314 AGFPTK---RLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFF 369
G G++FW +PK LEPL +F +E +DG + HA+ER +
Sbjct: 476 LGLDVPISGEESPIAPFGSVFWFRPKALEPLFAHGWQHTDFPQEPLPQDGTISHAIERVY 535
Query: 370 ACSVRYTEFSIESVD 384
+ + V
Sbjct: 536 PFVAQAAGYYPAVVM 550
>gi|83582737|ref|YP_425043.1| glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170]
gi|83578053|gb|ABC24603.1| Glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170]
Length = 1236
Score = 249 bits (635), Expect = 7e-64, Method: Composition-based stats.
Identities = 64/241 (26%), Positives = 105/241 (43%), Gaps = 15/241 (6%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDF-EQDVLKYFPS--AQLYVM 206
K+ + H YY D + ++ +F DL +T + ++ + L+ + + ++ V+
Sbjct: 987 KVLLHGHFYYVDLIDDFLKKIIINDFSCDLIITTTDEDRAVFLRKKLEEYKNGSVEVRVV 1046
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N GRDV F L YD + IHGKKS G WR +L+ L+G
Sbjct: 1047 PNIGRDVGAFFTGLSDLKNSDYDVVGHIHGKKSIHLSD--GTGNKWRNFLWEHLIGGEKK 1104
Query: 267 AIRI-INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK-RLHLD 324
A I ++ +NP +G++ + + + DLAK+ G D
Sbjct: 1105 AAAIAVSALIRNPDIGLVFAEEPFLF-------GWDKNKELANDLAKKMGIEKSLPRFFD 1157
Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383
+ GTMFW K K LEP+ +L+L ++ E G + HA+ER +V FS +
Sbjct: 1158 WPIGTMFWAKRKALEPIFDLNLRWEDYPPEPIPVYGTMLHALERLLPFAVEKAGFSFATT 1217
Query: 384 D 384
Sbjct: 1218 Y 1218
>gi|82703518|ref|YP_413084.1| glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196]
gi|82411583|gb|ABB75692.1| Glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196]
Length = 828
Score = 247 bits (630), Expect = 3e-63, Method: Composition-based stats.
Identities = 60/244 (24%), Positives = 97/244 (39%), Gaps = 13/244 (5%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLK 196
S L+ ++A+ +H YY + + EI L N DLF++V ++ +L
Sbjct: 579 SEEAARPLSSSIRVALHLHVYYSELFPEIMARLKVNNVRPDLFISVPTECTRNEVTGLLN 638
Query: 197 YFPS--AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254
+P + ++ N+GRD+ P L D YD + +H KK+ I G W
Sbjct: 639 DYPGKVVDIQIVPNRGRDIGPLLTAFGSVFLDDYDAIGHLHTKKTADLSDEMI-GKRWYT 697
Query: 255 WLFFDLLGFSDIAIRII-NTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
+L +LLG II +P +G++ + LA +
Sbjct: 698 FLLENLLGGKRNMADIILGRMTADPAIGIVFPDDPHVFD-------WGNNKAHADSLASK 750
Query: 314 AGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACS 372
G + + F GTMFW + + L PL L L ++ E DG + HA+ER
Sbjct: 751 LGLGKLQENFVFPMGTMFWARTEALRPLFTLDLSWQDYPAEPLPYDGTILHALERLLPLI 810
Query: 373 VRYT 376
Sbjct: 811 AAKQ 814
>gi|317047360|ref|YP_004115008.1| family 2 glycosyl transferase [Pantoea sp. At-9b]
gi|316948977|gb|ADU68452.1| glycosyl transferase family 2 [Pantoea sp. At-9b]
Length = 1419
Score = 246 bits (629), Expect = 4e-63, Method: Composition-based stats.
Identities = 64/240 (26%), Positives = 97/240 (40%), Gaps = 17/240 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDV------LKYFPSAQ 202
I + +H YY D E L + FDLF+++ + E+ +K
Sbjct: 597 RTIGVHLHLYYVDLADEFIKHLNTIPTGFDLFISLPRGKHNVEECERKFRSGIKTLKKLV 656
Query: 203 LYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLG 262
+ ENKGRD+ PF+ + Y+ + IH KKS + WRR+L LG
Sbjct: 657 VRETENKGRDIYPFIVEFGAELLS-YELILHIHSKKSPQ-----ALSKGWRRFLLHYTLG 710
Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH 322
I +I+N+F+ +P LG++ + R V R GF +
Sbjct: 711 TESITTQILNSFDNDPKLGVLFPAYFYGVTRQP---NWGGNREIVKQQLARLGFSYDMTY 767
Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
D+ G+ FW + L PL N + +F+EE DG L H ER F +S
Sbjct: 768 CPDYPAGSFFWSRSDALRPLLNGEYRLEDFDEEAGQYDGTLAHGFERLFGTIPLLQNYST 827
>gi|312133751|ref|YP_004001090.1| protein [Bifidobacterium longum subsp. longum BBMN68]
gi|311773029|gb|ADQ02517.1| Hypothetical protein BBMN68_1492 [Bifidobacterium longum subsp.
longum BBMN68]
Length = 641
Score = 246 bits (628), Expect = 5e-63, Method: Composition-based stats.
Identities = 65/248 (26%), Positives = 98/248 (39%), Gaps = 9/248 (3%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLK 196
S + K ++A+V+H YY D +I + D+ +TV E ++ +
Sbjct: 298 SQDNAQPIPQKFRVALVLHLYYMDILDQILRYARSMPEGCDVIITVGSEEKACIVKERCE 357
Query: 197 YFP-SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
P + + V+EN+GRDV L V YD +C H KK ++ I G + +
Sbjct: 358 GMPYNIDVRVIENRGRDVSALLVGAGKDVL-NYDLVCFAHDKKVRQLRPETI-GDGFAKK 415
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRA 314
F + L IIN F NP LG+ + +A YR DL
Sbjct: 416 CFENTLASKAYVANIINLFADNPRLGVAMPSAPNHADYFYSYAFSWGPNYRGTKDLLDGL 475
Query: 315 GFPTKRLH---LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370
G + GTMFW +PK L L + +F E N DG+ H VER +
Sbjct: 476 GIKVPLSPHADVIAPLGTMFWFRPKALHGLIDKSWEYSDFPPEPNPADGSFLHFVERAYC 535
Query: 371 CSVRYTEF 378
+ +
Sbjct: 536 YVAQSNGY 543
>gi|227497960|ref|ZP_03928140.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
gi|226832618|gb|EEH65001.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434]
Length = 626
Score = 245 bits (625), Expect = 1e-62, Method: Composition-based stats.
Identities = 56/249 (22%), Positives = 96/249 (38%), Gaps = 10/249 (4%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDV 194
P+ +SKIA+V+H Y+ D ++ H + DL TV K +
Sbjct: 296 PTQAVAVQPE-ESKIALVMHVYHMDLLPQLLHYAASMPAGCDLIATVDTEAKAQQVREAT 354
Query: 195 LKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254
+ + ++EN+GRDV L + D YD +C IH KK + G + +
Sbjct: 355 AGLSLNVETILIENRGRDVAALLVGARPRLLD-YDLVCFIHDKKVTQIRPGS-VGEGFAK 412
Query: 255 WLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKR 313
F ++L + +I TF+ P LG++ + A + +L
Sbjct: 413 RCFENVLATPEFVCNVIATFQAEPRLGVLTPSAPHHGDYFPISAFSWGPNDKNTKELLAS 472
Query: 314 AGFP---TKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFF 369
G G++FW +P+ + PL +F E DG + HA+ER +
Sbjct: 473 FGLHAPIDPDKEAIAPFGSVFWFRPQAIRPLLERKWRYDDFPAEPLPIDGTISHAIERVY 532
Query: 370 ACSVRYTEF 378
+ +
Sbjct: 533 CYMAQARGY 541
>gi|116071143|ref|ZP_01468412.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
BL107]
gi|116066548|gb|EAU72305.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
BL107]
Length = 1161
Score = 243 bits (621), Expect = 3e-62, Method: Composition-based stats.
Identities = 59/287 (20%), Positives = 102/287 (35%), Gaps = 16/287 (5%)
Query: 110 FMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTI-KSKIAIVVHCYYQDTWIEISH 168
F + ++++ P+K I + K + +H +Y + I+
Sbjct: 161 KFGIQEGRFSMDDIHFMRKTANIKKVSSPHPQKLTQAIEQKKFGVFLHIFYPELAKTIAD 220
Query: 169 ILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS---AQLYVMENKGRDVRPFLYLLELGVF 225
L ++ D++++ E D + + Q+ N GRDV PF+ +
Sbjct: 221 YLAKIPVKIDIYISTTEKEVDELAKTFRRLDNSEHVQVKSFSNTGRDVAPFVVGFREEIL 280
Query: 226 DRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS 285
+YD++ K+H KKS H W +L+G D+ I N +
Sbjct: 281 -KYDFILKLHSKKSP----HSDALSGWFEHCLDNLIGSKDVFYTNIFELMNNETAIIYPV 335
Query: 286 RRYRRY---KRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--HLDFFNGTMFWVKPKCLEP 340
Y K S + Y + L + F GTMFW K L+P
Sbjct: 336 ENYALSLGIKHDSCWGHEDGNYDKAKPLLDKLNLKHIDRDSKFLFPTGTMFWCKSYILQP 395
Query: 341 LRNLHL-IGEFEEERNLKDGALEHAVERFFACSV-RYTEFSIESVDC 385
+ + +L +F+ E DG L H++ER I + C
Sbjct: 396 ILDWNLGFHDFDNEGGQIDGTLAHSIERLIGLCCTEKFHKRIITSYC 442
>gi|297538440|ref|YP_003674209.1| Rhamnan synthesis F [Methylotenera sp. 301]
gi|297257787|gb|ADI29632.1| Rhamnan synthesis F [Methylotenera sp. 301]
Length = 782
Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats.
Identities = 66/261 (25%), Positives = 105/261 (40%), Gaps = 17/261 (6%)
Query: 126 VKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE 185
E P S + +A++ H +Y + L + F+FD+++T
Sbjct: 487 FARKIEYAMLVPFSYQVESPQNNPSLAVICHLFYHQMCEDYKVYLSNIPFNFDIYITTDT 546
Query: 186 ANKDFEQDVLKYFP-----SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240
+K + + K F ++ + N+GRD+ P L Y+Y+ IH K S
Sbjct: 547 EDK--KAYIEKSFSGWQRGKVEVRLAVNQGRDIAPKLIACRDIY-SAYEYILHIHSKNSP 603
Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300
H WR ++ LLG I F+ N LG+I + ++ K
Sbjct: 604 YSSIHTG----WRDYILDTLLGSQKTVSSIFEAFQLNSNLGIIAPQHFKALKLDI---GW 656
Query: 301 SEVYRRVIDLAKRAGFPTKRL-HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKD 358
++ LA R GF R +DF +G+MFW + L PL N L + +F E KD
Sbjct: 657 DRNFKIAKKLAGRMGFDISRKAPIDFPSGSMFWARSAALLPLLNCSLSLQDFPREDGQKD 716
Query: 359 GALEHAVERFFACSVRYTEFS 379
G H++ER + FS
Sbjct: 717 GTTAHSIERLYFFICEKAGFS 737
>gi|225350704|ref|YP_002720664.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae
WA1]
gi|225216388|gb|ACN85121.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae
WA1]
Length = 342
Score = 242 bits (619), Expect = 5e-62, Method: Composition-based stats.
Identities = 65/240 (27%), Positives = 107/240 (44%), Gaps = 12/240 (5%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFP---SAQL 203
K KI I +H YY D L +FDLF+T E NKD + P + +
Sbjct: 24 KLKIGIHIHLYYIDMMDMFIKYLKDSPIEFDLFITTSKEENKDICLNAFNKLPKLKNITI 83
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263
+++EN GRD+ P+L + YD C +H KKS H W +L +L+
Sbjct: 84 FIVENIGRDIAPWLIECNNIQ-NNYDLFCHLHTKKSL----HWESINEWGEYLIENLI-S 137
Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK-RAGFPTKRLH 322
+ I++ F + +G+I Y + + + +++ + L K F K +
Sbjct: 138 EEAINNILSNFILDNNIGIISPHIYYYLFPYILYIDKDDMHHIKLLLNKLNINFEPKPEN 197
Query: 323 LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
F G+M W +PK L+PL +L+L +F +E K G + HA+ER + + +
Sbjct: 198 FVFPVGSMLWYRPKVLKPLFDLNLKYSDFPQEPIPKTGTIAHAIERIIGIICEQSNYKFK 257
>gi|269219069|ref|ZP_06162923.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon
848 str. F0332]
gi|269211216|gb|EEZ77556.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon
848 str. F0332]
Length = 687
Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats.
Identities = 69/236 (29%), Positives = 104/236 (44%), Gaps = 9/236 (3%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMEN 208
S+IA+V+HC+Y D E+ L L DFDLFVT L+ + + +EN
Sbjct: 75 SRIAVVIHCFYADLMPELFDRLRNLPTDFDLFVTNASGADVAVPKDLERMRHSVVVEVEN 134
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYH---PIEGIIWRRWLFFDLLGFSD 265
GRD+ P + L+ G+ D YD + K+H KKS H G W+ DL+G +
Sbjct: 135 HGRDIFPTVQLVNSGILDPYDLILKLHTKKSPWREEHADLDGSGAAWKDQFLSDLVGSRE 194
Query: 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF 325
I+N F +P LG++ + K + R V L R L+F
Sbjct: 195 KVEEILNAFAADPTLGLVTAADSIVGKEF-----WGGDQRIVEQLMLRIEMSIDPDELEF 249
Query: 326 FNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
+G+M+W + L+ LR +L +F+EE+ D HA+ER
Sbjct: 250 ASGSMYWTRAFVLQGLRAFNLTSADFDEEKGQVDATTAHAIERIVGIVTDEAGLRT 305
Score = 71.9 bits (175), Expect = 2e-10, Method: Composition-based stats.
Identities = 10/88 (11%), Positives = 24/88 (27%), Gaps = 8/88 (9%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G V + + +++ + + F +I L + R+ F +
Sbjct: 604 YPGAMVGFDNTARRQWKADAWYGSNPYTFHRWIAGL------VRVVAPREAKDRLLFVNA 657
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123
E + A L + +
Sbjct: 658 WNEWAESAILEPTTRFGRTYLLAVRNAV 685
>gi|190572676|ref|YP_001970521.1| putative glycosyltransferase, fusion protein [Stenotrophomonas
maltophilia K279a]
gi|190010598|emb|CAQ44207.1| putative glycosyltransferase, fusion protein [Stenotrophomonas
maltophilia K279a]
Length = 566
Score = 242 bits (617), Expect = 9e-62, Method: Composition-based stats.
Identities = 80/250 (32%), Positives = 111/250 (44%), Gaps = 14/250 (5%)
Query: 147 IKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE--ANKDFEQDVLKYFPSAQLY 204
+KS+ AIV+H Y+ D I + + D DLFV+V + + + A ++
Sbjct: 313 LKSRFAIVLHLYHLDLIESIQGYMKNMIVDHDLFVSVKSVADRRVAVRFFEERKVRAFVF 372
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
V N GRDV PF+ LL G+ DRYD +CKIH KKS G WR L LLG S
Sbjct: 373 VHPNIGRDVGPFVSLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQWRDELMKSLLGSS 428
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324
++I+ F + G++G R+ LA G R+ L
Sbjct: 429 HTVLKILRAFRHDSSCGIVGPEHAYVSN----ARFWGGNEERLRRLAAETGIDDARIRLG 484
Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF---SI 380
FF GTMFW +P L LR L + EF+ E D L H +ER F V + +
Sbjct: 485 FFAGTMFWFRPAALYALRERALALSEFDPEAGQLDATLAHVIERLFVLWVEQAGYFAATT 544
Query: 381 ESVDCVAEYE 390
+ D +E
Sbjct: 545 RTPDAALRHE 554
>gi|13476281|ref|NP_107851.1| hypothetical protein mlr7560 [Mesorhizobium loti MAFF303099]
gi|14027042|dbj|BAB53996.1| mlr7560 [Mesorhizobium loti MAFF303099]
Length = 637
Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats.
Identities = 59/245 (24%), Positives = 99/245 (40%), Gaps = 12/245 (4%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYF--PSAQLYV 205
KIA+ H YY D EI + + +D T + + E + K + + V
Sbjct: 298 KIAVCAHIYYTDMLEEILALTGNIPVPYDFIATTDTPDKKAEIEATLAKRPGVKNVIVRV 357
Query: 206 ME-NKGRDVRPFLYLLELGV-FDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263
+E N+GRD+ L + DRYD +C++H KKS + +++R + +LL
Sbjct: 358 VEKNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQVQASRS--NLFKRHMLENLLNT 415
Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH- 322
+++ F NP +G+ A +V + A+ K H
Sbjct: 416 RGYVHNVLDMFHDNPSVGLAVPPVVHISYPTMGHA-WFFNRPKVEETARLLNIKVKFDHD 474
Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
GTMFW +P+ L + +F E N DG L H +ER A + + ++
Sbjct: 475 TPVAAYGTMFWFRPRALRKMFEHKWKWEDFNAEPNHVDGGLAHVLERLIAYAAQDAGYTT 534
Query: 381 ESVDC 385
+ C
Sbjct: 535 RHIMC 539
>gi|3399709|dbj|BAA32094.1| rgpFc [Streptococcus mutans]
Length = 583
Score = 241 bits (615), Expect = 1e-61, Method: Composition-based stats.
Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 19/260 (7%)
Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192
K+ + +K K+A+ +H +Y D E + +F +DLF+T +K + E+
Sbjct: 270 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
+ AQ++V N GRDV P L L YD++ H KKS+ + G W
Sbjct: 330 ILSANGQEAQVFVTGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEADF--WAGQSW 385
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310
R L L+ +D I+ +QNP +G++ + + RY + + + L
Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442
Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365
++ G K F GT W K L+PL +L+L + E L ++ HA+
Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502
Query: 366 ERFFACSV--RYTEFSIESV 383
ER + +F I
Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522
>gi|78184210|ref|YP_376645.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
CC9902]
gi|78168504|gb|ABB25601.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
CC9902]
Length = 1161
Score = 241 bits (615), Expect = 1e-61, Method: Composition-based stats.
Identities = 55/249 (22%), Positives = 94/249 (37%), Gaps = 17/249 (6%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFP---SAQLY 204
+ K + +H +Y + I+ + ++ D+ ++ ++ K + Q+
Sbjct: 200 QKKFGVFLHIFYPELAPIIADYIRKIPVKIDIHISTTHDAISGLTEIFKGLENSLNVQVK 259
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
N GRDV PF+ + +YDY+ K+H KKS H W +L+G
Sbjct: 260 SFPNIGRDVAPFIVGFREEIP-KYDYILKLHSKKSP----HSNALSGWFEHCLDNLIGSI 314
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRR----YKRWSFFAKRSEVYRRVIDLAKRAGFPTKR 320
D+ I + + ++ K S + Y + L K+ G
Sbjct: 315 DVFYTNIQELNKED-ISIVYPVENYALSLGIKHDSCWGHEDGNYNKAKTLLKKLGLEQIN 373
Query: 321 L--HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV-RYT 376
F G MFW KP L+P+ + L +F+ E DG L H++ER Y
Sbjct: 374 RNSEFLFPTGNMFWCKPDILKPILDWDLKFEDFDNEGGQIDGTLAHSIERLIGLCCTEYF 433
Query: 377 EFSIESVDC 385
I + C
Sbjct: 434 HKKIITSYC 442
>gi|290580710|ref|YP_003485102.1| rhamnan synthesis protein F [Streptococcus mutans NN2025]
gi|254997609|dbj|BAH88210.1| RgpFc protein [Streptococcus mutans NN2025]
Length = 557
Score = 241 bits (615), Expect = 2e-61, Method: Composition-based stats.
Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 19/260 (7%)
Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192
K+ + +K K+A+ +H +Y D E + +F +DLF+T +K + E+
Sbjct: 244 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 303
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
+ AQ++V N GRDV P L L YD++ H KKS+ + G W
Sbjct: 304 ILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEADF--WAGQSW 359
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310
R L L+ +D I+ +QNP +G++ + + RY + + + L
Sbjct: 360 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 416
Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365
++ G K F GT W K L+PL +L+L + E L ++ HA+
Sbjct: 417 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 476
Query: 366 ERFFACSV--RYTEFSIESV 383
ER + +F I
Sbjct: 477 ERLLIYIAWNEHYDFRISKN 496
>gi|30024644|dbj|BAC75698.1| rhamnosyltransferase [Streptococcus mutans]
Length = 583
Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats.
Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 19/260 (7%)
Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192
K+ + +K K+A+ +H +Y D E + +F +DLF+T +K + E+
Sbjct: 270 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
+ AQ++V N GRDV P L L YD++ H KKS+ + G W
Sbjct: 330 ILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSW 385
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310
R L L+ +D I+ +QNP +G++ + + RY + + + L
Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442
Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365
++ G K F GT W K L+PL +L+L + E L ++ HA+
Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502
Query: 366 ERFFACSV--RYTEFSIESV 383
ER + +F I
Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522
>gi|30024633|dbj|BAC75688.1| rhamnosyltransferase [Streptococcus mutans]
Length = 583
Score = 240 bits (613), Expect = 3e-61, Method: Composition-based stats.
Identities = 66/260 (25%), Positives = 109/260 (41%), Gaps = 19/260 (7%)
Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD- 193
K+ + +K K+A+ +H +Y D E + +F +DLF+T +K E +
Sbjct: 270 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329
Query: 194 -VLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
+ AQ++V N GRDV P L L YD++ H KKS+ + G W
Sbjct: 330 VLSANSQEAQIFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSW 385
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310
R L L+ +D I+ +QNP +G++ + + RY + + + L
Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442
Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365
++ G K F GT W K L+PL +L+L + E L ++ HA+
Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502
Query: 366 ERFFACSV--RYTEFSIESV 383
ER + +F I
Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522
>gi|299133415|ref|ZP_07026610.1| Rhamnan synthesis F [Afipia sp. 1NLS2]
gi|298593552|gb|EFI53752.1| Rhamnan synthesis F [Afipia sp. 1NLS2]
Length = 408
Score = 240 bits (613), Expect = 3e-61, Method: Composition-based stats.
Identities = 92/238 (38%), Positives = 130/238 (54%), Gaps = 5/238 (2%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196
P +PK L + I+VH +Y D W + L L F L VT+ E+N DF V
Sbjct: 153 PGAPKPLQLNGRIATGIIVHLHYCDVWPDFEKRLRNLTCPFSLIVTLNESNPDFAARVAG 212
Query: 197 YFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256
FP+A++ V N+GRDV PF+ LL G D ++ +CK+HGKK+ G I G IWRR L
Sbjct: 213 QFPNAKVLVYPNRGRDVGPFIQLLREGHLDDFELICKLHGKKTVSLGPRMIFGEIWRRLL 272
Query: 257 FFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF 316
DL+G ++ I+ F P LG++GS + R ++ ++LAKR G
Sbjct: 273 LNDLVGSDELVRAILQRFISQPGLGLVGSSHF----RGNYLGTWPRNAALTLELAKRLGC 328
Query: 317 PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373
P +R LDFF GTMFWV+ + L+ L++L+L +F E DG L+HA+ER F
Sbjct: 329 PEERFKLDFFAGTMFWVRRELLDLLKSLNLSQDDFPVEAGQTDGTLQHALERIFGALP 386
>gi|24379285|ref|NP_721240.1| RgpFc protein [Streptococcus mutans UA159]
gi|24377204|gb|AAN58546.1|AE014924_6 RgpFc protein [Streptococcus mutans UA159]
Length = 583
Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats.
Identities = 66/260 (25%), Positives = 109/260 (41%), Gaps = 19/260 (7%)
Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192
K+ + +K K A+ +H +Y D E + +F +DLF+T +K + E+
Sbjct: 270 HKYVKKRERVDLKNQKAAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
+ AQ++V N GRDV P L L YD++ H KKS+ + G W
Sbjct: 330 ILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSW 385
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310
R L L+ +D I+ +QNP +G++ + + RY + + + L
Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442
Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365
++ G K F GT W K L+PL +L+L + E L ++ HA+
Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502
Query: 366 ERFFACSV--RYTEFSIESV 383
ER + +F I
Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522
>gi|320095829|ref|ZP_08027469.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str.
F0338]
gi|319977239|gb|EFW08942.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str.
F0338]
Length = 619
Score = 239 bits (609), Expect = 9e-61, Method: Composition-based stats.
Identities = 64/254 (25%), Positives = 104/254 (40%), Gaps = 10/254 (3%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVL 195
++P+ ++ + H +Y D EI L L + L T + + E+ +
Sbjct: 286 AAPEAREKAASLRVVAIAHIFYADMADEIIDRLSVLPDGWRLVATTADEERKAAIEETMA 345
Query: 196 KYFPSAQLYVM-ENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWR 253
+ Q+ V+ N+GRD+ FL + D YD + KIH KKS ++ + +++
Sbjct: 346 RRGAVGQVRVVASNRGRDISAFLVDCSDVLAGDDYDVVVKIHSKKSVQDEANAA--QLFK 403
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
L+ +LL D I+ F +P LGM + A +LAKR
Sbjct: 404 DHLYENLLDSKDHVANILAEFADHPGLGMALAPMPHMGYPTMGHA-WFANRPPARELAKR 462
Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFA 370
G P G+MF +P+ L PL L +F E +DG+L H +ER A
Sbjct: 463 IGITVPFDDHQPLAPYGSMFIARPRALRPLVEAGLTHDDFPPEGGYQDGSLAHVIERLLA 522
Query: 371 CSVRYTEFSIESVD 384
+V + V
Sbjct: 523 YAVLSEGYYARPVM 536
>gi|78213552|ref|YP_382331.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
CC9605]
gi|78198011|gb|ABB35776.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
CC9605]
Length = 1162
Score = 238 bits (608), Expect = 9e-61, Method: Composition-based stats.
Identities = 54/241 (22%), Positives = 90/241 (37%), Gaps = 16/241 (6%)
Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS 200
I K+ I +H +Y + I+ L + D+F++ E + + + +
Sbjct: 195 AIKEGLINKKVGIFLHIFYPELGETIAAYLKNIPCSIDVFISTREDSVAALEKIFARVEN 254
Query: 201 ---AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLF 257
++ N GRDV PF+ + YDY+ K+H KKS H W
Sbjct: 255 TQKIEVRHFSNIGRDVAPFIVGFRDQIL-NYDYILKLHSKKSP----HSNALSGWFLHCL 309
Query: 258 FDLLGFSDIAIRIINTFEQNPCLGMIGS-RRYRRYK---RWSFFAKRSEVYRRVIDLAKR 313
+L+G I + + P +G++ Y S + Y + R
Sbjct: 310 DNLIGSEAITATNLKALQS-PEVGIVYPIENYALSLGIQHDSCWGHEDGNYAKARPFLNR 368
Query: 314 AGFPTKRLH--LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370
+ F GTMFW KP L+ + + L F+EE DG + H++ER
Sbjct: 369 YNLRQIKRESQFQFPTGTMFWCKPAVLQSILDWGLNWNNFDEEGGQIDGTIAHSIERLIG 428
Query: 371 C 371
Sbjct: 429 I 429
>gi|220924211|ref|YP_002499513.1| Lipopolysaccharide biosynthesis protein-like protein
[Methylobacterium nodulans ORS 2060]
gi|219948818|gb|ACL59210.1| Lipopolysaccharide biosynthesis protein-like protein
[Methylobacterium nodulans ORS 2060]
Length = 1366
Score = 238 bits (608), Expect = 9e-61, Method: Composition-based stats.
Identities = 68/245 (27%), Positives = 106/245 (43%), Gaps = 14/245 (5%)
Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK---YFPS 200
GL + ++A++ H +Y D E+S L R+ DLF++ +K +
Sbjct: 696 GLELPERVAVIAHVFYTDFCSELSAYLARIPTQADLFISTDTEDKRQQIAFALQSYNMGK 755
Query: 201 AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260
+ VM N GRD+ P L VF+ Y+Y IH KKS + WR +L +L
Sbjct: 756 LTVRVMPNIGRDIAPMLVGF-DDVFNSYEYFLHIHSKKSPHDPAF----GSWREFLLENL 810
Query: 261 LGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK- 319
LG DI I+ + G++ S+ + + F + + L R G
Sbjct: 811 LGSEDIIRSILYLLHAH-KTGIVFSQHFEPVRHLLNFGY---NFETMKGLLGRCGIKISN 866
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
L L+F + + FW + L+PL +L+L +F E DG L HA+ER V + F
Sbjct: 867 DLVLEFPSSSFFWGRSSALKPLLDLNLDWSDFAAEAGQIDGTLAHAIERSVLYIVEKSGF 926
Query: 379 SIESV 383
V
Sbjct: 927 RWAKV 931
>gi|218455303|gb|AAX19606.2| WxocB [Xanthomonas oryzae pv. oryzicola]
Length = 568
Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats.
Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%)
Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192
+R ++ S+ AIV+H ++ D I + + D+D+FV+V + + +
Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W
Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYRDG----GGQW 418
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312
R L LLG S +R++ F+ +P G++G R+ LA
Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474
Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371
G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F
Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534
Query: 372 SVRYTEF 378
V F
Sbjct: 535 WVEQAGF 541
>gi|33862360|ref|NP_893920.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313]
gi|33640473|emb|CAE20262.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313]
Length = 738
Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats.
Identities = 64/252 (25%), Positives = 99/252 (39%), Gaps = 18/252 (7%)
Query: 137 PSSPKKSGLTIKSK---IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD 193
P S L + IA+ VH +Y + I + L DLF++ E
Sbjct: 485 PMITPASSLQQQDSETTIALHVHVHYPELLDTILNALNYNKIRPDLFLSCTNHENHSEIQ 544
Query: 194 VLKYFPSAQ---LYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250
+ + N+GRD+ P L + + +Y+ +H KKS +G
Sbjct: 545 CKSAGANCTLKSIITTPNRGRDIGPLLTEIGKELDTKYEIYGHLHTKKSALLPG--KQGC 602
Query: 251 IWRRWLFFDLLGFSDI--AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI 308
WR +L +L+G DI A RI+ ++NP LG++ + S +
Sbjct: 603 SWRDFLISNLVGMQDIAMADRIVTALKKNPKLGLVFADDPTCV-------GWSGNRKHAD 655
Query: 309 DLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVER 367
LA + DF GTMFW K L L NL+L ++ +E DG + HA+ER
Sbjct: 656 ILANKLNLGPLPRCFDFPVGTMFWAKKGALTELYNLNLGWEDYPQEPLGYDGTILHAIER 715
Query: 368 FFACSVRYTEFS 379
F+
Sbjct: 716 LLPIIAAKQGFT 727
>gi|218455307|gb|AAX19610.2| WxocB [Xanthomonas oryzae pv. oryzicola]
gi|218455309|gb|AAX19612.2| WxocB [Xanthomonas oryzae pv. oryzicola]
Length = 568
Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats.
Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%)
Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192
+R ++ S+ AIV+H ++ D I + + D+D+FV+V + + +
Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W
Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQW 418
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312
R L LLG S +R++ F+ +P G++G R+ LA
Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474
Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371
G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F
Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534
Query: 372 SVRYTEF 378
V F
Sbjct: 535 WVEQAGF 541
>gi|166713474|ref|ZP_02244681.1| hypothetical protein Xoryp_19045 [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 568
Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats.
Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%)
Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192
+R ++ S+ AIV+H ++ D I + + D+D+FV+V + + +
Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W
Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQW 418
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312
R L LLG S +R++ F+ +P G++G R+ LA
Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474
Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371
G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F
Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534
Query: 372 SVRYTEF 378
V F
Sbjct: 535 WVEQAGF 541
>gi|218455296|gb|AAV67426.2| glycosyltransferase [Xanthomonas oryzae pv. oryzicola]
gi|218455299|gb|AAX19602.2| WxocB [Xanthomonas oryzae pv. oryzicola]
gi|218455301|gb|AAX19604.2| WxocB [Xanthomonas oryzae pv. oryzicola]
Length = 568
Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats.
Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%)
Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192
+R ++ S+ AIV+H ++ D I + + D+D+FV+V + + +
Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W
Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQW 418
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312
R L LLG S +R++ F+ +P G++G R+ LA
Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474
Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371
G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F
Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534
Query: 372 SVRYTEF 378
V F
Sbjct: 535 WVEQAGF 541
>gi|218455305|gb|AAX19608.2| WxocB [Xanthomonas oryzae pv. oryzicola]
Length = 568
Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats.
Identities = 76/247 (30%), Positives = 115/247 (46%), Gaps = 11/247 (4%)
Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192
+R ++ S+ AIV+H ++ D I + + D+D+FV+V + + +
Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362
Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
++ A +++ N GRDV PF+ LL G+ DRYD +CK+H KKS G W
Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKVHSKKSVYHDG----GGQW 418
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312
R L LLG S +R++ F+ +P G++G R+ LA
Sbjct: 419 RDDLMKALLGSSFNVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474
Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371
G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F
Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534
Query: 372 SVRYTEF 378
V F
Sbjct: 535 WVEQAGF 541
>gi|323138318|ref|ZP_08073389.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242]
gi|322396401|gb|EFX98931.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242]
Length = 754
Score = 236 bits (601), Expect = 6e-60, Method: Composition-based stats.
Identities = 62/239 (25%), Positives = 106/239 (44%), Gaps = 13/239 (5%)
Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFPS--A 201
+ + +A +VH +Y D I L + DL+++ + V++ +
Sbjct: 365 INMDKPVAAIVHAFYPDLLEHILGYLENIPCAVDLYISTDSAEKAEIIGKVVRNWSKGST 424
Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261
+ +MEN+GRD+ P + VF ++D +H K+S G WR +L L
Sbjct: 425 DVRIMENRGRDIAPMIVGFRD-VFAKHDIFLHVHTKRSPHAG---DLLYHWRDYLLNTLF 480
Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KR 320
G DIA +++ F +P +G++ + + +R + Y +L R G K
Sbjct: 481 GTGDIARSVLSLF-NDPKIGVVFPQHFFEVRRMLNWGF---DYDLARNLLARVGVQLNKD 536
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
L L+F +G+MFW + + PL +L L +F EE DG L HA+ER +
Sbjct: 537 LVLEFPSGSMFWGRTDAIRPLLDLDLQFSDFPEEAGQIDGTLAHAIERTLLMVAESKGY 595
>gi|84501312|ref|ZP_00999517.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597]
gi|84390603|gb|EAQ03091.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597]
Length = 741
Score = 235 bits (600), Expect = 1e-59, Method: Composition-based stats.
Identities = 74/252 (29%), Positives = 110/252 (43%), Gaps = 13/252 (5%)
Query: 134 NDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDF 190
R + P+ +++ AI +H YY D W E S L RL+ FDL+VT+ +
Sbjct: 113 PIRTTIPRFDPRRPRARFAIHLHLYYPDLWPEFSERLDRLDLSFDLYVTLTWRGPETEWL 172
Query: 191 EQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250
+ + P AQ++ + N+GRD+ PFL LL G FD Y+ +CK+HGKKS H +G
Sbjct: 173 ADIIREAHPRAQVFPVANRGRDILPFLRLLNAGAFDGYEAICKLHGKKSP----HRDDGD 228
Query: 251 IWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDL 310
WRR L +L + + + + ++W R L
Sbjct: 229 AWRRHLVDGVLPGKALWTSLSAFLADEDAALWVADGQRYSVRKW-----WGSNRARTDAL 283
Query: 311 AKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALEHAVERFF 369
+R DF G+M+W+KP L +R L L + FE E DG L HA ER
Sbjct: 284 LRRVELDRSDTDFDFPAGSMYWMKPLLLGMIRALDLTEDLFEPESGQTDGTLAHAFERAI 343
Query: 370 ACSVRYTEFSIE 381
+ +
Sbjct: 344 GALAKAAGQEVR 355
Score = 54.6 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 20/114 (17%), Positives = 36/114 (31%), Gaps = 9/114 (7%)
Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69
G I + +A ++G W S ++R + + F S
Sbjct: 620 FAGLIYDYPAVARRSLDKGYRAGLPEKTIAGIMPSWDNSARRRARAHIARGANPATFRS- 678
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
WLR + S+ F + E +KA L +R + + +E
Sbjct: 679 --WLRDL--QRERLAQSYRGE--LFINAWNEWGEKAMLEPSRTFGHLYLDILAE 726
>gi|312133752|ref|YP_004001091.1| protein [Bifidobacterium longum subsp. longum BBMN68]
gi|311773032|gb|ADQ02520.1| Hypothetical protein BBMN68_1493 [Bifidobacterium longum subsp.
longum BBMN68]
Length = 651
Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats.
Identities = 56/238 (23%), Positives = 94/238 (39%), Gaps = 10/238 (4%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYVM 206
+A+V H YY D + + D+ +TV E ++ + P + + V+
Sbjct: 309 KHVALVFHLYYIDLLDSSLQYISSMPEGCDVIITVGSEEKACIVKERCEGMPYNIDVRVI 368
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
EN+GRDV L V YD +C H KK + G + F ++L
Sbjct: 369 ENRGRDVSALLVGAGKDVL-NYDLVCFAHDKKVTQIKP-LSVGDGFAYKCFENILASKAY 426
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA-KRSEVYRRVIDLAKRAGFPTKRLH--- 322
II+ FE+ P LG++ + F + + + L +
Sbjct: 427 VANIIDQFEREPHLGVLMPNPPEHGNYFPVFTLSWGDNFDGTVQLLRDIHKTVPLDKKKE 486
Query: 323 LDFFNGTMFWVKPKCL-EPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
+ GTMFW +PK L + L N + +F +E N DG + H +ER + + +
Sbjct: 487 VIAPLGTMFWFRPKALSDGLLNHNWQYSDFPKEPNKIDGTILHYIERAYCYVAQANGY 544
>gi|262038042|ref|ZP_06011449.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii
F0264]
gi|261747934|gb|EEY35366.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii
F0264]
Length = 629
Score = 234 bits (598), Expect = 1e-59, Method: Composition-based stats.
Identities = 61/241 (25%), Positives = 101/241 (41%), Gaps = 13/241 (5%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYFPSAQLYVM 206
K+ + H Y++D E L + D+F+T + K + + K + V+
Sbjct: 303 PKVGLFFHIYFEDLIEECYRYALNMPEYADIFITTDKEEKKEKIEKIFSKMKNKIDIKVI 362
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
+N+GRDV FL +YDY C H KK+++ I+G ++ F ++LG ++
Sbjct: 363 QNRGRDVSAFLIP-NKEEILKYDYACFAHDKKTKQLQPE-IKGEDFKFRCFENILGSKEL 420
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-----VYRRVIDLAKRAGFPTK-- 319
II F +NP LG++ + + + Y +L K
Sbjct: 421 VENIIGLFIENPRLGLLSPPSPNHAEFYGNLGREWGHSGNDNYEETCNLLKELVIEVNVD 480
Query: 320 -RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTE 377
GT+FW +PK LE L +F +E N DG L HA+ER + V+
Sbjct: 481 ISKAPVAPYGTIFWFRPKSLEKLLKKGWKYEDFPKEPNKVDGTLLHAIERVYPFVVQGAG 540
Query: 378 F 378
+
Sbjct: 541 Y 541
>gi|260434430|ref|ZP_05788400.1| glycosyltransferase [Synechococcus sp. WH 8109]
gi|260412304|gb|EEX05600.1| glycosyltransferase [Synechococcus sp. WH 8109]
Length = 772
Score = 234 bits (598), Expect = 1e-59, Method: Composition-based stats.
Identities = 52/241 (21%), Positives = 95/241 (39%), Gaps = 15/241 (6%)
Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK---DFEQDVLKYFPSA 201
+ I K+ + +H +Y + EI + +++++ +
Sbjct: 530 MNIDEKVGLHIHVHYPELLDEILKAISMNKIRPEIYISCTNQAIRDLAIKNINEHGLILK 589
Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261
++ + N+GRD+ P L L + ++Y IH KKS H WR +L +L+
Sbjct: 590 KIILTPNRGRDIGPLLTCLGQELDEKYRIYGHIHTKKSIHIARHQSY--SWRTFLIENLI 647
Query: 262 GFSD--IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319
G + + II+ ++ +G+ YR+ LA++ +
Sbjct: 648 GNEENHMMDCIISAMIKDKTIGLAFPSDPHCP-------GWDANYRQAKLLAEKLNIKSL 700
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
+F GTMFW + L PL +L+L ++ E DG L H++ER F
Sbjct: 701 TNEFNFPIGTMFWARKNALSPLYSLNLGWDDYPSEPIGYDGTLLHSIERLIPFVAESQGF 760
Query: 379 S 379
S
Sbjct: 761 S 761
>gi|163853098|ref|YP_001641141.1| lipopolysaccharide biosynthesis protein-like protein
[Methylobacterium extorquens PA1]
gi|163664703|gb|ABY32070.1| Lipopolysaccharide biosynthesis protein-like protein
[Methylobacterium extorquens PA1]
Length = 916
Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats.
Identities = 66/262 (25%), Positives = 113/262 (43%), Gaps = 13/262 (4%)
Query: 127 KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA 186
+ P++ K A +VH +Y + EI L + NF D++V+ ++
Sbjct: 227 PRNENDYAFSIPLPERLRSHPYKKAAAIVHGFYPELMEEILIYLGKSNFPIDIYVSTDDS 286
Query: 187 NK-DFEQDVLKYFPSAQ--LYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243
K + + K + + Q + ++ N+GRD+ P L VFD Y+ IH KKS G
Sbjct: 287 KKAEQIISMGKKYHNGQLDVRIISNRGRDIGPMLTGFSD-VFDNYEAFLHIHTKKSPHGG 345
Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303
WR +LF +L+G ++I ++ +G + + +
Sbjct: 346 DGLS---SWRDYLFKNLIGSAEIIDSNLHILGT-RNVGFVYPQHLYALRGIL---NWGYN 398
Query: 304 YRRVIDLAKRAGFPTK-RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGAL 361
+ V L +R G + L+F +G+MFW + L L +L L + +F+ E DG L
Sbjct: 399 FDTVSSLLRRVGVRLSKDMVLEFPSGSMFWARTAALHGLLSLDLKLEDFDNEAGQVDGTL 458
Query: 362 EHAVERFFACSVRYTEFSIESV 383
HA+ER F + +S V
Sbjct: 459 GHAIERSFLYFAETSGYSWAKV 480
>gi|171779906|ref|ZP_02920810.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp.
infantarius ATCC BAA-102]
gi|171281254|gb|EDT46689.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp.
infantarius ATCC BAA-102]
Length = 592
Score = 232 bits (593), Expect = 5e-59, Method: Composition-based stats.
Identities = 63/270 (23%), Positives = 103/270 (38%), Gaps = 19/270 (7%)
Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184
+ + + KIA+ +H +Y D + +F +DLF+T
Sbjct: 266 NFPDFKYLLARKYVKEVPAVSLADKKIAVHLHVFYVDLLEDFLDAFENFHFVYDLFITTD 325
Query: 185 --EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
++ E + AQ++V N GRDV P L L YDY+ H KKS+
Sbjct: 326 NATKKQEIESILRSNGKDAQIFVTGNVGRDVLPMLKL--KDYLSDYDYIGHFHTKKSKEA 383
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300
+ G WR L L+ +D I+ F+ N LG++ + + R+ +
Sbjct: 384 DF--WAGESWRNELIDMLIKPAD---NILANFD-NDKLGIVIADIPTFFRFNKIVDAWNE 437
Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355
+ + DL ++ G +F GT W K L+PL +L L E
Sbjct: 438 HLIAPAMNDLWQQMGMTKAIDFNNFHNFVMSYGTYVWFKYDALKPLFDLGLTDEDVPAEP 497
Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESV 383
L ++ HA+ER + +F I
Sbjct: 498 LPQNSILHAIERLLIYIAWNEHYDFRISKN 527
>gi|259414984|ref|ZP_05738907.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B]
gi|259349435|gb|EEW61182.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B]
Length = 680
Score = 232 bits (591), Expect = 1e-58, Method: Composition-based stats.
Identities = 74/279 (26%), Positives = 120/279 (43%), Gaps = 17/279 (6%)
Query: 109 RFMSNSRMPFDS--EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEI 166
+ + ++ + +P + ++ A+V+H YY D W E
Sbjct: 30 QRPFEHFLRAGRHEQRVTREHSATIAESGSAVAPLRGAGINQNLQAVVIHLYYTDLWDEF 89
Query: 167 SHILLRLNFDFDLFVTVVE---ANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELG 223
L F FDL+VT+ E ++ + + +P A++ V+ N+GRD+ PFL+LL G
Sbjct: 90 RDRLRSARFTFDLYVTLTEQGPETEETRARIAEDWPEARVLVLPNRGRDIYPFLHLLNAG 149
Query: 224 VFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGM- 282
D Y +CK+H KKS H +G +WR L +L + A ++ F G+
Sbjct: 150 WLDHYRAVCKLHSKKSP----HRQDGDVWRTHLTEGILPEGETAE-LLERFLAAEDCGLW 204
Query: 283 IGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLR 342
+ ++ RW R +L R LDF G+++W+KP L+ LR
Sbjct: 205 VADGQHYEGARW-----WGSNLERCRNLLARLELAASADTLDFPAGSIYWLKPAILDMLR 259
Query: 343 NLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
L L +F+ E+ DG L HA+ER I
Sbjct: 260 GLALGFDDFDIEQGQTDGTLAHALERALGMICAAGGLQI 298
Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats.
Identities = 19/117 (16%), Positives = 37/117 (31%), Gaps = 11/117 (9%)
Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAH-VSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
G I + R+ + A +PAH ++G W + ++ + FE
Sbjct: 565 FGGVIYDY-DRVRARSQDPAYAGQLPAHTIAGTMPSWDNTARRGSAAHLAWGANPIRFER 623
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ LR+ S+ S + E +KA L + +
Sbjct: 624 WLRELRT-----HRLPQSYRSE--IMINAWNEWAEKAVLEPSAQHGRGYLNALRRGL 673
>gi|154509526|ref|ZP_02045168.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC
17982]
gi|153799160|gb|EDN81580.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC
17982]
Length = 620
Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats.
Identities = 66/254 (25%), Positives = 99/254 (38%), Gaps = 10/254 (3%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVL 195
+ KI V H +Y D EI L L + L T E
Sbjct: 286 ADQATLDAAASLKILAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEARAQ 345
Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWR 253
+ A + V+ N+GRD+ FL + YD + KIH KKS ++ Y+ +++
Sbjct: 346 ERGVDADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYNAA--QLFK 403
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
L+ +LL SD I+ F +P LGM+ + A D AK+
Sbjct: 404 EHLYDNLLASSDHVASILAEFAAHPGLGMVIAPMPHMGYPTMGHA-WFANRAPARDFAKK 462
Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370
G P G+MF +P+ L L L +F EE KDG+L H +ER +
Sbjct: 463 VGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLLS 522
Query: 371 CSVRYTEFSIESVD 384
+V + + V
Sbjct: 523 YAVLSRGYYVRPVM 536
>gi|221634566|ref|YP_002523254.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter
sphaeroides KD131]
gi|221163439|gb|ACM04401.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter
sphaeroides KD131]
Length = 755
Score = 231 bits (589), Expect = 2e-58, Method: Composition-based stats.
Identities = 75/234 (32%), Positives = 109/234 (46%), Gaps = 15/234 (6%)
Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLYVMEN 208
A+ VH YY D W E + L RL FDL+VT+ E Q++ FP A + M N
Sbjct: 139 AVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAQEIRADFPGAFVTPMPN 198
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
+GRD+ PF+ LL G FD Y +CK H KKS H +G +WR+ L +L + +
Sbjct: 199 RGRDILPFVTLLNAGAFDGYRAVCKFHTKKSP----HRQDGDLWRKHLIEGILPETGLEE 254
Query: 269 RIINTFEQNPCLG-MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327
+ + F + P G + ++ +W L +R P R L F
Sbjct: 255 K-LEAFVEAPEAGFWVADGQHYTGTQW-----WGSNVEATRHLLQRIEIPLDREALSFPA 308
Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
G+++WVKP L LR+L L + +F+ E DG L HA+ER +
Sbjct: 309 GSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTLAHAIERVLGYLTARAGQKV 362
Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 18/137 (13%), Positives = 38/137 (27%), Gaps = 10/137 (7%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
+ G I + A ++G W + ++ + + F
Sbjct: 626 AFSGLIYDYAAVARRALSETYVRTLPKATIAGVMPGWDNTARRGAAGHVAYGANPATFN- 684
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
WL L + S+ R F + E +KA L + + + +
Sbjct: 685 --VWLAGALE--RRVPASY--RRELFVNAWNEWAEKAVLEPSLTFGDLNLQVMRQHLGAA 738
Query: 127 KELFEGWNDRPSSPKKS 143
+ P+ +S
Sbjct: 739 EPATHLAEP-PAHGMRS 754
>gi|298290915|ref|YP_003692854.1| Rhamnan synthesis F [Starkeya novella DSM 506]
gi|296927426|gb|ADH88235.1| Rhamnan synthesis F [Starkeya novella DSM 506]
Length = 633
Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats.
Identities = 64/257 (24%), Positives = 107/257 (41%), Gaps = 15/257 (5%)
Query: 133 WNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-DFE 191
W P + + K+ + H +Y D E+ L DLF+T K +
Sbjct: 378 WAVPVFGPPAAPVASPLKVGLHGHFFYPDLLPELLERLAANASRPDLFLTTDTPAKVEQL 437
Query: 192 QDVLKYFP-SAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEG 249
+ + +P ++ V+ N GRD+ PFL L + YD L +HGKK++ G G
Sbjct: 438 RALTAAWPAKVRIDVVPNSGRDIGPFLTALRDVLTGGEYDVLLHLHGKKTK--GRRRAIG 495
Query: 250 IIWRRWLFFDLLGFSD-IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI 308
WR +L+ +L+G + ++ +P +G++ + R V
Sbjct: 496 DPWRNFLWENLIGGDHPMLDAVLAYMAAHPQVGLVYPEDTHLLD-------WARNGRVVE 548
Query: 309 DLAKRAGFPTKR-LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVE 366
+L + G ++DF G MF V+P L P+ L L ++ E DG + H +E
Sbjct: 549 ELRRDMGLTEPMGTYVDFPVGNMFAVRPAALAPVLALDLKWSDYPVEPIPLDGTVLHGIE 608
Query: 367 RFFACSVRYTEFSIESV 383
R VR F+ +V
Sbjct: 609 RLLPTVVRKAGFTTAAV 625
>gi|291516581|emb|CBK70197.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum
subsp. longum F8]
Length = 688
Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats.
Identities = 56/257 (21%), Positives = 95/257 (36%), Gaps = 15/257 (5%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196
PS + + A + H Y+ D + H + L + DL++T E ++ ++
Sbjct: 313 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 372
Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249
A + N+GRDV L V YD + H KKS + G+H E
Sbjct: 373 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 432
Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307
+ L + LG I+ F +NP LG + + + Y
Sbjct: 433 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANYEIT 492
Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361
+L + R G P G+ +W + + L+PL +F E + +DG +
Sbjct: 493 KELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGTI 552
Query: 362 EHAVERFFACSVRYTEF 378
HA+ER + +
Sbjct: 553 SHAIERANGYICQSRGY 569
>gi|126464825|ref|YP_001041801.1| lipopolysaccharide biosynthesis protein-like [Rhodobacter
sphaeroides ATCC 17029]
gi|126106640|gb|ABN79165.1| Lipopolysaccharide biosynthesis protein-like [Rhodobacter
sphaeroides ATCC 17029]
Length = 751
Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats.
Identities = 74/234 (31%), Positives = 109/234 (46%), Gaps = 15/234 (6%)
Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLYVMEN 208
A+ VH YY D W E + L RL FDL+VT+ E +++ FP A + M N
Sbjct: 135 AVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAEEIRADFPGAFVTPMPN 194
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
+GRD+ PF+ LL G FD Y +CK H KKS H +G +WR+ L +L + +
Sbjct: 195 RGRDILPFVTLLNAGAFDGYRAVCKFHTKKSP----HRQDGDLWRKHLIEGILPETGLEE 250
Query: 269 RIINTFEQNPCLG-MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327
+ + F + P G + ++ +W L +R P R L F
Sbjct: 251 K-LEAFVEAPEAGFWVADGQHYTGTQW-----WGSNVEATRHLLQRIEIPLDREALSFPA 304
Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
G+++WVKP L LR+L L + +F+ E DG L HA+ER +
Sbjct: 305 GSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTLAHAIERVLGYLTARAGQKV 358
Score = 71.1 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 18/137 (13%), Positives = 38/137 (27%), Gaps = 10/137 (7%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
+ G I + A ++G W + ++ + + F
Sbjct: 622 AFSGLIYDYAAVARRALSETYVRTLPKATIAGVMPGWDNTARRGAAGHVAYGANPATFN- 680
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
WL L + S+ R F + E +KA L + + + +
Sbjct: 681 --VWLAGALE--RRVPASY--RRELFVNAWNEWAEKAVLEPSLTFGDLNLQVMRQHLGAA 734
Query: 127 KELFEGWNDRPSSPKKS 143
+ P+ +S
Sbjct: 735 EPATHLAEP-PAHGMRS 750
>gi|322690050|ref|YP_004209784.1| hypothetical protein BLIF_1872 [Bifidobacterium longum subsp.
infantis 157F]
gi|320461386|dbj|BAJ72006.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis 157F]
Length = 672
Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats.
Identities = 56/257 (21%), Positives = 95/257 (36%), Gaps = 15/257 (5%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196
PS + + A + H Y+ D + H + L + DL++T E ++ ++
Sbjct: 291 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 350
Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249
A + N+GRDV L V YD + H KKS + G+H E
Sbjct: 351 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 410
Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307
+ L + LG I+ F +NP LG + + + Y
Sbjct: 411 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANYEIT 470
Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361
+L + R G P G+ +W + + L+PL +F E + +DG +
Sbjct: 471 KELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGTI 530
Query: 362 EHAVERFFACSVRYTEF 378
HA+ER + +
Sbjct: 531 SHAIERANGYICQSRGY 547
>gi|312866008|ref|ZP_07726229.1| rhamnan synthesis protein F [Streptococcus downei F0415]
gi|311098412|gb|EFQ56635.1| rhamnan synthesis protein F [Streptococcus downei F0415]
Length = 584
Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats.
Identities = 60/256 (23%), Positives = 105/256 (41%), Gaps = 18/256 (7%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVL 195
+ L +SK+A+ +H +Y D E +F +DLF+T + K + + +
Sbjct: 274 EQAEAEELPAESKVAVHLHVFYVDLLQEFLDAFKTFHFAYDLFITTDKEEKRAEIQAILE 333
Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ AQ++V N GRDV P L L YDY+ H KKS+ Y G WR+
Sbjct: 334 QNQVLAQIFVTGNIGRDVLPMLKL--KDQLKGYDYIGHFHTKKSKEADY--WAGQSWRQE 389
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
L L+ + +I+ +N LG++ + + R+ + + + + +L ++
Sbjct: 390 LIAMLVKPA---NQILAQMAKNDRLGIVIADMPSFFRFNKIVVAWNENLIAPEMEELWEK 446
Query: 314 AGFPTK-----RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERF 368
GT W K L PL +L L E+ L ++ HA+ER
Sbjct: 447 MSLKKSIDFKAMDTFVMSYGTYAWFKYDALSPLFDLDLTDEYVPAEPLPQNSILHAIERL 506
Query: 369 FACSV--RYTEFSIES 382
++ ++ I
Sbjct: 507 LIYIAWDKHYDYRISP 522
>gi|189440434|ref|YP_001955515.1| lipopolysaccharide biosynthesis protein [Bifidobacterium longum
DJO10A]
gi|317482688|ref|ZP_07941702.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA]
gi|189428869|gb|ACD99017.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum
DJO10A]
gi|316915934|gb|EFV37342.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA]
Length = 666
Score = 230 bits (587), Expect = 3e-58, Method: Composition-based stats.
Identities = 56/257 (21%), Positives = 95/257 (36%), Gaps = 15/257 (5%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196
PS + + A + H Y+ D + H + L + DL++T E ++ ++
Sbjct: 291 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 350
Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249
A + N+GRDV L V YD + H KKS + G+H E
Sbjct: 351 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 410
Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307
+ L + LG I+ F +NP LG + + + Y
Sbjct: 411 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANYEIT 470
Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361
+L + R G P G+ +W + + L+PL +F E + +DG +
Sbjct: 471 KELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGTI 530
Query: 362 EHAVERFFACSVRYTEF 378
HA+ER + +
Sbjct: 531 SHAIERANGYICQSRGY 547
>gi|13476282|ref|NP_107852.1| hypothetical protein mlr7561 [Mesorhizobium loti MAFF303099]
gi|14027043|dbj|BAB53997.1| mlr7561 [Mesorhizobium loti MAFF303099]
Length = 609
Score = 230 bits (587), Expect = 3e-58, Method: Composition-based stats.
Identities = 60/242 (24%), Positives = 98/242 (40%), Gaps = 11/242 (4%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVLKYFP--SAQLYV 205
+IA++ H Y+ D EI + +DL VT A+K +Q + K +A + V
Sbjct: 298 RIAVLAHVYHLDMIDEILGYAENVPKGYDLIVTTDNADKQALIQQAIAKATNASNAVVLV 357
Query: 206 MENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
+ N GRD L V DRYD +C++H K+S ++G G +++ F +LL
Sbjct: 358 VRNDGRDTSALLVGCRDYVLEDRYDLICRVHSKRSPQDGPR---GELFKLHTFENLLHTP 414
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF--PTKRLH 322
++ F NP LG++ + + V LA++ G
Sbjct: 415 GYVSNLLELFANNPALGLVMPPLVHIGYP-TIGNSWAGNKANVAKLARQLGLIVHLDDST 473
Query: 323 LDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIES 382
G M+W +P L L + +DG+L HA+ER A ++
Sbjct: 474 PVAPYGGMYWFRPAALRKLFEERWNWNDFANMDYRDGSLVHAIERIIAYVAIDAGYTFRH 533
Query: 383 VD 384
V
Sbjct: 534 VM 535
>gi|148927812|ref|ZP_01811237.1| Lipopolysaccharide biosynthesis protein-like protein [candidate
division TM7 genomosp. GTL1]
gi|147886838|gb|EDK72383.1| Lipopolysaccharide biosynthesis protein-like protein [candidate
division TM7 genomosp. GTL1]
Length = 498
Score = 230 bits (586), Expect = 3e-58, Method: Composition-based stats.
Identities = 70/237 (29%), Positives = 111/237 (46%), Gaps = 9/237 (3%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVME 207
++A+VVH +Y + EI ++ + FDL +T + S + + E
Sbjct: 240 RLAVVVHIFYPELANEIYDVIKNIVEPFDLIITTPHEGAVSELIDTFAPLASSVAIALSE 299
Query: 208 NKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA 267
N+GRDV PFL + G+ +RYD + K+H KKS G W++ LF L G S I
Sbjct: 300 NRGRDVGPFLAVHRSGLLERYDAVLKLHSKKSTY----SDSGQQWQQSLFRQLCGNSQIV 355
Query: 268 IRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327
R + ++ GM+G Y + A R V++ + L + + L FF
Sbjct: 356 RRSV-ALLRDGKTGMVGPHDYYLTHPHYWGANRPAVHKLLQSLTA-TPLKEEDVPLRFFA 413
Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383
GTMFW PK + L ++ + FE E +DG L HA+ER F + +++ S+
Sbjct: 414 GTMFWFAPKAIVALHDIPEALLNFESENGKQDGTLAHALERLFGIVPQLGGYNVTSL 470
>gi|125654691|ref|YP_001033885.1| hypothetical protein RSP_3918 [Rhodobacter sphaeroides 2.4.1]
gi|77386351|gb|ABA81780.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1]
Length = 751
Score = 230 bits (586), Expect = 4e-58, Method: Composition-based stats.
Identities = 74/234 (31%), Positives = 109/234 (46%), Gaps = 15/234 (6%)
Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLYVMEN 208
A+ VH YY D W E + L RL FDL+VT+ E +++ FP A + M N
Sbjct: 135 AVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAEEIRADFPGAFVTPMPN 194
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
+GRD+ PF+ LL G FD Y +CK H KKS H +G +WR+ L +L + +
Sbjct: 195 RGRDILPFVTLLNAGAFDGYRAVCKFHTKKSP----HRQDGDLWRKHLIEGILPETGLEE 250
Query: 269 RIINTFEQNPCLG-MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327
+ + F + P G + ++ +W L +R P R L F
Sbjct: 251 K-LEAFVEAPEAGFWVADGQHYTGTQW-----WGSNVEATRHLLQRIEIPLDREALSFPA 304
Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
G+++WVKP L LR+L L + +F+ E DG L HA+ER +
Sbjct: 305 GSIYWVKPLVLGLLRSLQLRLEDFDLEEGQVDGTLAHAIERVLGYLTARAGQKV 358
Score = 71.1 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 18/137 (13%), Positives = 38/137 (27%), Gaps = 10/137 (7%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
+ G I + A ++G W + ++ + + F
Sbjct: 622 AFSGLIYDYAAVARRALSETYVRTLPKATIAGVMPGWDNTARRGAAGHVAYGANPATFN- 680
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
WL L + S+ R F + E +KA L + + + +
Sbjct: 681 --VWLAGALE--RRVPASY--RRELFVNAWNEWAEKAVLEPSLTFGDLNLQVMRQHLGAA 734
Query: 127 KELFEGWNDRPSSPKKS 143
+ P+ +S
Sbjct: 735 EPATHLAEP-PAHGMRS 750
>gi|298346187|ref|YP_003718874.1| hypothetical protein HMPREF0573_11061 [Mobiluncus curtisii ATCC
43063]
gi|304390053|ref|ZP_07372007.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii
ATCC 35241]
gi|298236248|gb|ADI67380.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 43063]
gi|304326535|gb|EFL93779.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii
ATCC 35241]
Length = 680
Score = 230 bits (586), Expect = 4e-58, Method: Composition-based stats.
Identities = 63/242 (26%), Positives = 100/242 (41%), Gaps = 14/242 (5%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-----DFEQDVLKYFPSAQLY 204
++A+V+H YY D EI L + +FD+F+T + L +
Sbjct: 51 RLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP---IEGIIWRRWLFFDLL 261
+EN GRD+ P + L+ G D Y + K+H KKS HP G W+ LL
Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRENHPDLEGSGAQWKDEFLDALL 170
Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321
G D +I++ F +P LG++ + ++ +L +R K
Sbjct: 171 GSKDSVEKIMSAFGADPWLGLVTAPGNIVGPQF-----WGGDQALTAELLRRLEMQLKPS 225
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
L F G+M+WV+ ++ LR+L L +FE E D HA+ER +
Sbjct: 226 KLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAGLKL 285
Query: 381 ES 382
Sbjct: 286 RE 287
Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats.
Identities = 11/86 (12%), Positives = 27/86 (31%), Gaps = 8/86 (9%)
Query: 39 SGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSR 98
G V + + +++ + +F WL + ++ RI F +
Sbjct: 598 PGVMVNFDNTARRQWKPDVWYGANPYLFR---RWLAAA---ARSVLDRPAPERIVFINAW 651
Query: 99 KE--QKAFLRLNRFMSNSRMPFDSEK 122
E + A L + + + +
Sbjct: 652 NEWAEGAILEPTQRFGKTYLQAVRDV 677
>gi|219670466|ref|YP_002460901.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2]
gi|219540726|gb|ACL22465.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2]
Length = 606
Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats.
Identities = 60/262 (22%), Positives = 102/262 (38%), Gaps = 10/262 (3%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDV 194
PS L + K+ + H YY+D H + + D+ +T + E+ +
Sbjct: 279 PSDYVVKPLKRQPKVVVCFHVYYEDLLDSCFHYMQSIPQFADIVITTPKKELVGIIEEKI 338
Query: 195 LKY-FPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253
Y + + V+ +GR FL + + D YDY C +H KKS G+ +
Sbjct: 339 KSYELNNTTIKVINARGRAESAFLVATKDFILD-YDYACIVHDKKSSFLRPG-CVGVEFG 396
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAK 312
LL S I++ FE NP +G + + Y+ + K
Sbjct: 397 LQNLDALLATSAYVENILSIFEDNPRIGALEPVHLLHANFRDLYGGEWGANYKGTEEFLK 456
Query: 313 RAGFPT---KRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERF 368
RAG + G MFW +P C++ + ++ +F EE DG+L H +ER
Sbjct: 457 RAGIDLLISPDVPPLAPMGAMFWFRPICMKRILDMEWEYEDFPEEPLPLDGSLIHIIERA 516
Query: 369 FACSVRYTEFSIESVDCVAEYE 390
+ V+ + V + + E
Sbjct: 517 YPFIVQDAGYLTGWVSTIEDAE 538
>gi|315654770|ref|ZP_07907675.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333]
gi|315490731|gb|EFU80351.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333]
Length = 680
Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats.
Identities = 63/242 (26%), Positives = 100/242 (41%), Gaps = 14/242 (5%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-----DFEQDVLKYFPSAQLY 204
++A+V+H YY D EI L + +FD+F+T + L +
Sbjct: 51 RLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP---IEGIIWRRWLFFDLL 261
+EN GRD+ P + L+ G D Y + K+H KKS HP G W+ LL
Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDALL 170
Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321
G D +I++ F +P LG++ + ++ +L +R K
Sbjct: 171 GSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQF-----WGGDQALTAELLRRLEMQLKPS 225
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
L F G+M+WV+ ++ LR+L L +FE E D HA+ER +
Sbjct: 226 KLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAGLKL 285
Query: 381 ES 382
Sbjct: 286 RE 287
Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats.
Identities = 11/86 (12%), Positives = 27/86 (31%), Gaps = 8/86 (9%)
Query: 39 SGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSR 98
G V + + +++ + +F WL + ++ RI F +
Sbjct: 598 PGVMVNFDNTARRQWKPDVWYGANPYLFR---RWLAAA---ARSVLDRPAPERIVFINAW 651
Query: 99 KE--QKAFLRLNRFMSNSRMPFDSEK 122
E + A L + + + +
Sbjct: 652 NEWAEGAILEPTQRFGKTYLQAVRDV 677
>gi|293189412|ref|ZP_06608132.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309]
gi|292821502|gb|EFF80441.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309]
Length = 620
Score = 229 bits (583), Expect = 9e-58, Method: Composition-based stats.
Identities = 65/254 (25%), Positives = 99/254 (38%), Gaps = 10/254 (3%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVL 195
+ K+ V H +Y D EI L L + L T E
Sbjct: 286 ADQATLDAAASLKVLAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEAHAQ 345
Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWR 253
+ A + V+ N+GRD+ FL + YD + KIH KKS ++ Y+ +++
Sbjct: 346 ERGVDADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYNAA--QLFK 403
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
L+ +LL SD I+ F +P LGM+ + A D AK+
Sbjct: 404 EHLYDNLLASSDHVASILAKFAAHPGLGMVIAPMPHMGYPTMGHA-WFANRAPARDFAKK 462
Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370
G P G+MF +P+ L L L +F EE KDG+L H +ER +
Sbjct: 463 VGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLLS 522
Query: 371 CSVRYTEFSIESVD 384
+V + + V
Sbjct: 523 YAVLSRGYYVRPVM 536
>gi|258654317|ref|YP_003203473.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233]
gi|258557542|gb|ACV80484.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233]
Length = 631
Score = 229 bits (583), Expect = 9e-58, Method: Composition-based stats.
Identities = 66/308 (21%), Positives = 107/308 (34%), Gaps = 28/308 (9%)
Query: 101 QKAFLRLNRFMSNSRMPFDSEK-----FLYVKELFEGWNDR-----------PSSPKKSG 144
+ +L N + M S ++ + P
Sbjct: 232 EPTYLERNAILGRRVMEIVSRTDYPVDLIWRNVVRSAEPRTLYTNMSMLSVVPDVDTGFR 291
Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK---YFPSA 201
+I ++ H +Y+D E+ + + FDL VT A K + S
Sbjct: 292 PDPPLRICVLAHIFYEDMTDEMMGWIGNIPVPFDLVVTTTSAAKKEAIESALEAYALKSV 351
Query: 202 QLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
++ ++E N+GR FL + YD + KIH KKS + G + G +++ +
Sbjct: 352 EVRLVESNRGRAESAFLIACRDVLTSGEYDLVLKIHSKKSPQNGANL--GQLFKHHSVDN 409
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-- 317
LL I+ F+ P LGM+ + +LA + G
Sbjct: 410 LLSSPGYVASILGMFQSQPSLGMVFPPVVNIGFP-TLGHSWFTNREAAHELADQLGIHTI 468
Query: 318 TKRLHLDFFNGTMFWVKPKCLEPLRNLH-LIGEFEEE-RNLKDGALEHAVERFFACSVRY 375
R NGTMFW +P+ L L +F E DG L H +ER + +V
Sbjct: 469 FDRTTPLAPNGTMFWARPESLAKLARHDFDYSQFAAEHEGWSDGMLGHVIERLYGYAVLD 528
Query: 376 TEFSIESV 383
I+ V
Sbjct: 529 AGLRIQCV 536
>gi|261868364|ref|YP_003256286.1| lipopolysaccharide biosynthesis protein [Aggregatibacter
actinomycetemcomitans D11S-1]
gi|3132260|dbj|BAA28137.1| unnamed protein product [Actinobacillus actinomycetemcomitans]
gi|261413696|gb|ACX83067.1| lipopolysaccharide biosynthesis protein [Aggregatibacter
actinomycetemcomitans D11S-1]
Length = 632
Score = 228 bits (582), Expect = 9e-58, Method: Composition-based stats.
Identities = 55/250 (22%), Positives = 99/250 (39%), Gaps = 13/250 (5%)
Query: 139 SPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD----- 193
S K + KI +V H YY D EI + +DL +T E +
Sbjct: 284 SSKVEKVRSDIKILVVAHIYYSDMLDEIISYTQNIPCSYDLLITTANEKSKLEIESNPIL 343
Query: 194 VLKYFPSAQLYVME-NKGRDVRPFLYLLELGVFD-RYDYLCKIHGKKSQREGYHPIEGII 251
+ + V+E N+GRD+ + + RYD++C++H KKS + ++
Sbjct: 344 KMSGAKGINVKVVEQNRGRDMSSLFITCKQEIISERYDWVCRLHSKKSPQNSHNMSI--H 401
Query: 252 WRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLA 311
++ ++ ++L ++IN ++N +G A I +A
Sbjct: 402 FKEMMYLNILKDKAYISKVINYLDKNKSIGFAMPSMVHIGHPTLGHA-WFTNRDLAIKIA 460
Query: 312 KRAGFPTK-RLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERF 368
+R G F GTMFW +P+ L+ L + +F +E +D +L H +ER
Sbjct: 461 ERVGIKLPFDDISPFAAYGTMFWFRPEALKKLFEYNWKFEDFNKEPMHQDSSLAHILERL 520
Query: 369 FACSVRYTEF 378
+ +
Sbjct: 521 LVYAAHDAGY 530
>gi|254876593|ref|ZP_05249303.1| predicted protein [Francisella philomiragia subsp. philomiragia
ATCC 25015]
gi|254842614|gb|EET21028.1| predicted protein [Francisella philomiragia subsp. philomiragia
ATCC 25015]
Length = 765
Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats.
Identities = 55/249 (22%), Positives = 100/249 (40%), Gaps = 13/249 (5%)
Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYF 198
P S I K AI +H +Y D E + L +DL++T+ N +F ++
Sbjct: 520 PINSEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSS 579
Query: 199 P--SAQLYVMENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ ++ ++N GRD+ P ++ L+ + Y+ + H KK+ H G WR +
Sbjct: 580 GAANVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKT--VSAHDNLGDKWRAY 637
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
L +L+G ++ I + +G++ + E V +L G
Sbjct: 638 LLNNLIGDNEQISNSILNLFNDEKIGLVFPE-------DRTYIDIGENKFYVDELCTAIG 690
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375
F G MFW + + + +L+ +EE +DG+ HA+ER V
Sbjct: 691 LEKICETPLFPLGNMFWARVDAIRDIFSLNEDMILQEEPLPRDGSYMHALERIIPNIVEK 750
Query: 376 TEFSIESVD 384
+ +V
Sbjct: 751 NGYKYVTVY 759
>gi|319939379|ref|ZP_08013739.1| RgpFc protein [Streptococcus anginosus 1_2_62CV]
gi|319811365|gb|EFW07660.1| RgpFc protein [Streptococcus anginosus 1_2_62CV]
Length = 587
Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats.
Identities = 63/271 (23%), Positives = 99/271 (36%), Gaps = 21/271 (7%)
Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184
+ + KI + +H +Y D + +F +DLF+T
Sbjct: 266 NFPDFKYLLARKYIQTTAPTSLSNKKIGVHLHVFYVDLLEDFLKAFENFHFAYDLFITTD 325
Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
K + E + + +A ++V N GRDV P L L YDY+ H KKS+
Sbjct: 326 NDTKKLEIEAILNQNHKNAHIFVTGNIGRDVLPMLKL--KKYLSTYDYIGHFHTKKSKEA 383
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300
+ G WR L L+ +D I+ FE N LG++ S + RY +
Sbjct: 384 DF--WAGESWRNELIDMLIKPAD---NILANFE-NDKLGLVISDIPTFFRYNKIVDAWNE 437
Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIG-EFEEER 354
+ + DL + F GT W K L+PL +L L + E
Sbjct: 438 HLIAPEMNDLWYKMKMTKPIDFNTFHTFVMSYGTFIWFKYDALKPLFDLDLTDKDVPIEP 497
Query: 355 NLKDGALEHAVERFFACSV--RYTEFSIESV 383
++ HA+ER + +F I
Sbjct: 498 LP-QNSILHAIERLIVYVAWNEHYDFRISKN 527
>gi|241668058|ref|ZP_04755636.1| glycosyl transferase, group 1 [Francisella philomiragia subsp.
philomiragia ATCC 25015]
Length = 756
Score = 227 bits (580), Expect = 2e-57, Method: Composition-based stats.
Identities = 55/249 (22%), Positives = 100/249 (40%), Gaps = 13/249 (5%)
Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYF 198
P S I K AI +H +Y D E + L +DL++T+ N +F ++
Sbjct: 511 PINSEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSS 570
Query: 199 P--SAQLYVMENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ ++ ++N GRD+ P ++ L+ + Y+ + H KK+ H G WR +
Sbjct: 571 GAANVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKT--VSAHDNLGDKWRAY 628
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
L +L+G ++ I + +G++ + E V +L G
Sbjct: 629 LLNNLIGDNEQISNSILNLFNDEKIGLVFPE-------DRTYIDIGENKFYVDELCTAIG 681
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375
F G MFW + + + +L+ +EE +DG+ HA+ER V
Sbjct: 682 LEKICETPLFPLGNMFWARVDAIRDIFSLNEDMILQEEPLPRDGSYMHALERIIPNIVEK 741
Query: 376 TEFSIESVD 384
+ +V
Sbjct: 742 NGYKYVTVY 750
>gi|315657309|ref|ZP_07910191.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii
ATCC 35242]
gi|315491781|gb|EFU81390.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii
ATCC 35242]
Length = 680
Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats.
Identities = 63/242 (26%), Positives = 100/242 (41%), Gaps = 14/242 (5%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-----DFEQDVLKYFPSAQLY 204
++A+V+H YY D EI L + +FD+F+T + L +
Sbjct: 51 RLAVVMHVYYSDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP---IEGIIWRRWLFFDLL 261
+EN GRD+ P + L+ G D Y + K+H KKS HP G W+ LL
Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDALL 170
Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321
G D +I++ F +P LG++ + ++ +L +R K
Sbjct: 171 GSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQF-----WGGDQALTAELLRRLEMQLKPS 225
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
L F G+M+WV+ ++ LR+L L +FE E D HA+ER +
Sbjct: 226 KLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAGLKL 285
Query: 381 ES 382
Sbjct: 286 RE 287
Score = 82.3 bits (202), Expect = 1e-13, Method: Composition-based stats.
Identities = 11/86 (12%), Positives = 27/86 (31%), Gaps = 8/86 (9%)
Query: 39 SGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSR 98
G V + + +++ + +F WL + ++ RI F +
Sbjct: 598 PGVMVNFDNTARRQWKPDVWYGANPYLFR---RWLAAA---ARSVLDRPAPERIVFINAW 651
Query: 99 KE--QKAFLRLNRFMSNSRMPFDSEK 122
E + A L + + + +
Sbjct: 652 NEWAEGAILEPTQRFGKTYLQAVRDV 677
>gi|167627488|ref|YP_001677988.1| group 1 glycosyl transferase [Francisella philomiragia subsp.
philomiragia ATCC 25017]
gi|167597489|gb|ABZ87487.1| glycosyl transferase, group 1 [Francisella philomiragia subsp.
philomiragia ATCC 25017]
Length = 763
Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats.
Identities = 55/249 (22%), Positives = 100/249 (40%), Gaps = 13/249 (5%)
Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYF 198
P S I K AI +H +Y D E + L +DL++T+ N +F ++
Sbjct: 518 PINSEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSS 577
Query: 199 P--SAQLYVMENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
+ ++ ++N GRD+ P ++ L+ + Y+ + H KK+ H G WR +
Sbjct: 578 GAANVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKT--VSAHDNLGDKWRAY 635
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
L +L+G ++ I + +G++ + E V +L G
Sbjct: 636 LLNNLIGDNEQISNSILNLFNDEKIGLVFPE-------DRTYIDIGENKFYVDELCTAIG 688
Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375
F G MFW + + + +L+ +EE +DG+ HA+ER V
Sbjct: 689 LEKICETPLFPLGNMFWARVDAIRDIFSLNEDMILQEEPLPRDGSYMHALERIIPNIVEK 748
Query: 376 TEFSIESVD 384
+ +V
Sbjct: 749 NGYKYVTVY 757
>gi|296876714|ref|ZP_06900762.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
parasanguinis ATCC 15912]
gi|296432216|gb|EFH18015.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
parasanguinis ATCC 15912]
Length = 582
Score = 225 bits (575), Expect = 6e-57, Method: Composition-based stats.
Identities = 65/266 (24%), Positives = 107/266 (40%), Gaps = 18/266 (6%)
Query: 127 KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-- 184
+ + + ++ K+A+ +H +Y D E +FD+DL++T
Sbjct: 262 PDFPYLLSRKYLKKQELAGDFDKKVAVHLHVFYVDLLEEFLDAFRDFHFDYDLWITTDVE 321
Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244
E + EQ + A++ V N GRDV P L L +YDY+ H KKS+ +
Sbjct: 322 EKKQAIEQILSNRAQDARVVVTGNIGRDVLPMLLL--KEQLSKYDYVGHFHTKKSKEADF 379
Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSE 302
G WR+ L L+ +D +I+ E NP +G+ + + RY R +
Sbjct: 380 --WAGESWRKELIEMLVKPAD---QILANMEANPKVGITIADIPTFFRYNRIVVAWNEAL 434
Query: 303 VYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLK 357
+ + L +R G K GT W K L+PL +L+L L
Sbjct: 435 ISPEMNKLWQRMGATKTIDFEKINTFVMSYGTFVWFKYDALKPLFDLNLTAADVPAEPLP 494
Query: 358 DGALEHAVERFFACSV--RYTEFSIE 381
++ HA+ER + +F I
Sbjct: 495 QNSILHAIERLLIYIAWDQKYDFRIS 520
>gi|289678438|ref|ZP_06499328.1| glycosyl transferase, group 1 [Pseudomonas syringae pv. syringae
FF5]
Length = 774
Score = 225 bits (574), Expect = 9e-57, Method: Composition-based stats.
Identities = 53/241 (21%), Positives = 90/241 (37%), Gaps = 11/241 (4%)
Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFE-QDVLKYFPSA- 201
+ +A+ +H +Y+D + SH L D+F+T+ +A + V P
Sbjct: 260 PEAARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLADAKHQKKTIAVFSKHPRVK 319
Query: 202 --QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
++ + N+GR+ P L YD C +H KKS G E W +L
Sbjct: 320 NLKVRCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 375
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319
LL ++I R++N F + LG+ + W +
Sbjct: 376 LLRDANIITRLLNAFADHKDLGLYYPTTFWMMPSWVNHVTM--NKSFMAAWHNEWQIDPC 433
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLH-LIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
L + G MFW +P+ L+ + F +E DG++ HA+ER +
Sbjct: 434 DGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 493
Query: 379 S 379
Sbjct: 494 K 494
>gi|262282406|ref|ZP_06060174.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA]
gi|262261697|gb|EEY80395.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA]
Length = 582
Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats.
Identities = 63/244 (25%), Positives = 100/244 (40%), Gaps = 18/244 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206
KIA+ +H +Y D E H +F +DLF+T K + + A++ V
Sbjct: 283 KKIAVHLHVFYVDLLAEFLHAFESFHFSYDLFITTDSEKKKNEILDILEGKQAKAEVLVT 342
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N GRDV P L L +YDY+ H KKS+ Y G WR+ L L+ +D
Sbjct: 343 GNVGRDVLPMLKLKR--HLSQYDYIGHFHTKKSKEADY--WAGESWRKELINMLVHPAD- 397
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK----- 319
+I++ Q+ LG++ + + R+ R + + + L +R +
Sbjct: 398 --QIVSQLGQDDRLGLVIADIPSFFRFNRIVVAWNEALISPEMNKLWERMNCQKEVDFKQ 455
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L PL +L+L E L ++ HA+ER + +
Sbjct: 456 MNTFVMSYGTFVWFKYDALSPLFDLNLTEEDVPSEPLPQNSILHAIERLLVYIAWDKQYD 515
Query: 378 FSIE 381
F I
Sbjct: 516 FKIS 519
>gi|55821450|ref|YP_139892.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG
18311]
gi|55737435|gb|AAV61077.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG
18311]
Length = 594
Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats.
Identities = 64/247 (25%), Positives = 101/247 (40%), Gaps = 18/247 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206
KIA+ +H YY D + +F +DLF+T +K + + + K A++++
Sbjct: 287 KKIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDSEDKKAEIQSILDKNGKVARIFIT 346
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRDV P L L YDY+ H KKS Y G WR LF L+ +D
Sbjct: 347 GNRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPEYPY--WVGDSWRNELFSMLIQPAD- 401
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321
II E++ LG++ + + RY + + + DL +R
Sbjct: 402 --NIIANLERDDRLGLVIADIPSFFRYTKIVDPWNENRFAEGMNDLWERMDLGRDIDFDK 459
Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFAC--SVRYTE 377
GT W K L+PL +L L E + + H++ER R +
Sbjct: 460 MNTFIMSYGTFIWFKYDALKPLFDLDLQDEEIPAEPIPQHTILHSIERILVYLAWARRYD 519
Query: 378 FSIESVD 384
++I D
Sbjct: 520 YAIAKND 526
>gi|306831662|ref|ZP_07464819.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
gallolyticus subsp. gallolyticus TX20005]
gi|325978600|ref|YP_004288316.1| rhamnosyltransferase [Streptococcus gallolyticus subsp.
gallolyticus ATCC BAA-2069]
gi|304426087|gb|EFM29202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
gallolyticus subsp. gallolyticus TX20005]
gi|325178528|emb|CBZ48572.1| rhamnosyltransferase [Streptococcus gallolyticus subsp.
gallolyticus ATCC BAA-2069]
Length = 586
Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats.
Identities = 59/239 (24%), Positives = 96/239 (40%), Gaps = 16/239 (6%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E + +FD+DLF+T K + E + K AQ+++
Sbjct: 287 KKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESILEKNGKIAQVFLT 346
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L YDY+ H KKS Y G WR L+ L+ +D
Sbjct: 347 GNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPEYPY--WVGDSWRNELYQMLIQSAD- 401
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319
I+ E N LG++ + + RY + + + +L +R
Sbjct: 402 --NILANLENNDNLGLVIADIPSFFRYTKIVDPWNENRFADGMNELWERMNLERQIDFNN 459
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GT W K L+PL +L L + + + H++ER +
Sbjct: 460 LSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPSEPIPQHTILHSIERILVYLAWANNY 518
>gi|55823377|ref|YP_141818.1| polysaccharide biosynthesis protein [Streptococcus thermophilus
CNRZ1066]
gi|55739362|gb|AAV63003.1| polysaccharide biosynthesis protein [Streptococcus thermophilus
CNRZ1066]
Length = 581
Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats.
Identities = 65/244 (26%), Positives = 100/244 (40%), Gaps = 18/244 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E +F +DL++T E ++ EQ + + A + V
Sbjct: 284 KKVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWITTDIEEKKQEIEQILSRRSQDATIVVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N GRDV P L L RYDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321
+I+ E NP +G+ Y RY R + + + L +R G
Sbjct: 399 --QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALISPEMNKLWQRMGATKNIDFKN 456
Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+PL +L+L L ++ HA+ER + +
Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVPAEPLPQNSILHAIERLLVYIAWDQKYD 516
Query: 378 FSIE 381
F I
Sbjct: 517 FRIS 520
>gi|94990172|ref|YP_598272.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS10270]
gi|94543680|gb|ABF33728.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS10270]
Length = 581
Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats.
Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319
I++ FE N +G+I + + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+ L +L L L ++ HA+ER +
Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515
Query: 378 FSI 380
F I
Sbjct: 516 FRI 518
>gi|94988294|ref|YP_596395.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes MGAS9429]
gi|94992170|ref|YP_600269.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS2096]
gi|94541802|gb|ABF31851.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes MGAS9429]
gi|94545678|gb|ABF35725.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS2096]
Length = 581
Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats.
Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319
I++ FE N +G+I + + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+ L +L L L ++ HA+ER +
Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515
Query: 378 FSI 380
F I
Sbjct: 516 FRI 518
>gi|330899783|gb|EGH31202.1| hypothetical protein PSYJA_20361 [Pseudomonas syringae pv. japonica
str. M301072PT]
Length = 626
Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats.
Identities = 53/241 (21%), Positives = 90/241 (37%), Gaps = 11/241 (4%)
Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFE-QDVLKYFPSA- 201
+ +A+ +H +Y+D + SH L D+F+T+ +A + V P
Sbjct: 112 PEAARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLADAKHQKKTIAVFSKHPRVK 171
Query: 202 --QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
++ + N+GR+ P L YD C +H KKS G E W +L
Sbjct: 172 NLKVRCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 227
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319
LL ++I R++N F + LG+ + W +
Sbjct: 228 LLRDANIITRLLNAFADHKDLGLYYPTTFWMMPSWVNHVTM--NKSFMAAWHNEWQIAPC 285
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLH-LIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
L + G MFW +P+ L+ + F +E DG++ HA+ER +
Sbjct: 286 DGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 345
Query: 379 S 379
Sbjct: 346 K 346
>gi|21910063|ref|NP_664331.1| hypothetical protein SpyM3_0527 [Streptococcus pyogenes MGAS315]
gi|28896239|ref|NP_802589.1| hypothetical protein SPs1327 [Streptococcus pyogenes SSI-1]
gi|21904254|gb|AAM79134.1| putative protein [Streptococcus pyogenes MGAS315]
gi|28811490|dbj|BAC64422.1| conserved hypothetical protein [Streptococcus pyogenes SSI-1]
Length = 581
Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats.
Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319
I++ FE N +G+I + + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+ L +L L L ++ HA+ER +
Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515
Query: 378 FSI 380
F I
Sbjct: 516 FRI 518
>gi|319946716|ref|ZP_08020950.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
australis ATCC 700641]
gi|319746764|gb|EFV99023.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
australis ATCC 700641]
Length = 581
Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats.
Identities = 62/244 (25%), Positives = 97/244 (39%), Gaps = 18/244 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E +F +DL++T E + E+ + A + V
Sbjct: 284 KKVAVHLHVFYVDLLEEFLDAFQAFHFAYDLWITTDVEEKKQAIEEILSNRAQVATVVVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N GRDV P L L YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNIGRDVLPMLLL--KEQLSHYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-----K 319
+I+ E NP +G+ + + RY R + + L +R G
Sbjct: 399 --KILANMEANPKVGITIADIPTFFRYNRIVVAWNEVLISPEMNKLWQRMGATKTIDFKN 456
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+PL +L+L L ++ HA+ER + +
Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLKAADVPAEPLPQNSILHAIERLLVYIAWDQKYD 516
Query: 378 FSIE 381
F I
Sbjct: 517 FRIS 520
>gi|50913971|ref|YP_059943.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes MGAS10394]
gi|50903045|gb|AAT86760.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes MGAS10394]
Length = 581
Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats.
Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319
I++ FE N +G+I + + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+ L +L L L ++ HA+ER +
Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515
Query: 378 FSI 380
F I
Sbjct: 516 FRI 518
>gi|322516362|ref|ZP_08069287.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124]
gi|322125095|gb|EFX96488.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124]
Length = 581
Score = 224 bits (570), Expect = 3e-56, Method: Composition-based stats.
Identities = 63/244 (25%), Positives = 99/244 (40%), Gaps = 18/244 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E +F +DL++T E + E+ + A + V
Sbjct: 284 RKVAVHLHVFYVDLLEEFLDAFQAFHFIYDLWITTDVEEKKQAIEKILSNRVQDATVVVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N GRDV P L L RYDY+ H KKS+ + G WR+ L L+ +D+
Sbjct: 344 GNIGRDVLPMLLL--KEQLSRYDYVGHFHTKKSKEADF--WAGESWRKELIEMLVKPADL 399
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-----K 319
I+ E NP +G+ + + RY R + + + L +R G
Sbjct: 400 ---ILANMEANPKVGITIADIPTFFRYNRIVVAWNEALISPEMNKLWQRMGATKTIDFKS 456
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+PL +L+L L ++ HA+ER + +
Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLTAADVPAEPLPQNSILHAIERLLIYIAWDQKYD 516
Query: 378 FSIE 381
F I
Sbjct: 517 FRIS 520
>gi|116628171|ref|YP_820790.1| polysaccharide biosynthesis protein [Streptococcus thermophilus
LMD-9]
gi|116101448|gb|ABJ66594.1| Lipopolysaccharide biosynthesis protein [Streptococcus thermophilus
LMD-9]
Length = 581
Score = 223 bits (569), Expect = 3e-56, Method: Composition-based stats.
Identities = 65/244 (26%), Positives = 100/244 (40%), Gaps = 18/244 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E +F +DL++T E ++ EQ + + A + V
Sbjct: 284 KKVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWITTDVEEKKQEIEQILSRRSQDATIVVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N GRDV P L L RYDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321
+I+ E NP +G+ Y RY R + + + L +R G
Sbjct: 399 --QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALISPEMNKLWQRMGATKNIDFKN 456
Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+PL +L+L L ++ HA+ER + +
Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVPAEPLPQNSILHAIERLLVYIAWDQKYD 516
Query: 378 FSIE 381
F I
Sbjct: 517 FRIS 520
>gi|83950907|ref|ZP_00959640.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM]
gi|83838806|gb|EAP78102.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM]
Length = 752
Score = 223 bits (568), Expect = 4e-56, Method: Composition-based stats.
Identities = 69/251 (27%), Positives = 106/251 (42%), Gaps = 13/251 (5%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLY 204
K++ A+ H YY D W E + + DL++T+ E + ++ + FP A +
Sbjct: 130 KARFALHAHIYYPDLWPEFATRFDEIGDGIDLYITLTWRGEETRWLADEITERFPRAFVT 189
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
+ N+GRD+ PFL L G FD YD LCKIH KKS H +G WRR L +L +
Sbjct: 190 PVPNRGRDILPFLLLANAGAFDGYDALCKIHTKKSP----HRDDGDQWRRHLIDGVLPAT 245
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324
+ R+ + + + + + W + + +R L
Sbjct: 246 GLQERLQHFLADDAAAFWVADGQAYAARDW-----WGINRDKTAAVLRRVELDPLLDALR 300
Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383
F G+++W+KP L ++ L L FE E+ DG L HAVER I
Sbjct: 301 FPAGSIYWMKPLMLGMIKALDLDAPMFEPEKGQVDGTLAHAVERAIGGLALAAGQEIRET 360
Query: 384 DCVAEYERLLH 394
+ R H
Sbjct: 361 AALMRPRRAGH 371
Score = 73.8 bits (180), Expect = 4e-11, Method: Composition-based stats.
Identities = 18/110 (16%), Positives = 32/110 (29%), Gaps = 9/110 (8%)
Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73
I + + P ++G W + ++ + H + F + WL
Sbjct: 633 IYDYRAIAARSLTPQYRDRLPPNTIAGIMPSWDNTARRGPRAHIAHGATPASFRN---WL 689
Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
R LS F + E +KA L + + SE
Sbjct: 690 RGLCG----GPLSQSYRGELFINAWNEWAEKAMLEPSTRFGRLYLDVLSE 735
>gi|288905572|ref|YP_003430794.1| polysaccharide biosynthesis protein (RgpF) [Streptococcus
gallolyticus UCN34]
gi|288732298|emb|CBI13867.1| Putative polysaccharide biosynthesis protein (RgpF) [Streptococcus
gallolyticus UCN34]
Length = 586
Score = 222 bits (567), Expect = 5e-56, Method: Composition-based stats.
Identities = 58/239 (24%), Positives = 95/239 (39%), Gaps = 16/239 (6%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E + +FD+DLF+T K + E + K AQ+++
Sbjct: 287 KKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESILEKNGKIAQVFLT 346
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L YDY+ H KKS Y G WR L+ L+ +D
Sbjct: 347 GNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPEYPY--WVGDSWRNELYQMLIQSAD- 401
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319
I+ E N LG++ + + RY + + + +L +
Sbjct: 402 --NILANLENNDNLGLVIADIPSFFRYTKIVDPWNENRFADGMNELWECMNLERQIDFNN 459
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GT W K L+PL +L L + + + H++ER +
Sbjct: 460 LSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPSEPIPQHTILHSIERILVYLAWANNY 518
>gi|302337198|ref|YP_003802404.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293]
gi|301634383|gb|ADK79810.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293]
Length = 1808
Score = 222 bits (566), Expect = 7e-56, Method: Composition-based stats.
Identities = 56/239 (23%), Positives = 93/239 (38%), Gaps = 17/239 (7%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFP---SAQL 203
I + +H +Y D E+ L+ + F LF++ + + ++ V K P +
Sbjct: 1018 SIGVHLHLFYIDLAEELLSSLINIPVCFSLFISTSAGVKDQEYIKKIVNKKLPLCNECTV 1077
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263
EN+GRD+ PF+ ++D + H KKS H RR+L +LG
Sbjct: 1078 IQTENRGRDIAPFIVEFGNS-LSQFDLILHFHSKKSL----HSDSLSDARRFLLHYILGN 1132
Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR-RVIDLAKRAGFPTKRLH 322
I I+ +N F +N +GM+ + + K+ G
Sbjct: 1133 KAITIQNLNMFFENGSIGMVAPPYH----PSLRNMPNFGLQEYETKQFLKKMGINYSGKC 1188
Query: 323 LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
DF G+ FW + + L ++ F EE+ DG L H +ER + F I
Sbjct: 1189 TDFPAGSFFWCRKDAIRQLLTSNIRWNSFPEEKGQIDGTLAHVIERSLGIICKQNNFKI 1247
>gi|225868697|ref|YP_002744645.1| rhamnan synthesis protein F family protein [Streptococcus equi
subsp. zooepidemicus]
gi|225701973|emb|CAW99527.1| rhamnan synthesis protein F family protein [Streptococcus equi
subsp. zooepidemicus]
Length = 581
Score = 222 bits (566), Expect = 7e-56, Method: Composition-based stats.
Identities = 58/274 (21%), Positives = 105/274 (38%), Gaps = 19/274 (6%)
Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184
++ + + + + KIA+ +H +Y D E +FD+DL +T
Sbjct: 260 HLPDAKYLLAHKYLPEQPISIDQSKKIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLITTD 319
Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
K + ++ + + SA + V N GRDV P L L +YDY+ H KKS+
Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300
+ G WR L ++ +D +I+ + +G++ + + R+ +
Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAADA-IGIVIADIPSFFRFNKIVDAWNE 431
Query: 301 SEVYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355
+ + L + G GT W K L+PL +L L
Sbjct: 432 HLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIPAEP 491
Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESVDCVA 387
L ++ HA+ER R+ +F I + +
Sbjct: 492 LPQNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525
>gi|157151529|ref|YP_001450315.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr.
CH1]
gi|157076323|gb|ABV11006.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr.
CH1]
Length = 582
Score = 222 bits (566), Expect = 7e-56, Method: Composition-based stats.
Identities = 63/244 (25%), Positives = 102/244 (41%), Gaps = 18/244 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206
KIA+ +H +Y D E H +F +DLF+T K + + A+++V
Sbjct: 283 KKIAVHLHVFYVDLLAEFLHAFESFHFSYDLFITTDSEKKKNEILGILEGKQAKAEVFVT 342
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N GRDV P L L +YDY+ H KKS+ Y G WR+ L L+ +D
Sbjct: 343 GNVGRDVLPMLKLKR--HLSQYDYIGHFHTKKSKEADY--WAGESWRKELINMLVHPAD- 397
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK----- 319
+I++ Q+ CLG++ + + R+ R + + + L +R +
Sbjct: 398 --QIVSQLGQDDCLGLVIADIPSFFRFNRIVVAWNEALISPEMNKLWERMNCQKEVDFKQ 455
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L PL +L++ E L ++ HA+ER + +
Sbjct: 456 MNTFVMSYGTFVWFKYDALSPLFDLNMTEEDVPSEPLPQNSILHAIERLLVYIAWDKQYD 515
Query: 378 FSIE 381
F I
Sbjct: 516 FKIS 519
>gi|269978088|ref|ZP_06185038.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1]
gi|306818459|ref|ZP_07452182.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239]
gi|307700705|ref|ZP_07637730.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16]
gi|269933597|gb|EEZ90181.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1]
gi|304648632|gb|EFM45934.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239]
gi|307613700|gb|EFN92944.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16]
Length = 613
Score = 222 bits (566), Expect = 8e-56, Method: Composition-based stats.
Identities = 69/254 (27%), Positives = 106/254 (41%), Gaps = 10/254 (3%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVL 195
+ K +IA V H +Y D EI L +F+T E EQ +
Sbjct: 285 AEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKKTQIEQQLQ 344
Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGV-FDRYDYLCKIHGKKSQREGYHPIEGIIWR 253
A++ ++E N+GRDV FL + R+D + KIH KKS ++ Y+ +++
Sbjct: 345 TMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGRFDVVAKIHSKKSAQDAYNAA--ELFK 402
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
R LF +LL +++ F P LGM+ A + + L +R
Sbjct: 403 RHLFENLLPSPGYTANLLHLFATEPYLGMVFPPAVSLGYPTLGHA-WFANKKPALALCER 461
Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370
G P G+MF+ +P+ L PL H +F EE DG+L H +ER F+
Sbjct: 462 LGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSLAHVIERIFS 521
Query: 371 CSVRYTEFSIESVD 384
S +SV
Sbjct: 522 YSSLSEGLICKSVM 535
>gi|195977971|ref|YP_002123215.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus equi
subsp. zooepidemicus MGCS10565]
gi|195974676|gb|ACG62202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus equi
subsp. zooepidemicus MGCS10565]
Length = 581
Score = 222 bits (565), Expect = 9e-56, Method: Composition-based stats.
Identities = 59/274 (21%), Positives = 106/274 (38%), Gaps = 19/274 (6%)
Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184
++ + + S + + KIA+ +H +Y D E +FD+DL +T
Sbjct: 260 HLPDAKYLLAHKYLSNQPISIAPSKKIAVHLHVFYADLLSEFLEAFSHFHFDYDLLITTD 319
Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
K + ++ + + SA + V N GRDV P L L +YDY+ H KKS+
Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300
+ G WR L ++ +D +I+ + +G++ + + R+ +
Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAADA-IGIVIADIPSFFRFNKIVDAWNE 431
Query: 301 SEVYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355
+ + L + G GT W K L+PL +L L
Sbjct: 432 HLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLGLNEADIPAEP 491
Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESVDCVA 387
L ++ HA+ER R+ +F I + +
Sbjct: 492 LPQNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525
>gi|222152862|ref|YP_002562039.1| rhamnan synthesis protein F family protein [Streptococcus uberis
0140J]
gi|222113675|emb|CAR41606.1| rhamnan synthesis protein F family protein [Streptococcus uberis
0140J]
Length = 585
Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats.
Identities = 64/247 (25%), Positives = 101/247 (40%), Gaps = 19/247 (7%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYV 205
+ IA+ +H +Y D E H F FDL++T E + + + + SA++ V
Sbjct: 285 EHSIAVHLHVFYVDLLEEFLHAFTSFKFPFDLYITTDKSEKESEIKAILDSFRVSAKIVV 344
Query: 206 MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSD 265
N GRDV P L L +YDY+ H KKS+ + G WR L L+ +
Sbjct: 345 TGNIGRDVLPMLKL--KDELSQYDYIGHFHTKKSKEADF--WAGESWRNELIDMLIKPA- 399
Query: 266 IAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHL 323
IIN FE +P +G+I + + R+ + + + L ++
Sbjct: 400 --NTIINQFE-DPAIGIIIADIPSFFRFNKIVTPLNEHLIAPEMNKLWEKMNLSKTIDFE 456
Query: 324 DF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYT 376
F GT W K L+PL +L+L + L ++ HAVER +
Sbjct: 457 QFDTFVMSYGTFVWFKYDALKPLFDLNLKDGDVPKEPLPQNSILHAVERLLIYIAWDSHF 516
Query: 377 EFSIESV 383
+F I
Sbjct: 517 DFRIAKN 523
>gi|225870347|ref|YP_002746294.1| rhamnan synthesis protein F family protein [Streptococcus equi
subsp. equi 4047]
gi|225699751|emb|CAW93520.1| rhamnan synthesis protein F family protein [Streptococcus equi
subsp. equi 4047]
Length = 581
Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats.
Identities = 58/274 (21%), Positives = 104/274 (37%), Gaps = 19/274 (6%)
Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184
+ + + + + KIA+ +H +Y D E +FD+DL +T
Sbjct: 260 HPPDAKYLLAHKYLPEQPISIDQSKKIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLITTD 319
Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
K + ++ + + SA + V N GRDV P L L +YDY+ H KKS+
Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300
+ G WR L ++ +D +I+ + +G++ + + R+ +
Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAADA-IGIVIADIPSFFRFNKIVDAWNE 431
Query: 301 SEVYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355
+ + L + G GT W K L+PL +L L
Sbjct: 432 HLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIPAEP 491
Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESVDCVA 387
L ++ HA+ER R+ +F I + +
Sbjct: 492 LSQNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525
>gi|227875198|ref|ZP_03993340.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris
ATCC 35243]
gi|227844103|gb|EEJ54270.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris
ATCC 35243]
Length = 613
Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats.
Identities = 68/254 (26%), Positives = 105/254 (41%), Gaps = 10/254 (3%)
Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVL 195
+ K +IA V H +Y D EI L +F+T E EQ +
Sbjct: 285 AEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKKTQIEQQLQ 344
Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWR 253
A++ ++E N+GRDV FL + +D + KIH KKS ++ Y+ +++
Sbjct: 345 TMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGCFDVVAKIHSKKSAQDAYNAA--ELFK 402
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
R LF +LL +++ F P LGM+ A + + L +R
Sbjct: 403 RHLFENLLPSPGYTANLLHLFATEPYLGMVFPPAVSLGYPTLGHA-WFANKKPALALCER 461
Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370
G P G+MF+ +P+ L PL H +F EE DG+L H +ER F+
Sbjct: 462 LGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSLAHVIERIFS 521
Query: 371 CSVRYTEFSIESVD 384
S +SV
Sbjct: 522 YSSLSEGLICKSVM 535
>gi|306833804|ref|ZP_07466929.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis
ATCC 700338]
gi|304423998|gb|EFM27139.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis
ATCC 700338]
Length = 586
Score = 221 bits (564), Expect = 1e-55, Method: Composition-based stats.
Identities = 58/239 (24%), Positives = 97/239 (40%), Gaps = 16/239 (6%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E + +FD+DLF+T K + E + K +AQ+++
Sbjct: 287 KKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESILEKNGKTAQVFLT 346
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L YDY+ H KKS Y G WR L+ L+ +D
Sbjct: 347 GNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPEYPY--WVGDSWRNELYQMLIQSAD- 401
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319
++ E N LG++ + + RY + + + +L +R
Sbjct: 402 --NVLANLENNDNLGLVIADIPSFFRYTKIVDPWNENRFADGMNELWERMNLGRQIDFNN 459
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GT W K L+PL +L L + + + H++ER +
Sbjct: 460 LSTFIMSYGTFIWFKHDTLKPLFDLELTDDEIPSEPIPQHTILHSIERILVYLAWANNY 518
>gi|296135664|ref|YP_003642906.1| glycosyl transferase family 2 [Thiomonas intermedia K12]
gi|295795786|gb|ADG30576.1| glycosyl transferase family 2 [Thiomonas intermedia K12]
Length = 1414
Score = 221 bits (564), Expect = 1e-55, Method: Composition-based stats.
Identities = 79/241 (32%), Positives = 109/241 (45%), Gaps = 19/241 (7%)
Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGR 211
A+++H YY D W E L L D++V++ E ++ D+++ P A + NKGR
Sbjct: 281 AVLLHLYYPDLWPEFLAHLKTLPAPCDVYVSLSEGREELLTDIVRDLPDAVVMRHPNKGR 340
Query: 212 DVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY---------HPIEGIIWRRWLFFDLLG 262
D+ P L LL L Y L +HGKKS +G WRR L LL
Sbjct: 341 DIAPRLALLRLARAHNYKQLLFLHGKKSPHLKEVENIHIPFLQHKDGDRWRRELLAALL- 399
Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH 322
D + + I F Q P LG+IG + R + R+ A+R G
Sbjct: 400 --DASEKTIAAFAQQPKLGLIGPHGFWLGLR------GDANFPRLSAQAQRMGITPDPAR 451
Query: 323 LDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
+F G+MFW +P+ L+PL L L +FE+E DG L H VER FA S F I
Sbjct: 452 HGYFAGSMFWCRPQALDPLLALDLKDADFEDETGQTDGTLAHVVERLFALSAEKAGFQIA 511
Query: 382 S 382
Sbjct: 512 D 512
>gi|322373386|ref|ZP_08047922.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus sp.
C150]
gi|321278428|gb|EFX55497.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus sp.
C150]
Length = 594
Score = 221 bits (563), Expect = 2e-55, Method: Composition-based stats.
Identities = 65/247 (26%), Positives = 101/247 (40%), Gaps = 18/247 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206
KIA+ +H YY D + +F +DLF+T E K+ + + K+ A++++
Sbjct: 287 KKIAVHLHTYYVDLLDDFLRQFENFHFTYDLFLTTDSEEKKKEIQSILDKHGKEARIFIT 346
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRDV P L L YDY+ H KKS Y G WR LF L+ +D
Sbjct: 347 GNRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPEYPY--WVGDSWRNELFSMLIQPAD- 401
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321
II E + LG++ + + RY + + + DL +R
Sbjct: 402 --NIIANLEHDDRLGLVIADIPTFFRYTKIVDPWNENRFAEGMNDLWERMDLGRDIDFDK 459
Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFAC--SVRYTE 377
GT W K L+PL +L L E + + H++ER R +
Sbjct: 460 MNTFIMSYGTFIWFKYDTLKPLFDLDLQDEEIPAEPIPQHTILHSIERILVYLAWARRYD 519
Query: 378 FSIESVD 384
++I D
Sbjct: 520 YAIAKND 526
>gi|312867647|ref|ZP_07727853.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405]
gi|311096710|gb|EFQ54948.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405]
Length = 582
Score = 220 bits (561), Expect = 3e-55, Method: Composition-based stats.
Identities = 63/252 (25%), Positives = 102/252 (40%), Gaps = 18/252 (7%)
Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYF 198
++ K+A+ +H +Y D E +F +DL++T E + E+ +
Sbjct: 276 QELAENFDRKVAVHLHVFYVDLLEEFLDAFQAFHFVYDLWITTDVEEKKQTIEKILSNRA 335
Query: 199 PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258
A + V N GRDV P L L +YDY+ H KKS+ + G WR+ L
Sbjct: 336 QDATVVVTGNIGRDVLPMLLL--KEQLSQYDYVGHFHTKKSKEADF--WAGESWRKELIE 391
Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAG- 315
L+ +D +I+ E NP +G+ + + RY R + + + L +R G
Sbjct: 392 MLVKPAD---QILANMEANPKVGITIADIPTFFRYNRIVVAWNEALISPEMNKLWERMGA 448
Query: 316 ---FPTKR-LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFAC 371
K GT W K L+PL +L+L L ++ HA+ER
Sbjct: 449 AKTIDFKNLNTFVMSYGTFVWFKYDALKPLFDLNLTAANVPAEPLPQNSILHAIERLLIY 508
Query: 372 SV--RYTEFSIE 381
+ +F I
Sbjct: 509 IAWDQKYDFRIS 520
>gi|304309760|ref|YP_003809358.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1]
gi|301795493|emb|CBL43691.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1]
Length = 1315
Score = 219 bits (559), Expect = 4e-55, Method: Composition-based stats.
Identities = 63/317 (19%), Positives = 111/317 (35%), Gaps = 27/317 (8%)
Query: 84 KLSFPSCRIFFYGSRKEQKAFLR----LNRFMSNSRMPFDSEKFLYVK---ELFEGWNDR 136
+ S + + F R + FL R+ + + + + L L + +
Sbjct: 363 RNSQEAAAMLFPRLRTITRTFLEKLPTPLRYRLQAFLRTLAHRLLPNAVQGRLAQTATNH 422
Query: 137 PSSPKKSGL--------TIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK 188
P + L T + IAI +H YY D L R+ FDL++++
Sbjct: 423 PYPEQLKQLHELTLPKHTSNATIAIHIHLYYADLAPTFVQALSRMERPFDLYISIQVRAN 482
Query: 189 DFEQDVLKY----FPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244
E + + + N GRD+ PF+ + +YD + +H KKS Y
Sbjct: 483 PVEIEAVVRKIPCLRGLDIRATPNLGRDLYPFVCIFG-EALRKYDIIAHLHSKKSL---Y 538
Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY 304
+ W ++ L + RI+ G++ + + + +
Sbjct: 539 NQGATAGWLEYILDSLFRSPEDIARILERLSDASQTGIVYPQNFS-GLPYMAYT-WLANR 596
Query: 305 RRVIDLAKRAGFPTKRL-HLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALE 362
R + R G + + D+ G+MFW + + P L +FE E DG L
Sbjct: 597 SRAQQVQARFGLTSLPSGYFDYPAGSMFWARADAIAPFFEAQLNEDDFENESGQTDGTLA 656
Query: 363 HAVERFFACSVRYTEFS 379
H +ERF F
Sbjct: 657 HTLERFLVLVPESLGFR 673
>gi|329116186|ref|ZP_08244903.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020]
gi|326906591|gb|EGE53505.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020]
Length = 589
Score = 219 bits (558), Expect = 6e-55, Method: Composition-based stats.
Identities = 66/248 (26%), Positives = 106/248 (42%), Gaps = 19/248 (7%)
Query: 147 IKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYFPSAQLY 204
I K+AI +H +Y D E +FD+DLF+T K + + + + A+++
Sbjct: 288 INKKVAIHLHTFYVDLLQEFLSAFENFHFDYDLFITTDIEEKKTQIENVLNENNQKAEVF 347
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
V N GRDV P L L YDY+ H KKS+ + G WR+ L L+ +
Sbjct: 348 VTGNIGRDVLPML--LLKEKLSVYDYIGHFHTKKSKEADF--WAGESWRKELIKMLVLPA 403
Query: 265 DIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL- 321
D I+ T E+N +G++ + Y RY + + + + +L K+ G
Sbjct: 404 D---SILATLEKN-KVGIVIADMPTYFRYNKIVTAWNENLIAPEMNELWKKMGLTKSIDF 459
Query: 322 ----HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RY 375
GT W K L+PL +L+L E L ++ HA+ER ++
Sbjct: 460 NHLHTFVMSYGTFVWFKYDALKPLFDLNLTVEDVPAEPLPQNSILHAIERLLIYIAWNQH 519
Query: 376 TEFSIESV 383
+F I
Sbjct: 520 YDFRISKN 527
>gi|322385732|ref|ZP_08059376.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
cristatus ATCC 51100]
gi|321270470|gb|EFX53386.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
cristatus ATCC 51100]
Length = 598
Score = 219 bits (558), Expect = 7e-55, Method: Composition-based stats.
Identities = 65/247 (26%), Positives = 102/247 (41%), Gaps = 18/247 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYFPSAQLYVM 206
KIA+ +H YY D + +F +DLF+T K E + +LK ++Y+
Sbjct: 287 KKIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDSEKKKLEIEAVLLKRNQLGKIYIT 346
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
NKGRD+ P L L YDY+ H KKS Y G WR LF LL +D+
Sbjct: 347 GNKGRDIIPMLKL--REELCTYDYIGHFHTKKSPEYPY--WVGDSWRNELFDMLLKPADL 402
Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321
I+ + E + LG++ + + RY + ++ + L +R
Sbjct: 403 ---IMASLENDKRLGLVIADIPTFFRYTKIVDPWNENKFADDMNILWERMDINRSIDFNK 459
Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377
GT W K L+PL +L+L E L + H++ER + +
Sbjct: 460 LNTFIMSYGTFIWFKYDALKPLFDLNLQDEDIPSEPLPQHTILHSIERILVYLAWSQRFD 519
Query: 378 FSIESVD 384
++I D
Sbjct: 520 YAISKND 526
>gi|32455988|ref|NP_861990.1| rb115 [Ruegeria sp. PR1b]
gi|22726340|gb|AAN05136.1| RB115 [Ruegeria sp. PR1b]
Length = 963
Score = 217 bits (554), Expect = 2e-54, Method: Composition-based stats.
Identities = 63/264 (23%), Positives = 107/264 (40%), Gaps = 18/264 (6%)
Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS- 200
+ K ++ + +H YY D E+ +L RL F+L +++ E +++++ F +
Sbjct: 148 RQPPLPKGRLVVQLHLYYVDMAAEMIALLARLPVTFELLLSLPETAVVADEEMISLFRAG 207
Query: 201 ------AQLYVMENKGRDVRPFLYLLELGV--FDRYDYLCKIHGKKSQREGYHPIEGIIW 252
L + N+GRDV P++ + D + +H KKS YH W
Sbjct: 208 LERLGAITLRRVPNRGRDVAPWMVSFRSELRALADRDLVLHLHSKKSPHGNYHVG----W 263
Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312
R+L LLG + +A +++ F ++P LG++ + +R + K L +
Sbjct: 264 GRYLGHSLLGSTAVAAQMLGLFAEDPELGLVAPAYWPALRRAPNYGKVG---DLCAHLFR 320
Query: 313 RAGF-PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370
R G + DF G+ F + L P L L +F E G L HAVER
Sbjct: 321 RMGLGEVDPICADFPAGSFFCARAAVLRPFLTLGLEARDFPAEAGQICGTLAHAVERLLG 380
Query: 371 CSVRYTEFSIESVDCVAEYERLLH 394
+ V +E H
Sbjct: 381 QVPARLGLRFDMVAVDLPFEEAAH 404
>gi|192359986|ref|YP_001983898.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio
japonicus Ueda107]
gi|190686151|gb|ACE83829.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio
japonicus Ueda107]
Length = 872
Score = 217 bits (553), Expect = 2e-54, Method: Composition-based stats.
Identities = 77/262 (29%), Positives = 112/262 (42%), Gaps = 14/262 (5%)
Query: 127 KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE- 185
E + + + + + + +IA+V H YY+D EI L + FDL VT+ +
Sbjct: 579 PEEAVRRDSQFAEIRAALEHSQKRIAVVAHLYYRDLVPEILSALETIPEAFDLIVTLPDW 638
Query: 186 ANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYH 245
+ EQ V + +P A Y N+GRD+ PF+ LL L YD L KI K+
Sbjct: 639 GTRHIEQMVREAYPEAVFYRAVNRGRDIGPFVDLLPLITEKNYDALLKIQTKRGYYRSGR 698
Query: 246 --PIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303
P G +WR F LLG I+ +P L M+G Y + + ++
Sbjct: 699 LLPQFGQLWRSETFRALLGNKSRVTDILEALRTDPSLNMVGPSPYFLSLTKYPYHDQGDL 758
Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLH--LIGEFEEERNLKDGAL 361
+ +++ FF GTMFWV+P CL PL I FE E DGA
Sbjct: 759 AQTILN---------NPTGNGFFAGTMFWVRPSCLRPLTEPEHLSITAFEPESGANDGAT 809
Query: 362 EHAVERFFACSVRYTEFSIESV 383
H +ER F+ + I V
Sbjct: 810 AHLIERLFSQVAFANDGKIAGV 831
>gi|310286583|ref|YP_003937841.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum
S17]
gi|309250519|gb|ADO52267.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum
S17]
Length = 662
Score = 216 bits (551), Expect = 4e-54, Method: Composition-based stats.
Identities = 52/257 (20%), Positives = 89/257 (34%), Gaps = 15/257 (5%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196
P+ + + A + H Y+ D + + L + DL++T E D +D +
Sbjct: 292 PTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRDYMA 351
Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249
+ N+GRDV L V YD + H KKS + G+H E
Sbjct: 352 SHGVNHPVTFISVVNRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 411
Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307
+ L + L D I+ F P LG + + + +
Sbjct: 412 QGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPHDWGANFEIT 471
Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361
+L + R P G+ +W + + L+PL +F E +DG +
Sbjct: 472 KELLEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGEMGEDGTV 531
Query: 362 EHAVERFFACSVRYTEF 378
HA+ER + +
Sbjct: 532 SHAIERANGYICQSQGY 548
>gi|224284010|ref|ZP_03647332.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum
NCIMB 41171]
gi|313141164|ref|ZP_07803357.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB
41171]
gi|313133674|gb|EFR51291.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB
41171]
Length = 662
Score = 216 bits (551), Expect = 4e-54, Method: Composition-based stats.
Identities = 52/257 (20%), Positives = 89/257 (34%), Gaps = 15/257 (5%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196
P+ + + A + H Y+ D + + L + DL++T E D +D +
Sbjct: 292 PTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRDYMA 351
Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249
+ N+GRDV L V YD + H KKS + G+H E
Sbjct: 352 SHGVNHPVTFISVVNRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 411
Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307
+ L + L D I+ F P LG + + + +
Sbjct: 412 QGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPHDWGANFEIT 471
Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361
+L + R P G+ +W + + L+PL +F E +DG +
Sbjct: 472 KELLEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGEMGEDGTV 531
Query: 362 EHAVERFFACSVRYTEF 378
HA+ER + +
Sbjct: 532 SHAIERANGYICQSQGY 548
>gi|320330331|gb|EFW86314.1| hypothetical protein PsgRace4_09215 [Pseudomonas syringae pv.
glycinea str. race 4]
Length = 774
Score = 216 bits (551), Expect = 4e-54, Method: Composition-based stats.
Identities = 52/241 (21%), Positives = 88/241 (36%), Gaps = 11/241 (4%)
Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK--YFP 199
+ +AI +H +Y+D + SH L D+F+T+ K +
Sbjct: 260 PEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVK 319
Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
+ ++ + N+GR+ P L YD C +H KKS G E W +L
Sbjct: 320 NLKVSCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 375
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319
LL ++I R++N F + LG+ + W +
Sbjct: 376 LLRDANIITRLLNAFVDHKDLGLYYPTTFWMMPSWVNHVTM--NKAFMNAWHNEWQIDPC 433
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
L + G MFW +P+ L+ + F +E DG++ HA+ER +
Sbjct: 434 EGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 493
Query: 379 S 379
Sbjct: 494 K 494
>gi|330882679|gb|EGH16828.1| hypothetical protein Pgy4_27710 [Pseudomonas syringae pv. glycinea
str. race 4]
Length = 608
Score = 215 bits (548), Expect = 8e-54, Method: Composition-based stats.
Identities = 52/241 (21%), Positives = 88/241 (36%), Gaps = 11/241 (4%)
Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK--YFP 199
+ +AI +H +Y+D + SH L D+F+T+ K +
Sbjct: 94 PEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVK 153
Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
+ ++ + N+GR+ P L YD C +H KKS G E W +L
Sbjct: 154 NLKVSCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 209
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319
LL ++I R++N F + LG+ + W +
Sbjct: 210 LLRDANIITRLLNAFVDHKDLGLYYPTTFWMMPSWVNHVTM--NKAFMNAWHNEWQIDPC 267
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
L + G MFW +P+ L+ + F +E DG++ HA+ER +
Sbjct: 268 EGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 327
Query: 379 S 379
Sbjct: 328 K 328
>gi|15674835|ref|NP_269009.1| hypothetical protein SPy_0792 [Streptococcus pyogenes M1 GAS]
gi|71910421|ref|YP_281971.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS5005]
gi|13621968|gb|AAK33730.1| conserved hypothetical protein - possibly involved in cell wall
localization and side chain formation of
rhamnose-glucose polysaccharide [Streptococcus pyogenes
M1 GAS]
gi|71853203|gb|AAZ51226.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS5005]
Length = 581
Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats.
Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFENWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320
I++ FE + +I + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378
GT W K L+ L +L L L ++ HA+ER +F
Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGDSYDF 516
Query: 379 SI 380
I
Sbjct: 517 RI 518
>gi|71903253|ref|YP_280056.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes MGAS6180]
gi|71802348|gb|AAX71701.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes MGAS6180]
Length = 581
Score = 214 bits (544), Expect = 3e-53, Method: Composition-based stats.
Identities = 54/241 (22%), Positives = 95/241 (39%), Gaps = 15/241 (6%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPADS 399
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKRL 321
+ + T + + + R+ + + + ++ L ++
Sbjct: 400 ILSVFETDDIGII--IADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAMD 457
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEFS 379
GT W K L+ L +L L L ++ HA+ER F +F
Sbjct: 458 TFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLFVYIAWGNSYDFR 517
Query: 380 I 380
I
Sbjct: 518 I 518
>gi|94994091|ref|YP_602189.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS10750]
gi|94547599|gb|ABF37645.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes
MGAS10750]
Length = 581
Score = 214 bits (544), Expect = 3e-53, Method: Composition-based stats.
Identities = 54/241 (22%), Positives = 95/241 (39%), Gaps = 15/241 (6%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPADS 399
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKRL 321
+ + T + + + R+ + + + ++ L ++
Sbjct: 400 ILSVFETDDIGII--IADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAMD 457
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEFS 379
GT W K L+ L +L L L ++ HA+ER F +F
Sbjct: 458 TFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLFVYIAWGNSYDFR 517
Query: 380 I 380
I
Sbjct: 518 I 518
>gi|325276923|ref|ZP_08142610.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51]
gi|324097938|gb|EGB96097.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51]
Length = 758
Score = 213 bits (543), Expect = 4e-53, Method: Composition-based stats.
Identities = 70/325 (21%), Positives = 109/325 (33%), Gaps = 24/325 (7%)
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSR----KEQKAFLRLNRFM-----SNSRMPFD 119
F L +F + S+ S + F +++ +
Sbjct: 164 FASELDAFKDYLHKSRFSPVNPSENFDNEIYHRCNIDVFHAQISPLFHYIISGQTEGRAY 223
Query: 120 SEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDL 179
S FE +PK S KIAI +H YY D + L + DL
Sbjct: 224 SSVMPKWTPKFEINPASELTPKAS----NQKIAICLHIYYDDYIERFAEALYTFPTEVDL 279
Query: 180 FVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIH 235
+T+ + ++ + + N+GR+ P L + YD LC +H
Sbjct: 280 LITIANESFRDRAYQTFSKIQAVKKVTIKSVPNRGRNFGPLLVEFAQELLT-YDLLCHLH 338
Query: 236 GKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWS 295
KKS G E W +L LL + R++N F NP G+ + W
Sbjct: 339 SKKSLYSG---REQTQWADYLSEYLLNDCSVVKRVLNAFSDNPQFGVYYPTTFWMMPSWV 395
Query: 296 FFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEER 354
+ +L GF L + G MFW +PK L + N +F E
Sbjct: 396 NHVTM--NKPHMRNLQTALGFGHFDDFLSYPAGGMFWARPKALVDILNKTYTYDDFPNEP 453
Query: 355 NLKDGALEHAVERFFACSVRYTEFS 379
DG++ HA+ER +
Sbjct: 454 LPNDGSMLHALERVIGPVCEKNGYQ 478
>gi|209559162|ref|YP_002285634.1| RgpFc protein [Streptococcus pyogenes NZ131]
gi|209540363|gb|ACI60939.1| RgpFc protein [Streptococcus pyogenes NZ131]
Length = 581
Score = 213 bits (543), Expect = 4e-53, Method: Composition-based stats.
Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320
I++ FE + +I + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378
GT W K L+ L +L L L ++ HA+ER +F
Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516
Query: 379 SI 380
I
Sbjct: 517 RI 518
>gi|306827605|ref|ZP_07460885.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes ATCC 10782]
gi|304430168|gb|EFM33197.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus
pyogenes ATCC 10782]
Length = 581
Score = 213 bits (543), Expect = 4e-53, Method: Composition-based stats.
Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320
I++ FE + +I + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378
GT W K L+ L +L L L ++ HA+ER +F
Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516
Query: 379 SI 380
I
Sbjct: 517 RI 518
>gi|19745874|ref|NP_607010.1| hypothetical protein spyM18_0853 [Streptococcus pyogenes MGAS8232]
gi|19748025|gb|AAL97509.1| conserved hypothetical protein [Streptococcus pyogenes MGAS8232]
Length = 581
Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats.
Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320
I++ FE + +I + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378
GT W K L+ L +L L L ++ HA+ER +F
Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516
Query: 379 SI 380
I
Sbjct: 517 RI 518
>gi|56808559|ref|ZP_00366292.1| COG3754: Lipopolysaccharide biosynthesis protein [Streptococcus
pyogenes M49 591]
Length = 581
Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats.
Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320
I++ FE + +I + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378
GT W K L+ L +L L L ++ HA+ER +F
Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516
Query: 379 SI 380
I
Sbjct: 517 RI 518
>gi|139474025|ref|YP_001128741.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes
str. Manfredo]
gi|134272272|emb|CAM30524.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes
str. Manfredo]
Length = 581
Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats.
Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206
K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V
Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D
Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398
Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320
I++ FE + +I + R+ + + + ++ L ++
Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378
GT W K L+ L +L L L ++ HA+ER +F
Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516
Query: 379 SI 380
I
Sbjct: 517 RI 518
>gi|71735705|ref|YP_273244.1| hypothetical protein PSPPH_0972 [Pseudomonas syringae pv.
phaseolicola 1448A]
gi|71556258|gb|AAZ35469.1| conserved hypothetical protein [Pseudomonas syringae pv.
phaseolicola 1448A]
Length = 1262
Score = 212 bits (540), Expect = 8e-53, Method: Composition-based stats.
Identities = 53/237 (22%), Positives = 93/237 (39%), Gaps = 13/237 (5%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-----NKDFEQDVLKYFPSAQLY 204
+I + +H YY D IS L + FDLF++ + D + +
Sbjct: 211 RIGVYLHLYYTDLLGAISKHLNNIPLAFDLFISTPHELDHKKLRKIVSDSVTNVKEISIK 270
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
+ N+GRD+ PF+ YD +C IH KKS+ W + LLG
Sbjct: 271 HVPNRGRDIAPFIIEFGNE-LQAYDAICHIHTKKSEHTKG----LSDWGDDILSSLLGSR 325
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324
+ +I+ + + + + Y + +++ E+ + ++ +
Sbjct: 326 EDVKKILTLLKGDAKIIYPEGQNYYMKDP-TGWSENHEIAKHILSDHLETDISNFP-KAE 383
Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
F G+MFW + + ++ N+ L +F EE DG L HA+ER S I
Sbjct: 384 FPEGSMFWARQEGIQSFLNIPLDWEDFPEEPIPTDGTLAHALERIILISAYAAPGRI 440
Score = 85.8 bits (211), Expect = 1e-14, Method: Composition-based stats.
Identities = 16/108 (14%), Positives = 31/108 (28%), Gaps = 7/108 (6%)
Query: 30 QAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPS 89
A + W + + S VH F+ WL +AF+K
Sbjct: 727 DAPKEFEYFRSLVPTWDNTARYGSESYVVHESTPEKFQG---WLEQSIAFTK--ANLPED 781
Query: 90 CRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWND 135
+ + E + A L + + + + +K L +
Sbjct: 782 RHLVVINAWNEWAEGAHLEPDTYSGYAYLNSVGRVLSGIKYLDDKPTA 829
>gi|320325880|gb|EFW81940.1| hypothetical protein PsgB076_04646 [Pseudomonas syringae pv.
glycinea str. B076]
Length = 774
Score = 211 bits (538), Expect = 1e-52, Method: Composition-based stats.
Identities = 51/241 (21%), Positives = 87/241 (36%), Gaps = 11/241 (4%)
Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK--YFP 199
+ +AI +H +Y+D + SH L D+F+T+ K +
Sbjct: 260 PEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVK 319
Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259
+ ++ + N+ R+ P L YD C +H KKS G E W +L
Sbjct: 320 NLKVSCVPNRERNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 375
Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319
LL ++I R++N F + LG+ + W +
Sbjct: 376 LLRDANIITRLLNAFVDHKDLGLYYPTTFWMMPSWVNHVTM--NKAFMNAWHNEWQIDPC 433
Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
L + G MFW +P+ L+ + F +E DG++ HA+ER +
Sbjct: 434 EGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 493
Query: 379 S 379
Sbjct: 494 K 494
>gi|160936497|ref|ZP_02083865.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC
BAA-613]
gi|158440582|gb|EDP18320.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC
BAA-613]
Length = 373
Score = 211 bits (537), Expect = 2e-52, Method: Composition-based stats.
Identities = 47/233 (20%), Positives = 91/233 (39%), Gaps = 9/233 (3%)
Query: 158 YYQDTWIEISHILLRLNFDFDL-FVTVVEANKDFEQDVLKYFPSA--QLYVMENKGRDVR 214
+Y+D + + ++ D+ FVT + + ++ V EN+GRD+
Sbjct: 2 FYEDLLNQCYLYIEQIPKYIDVCFVTSNPKIAFKVKKYINNTKKINYKVLVKENRGRDMA 61
Query: 215 PFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTF 274
L + + Y+YLC +H KKS + G + +G + ++ +L+G + + I+
Sbjct: 62 ALLVTCHDFIME-YEYLCFVHDKKSLQMG-NDNDGCKFMELIWKNLIGSTGLIENILRYL 119
Query: 275 EQNPCLGMIGSRRYRRYKRWSFFAK-RSEVYRRVIDLAKRAGFP--TKRLHLDFFNGTMF 331
N +G++ F + Y VI+L + G F
Sbjct: 120 GNNRDVGLMVPPIPYWGNYIGVFINPWTCNYDNVINLGNQLKLKKNVCYEKEYVTIGGAF 179
Query: 332 WVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383
W + L+PL + +F +E DG + HA+ER + + +
Sbjct: 180 WCRTNALKPLFEYKWKLEDFCQEPMAVDGTISHAIERILGFVALNNGYDVLEI 232
>gi|323135560|ref|ZP_08070643.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242]
gi|322398651|gb|EFY01170.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242]
Length = 812
Score = 210 bits (534), Expect = 3e-52, Method: Composition-based stats.
Identities = 60/241 (24%), Positives = 100/241 (41%), Gaps = 13/241 (5%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFPS--AQLY 204
+ IA +VH +Y + + L + DLF + K +DV + +P ++
Sbjct: 144 ERPIAAIVHGFYPEIAPLVLEKLKNVTGPVDLFFSTDTQEKKHALEDVCRDWPKGRVEIR 203
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
+ N+GRD+ + D YD +H K+S G WR +LF +LLG
Sbjct: 204 ICPNRGRDIAAKFFGFRDVYAD-YDLFIHLHTKRSPHGG---AALARWRDYLFDNLLGSP 259
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL-HL 323
+I I++ F+ +P +G++ + + Y L KR G + L
Sbjct: 260 EIVNSILSLFD-DPKIGVVFPQHLFELRGIL---NWGYDYDHARALMKRMGVEIDKNLVL 315
Query: 324 DFFNGTMFWVKPKCLEPLRNLHLIGEF-EEERNLKDGALEHAVERFFACSVRYTEFSIES 382
+F +G+MFW + PL +L + + +E DG L HA+ER F
Sbjct: 316 EFPSGSMFWGRSAAFRPLLDLDIDFDDFPQEGGQVDGTLAHAIERSLLMIAESRGFEWLK 375
Query: 383 V 383
V
Sbjct: 376 V 376
>gi|116071634|ref|ZP_01468902.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107]
gi|116065257|gb|EAU71015.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107]
Length = 934
Score = 210 bits (534), Expect = 4e-52, Method: Composition-based stats.
Identities = 54/258 (20%), Positives = 98/258 (37%), Gaps = 15/258 (5%)
Query: 131 EGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD- 189
S K+ + + ++AI +H YY ++ E L L L +T + K
Sbjct: 26 HIDILDHSGKCKTSIFQECQVAIYLHIYYPESLHEFLEYLTVLPSQIRLVITTTTSEKKE 85
Query: 190 ------FEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243
++ + ENKGRD+ F+ + +YD +CK+H KKS G
Sbjct: 86 LIIEILERALLINRLDLCHV-YHENKGRDIGAFINIY--DELIKYDVVCKLHAKKSPHLG 142
Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303
G W R+L +G I+N + +G++ ++ +A ++
Sbjct: 143 E---FGKSWFRYLIRSTIGNQSAIENIVNILYHSKDIGILAPTSFQ-GTNNHDWASNFDI 198
Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALE 362
+ + D + + L + + T+FW KP+ L + + + F EE DG
Sbjct: 199 SQSISDHIFNSELDINKEKLRYPSATVFWFKPEALNQQQFRSIQPDFFPEEPIPIDGTTA 258
Query: 363 HAVERFFACSVRYTEFSI 380
H++ER
Sbjct: 259 HSLERLIPYISILNGLKT 276
>gi|222148479|ref|YP_002549436.1| hypothetical protein Avi_2007 [Agrobacterium vitis S4]
gi|221735467|gb|ACM36430.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 513
Score = 207 bits (526), Expect = 3e-51, Method: Composition-based stats.
Identities = 61/243 (25%), Positives = 109/243 (44%), Gaps = 14/243 (5%)
Query: 146 TIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFP---SA 201
++ + I VHC+Y + + EI+ L L F L VTV E++ +++L F +
Sbjct: 252 ALQLSLCIHVHCFYVELFNEIADRLQCLTLPFYLVVTVCNESDAKVVENLLVDFNQRQNT 311
Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261
+ V+EN+GRD+ PFL ++ + D + +H KKS H G WRR+LF +
Sbjct: 312 HILVVENRGRDIAPFLIDASP-IWRKSDLVLHLHTKKSP----HITWGDNWRRYLFDQTI 366
Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321
G+ + II+ F+ +GM+ + K ++ + + + +A++
Sbjct: 367 GYEPLLKGIIDQFQDRDDMGMMYPENFCMIKHFT---EEEKNKDAIRYIAQKLRLECSFE 423
Query: 322 HLD-FFNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALEHAVERFFACSVRYTEFS 379
L + G+M + + K L + + F E+ DG H +ER VR F
Sbjct: 424 ALGAYAAGSMAFYRVKALASVLEYDALENLFGPEQGQLDGTAAHVLERLLPEMVRLNGFE 483
Query: 380 IES 382
+
Sbjct: 484 TQP 486
>gi|332035169|gb|EGI71680.1| glycosyl transferase, group 1 [Pseudoalteromonas haloplanktis
ANT/505]
Length = 672
Score = 207 bits (526), Expect = 3e-51, Method: Composition-based stats.
Identities = 55/254 (21%), Positives = 87/254 (34%), Gaps = 11/254 (4%)
Query: 131 EGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDF 190
W + + +G K+A+ H +Y + L + D+FV+V
Sbjct: 145 AKWYPKAIASSANGEPTTLKLAMCFHVFYGEFIDYYCGALAKFTQQVDVFVSVASEELAK 204
Query: 191 EQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP 246
+ + V+ N GR+ P L YD C +H KKS G
Sbjct: 205 KAIHDFKACSKVNKVVVKVVPNHGRNFGPMLVEFASD-LQNYDLFCHMHSKKSLYSGRAQ 263
Query: 247 IEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRR 306
W +L LL + +++N F NP G+ + W +
Sbjct: 264 T---QWADYLGEYLLNDPHVIKQVLNHFNDNPKSGLYYPTSFWMMPDWVNH--WLKNKPA 318
Query: 307 VIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAV 365
K+ K L + G MFW +P+ L+ L N +F E DG+ HA+
Sbjct: 319 AQKFTKKWNIELKDDFLAYPAGGMFWARPEALKQLLNKEYKYDDFPGEPLPNDGSQLHAL 378
Query: 366 ERFFACSVRYTEFS 379
ER V +
Sbjct: 379 ERMLGLLVEKNGYK 392
>gi|281490695|ref|YP_003352675.1| bifunctional alpha-L-Rha
alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis
subsp. lactis KF147]
gi|281374464|gb|ADA63985.1| Alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis
subsp. lactis KF147]
Length = 589
Score = 202 bits (515), Expect = 7e-50, Method: Composition-based stats.
Identities = 63/245 (25%), Positives = 102/245 (41%), Gaps = 19/245 (7%)
Query: 151 IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVLKYFPSAQLYVMEN 208
+A+ +H YY + E +FD+DL++T K+ ++ + A+L N
Sbjct: 289 VAVHLHVYYPELLEEFLDAFKNFSFDYDLYLTTNTDEKEEIIKEMLKCKDARAKLVRTPN 348
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
GRD+ PFL L +YD + H K+S + G WR L L+ + A
Sbjct: 349 HGRDIVPFLAL--KEELKKYDIVGHFHTKRSLEAAFF--AGESWRTELISMLI---EPAD 401
Query: 269 RIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRLHLDF 325
I+ FEQ LG++ + + R+ + ++ + + D+ KR K DF
Sbjct: 402 NIMAHFEQKQKLGIVIADIPSFFRFNKIVNADNENKQIAPIMNDIWKRMKMNKKVNFHDF 461
Query: 326 -----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378
GT FW K + LEPL NL ++ L + HA+ER + +F
Sbjct: 462 NTFTMSYGTFFWAKTEVLEPLFNLEIMDREIPNEPLPQNTILHAIERVLIYLAWDKEMDF 521
Query: 379 SIESV 383
I
Sbjct: 522 KISPN 526
>gi|23009067|ref|ZP_00050256.1| COG3754: Lipopolysaccharide biosynthesis protein [Magnetospirillum
magnetotacticum MS-1]
Length = 486
Score = 202 bits (513), Expect = 1e-49, Method: Composition-based stats.
Identities = 56/221 (25%), Positives = 90/221 (40%), Gaps = 13/221 (5%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-DFEQDVLKYF--PSAQLYVM 206
+I + H ++ D + + FD ++VT A+K DF + ++ +
Sbjct: 274 RIGVFAHIFHTDLCEYVLKYTNNIPFDTTVYVTTSSASKADFIRKTFGRLSKHRYEIVIA 333
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N+GRD+ P L F DY +H KKS WR +LF LG +++
Sbjct: 334 PNRGRDIAPMLVGYRNA-FQNCDYAVHVHTKKSLHYSSGF---DAWRDYLFEMNLGSAEL 389
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH-LDF 325
I+N ++ +G + Y + + + L G + LDF
Sbjct: 390 ITGIVNVLSRS-NIGAVAPDHYA---PIAKLIQWGGNIDAINGLLSFTGLSVASENVLDF 445
Query: 326 FNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALEHAV 365
+G+MFW KP L L +HL F+ E DG L HA+
Sbjct: 446 PSGSMFWFKPDALSKLMEIHLQSYHFDPELGQVDGTLAHAI 486
>gi|116511036|ref|YP_808252.1| lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp.
cremoris SK11]
gi|116106690|gb|ABJ71830.1| Lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp.
cremoris SK11]
Length = 588
Score = 201 bits (512), Expect = 1e-49, Method: Composition-based stats.
Identities = 59/239 (24%), Positives = 106/239 (44%), Gaps = 20/239 (8%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS---AQLYVM 206
KI I +H +Y D E + + ++DL++T K +++LK +P ++ V
Sbjct: 299 KIGIHLHAFYLDLIPEYLNYFDKYVQNYDLYITTDTEEK--YEEILKNYPLPQIKKVIVT 356
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
NKGRDV P++ + + YD H KKS+ I G WRR + + LL +
Sbjct: 357 GNKGRDVLPWMQV--SELMTDYDLCGHFHTKKSKDND--WIVGESWRRDIEYSLL---EP 409
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSF--FAKRSEVYRRVIDLAKRAGFPTKRL--- 321
A I FE+NP LG+I + ++ + + +++ + ++ ++ F +
Sbjct: 410 AQAIFQEFEKNPKLGLIIADVPSFFEHFYGPTYITERDIWPDMQEIWQKIDFENSKELKQ 469
Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GTM W +P+ L L N+++ + EE ++ HA ER +
Sbjct: 470 KDSYVMSYGTMIWYRPQALNNLLNVNIQADVPEEPLPY-NSILHAFERLLVYVSWANGY 527
>gi|15672189|ref|NP_266363.1| polysaccharide biosynthesis protein [Lactococcus lactis subsp.
lactis Il1403]
gi|12723062|gb|AAK04305.1|AE006258_8 polysaccharide biosynthesis protein [Lactococcus lactis subsp.
lactis Il1403]
Length = 589
Score = 200 bits (510), Expect = 2e-49, Method: Composition-based stats.
Identities = 61/233 (26%), Positives = 98/233 (42%), Gaps = 17/233 (7%)
Query: 151 IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVLKYFPSAQLYVMEN 208
+A+ +H YY + E +FD+DL++T K+ ++ + A+L N
Sbjct: 289 VAVHLHVYYPELLEEFLDAFKNFSFDYDLYLTTNTDEKEEIIKEMLKCKDAKAKLVRTPN 348
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
GRD+ PFL L +YD + H K+S + G WR L L+ + A
Sbjct: 349 HGRDIVPFLAL--KEELKKYDIVGHFHTKRSLEAAFF--AGESWRTELISMLI---EPAD 401
Query: 269 RIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRLHLDF 325
I+ FEQ LG++ + + R+ + ++ + + D+ KR K DF
Sbjct: 402 NIMAHFEQKQKLGIVIADIPSFFRFNKIVNADNENKQIAPIMNDIWKRMKMNKKVNFHDF 461
Query: 326 -----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV 373
GT FW K + LEPL NL ++ L + HA+ER
Sbjct: 462 NTFTMSYGTFFWAKIEVLEPLFNLEIMDREIPNEPLPQNTILHAIERVLIYLA 514
>gi|88808074|ref|ZP_01123585.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805]
gi|88788113|gb|EAR19269.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805]
Length = 512
Score = 200 bits (509), Expect = 3e-49, Method: Composition-based stats.
Identities = 55/241 (22%), Positives = 93/241 (38%), Gaps = 11/241 (4%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKY----FPSAQLY 204
KI +V+H YY ++ I L + FDL VTV +K+ ++ L+ +
Sbjct: 50 KILVVIHAYYPESLATIFPSLRHMPCHFDLVVTVCSCGDKEVVKEYLEKVDLPIDVLDIK 109
Query: 205 VMENKGRDVRPFLYLLELGVFDR--YDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLG 262
V+ N GRD+ PF+ +++ YD++ K+H K+S G W +LLG
Sbjct: 110 VLTNLGRDLLPFVQVIKGLKLQNKAYDFVLKLHTKRSVASSKGKEFGGKWLEGSLSNLLG 169
Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH 322
+ I+ Q ++ R+ + + L R G
Sbjct: 170 SPENVKYILLELLQTTNCALVSPLISLDVFRFCKWKNNLAP---ISHLLDRFGVRESPED 226
Query: 323 L-DFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
F G+MFWV K + + E +G+ HA ER + T+ ++
Sbjct: 227 FICFPAGSMFWVDFKAAVLIASCFEESRVPPEPLPSNGSYLHAFERLVPYILESTQKRMQ 286
Query: 382 S 382
S
Sbjct: 287 S 287
>gi|125623094|ref|YP_001031577.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus
lactis subsp. cremoris MG1363]
gi|124491902|emb|CAL96823.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus
lactis subsp. cremoris MG1363]
gi|300069842|gb|ADJ59242.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus
lactis subsp. cremoris NZ9000]
Length = 588
Score = 194 bits (492), Expect = 3e-47, Method: Composition-based stats.
Identities = 60/239 (25%), Positives = 103/239 (43%), Gaps = 20/239 (8%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLY---VM 206
KIAI +H +Y D E + ++DLF+T +K + ++K +P Q+ V
Sbjct: 299 KIAIHLHAFYLDLIPEYLDYFDKYVQNYDLFITTDTKDK--YEQIIKSYPLNQIKKVLVT 356
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
NKGRDV P++ + + YD H KKS+ I G WRR + + LL +
Sbjct: 357 GNKGRDVLPWMEI--SELMADYDLCGHFHTKKSKDND--WIVGESWRRDIEYSLLKPAQ- 411
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSF--FAKRSEVYRRVIDLAKRAGFPTKR---- 320
I FE+NP LG++ + ++ + + +++ + ++ K+ F R
Sbjct: 412 --AIFQEFEKNPKLGLMIADVPSFFEHFYGPTYITERDIWPDMEEIWKKINFENPRGLKQ 469
Query: 321 -LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378
GTM W +P+ L L + + EE ++ HA ER + +
Sbjct: 470 KDSYVMSYGTMIWYRPQALNNLLKVDIEAAVPEEPLPY-NSILHAFERLLVYTSWANGY 527
>gi|209524107|ref|ZP_03272658.1| glycosyl transferase family 2 [Arthrospira maxima CS-328]
gi|209495482|gb|EDZ95786.1| glycosyl transferase family 2 [Arthrospira maxima CS-328]
Length = 2819
Score = 194 bits (492), Expect = 3e-47, Method: Composition-based stats.
Identities = 69/240 (28%), Positives = 107/240 (44%), Gaps = 13/240 (5%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD-FEQDVLKYFPSAQLYVME 207
KIA+V+H YY + E+ L L D+DLFVT+ E D + KY + Q+ +++
Sbjct: 1737 PKIAVVLHAYYPELLPELFSKLDNL-SDYDLFVTIPENVVDSVTSALDKYTKNYQVSIVK 1795
Query: 208 NKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA 267
N G D+ PFL ++ Y Y+CKIH K+ HP G +WR L +LG +I
Sbjct: 1796 NIGYDILPFLEVISELDTLGYKYVCKIHTKR-----DHPDFGSLWRECLLDAVLGDKNIT 1850
Query: 268 IRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327
+II F+ NP L ++G + + ++ + + D + FF
Sbjct: 1851 EQIITAFDNNPSLQIVGPALLYMSMLGTIYDGHEKMKKMIHDFMEPLNL---IEDWGFFG 1907
Query: 328 GTMFWVKPKCLEPLRN---LHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD 384
G+MFW + L+ + + L I + L G H VER E + VD
Sbjct: 1908 GSMFWSRITPLKYIADQILLKPIDWQASKSWLTTGFYYHIVERLLGLVSYINEGQVGLVD 1967
>gi|221634514|ref|YP_002523202.1| hypothetical protein RSKD131_4489 [Rhodobacter sphaeroides KD131]
gi|221163387|gb|ACM04349.1| Hypothetical Protein RSKD131_4489 [Rhodobacter sphaeroides KD131]
Length = 1042
Score = 193 bits (491), Expect = 4e-47, Method: Composition-based stats.
Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 11/234 (4%)
Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKYFPSAQLYVMENKG 210
A ++H ++ D +++ L L+ D FVT+ + ++ V FP A + +EN+G
Sbjct: 8 AAIIHVWHLDVLDDLTEALEHLHGSADQFVTLPSSFRQEQRDRVTAAFPKATIVEVENRG 67
Query: 211 RDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270
+D+ L++ RYD++CKIH KK WRR L +LG I
Sbjct: 68 QDIGALFQLMQKVNLGRYDFICKIHTKKGPNMP------EEWRRALLDGVLGSKRQVTHI 121
Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTM 330
+ +F +P + + G+R+ Y +V L F + F GT
Sbjct: 122 VESFRADPKVMLAGARQLFVYGPAYLEPNADKVAEDYASLIG--DFDVRSEDWGFIAGTC 179
Query: 331 FWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD 384
FW++ L+ + +F + DGA HA ER F V ++ D
Sbjct: 180 FWIRTSILQEMAAC--AVDFLPADYVTDGAPAHAAERMFGLCVALRGGTVLLQD 231
>gi|302337197|ref|YP_003802403.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293]
gi|301634382|gb|ADK79809.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293]
Length = 1100
Score = 192 bits (487), Expect = 1e-46, Method: Composition-based stats.
Identities = 64/228 (28%), Positives = 94/228 (41%), Gaps = 14/228 (6%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208
I +V H Y++D + + + FDL VT E N D V +P A++ +N
Sbjct: 186 SIVVVFHIYHEDLVGSCLQYISHIPYPFDLIVTTPLEENNDAILQVKSLYPDAEIVRSKN 245
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
GRD+ PFL + + + +YD CK+H KK + IWR +L D
Sbjct: 246 AGRDIGPFLQVWDRVL--QYDLCCKVHTKK-----GNSAYSEIWRDLSLRGILETVDTVH 298
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KRLHLDFFN 327
I+ FEQ L + G+ ++ + L K P + FF
Sbjct: 299 GILRMFEQEDSLALAGAELLYGSYQFLLG----KNKDLSNSLIKDYNIPVNSYSNNGFFM 354
Query: 328 GTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375
GTMFW++ K L NL + F E DG EHA+ER +
Sbjct: 355 GTMFWMRVKKFIFLSNLKQLQ-FPIEDGKNDGKYEHALERLLGSLSLH 401
>gi|78184217|ref|YP_376652.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
CC9902]
gi|78168511|gb|ABB25608.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp.
CC9902]
Length = 519
Score = 190 bits (482), Expect = 5e-46, Method: Composition-based stats.
Identities = 56/244 (22%), Positives = 100/244 (40%), Gaps = 19/244 (7%)
Query: 151 IAIVVHCYYQDTWIEISHILLRL-----NFDFDLFVTVVEANKDFEQDVLKY--FPSAQL 203
+A+++H +Y D +I L DL+V+ D + L+ F +L
Sbjct: 268 LALMIHGFYPDVLDDILLKLPSFCAGMVGTQLDLYVSTSMDQIDQVEKKLRDLDFACVRL 327
Query: 204 YVMENKGRDVRPFL-YLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLG 262
+ +EN+GRDV PFL +LL + + K+H KKS + + W R L LL
Sbjct: 328 FGVENRGRDVAPFLLHLLPAVAAAGHHFFVKLHTKKSLQ--FGIDGLDKWSRHLIESLL- 384
Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI--DLAKRAGFPTKR 320
+ I F + LG + + F ++ ++ + ++ R
Sbjct: 385 SAAGLEAIRYQFLDDEDLGCLCPSGTLLPLAIALFKNKTHLHHLLSHSEINGRWALMQT- 443
Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFS 379
F G+MF + + L + + +FE E DG HA+ER + V+ + +
Sbjct: 444 ----FVAGSMFAGRVEAFRSLLDQGFSLDDFELEGGQFDGTFAHALERLISLEVKRSGWQ 499
Query: 380 IESV 383
I+ +
Sbjct: 500 IKEM 503
>gi|14090418|gb|AAK53494.1| putative methyltransferase [Xanthomonas campestris pv. campestris]
Length = 212
Score = 189 bits (481), Expect = 6e-46, Method: Composition-based stats.
Identities = 42/235 (17%), Positives = 83/235 (35%), Gaps = 32/235 (13%)
Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKS 149
+ F + E + A L + + + + + ++ PS+
Sbjct: 1 MVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICSPSA---------- 49
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208
+V+H +Y D E+ ++ + +T + + + + A++ EN
Sbjct: 50 --CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFEN 107
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
+GRD+ PFL++ + + + K+H KKS H +G WR + LLG
Sbjct: 108 RGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGEMLTALLG-PQRVD 162
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHL 323
I+N F +P +G+ + A+R G L
Sbjct: 163 AIVNAFSTDPLVGLAAPEDHLLPVTEFIGGN-----------AERTGLSYCSHRL 206
>gi|148556902|ref|YP_001264484.1| glycosyl transferase family protein [Sphingomonas wittichii RW1]
gi|148502092|gb|ABQ70346.1| glycosyl transferase, family 2 [Sphingomonas wittichii RW1]
Length = 1301
Score = 187 bits (474), Expect = 3e-45, Method: Composition-based stats.
Identities = 61/237 (25%), Positives = 102/237 (43%), Gaps = 12/237 (5%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKYFPSAQLYVMEN 208
K A+V+H +Y + +E+ + + D+FVT A ++ + + A++ + N
Sbjct: 2 KAALVLHLFYPEVAVELIDRVAAIGASVDIFVTHSVALDETVLAALDRLPRKAEVVTVAN 61
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
+G D+ P LL L YD + K+H KK WRR + ++G +
Sbjct: 62 RGWDIGPLFELLPLLAERGYDLIGKLHSKK-----GGSGYAPEWRRLAYDGMIGSPALVA 116
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-TKRLHLDFFN 327
I+ F+ +P L ++G++ + F + DLA R P FF
Sbjct: 117 DIVAAFDAHPDLSLLGAKPLYKSVASHLFRNA----ELLSDLAPRLTAPAYPPADWGFFA 172
Query: 328 GTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD 384
GT FW + LE + L + ++ +DGAL HAVER F + I V+
Sbjct: 173 GTFFWARRTLLEKVAALADFRDAAPNQD-RDGALGHAVERLFGLAPIGLGGKIGLVE 228
>gi|146279467|ref|YP_001169625.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC
17025]
gi|145557708|gb|ABP72320.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC
17025]
Length = 823
Score = 172 bits (436), Expect = 9e-41, Method: Composition-based stats.
Identities = 61/341 (17%), Positives = 98/341 (28%), Gaps = 23/341 (6%)
Query: 49 PKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFL--- 105
P++ T+ +F L LA + R EQ AF
Sbjct: 480 PRKTGTAGAAQPAGGLLFARIRRALFDRLAAQRRFVRGASDIDAPLLFPRPEQAAFRILE 539
Query: 106 -RLNRFMSNSR----MPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQ 160
+ R + E + + A+ VH +Y
Sbjct: 540 REKMQRYGRRRVWRDLAEVEETLSASDNWVHRALRLAPYATVADSSDLPPFALHVHAFYT 599
Query: 161 DTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLY 218
D + +T K + + ++ ++ N+GRD+ PF+
Sbjct: 600 DDLAADVRSHRAFRLARRIVITTDNERKASEIRTRMGAEGLYPEVILVPNRGRDILPFMQ 659
Query: 219 LLELGVFDRYD-YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQN 277
L G D C +H KKS G +WR +L LLG + ++
Sbjct: 660 LFLPGGPAGKDEIWCHLHQKKSLATSDS---GDVWRAFLLRILLGDDAGLSDAVGHL-RD 715
Query: 278 PCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKC 337
P +G++ + A R P L F G MFWV+
Sbjct: 716 PAVGLVAPFDPYHVP-------WDASRALLPRFAPRLPGPLPDNPLLFPVGNMFWVRAGV 768
Query: 338 LEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTE 377
+ + +L E DG H VER +
Sbjct: 769 VRAMNDLFGPSYPWPNEPIANDGTEFHLVERLWPTMAARCG 809
>gi|221218294|ref|YP_002524321.1| glycosyltransferase [Rhodobacter sphaeroides KD131]
gi|221163321|gb|ACM04287.1| glycosyltransferase [Rhodobacter sphaeroides KD131]
Length = 821
Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats.
Identities = 58/284 (20%), Positives = 94/284 (33%), Gaps = 20/284 (7%)
Query: 105 LRLNRFMSNSRMPFDSEKFLYVKELFEGWN-----DRPSSPKKSGLTIKSKIAIVVHCYY 159
L + M R + + L + N R + + T + ++ VH +Y
Sbjct: 537 LEREKMMRYGRRRMWRDLAEVEERLADADNWVHRKLRIAPYGTAEATELPRFSLHVHAFY 596
Query: 160 QDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFL 217
D + + VT K + + + ++ V N+GRD+ PFL
Sbjct: 597 TDDLAQDVRRHAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRPNRGRDILPFL 656
Query: 218 YLLELGVFDRYD-YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276
L G D C +H KKS G IWR +L LLG +
Sbjct: 657 QLFLPGGAAGEDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEASLSDAATHL-R 712
Query: 277 NPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPK 336
NP +G++ + +A R P L F G MF+V+ +
Sbjct: 713 NPGVGLVAPFDPYFIP-------WDASRALLPRVAPRLPGPLPDNPLLFPVGNMFFVRSR 765
Query: 337 CLEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTEFS 379
+ + +L G E DG H +ER + +
Sbjct: 766 VVRAMNDLFGAGYPWPNEPIPNDGTEFHLIERLWPAMAAQCGLT 809
>gi|291520449|emb|CBK75670.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens
16/4]
Length = 486
Score = 159 bits (403), Expect = 7e-37, Method: Composition-based stats.
Identities = 43/174 (24%), Positives = 71/174 (40%), Gaps = 6/174 (3%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD---FEQDVLKYFPSAQLYVM 206
K+A+V H YY + + L ++ + D+ +T +K E K ++ V
Sbjct: 291 KVAVVAHLYYVEMFELCMDYLAKVPYGIDIIITTNSDDKKQNIIEVASEKGVKLTEVIVA 350
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
EN+GR++ L + +Y Y C +H KKS H G+ +R L+ L
Sbjct: 351 ENRGRELAALLVGCGKFLL-KYKYFCFVHDKKSS-AKEHLSVGLAFRDILWDSSLYSEGY 408
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFF-AKRSEVYRRVIDLAKRAGFPTK 319
II+ FEQN C+G+ + F Y + I+L+K
Sbjct: 409 IRNIIDMFEQNECMGLAVPPTVYCGSYFYPFPDYWVGNYEKTIELSKILNINVD 462
>gi|297182567|gb|ADI18727.1| lipopolysaccharide biosynthesis protein [uncultured Rhizobiales
bacterium HF4000_32B18]
Length = 887
Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats.
Identities = 55/236 (23%), Positives = 80/236 (33%), Gaps = 21/236 (8%)
Query: 153 IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQL--YVMENKG 210
+ VH +Y D + E + T K E + V+ N+G
Sbjct: 648 VHVHAHYTDGFAEDLAGFAAWRHAARVVATTDTEAKAAEIAAAGRNGGVAIETRVVANRG 707
Query: 211 RDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270
RDV PFL L + D C +H KKS G G +WR +L LLG +
Sbjct: 708 RDVLPFLELFDGSEDDN-ALWCHVHLKKSVGLGP-TSPGAVWRAFLMRILLGGPERLSTA 765
Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG--------FPTKRLH 322
+ + P G++G+ + R + L R P
Sbjct: 766 L-ALIRAPEAGLVGAFDPYV-------MGWTGSRRLLAPLQARLDGWEADGGRRPLPDHP 817
Query: 323 LDFFNGTMFWVKPKCLEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTE 377
L F G MFWVK + +R L E DG + H +ER + +
Sbjct: 818 LLFPVGDMFWVKAGVVNAMRRLFGADYPWPGEPLPGDGTVYHLIERLWPTAAALAG 873
>gi|50982351|gb|AAT91804.1| hypothetical protein [Yersinia enterocolitica]
Length = 358
Score = 152 bits (385), Expect = 8e-35, Method: Composition-based stats.
Identities = 56/247 (22%), Positives = 96/247 (38%), Gaps = 16/247 (6%)
Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSA 201
K +K I+VH +YQ EI + L+ +D+ +T N + +
Sbjct: 120 KIKPNTDNKKLIIVHAFYQREAEEIFNRLVAFTD-YDIVITSPYNNIICKAKEILGQERV 178
Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261
++M N GRD+ PFL L+L V ++Y+Y K+H K+SQ H + W L+
Sbjct: 179 IGFIMPNYGRDILPFLICLQLIVIEKYEYFVKVHTKRSQ----HLNDNGAWFNNNLDYLV 234
Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321
G + + + + + Y + + + L +
Sbjct: 235 GNKNATDGLFSIMSDDE---------PQIYGEYILPIQDHIAN-NIHWLTYLLEKEPASV 284
Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380
F GTMF L +R+L L + + E+E DG HA+ER+F
Sbjct: 285 EASFIPGTMFIGNRAFLVLIRDLQLHLFQIEKENGQLDGCCVHAIERYFGYIASVNGGKC 344
Query: 381 ESVDCVA 387
S++ +
Sbjct: 345 CSIETLI 351
>gi|301632931|ref|XP_002945533.1| PREDICTED: o-antigen export system ATP-binding protein rfbB-like,
partial [Xenopus (Silurana) tropicalis]
Length = 367
Score = 140 bits (354), Expect = 3e-31, Method: Composition-based stats.
Identities = 40/150 (26%), Positives = 61/150 (40%), Gaps = 10/150 (6%)
Query: 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSR 286
RY + ++H K+S G WR L+ L G I++TF +P LGM+
Sbjct: 203 RYALILRLHSKRSLHIPGQ--VGEEWRALLYTSLAGSRQRVNAIVDTFNTHPKLGMLCPA 260
Query: 287 RYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHL-DFFNGTMFWVKPKCLEPLRNLH 345
++ Y+R+ L + G DF G+MFW +P+ L
Sbjct: 261 ---VIDHYADCLHFGGNYKRMCALLQPHGITLPPDQPIDFPMGSMFWCRPQALSVWLEPG 317
Query: 346 L-IGEFEEERNL---KDGALEHAVERFFAC 371
+F +L +DG L HA+ER F
Sbjct: 318 FTFDDFTPTNDLDTDRDGTLAHALERLFFF 347
>gi|77404644|ref|YP_345218.1| glycosyltransferase [Rhodobacter sphaeroides 2.4.1]
gi|77390294|gb|ABA81477.1| possible glycosyltransferase [Rhodobacter sphaeroides 2.4.1]
Length = 793
Score = 140 bits (354), Expect = 3e-31, Method: Composition-based stats.
Identities = 51/249 (20%), Positives = 83/249 (33%), Gaps = 19/249 (7%)
Query: 105 LRLNRFMSNSRMPFDSEKFLYVKELFEGWN-----DRPSSPKKSGLTIKSKIAIVVHCYY 159
L + M R + + L + N R + + T + ++ VH +Y
Sbjct: 537 LEREKMMRYGRRRMWRDLAEVEERLADADNWVHRKLRIAPYGTAEATELPRFSLHVHAFY 596
Query: 160 QDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFL 217
D + + VT K + + + ++ V N+GRD+ PFL
Sbjct: 597 TDDLAQDVRRHAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRPNRGRDILPFL 656
Query: 218 YLLELGVFDRYD-YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276
L G D C +H KKS G IWR +L LLG +
Sbjct: 657 QLFLPGGAAGEDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEASLSDAATNL-R 712
Query: 277 NPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPK 336
NP +G++ + +A R P L F G MF+V+
Sbjct: 713 NPGVGLVAPFDPYFIP-------WDASRALLPRVAPRLPGPLPDNPLLFPVGNMFFVRSA 765
Query: 337 CLEPLRNLH 345
+ + +L
Sbjct: 766 VVRAMNDLF 774
>gi|224536718|ref|ZP_03677257.1| hypothetical protein BACCELL_01594 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521634|gb|EEF90739.1| hypothetical protein BACCELL_01594 [Bacteroides cellulosilyticus
DSM 14838]
Length = 361
Score = 123 bits (310), Expect = 4e-26, Method: Composition-based stats.
Identities = 17/127 (13%), Positives = 37/127 (29%), Gaps = 8/127 (6%)
Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61
Y+ + + K+ + ++ + Y + W SP+ + ++
Sbjct: 241 YECAKWRHKIFRTPKIVEYKKASSFFVGEEEYDKEIIPTIIPNWDHSPRSLGKALVLNHA 300
Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119
E FE + + CR+ F S E + +L + +
Sbjct: 301 EPRYFEK------HVKNVMIHIENKPFECRLAFVKSWNEWAEGNYLEPDLRYGKRYLEVM 354
Query: 120 SEKFLYV 126
E L
Sbjct: 355 KECILKE 361
>gi|291520444|emb|CBK75665.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens
16/4]
Length = 424
Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats.
Identities = 29/144 (20%), Positives = 52/144 (36%), Gaps = 4/144 (2%)
Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA-KRSEV 303
+ G + ++ LLG ++ +++ F LG++ + + +
Sbjct: 3 YESVGRDFNNRIWQSLLGSKELVEEVLSAFSDEKYLGLLMPSMVTHGEYFHTAIDSWTIC 62
Query: 304 YRRVIDLAKRAGFPTKR--LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGA 360
Y ++LAK+ G GT FW + K LE L + F E DG+
Sbjct: 63 YDGTVELAKKIGLNVPIYGDRNPLSLGTAFWARTKALEKLFEYNFSYDMFPGEPFPVDGS 122
Query: 361 LEHAVERFFACSVRYTEFSIESVD 384
+ H +ER F + V
Sbjct: 123 ISHYIERIFPYVALDAGYYTGIVY 146
>gi|270294908|ref|ZP_06201109.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274155|gb|EFA20016.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 358
Score = 115 bits (289), Expect = 1e-23, Method: Composition-based stats.
Identities = 21/122 (17%), Positives = 37/122 (30%), Gaps = 8/122 (6%)
Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61
YK + K K+ +I ++ Y + W SP+ R S ++
Sbjct: 238 YKYAKWKHKIFRIPKVVEYKKASSFFVGDEEYEENIIPTIIPNWDHSPRSRGKSLVLNHA 297
Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119
E S F R K + R+ F S E + +L + +
Sbjct: 298 EPSYF------ARHLKEAIKRIENKPLDHRLAFVKSWNEWAEGNYLEPDLHYGKRYLEVI 351
Query: 120 SE 121
+
Sbjct: 352 KK 353
>gi|160888551|ref|ZP_02069554.1| hypothetical protein BACUNI_00968 [Bacteroides uniformis ATCC 8492]
gi|317477905|ref|ZP_07937089.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides
sp. 4_1_36]
gi|156861865|gb|EDO55296.1| hypothetical protein BACUNI_00968 [Bacteroides uniformis ATCC 8492]
gi|316905921|gb|EFV27691.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides
sp. 4_1_36]
Length = 358
Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats.
Identities = 21/122 (17%), Positives = 37/122 (30%), Gaps = 8/122 (6%)
Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61
YK + K K+ +I ++ Y + W SP+ R S ++
Sbjct: 238 YKYAKWKHKIFRIPKVVEYKKASSFFVGDEEYEENIIPTIIPNWDHSPRSRGKSLVLNHA 297
Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119
E S F R K + R+ F S E + +L + +
Sbjct: 298 EPSYF------ARHMKEAIKRIENKPLDHRLAFVKSWNEWAEGNYLEPDLHYGKRYLEVI 351
Query: 120 SE 121
+
Sbjct: 352 KK 353
>gi|75674736|ref|YP_317157.1| lipopolysaccharide biosynthesis protein [Nitrobacter winogradskyi
Nb-255]
gi|74419606|gb|ABA03805.1| lipopolysaccharide biosynthesis protein [Nitrobacter winogradskyi
Nb-255]
Length = 734
Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats.
Identities = 20/122 (16%), Positives = 37/122 (30%), Gaps = 12/122 (9%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
+S GK+ + + + Y PA + W +P++ + FE
Sbjct: 429 ESFTGKVYDYVDAVRSSLGKTYDFPYFPAVMP----RWDNTPRKGSRGHVFNRSSPEAFE 484
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
WLR ++ + P I F S E + A L + + +
Sbjct: 485 ---VWLRDATGRARRGPFAEP---IVFINSWNEWAEGAHLEPDSRYGRAFLEAVRRVASS 538
Query: 126 VK 127
Sbjct: 539 EP 540
>gi|92116633|ref|YP_576362.1| lipopolysaccharide biosynthesis protein [Nitrobacter hamburgensis
X14]
gi|91799527|gb|ABE61902.1| lipopolysaccharide biosynthesis protein [Nitrobacter hamburgensis
X14]
Length = 734
Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats.
Identities = 21/121 (17%), Positives = 37/121 (30%), Gaps = 12/121 (9%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
S GK+ + + + Y PA + W +P++ + FE
Sbjct: 430 SFTGKVYDYVDAVRSSLGKTYDFPYFPAVMP----RWDNTPRKGSRGHIFNRSSPEAFE- 484
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
WLR ++ S + P I F S E + A L + + +
Sbjct: 485 --VWLRDAANRARKSAFAEP---IVFINSWNEWAEGAHLEPDSRYGRAFLEAVRRVASSE 539
Query: 127 K 127
Sbjct: 540 P 540
>gi|148238469|ref|YP_001223856.1| sulfotransferase [Synechococcus sp. WH 7803]
gi|147847008|emb|CAK22559.1| Possible sulfotransferase [Synechococcus sp. WH 7803]
Length = 476
Score = 109 bits (273), Expect = 8e-22, Method: Composition-based stats.
Identities = 34/160 (21%), Positives = 53/160 (33%), Gaps = 8/160 (5%)
Query: 222 LGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLG 281
+D + H K++ G WR+ L D T P G
Sbjct: 4 RDRLKEFDLVVHCHTKRTPHAPD--GFGESWRQSLLQCTFPNPDRCQE-FQTLLHKPEAG 60
Query: 282 MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD-FFNGTMFWVKPKCLEP 340
+I +R + R +++L G +R L F G+ FW + L
Sbjct: 61 LIMPWPHRFVAHNVNWGSNFTQTRALMNL---MGHTIRRDTLLAFPAGSFFWARVDSLLA 117
Query: 341 LRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFS 379
L +L L +F E DG L H++ER +
Sbjct: 118 LLDLTLRWEDFAAEPLPGDGRLAHSLERCLGLLPMLNDRR 157
>gi|85713620|ref|ZP_01044610.1| lipopolysaccharide biosynthesis protein [Nitrobacter sp. Nb-311A]
gi|85699524|gb|EAQ37391.1| lipopolysaccharide biosynthesis protein [Nitrobacter sp. Nb-311A]
Length = 734
Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats.
Identities = 18/121 (14%), Positives = 33/121 (27%), Gaps = 12/121 (9%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
+ GKI + + + Y W +P++ + FE
Sbjct: 430 TFTGKIYDYVDAVRSSLGK----TYDFPCFPAVMPRWDNTPRKGSRGHIFNRSSPEAFE- 484
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
WLR ++ + P I F S E + A L + + +
Sbjct: 485 --VWLRDAAGRARREPFAEP---IVFINSWNEWAEGAHLEPDSRYGRAFLEAVRRVASSE 539
Query: 127 K 127
Sbjct: 540 P 540
>gi|189426434|ref|YP_001953611.1| radical SAM protein [Geobacter lovleyi SZ]
gi|189422693|gb|ACD97091.1| Radical SAM domain protein [Geobacter lovleyi SZ]
Length = 843
Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats.
Identities = 18/118 (15%), Positives = 39/118 (33%), Gaps = 14/118 (11%)
Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69
K+ + E+L+L L + + + W +P+ + +F
Sbjct: 731 KVSRYEDLVLYLKQYQLSDNE-------YPLVVPNWDNTPRSGSNGFVLQGSTPELFGEM 783
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
L L + K P+ RI F + E + L + ++ + + L+
Sbjct: 784 ---LEDALRKVEQRKD--PADRIVFIKAWNEWAEGNHLEPDLLHGHAYLQALYKALLH 836
>gi|296445524|ref|ZP_06887480.1| lipopolysaccharide biosynthesis protein-like protein [Methylosinus
trichosporium OB3b]
gi|296256929|gb|EFH04000.1| lipopolysaccharide biosynthesis protein-like protein [Methylosinus
trichosporium OB3b]
Length = 431
Score = 106 bits (264), Expect = 9e-21, Method: Composition-based stats.
Identities = 16/133 (12%), Positives = 34/133 (25%), Gaps = 6/133 (4%)
Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75
N++ + E G W ++ ++E WL
Sbjct: 273 NVVAYEAMIEASLNHRPTGYKLFPGVCPSWDNEARRPGKGSCFAGASPRLYED---WLTG 329
Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGW 133
+ RI F + E + A+L +R + + + V+ +
Sbjct: 330 ACRAVLTDAQTRDE-RIVFINAWNEWGEGAYLEPDRHYGYAYLVATANALRRVENQRDNE 388
Query: 134 NDRPSSPKKSGLT 146
+ S
Sbjct: 389 GAIEGAKGASNRN 401
>gi|312100417|gb|ADQ27813.1| glycosyltransferase [Burkholderia pseudomallei]
gi|312100462|gb|ADQ27848.1| putative glycosyltransferase [Burkholderia pseudomallei]
Length = 1738
Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats.
Identities = 14/110 (12%), Positives = 28/110 (25%), Gaps = 6/110 (5%)
Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84
E+ G W ++ ++E WL + A +
Sbjct: 980 ERSRAYPDTEYRLFRGVTPSWDNEARKPGRGAVFVGSTPKLYEE---WLLNA-ATDTVER 1035
Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
+ P R+ F + E + A L +R + +
Sbjct: 1036 IDNPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATRNALTQANRTGAA 1085
>gi|312100431|gb|ADQ27825.1| glycosyltransferase [Burkholderia pseudomallei]
Length = 1706
Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats.
Identities = 14/110 (12%), Positives = 28/110 (25%), Gaps = 6/110 (5%)
Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84
E+ G W ++ ++E WL + A +
Sbjct: 948 ERSRAYPDTEYRLFRGVTPSWDNEARKPGRGAVFVGSTPKLYEE---WLLNA-ATDTVER 1003
Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
+ P R+ F + E + A L +R + +
Sbjct: 1004 IDNPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATRNALTQANRTGAA 1053
>gi|150010201|ref|YP_001304944.1| hypothetical protein BDI_3624 [Parabacteroides distasonis ATCC
8503]
gi|149938625|gb|ABR45322.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 370
Score = 101 bits (253), Expect = 1e-19, Method: Composition-based stats.
Identities = 11/109 (10%), Positives = 27/109 (24%), Gaps = 10/109 (9%)
Query: 24 EEKGNMQAIYIPA--HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81
+ + Y W SP++ + H ++F+
Sbjct: 268 KAIEKIDTPYYEEDRVYPNIIPGWDNSPRRGPGAFIFHKATPALFKK------HVKMILN 321
Query: 82 YSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
K ++ F S E + ++ + + E +
Sbjct: 322 RIKDKPDEDKVIFLKSWNEWAEGNYMEPDLKWGKGYIRALREALEEDAK 370
>gi|308813905|ref|XP_003084258.1| conserved domain protein (ISS) [Ostreococcus tauri]
gi|116056142|emb|CAL58323.1| conserved domain protein (ISS) [Ostreococcus tauri]
Length = 684
Score = 101 bits (253), Expect = 2e-19, Method: Composition-based stats.
Identities = 39/248 (15%), Positives = 76/248 (30%), Gaps = 55/248 (22%)
Query: 175 FDFDLFVTVVE------ANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELG--VFD 226
L++++ F + L+ + ++ ++++G D+ FL L
Sbjct: 103 VQLQLYLSLTPTVANAPEVAYFTERFLRNEKNIRVVHVKDEGYDIGAFLKQLHRFRHELQ 162
Query: 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS- 285
+ Y+ K+H K IW L G I+ FE L ++
Sbjct: 163 VHQYILKVHSKSDP----------IWLERAVESLCGSEHQVKSILKAFETQSTLDIVSPM 212
Query: 286 ---------RRYRRYKRWSFFAKRSEVY--------RRVIDLAKRAGFPTKRLHLDF--- 325
+ + + ++ + L + G +
Sbjct: 213 GSTFSATTSKDAVFPHLKRKYFNKVDLATAFDDKTMHTMERLCAQLGLEACPYFEKYLAS 272
Query: 326 -FNGTMFWVK---------PKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375
GTMFW + P+ E +RN L ++ + +EHA+ER R
Sbjct: 273 ITAGTMFWARNSRLYTEHLPRLFESIRN-ELSQDY-----SNNNRIEHALERLIPTLSRL 326
Query: 376 TEFSIESV 383
I +
Sbjct: 327 NGRMIGDI 334
Score = 43.8 bits (102), Expect = 0.050, Method: Composition-based stats.
Identities = 13/88 (14%), Positives = 27/88 (30%), Gaps = 2/88 (2%)
Query: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99
G V + P+ + + F S + + L+ ++ I +
Sbjct: 594 GSTVRFDRRPRSGDYNFPILR-TPQEFGSAYSAMIARLSTMPGREIDVGFNFICAWNEWN 652
Query: 100 EQKAFLRLNRFMSNSRMPFDSEKFLYVK 127
EQ A L + + R+ + V
Sbjct: 653 EQ-AVLEPDEWWGFQRLQEILKVVNNVP 679
>gi|322418494|ref|YP_004197717.1| group 1 glycosyl transferase [Geobacter sp. M18]
gi|320124881|gb|ADW12441.1| glycosyl transferase group 1 [Geobacter sp. M18]
Length = 708
Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats.
Identities = 11/128 (8%), Positives = 31/128 (24%), Gaps = 11/128 (8%)
Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNM---QAIYIPAHVSGYYVLWSFSPKQRITSKDVH 59
++ +L+ + L + + W +P+ +H
Sbjct: 241 EILKLRFFSKEKPELPQVYSYKSFVANAFPDNTLRRDYYPCVVPNWDNTPRSGKNGFVLH 300
Query: 60 FQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMP 117
++E + + R+ F S E + +L + + +
Sbjct: 301 GSTPQLYEQHLEEAVDLVD------DRPEDERVIFVKSWNEWAETNYLEPDLRWGKAYLD 354
Query: 118 FDSEKFLY 125
Sbjct: 355 ATLRAVTR 362
>gi|293407666|gb|ADE44320.1| putative glycosyl transferase [Burkholderia pseudomallei]
Length = 740
Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats.
Identities = 14/110 (12%), Positives = 28/110 (25%), Gaps = 6/110 (5%)
Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84
E+ G W ++ ++E WL + A +
Sbjct: 366 ERSRAYPDTEYRLFRGVTPSWDNEARKPGRGAVFVGSTPKLYEE---WLLNA-ATDTVER 421
Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
+ P R+ F + E + A L +R + +
Sbjct: 422 IDNPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATRNALTQANRTGAA 471
>gi|30248500|ref|NP_840570.1| hypothetical protein NE0485 [Nitrosomonas europaea ATCC 19718]
gi|30138386|emb|CAD84396.1| conserved hypothetical protein [Nitrosomonas europaea ATCC 19718]
Length = 445
Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats.
Identities = 16/113 (14%), Positives = 34/113 (30%), Gaps = 7/113 (6%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+L D+ E P +W S ++ ++E WL
Sbjct: 337 VLDYRDIVEHKKYFLYNHPKLHRAAMPMWDNSARRDNKGMIFEGASPDLYE---RWLTDI 393
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127
L +K + F + E + A+L ++ + + + V+
Sbjct: 394 LLEAKNREDL--EDHYIFINAWNEWGEGAYLEPDKKYGYAYLNATRQAIEGVR 444
>gi|221201094|ref|ZP_03574134.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2M]
gi|221206454|ref|ZP_03579467.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2]
gi|221173763|gb|EEE06197.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2]
gi|221178944|gb|EEE11351.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2M]
Length = 1714
Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats.
Identities = 15/111 (13%), Positives = 32/111 (28%), Gaps = 6/111 (5%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+L E+ G W ++ ++E WL +
Sbjct: 948 ILDWTHYVERSRSYQDAEYRLFRGVTPSWDNEARKPGRGTVFVGSTPKLYEE---WLCNA 1004
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
A +++ P R+ F + E + A L +R + + +
Sbjct: 1005 -ATDTVRRIANPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATNNALSR 1054
>gi|330826738|ref|YP_004390041.1| family 2 glycosyl transferase [Alicycliphilus denitrificans K601]
gi|329312110|gb|AEB86525.1| glycosyl transferase family 2 [Alicycliphilus denitrificans K601]
Length = 1669
Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats.
Identities = 18/133 (13%), Positives = 39/133 (29%), Gaps = 8/133 (6%)
Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLR 74
NL + E + G W + ++R +H ++E WLR
Sbjct: 678 NLADYAQLAEFWLDRPSPAYKRFRGIVPAWDNAARRRKGGATVIHGSTPQLYEK---WLR 734
Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
+A + + RI F + E + +L + ++ + +
Sbjct: 735 GTVA--RTLEEREGDERIVFINAWNEWGEGCYLEPDEKFGHAYLEATQRVLRDPPQALLE 792
Query: 133 WNDRPSSPKKSGL 145
R + +
Sbjct: 793 DLRRERAAVAAPA 805
>gi|319764522|ref|YP_004128459.1| glycosyl transferase family 2 [Alicycliphilus denitrificans BC]
gi|317119083|gb|ADV01572.1| glycosyl transferase family 2 [Alicycliphilus denitrificans BC]
Length = 1669
Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats.
Identities = 18/133 (13%), Positives = 39/133 (29%), Gaps = 8/133 (6%)
Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLR 74
NL + E + G W + ++R +H ++E WLR
Sbjct: 678 NLADYAQLAEFWLDRPSPAYKRFRGIVPAWDNAARRRKGGATVIHGSTPQLYEK---WLR 734
Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
+A + + RI F + E + +L + ++ + +
Sbjct: 735 GTVA--RTLEEREGDERIVFINAWNEWGEGCYLEPDEKFGHAYLEATQRVLRDPPQALLE 792
Query: 133 WNDRPSSPKKSGL 145
R + +
Sbjct: 793 DLRRERAAVAAPA 805
>gi|217420529|ref|ZP_03452034.1| glycosyltransferase, group 1 [Burkholderia pseudomallei 576]
gi|217395941|gb|EEC35958.1| glycosyltransferase, group 1 [Burkholderia pseudomallei 576]
Length = 1736
Score = 99.6 bits (247), Expect = 7e-19, Method: Composition-based stats.
Identities = 15/111 (13%), Positives = 32/111 (28%), Gaps = 6/111 (5%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+L E+ G W ++ ++E WL +
Sbjct: 969 ILDWTHYLERSRSYPDAEYRLFRGVTPSWDNEARKPGRGTVFVGSTPKLYEE---WLFNA 1025
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
+ ++ P R+ F + E + A L +R + + S+
Sbjct: 1026 -SVDTVRRIENPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATSDALSR 1075
>gi|237653904|ref|YP_002890218.1| lipopolysaccharide biosynthesis protein-like protein [Thauera sp.
MZ1T]
gi|237625151|gb|ACR01841.1| lipopolysaccharide biosynthesis protein-like protein [Thauera sp.
MZ1T]
Length = 358
Score = 99.3 bits (246), Expect = 1e-18, Method: Composition-based stats.
Identities = 14/122 (11%), Positives = 37/122 (30%), Gaps = 9/122 (7%)
Query: 5 FRLKSKLGKIE-NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63
R++ GK + +L + + W +P+ + +H
Sbjct: 239 ARMRMAKGKYKLTVLDYARIMSGLTRASPPQFTEYPTVLPNWDNTPRSGLNGLVLHGSTP 298
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+F++ + + + RI F + E + +L ++ + + E
Sbjct: 299 ELFKTVLRRGVDLV------QGYPAEQRIVFIKAWNEWAEGNYLEPDQRFGHGYLRAVRE 352
Query: 122 KF 123
Sbjct: 353 VL 354
>gi|294675724|ref|YP_003576339.1| family 2 glycosyl transferase [Rhodobacter capsulatus SB 1003]
gi|294474544|gb|ADE83932.1| glycosyl transferase, family 2/group 1 [Rhodobacter capsulatus SB
1003]
Length = 1993
Score = 97.7 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 13/135 (9%), Positives = 36/135 (26%), Gaps = 10/135 (7%)
Query: 21 LDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFS 80
E+ + W + +++ + + WL + + +
Sbjct: 1237 RSYVERSRNYPMPDYKLYRSVCPSWDNTARRKNKGAIFANSNPAEYR---VWLENAVTRT 1293
Query: 81 KYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF--LYVKELFE--GWN 134
+ R+ F + E + A L + + + + V + G +
Sbjct: 1294 LADARTPDE-RVIFVNAWNEWAEGAHLEPDTKYGYAYLEASRAALNPVEVPRMVTLVGHD 1352
Query: 135 DRPSSPKKSGLTIKS 149
P + L +
Sbjct: 1353 AHPHGAQILLLNLAR 1367
>gi|322418493|ref|YP_004197716.1| group 1 glycosyl transferase [Geobacter sp. M18]
gi|320124880|gb|ADW12440.1| glycosyl transferase group 1 [Geobacter sp. M18]
Length = 1687
Score = 97.7 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 12/115 (10%), Positives = 33/115 (28%), Gaps = 8/115 (6%)
Query: 16 NLLLRLD-VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74
N + D + + + W S +++ + + WL
Sbjct: 1490 NYVHYYDNLANEMMAKPPVAYKRFRCATPSWDNSARRQEGANIFVGSTPEKYR---QWLE 1546
Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127
+++++ K +I F + E + L ++ + + V
Sbjct: 1547 HIVSYTR--KTFKGDEQIAFVNAWNEWAEGNHLEPDQKYGRAYLEATRSAIAGVP 1599
>gi|264678899|ref|YP_003278806.1| hyaluronan synthase [Comamonas testosteroni CNB-2]
gi|262209412|gb|ACY33510.1| hyaluronan synthase [Comamonas testosteroni CNB-2]
Length = 795
Score = 97.7 bits (242), Expect = 3e-18, Method: Composition-based stats.
Identities = 11/117 (9%), Positives = 28/117 (23%), Gaps = 6/117 (5%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+ D+ + + + W ++ +E WL+S
Sbjct: 39 YMHYDDLISRSLDEVPPSFELIKTLVPSWDNEARKPGRGMGFVGATPEKYE---RWLKSL 95
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFE 131
+ L F + E + A L + + + + +
Sbjct: 96 ARRAVERPLLGKQPY-VFVNAWNEWAEGALLEPDLHYGYAYLNATFRALTNTPRVSK 151
>gi|260174685|ref|ZP_05761097.1| hypothetical protein BacD2_22702 [Bacteroides sp. D2]
gi|315922947|ref|ZP_07919187.1| conserved hypothetical protein [Bacteroides sp. D2]
gi|313696822|gb|EFS33657.1| conserved hypothetical protein [Bacteroides sp. D2]
Length = 372
Score = 97.3 bits (241), Expect = 4e-18, Method: Composition-based stats.
Identities = 9/115 (7%), Positives = 25/115 (21%), Gaps = 8/115 (6%)
Query: 11 LGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFI 70
L + ++ + + W +P+ F
Sbjct: 259 LKRPPRMIDYSKYYHSLITEDDQSVDVIPSIVPQWDHTPRSGWNGSLWVNSTPYFF---- 314
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ L K + +I S E + ++ + + +
Sbjct: 315 --YKHVLEALDAIKNKPQNQQILLLKSWNEWGEGNYMEPDLKNGKGYIEALKKAL 367
>gi|46241633|gb|AAS83018.1| hypothetical protein pRhico010 [Azospirillum brasilense]
Length = 1380
Score = 96.9 bits (240), Expect = 5e-18, Method: Composition-based stats.
Identities = 14/122 (11%), Positives = 41/122 (33%), Gaps = 10/122 (8%)
Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69
G+I + +D + + + W +++ H +
Sbjct: 623 FSGEIRDYNAMVDAS---LNEPAPSFPLIKTVFPSWDNDARRQGRGAVYHGSTPENYR-- 677
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127
W+ +A++K + F + R+ F + E + A+L + + + + +
Sbjct: 678 -RWMEGVIAYAKANP--FHNERMMFINAWNEWAEGAYLEPDLHFGAAYLNATARAIYGRR 734
Query: 128 EL 129
++
Sbjct: 735 QV 736
>gi|167903945|ref|ZP_02491150.1| glycosyl transferase, group 1 [Burkholderia pseudomallei NCTC 13177]
Length = 1741
Score = 96.9 bits (240), Expect = 5e-18, Method: Composition-based stats.
Identities = 16/113 (14%), Positives = 34/113 (30%), Gaps = 6/113 (5%)
Query: 11 LGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFI 70
G ++L E+ G W ++ ++E
Sbjct: 968 TGYAGHILDWTHYLERSRSYPDAEYRLFRGVTPSWDNEARKPGRGTVFVGSTPKLYEE-- 1025
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
WL + + ++ P R+ F + E + A L +R + + S+
Sbjct: 1026 -WLFNA-SVDTVRRIENPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATSD 1076
>gi|313202892|ref|YP_004041549.1| hypothetical protein Palpr_0404 [Paludibacter propionicigenes WB4]
gi|312442208|gb|ADQ78564.1| hypothetical protein Palpr_0404 [Paludibacter propionicigenes WB4]
Length = 381
Score = 96.6 bits (239), Expect = 6e-18, Method: Composition-based stats.
Identities = 8/116 (6%), Positives = 26/116 (22%), Gaps = 9/116 (7%)
Query: 11 LGKIENLLLRLDVEEK-GNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69
L + + + + + W +P+ F
Sbjct: 260 LHRPPRITDYRKYYKFLVDKSEDACEDVLPTIVPNWDHTPRSGWNGTLFVHATPEYFRKH 319
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ + + P R+ S E + ++ + + + +
Sbjct: 320 VDEVLDIVH------KKSPERRVVMLKSWNEWGEGNYMEPDLVFGKAYIRALRDAI 369
>gi|94972405|ref|YP_595623.1| hypothetical protein LIC007 [Lawsonia intracellularis PHE/MN1-00]
gi|94731942|emb|CAJ53959.1| conserved hypothetical protein [Lawsonia intracellularis
PHE/MN1-00]
Length = 789
Score = 96.2 bits (238), Expect = 8e-18, Method: Composition-based stats.
Identities = 22/151 (14%), Positives = 40/151 (26%), Gaps = 21/151 (13%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
K G+I + + E + W +P++ S +
Sbjct: 298 KRFKGRIRHYSM---FAEAVVKDYTTKYTLYPCVFPGWDNTPRRLYFSSIFACSTPQAYR 354
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
WL F+ S R F + E + A L N+ + + S
Sbjct: 355 ---QWLTDACTFA--STTHEKDNRFVFINAWNEWAEGAHLEPNKAYGYAYLNATSRVVEN 409
Query: 126 VKELFEGWNDRPSSPKKSGLTIKSKIAIVVH 156
+ P + K+ +V H
Sbjct: 410 F-----------AVPPSTAENNPHKVLVVGH 429
>gi|86132907|ref|ZP_01051498.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
gi|85816613|gb|EAQ37800.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
Length = 361
Score = 95.4 bits (236), Expect = 1e-17, Method: Composition-based stats.
Identities = 18/127 (14%), Positives = 37/127 (29%), Gaps = 11/127 (8%)
Query: 2 YKVFRLKSKLGKIEN-LLLRLDVEEKGNMQAIYIPA--HVSGYYVLWSFSPKQRITSKDV 58
Y L+ I+N L D E+ ++Q G +W + +++ +
Sbjct: 240 YTTALLRKFKWTIDNRYELFYDYEQFVDLQINTEFKSKVYPGITPMWDNTARRKKNYFAL 299
Query: 59 HFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRM 116
H + WL+ Y P + F + E + L + +
Sbjct: 300 HNSTPQ---KYAKWLKHI--VLNYPWQKMPENYL-FINAWNEWAEGNHLEPCQKWGKQYL 353
Query: 117 PFDSEKF 123
+
Sbjct: 354 EETYKAL 360
>gi|294672884|ref|YP_003573500.1| hypothetical protein PRU_0097 [Prevotella ruminicola 23]
gi|294473985|gb|ADE83374.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 369
Score = 95.4 bits (236), Expect = 1e-17, Method: Composition-based stats.
Identities = 15/94 (15%), Positives = 27/94 (28%), Gaps = 8/94 (8%)
Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
Y W SP+ + +FE + K P +I F
Sbjct: 281 VYPAIYPNWDHSPRSGRNGFIIVDSTPDLFEKHVAQ------VLDEVKSKQPEHQIAFIK 334
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
S E + ++ + N + S + V+
Sbjct: 335 SWNEWGEGNYIEPDLKFGNGYLEALSRQIEKVRY 368
>gi|253565823|ref|ZP_04843278.1| conserved hypothetical protein [Bacteroides sp. 3_2_5]
gi|251946102|gb|EES86509.1| conserved hypothetical protein [Bacteroides sp. 3_2_5]
Length = 362
Score = 95.0 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 10/117 (8%), Positives = 29/117 (24%), Gaps = 12/117 (10%)
Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74
++ + E Y W +P+ + +FE
Sbjct: 256 YKKVIPTLIGELERNCDNY----FPTIIPNWDHTPRSGVNGDLFTKSTPDLFE------I 305
Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKEL 129
+ + ++ F S E + ++ + + + ++ L
Sbjct: 306 HCMDVLSSVTKKNTNRQVCFLKSWNEWGEGNYMEPDLKYGKGYIYALRKVVDTLESL 362
>gi|218244934|ref|YP_002370305.1| polysaccharide biosynthesis protein [Cyanothece sp. PCC 8801]
gi|218165412|gb|ACK64149.1| polysaccharide biosynthesis protein [Cyanothece sp. PCC 8801]
Length = 383
Score = 94.6 bits (234), Expect = 2e-17, Method: Composition-based stats.
Identities = 12/91 (13%), Positives = 29/91 (31%), Gaps = 8/91 (8%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G W + ++++ + + I+E +WL++ + + P I F +
Sbjct: 298 FPGVTPSWDNTARRQVAATILKDSTPEIYE---YWLKAVIEKTISKPELPP---IIFINA 351
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
E + L + S +
Sbjct: 352 WNEWAEGNHLEPCQRWGRSYLEATQRAIKQF 382
>gi|148264392|ref|YP_001231098.1| lipopolysaccharide biosynthesis protein-like protein [Geobacter
uraniireducens Rf4]
gi|146397892|gb|ABQ26525.1| Lipopolysaccharide biosynthesis protein-like protein [Geobacter
uraniireducens Rf4]
Length = 368
Score = 94.6 bits (234), Expect = 2e-17, Method: Composition-based stats.
Identities = 13/123 (10%), Positives = 33/123 (26%), Gaps = 9/123 (7%)
Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
+ + K GK D + ++ + + W +P+ + +H
Sbjct: 238 QYQVKTGKPAIFSYEKDFADLQPIKIAHG-DNYPCLLPNWDNTPRSKSNGLVLHDSTPEA 296
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + S+ ++ F S E + L + + + +
Sbjct: 297 FRKHVKKALEI------SRDKPDERKLVFIKSWNEWAEGNHLEPDLKFGRAYLEILRNEI 350
Query: 124 LYV 126
Sbjct: 351 SNE 353
>gi|87201246|ref|YP_498503.1| polysaccharide biosynthesis protein [Novosphingobium
aromaticivorans DSM 12444]
gi|87136927|gb|ABD27669.1| polysaccharide biosynthesis protein [Novosphingobium
aromaticivorans DSM 12444]
Length = 377
Score = 93.9 bits (232), Expect = 4e-17, Method: Composition-based stats.
Identities = 11/86 (12%), Positives = 28/86 (32%), Gaps = 7/86 (8%)
Query: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99
W S +++ + WLR +A+++ + P R F +
Sbjct: 287 CVTPGWDNSARKKNRPLIFVGSTPERYG---RWLREMVAWTRR--NAPPERRFIFINAWN 341
Query: 100 E--QKAFLRLNRFMSNSRMPFDSEKF 123
E + L ++ ++ + +
Sbjct: 342 EWAEGNHLEPDQRNGHANLEATARAL 367
>gi|288803153|ref|ZP_06408588.1| glycosyltransferase [Prevotella melaninogenica D18]
gi|288334414|gb|EFC72854.1| glycosyltransferase [Prevotella melaninogenica D18]
Length = 381
Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats.
Identities = 18/121 (14%), Positives = 32/121 (26%), Gaps = 9/121 (7%)
Query: 7 LKSKLGKIENL-LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
L KL + +L L V W +P+ +
Sbjct: 267 LHKKLSFLPSLKLDYSKVVSNFFAPEDKWDNVYPMIIPGWDRTPRAGNSEGIYINSTPEN 326
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F+ I S + +I F S E + ++ N ++ + E
Sbjct: 327 FKKHIKQALSIVD------SKPQDHKILFLKSWNEWGEGNYVEPNLKFGHAYLDAIKENL 380
Query: 124 L 124
L
Sbjct: 381 L 381
>gi|241763180|ref|ZP_04761239.1| Methyltransferase type 12 [Acidovorax delafieldii 2AN]
gi|241367679|gb|EER61945.1| Methyltransferase type 12 [Acidovorax delafieldii 2AN]
Length = 1786
Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats.
Identities = 17/136 (12%), Positives = 40/136 (29%), Gaps = 12/136 (8%)
Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74
+ + V + Y ++V +W + + + +H F+ W+
Sbjct: 1243 YDQVRDYYVAQNDRKSFDYFRSNVP----MWDNTARYGTGALLLHGSTPQSFQ---QWME 1295
Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
+A ++ R + E + A L + S + + E
Sbjct: 1296 HSIADAQ--ANLPADRRFVVVNAWNEWAEGAHLEPDTRYGYSYLNSVGRALAGLPYAHEL 1353
Query: 133 WNDRPSSPKKSGLTIK 148
P P+ L ++
Sbjct: 1354 NATAPL-PQGLCLQVQ 1368
>gi|302186464|ref|ZP_07263137.1| glycosyl transferase family 2 [Pseudomonas syringae pv. syringae 642]
Length = 1318
Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats.
Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+ D+ + + G W + ++ TS F+ WL
Sbjct: 1206 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 1262
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+A +K + +F R+ F + E + A+L +R ++ +
Sbjct: 1263 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 1308
>gi|257481987|ref|ZP_05636028.1| glycosyl transferase family 2 [Pseudomonas syringae pv. tabaci ATCC
11528]
Length = 1360
Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats.
Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+ D+ + + G W + ++ TS F+ WL
Sbjct: 1248 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 1304
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+A +K + +F R+ F + E + A+L +R ++ +
Sbjct: 1305 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 1350
>gi|86140376|ref|ZP_01058935.1| Glycosyltransferase [Leeuwenhoekiella blandensis MED217]
gi|85832318|gb|EAQ50767.1| Glycosyltransferase [Leeuwenhoekiella blandensis MED217]
Length = 380
Score = 93.5 bits (231), Expect = 6e-17, Method: Composition-based stats.
Identities = 19/125 (15%), Positives = 39/125 (31%), Gaps = 15/125 (12%)
Query: 6 RLKSKLGKIENLLLRL-----DVEEKGNMQAIYIP--AHVSGYYVLWSFSPKQRITSKDV 58
+ KS +G + R D ++ + + IP ++ + W SP+ S
Sbjct: 255 KYKSLIGHTNKIGERKRPLIFDYKKGARLLSQNIPHKKYIPCVFPNWDNSPRSGKKSLIF 314
Query: 59 HFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRM 116
+ W + K + +I S E + +L ++ S +
Sbjct: 315 KNATPN------AWKEHLKHTIEVLKSKPENPQIIIIKSWNEWAEGNYLEPDQEFGISML 368
Query: 117 PFDSE 121
E
Sbjct: 369 KVVKE 373
>gi|330989699|gb|EGH87802.1| glycosyl transferase family 2 [Pseudomonas syringae pv. lachrymans
str. M301315]
Length = 1301
Score = 93.1 bits (230), Expect = 6e-17, Method: Composition-based stats.
Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+ D+ + + G W + ++ TS F+ WL
Sbjct: 1189 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 1245
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+A +K + +F R+ F + E + A+L +R ++ +
Sbjct: 1246 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 1291
>gi|262383300|ref|ZP_06076436.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262294198|gb|EEY82130.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 387
Score = 93.1 bits (230), Expect = 7e-17, Method: Composition-based stats.
Identities = 17/128 (13%), Positives = 37/128 (28%), Gaps = 15/128 (11%)
Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIP-----AHVSGYYVLWSFSPKQRITSKD 57
+V R + + +++ + IY+P W S + ++
Sbjct: 264 RVIRW--LMFNLFKYRTLSKCDQRVINKYIYVPEDKWDNVYPILLPQWDRSARAGKMARI 321
Query: 58 VHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSR 115
+F S I S L + +I F S E + ++ + +
Sbjct: 322 YVGSTPDVFRSQIQSALSLL------ENKTDEHKILFLRSWNEWAEGNYVEPDLKYGHGY 375
Query: 116 MPFDSEKF 123
+ E
Sbjct: 376 LDVLRECL 383
>gi|94497762|ref|ZP_01304329.1| hypothetical protein SKA58_12300 [Sphingomonas sp. SKA58]
gi|94422811|gb|EAT07845.1| hypothetical protein SKA58_12300 [Sphingomonas sp. SKA58]
Length = 1425
Score = 93.1 bits (230), Expect = 7e-17, Method: Composition-based stats.
Identities = 22/229 (9%), Positives = 57/229 (24%), Gaps = 46/229 (20%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
K G+I + +D + Y G + W ++ ++ H F
Sbjct: 596 KDFGGEIFDYGAVVDGD-VERYADGYEWPVHRGAMLGWDNMARRLTDARVFHGATPQGFR 654
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
W++ L + + F + E + +L ++ + +
Sbjct: 655 ---RWIKGILDQESRHNSAP--ETLMFINAWNEWAEGTYLEPDQRWGRTNLAAFRSAVDA 709
Query: 126 VKELFEGWND-----------------RPSSPK-------------KSGLTIKSKIAIVV 155
+ P +P + K I +
Sbjct: 710 TPGMKAVTLPAGIAAAPKQEGRLAHLGSPLAPDGTMPRGPVWYRGYREVDPTKPTILLCA 769
Query: 156 HCYYQDT------WIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYF 198
H +++ L + + + +T+ N + ++
Sbjct: 770 HISGHQLFGGERSLLDVLEALATMPVN--VIMTLPSDNNRAYIEAIQKL 816
>gi|331008848|gb|EGH88904.1| glycosyl transferase family 2 [Pseudomonas syringae pv. tabaci ATCC
11528]
Length = 846
Score = 92.7 bits (229), Expect = 8e-17, Method: Composition-based stats.
Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
+ D+ + + G W + ++ TS F+ WL
Sbjct: 734 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 790
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+A +K + +F R+ F + E + A+L +R ++ +
Sbjct: 791 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 836
>gi|330996421|ref|ZP_08320304.1| hypothetical protein HMPREF9442_01389 [Paraprevotella xylaniphila
YIT 11841]
gi|329573279|gb|EGG54893.1| hypothetical protein HMPREF9442_01389 [Paraprevotella xylaniphila
YIT 11841]
Length = 367
Score = 92.7 bits (229), Expect = 1e-16, Method: Composition-based stats.
Identities = 12/121 (9%), Positives = 24/121 (19%), Gaps = 8/121 (6%)
Query: 5 FRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELS 64
+ + K + + W SP+ +
Sbjct: 247 AKFQRIALKRGRHIEYSRASQYFQGPEEQANDCYPTLIPNWDHSPRSGRAGHILIRSTPE 306
Query: 65 IFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
F+ A RI F S E + ++ + + E
Sbjct: 307 KFKKHAQ------ASFNNISHKAMEDRIVFLKSWNEWAEGNYMEPDLKFGKGYLKALKEA 360
Query: 123 F 123
Sbjct: 361 I 361
>gi|255014255|ref|ZP_05286381.1| hypothetical protein B2_10114 [Bacteroides sp. 2_1_7]
Length = 392
Score = 92.7 bits (229), Expect = 1e-16, Method: Composition-based stats.
Identities = 15/129 (11%), Positives = 37/129 (28%), Gaps = 10/129 (7%)
Query: 1 MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF 60
+++V + + ++ L + + + W SP+ +
Sbjct: 272 IHRVLSSRFHISSLDKY-DYLKIIKHYYVPEDKWDNVYPSLLPQWDRSPRSGVNG-IYVN 329
Query: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPF 118
F+ I+ + L RI F S E + ++ + + +
Sbjct: 330 STPVNFKKMIYEALNLLN------NKQDEHRILFLKSWNEWAEGNYVEPDLKYGHGYLDV 383
Query: 119 DSEKFLYVK 127
E + K
Sbjct: 384 LRECLVNDK 392
>gi|118580521|ref|YP_901771.1| polysaccharide biosynthesis protein [Pelobacter propionicus DSM
2379]
gi|118503231|gb|ABK99713.1| polysaccharide biosynthesis protein [Pelobacter propionicus DSM
2379]
Length = 363
Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats.
Identities = 15/119 (12%), Positives = 37/119 (31%), Gaps = 13/119 (10%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
LK ++ + +L+ + +E + SP+++ S H ++
Sbjct: 252 LKHQIYEYSSLVDAMLGKELPTYPF------YRCVCPSFDNSPRRKTDSVVFHNSTPELY 305
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
WL + ++ P R+ F + E + L + + +
Sbjct: 306 ---FRWLNEVVEWTSC--NHSPEERLVFVNAWNEWGEGNHLEPDLRWGKQYLEKTRQAI 359
>gi|254411253|ref|ZP_05025030.1| hypothetical protein MC7420_1744 [Microcoleus chthonoplastes PCC
7420]
gi|196181754|gb|EDX76741.1| hypothetical protein MC7420_1744 [Microcoleus chthonoplastes PCC
7420]
Length = 379
Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats.
Identities = 14/135 (10%), Positives = 37/135 (27%), Gaps = 23/135 (17%)
Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIY---------------IPAHVSGYYVLWSFSPK 50
++K KL + ++ + +Y + W +P+
Sbjct: 249 KVKQKLSAFSSRRFYQKYKQFSDYPLLYSYEKAIKCAFKGSHPYFVTYPCIFPNWDNTPR 308
Query: 51 QRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLN 108
I +F + ++ + K R+ F S E + +L +
Sbjct: 309 TGIYGLVFLKSTPDLFRVHLQEAIETVSERESEK------RLIFIRSWNEWAEGNYLEPD 362
Query: 109 RFMSNSRMPFDSEKF 123
+ + ++
Sbjct: 363 LKFGKAFLEVIRDEI 377
>gi|256827944|ref|YP_003156672.1| glycosyl transferase family 2 [Desulfomicrobium baculatum DSM 4028]
gi|256577120|gb|ACU88256.1| glycosyl transferase family 2 [Desulfomicrobium baculatum DSM 4028]
Length = 1077
Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats.
Identities = 12/118 (10%), Positives = 32/118 (27%), Gaps = 14/118 (11%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
+ ++ + + R + A Y W +P++ S + +
Sbjct: 800 EHRVYDYDEFVAR----QLTKPAASY--RRYPCVTPRWDNTPRRPKDSVVLLDPSPDRYR 853
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
WL + R+ F + E + L + ++ + +
Sbjct: 854 ---RWLSHAVESVT---KLPADERLVFINAWNEWGEGCALEPDLLRGDAYLKATAAAL 905
>gi|149276164|ref|ZP_01882308.1| hypothetical protein PBAL39_00552 [Pedobacter sp. BAL39]
gi|149232684|gb|EDM38059.1| hypothetical protein PBAL39_00552 [Pedobacter sp. BAL39]
Length = 399
Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats.
Identities = 21/124 (16%), Positives = 38/124 (30%), Gaps = 11/124 (8%)
Query: 5 FRLKSKLGKIENLLLRLDVEEKG--NMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQE 62
++ ++ L L EK + + + V W SP+ S V
Sbjct: 281 YQFVHFTEVNKDYLDILTAVEKEWARIDTAFEFNYYPHISVGWDNSPRTG-KSAVVKNNT 339
Query: 63 LSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120
FE LR A++ P + S E + ++L+ + +
Sbjct: 340 PENFEKG---LRMAKAYADAHPKQVP---LITINSWNEWTETSYLQPDNVYGYGYLDAIK 393
Query: 121 EKFL 124
FL
Sbjct: 394 RVFL 397
>gi|163814421|ref|ZP_02205810.1| hypothetical protein COPEUT_00572 [Coprococcus eutactus ATCC 27759]
gi|158450056|gb|EDP27051.1| hypothetical protein COPEUT_00572 [Coprococcus eutactus ATCC 27759]
Length = 387
Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats.
Identities = 14/109 (12%), Positives = 37/109 (33%), Gaps = 12/109 (11%)
Query: 20 RLDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
R+D ++ P +V G +V W +P+ + + FE ++
Sbjct: 276 RVDYDKAWETILNTTPESIINVPGAFVDWDNTPRHGERGRVYIGKTPEKFEKYLSE---- 331
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ +K + + F + E + +L ++ + + +
Sbjct: 332 --QIRRAKNVYHKD-MIFMYAWNEWAEGGYLEPDQTSGYAYLEAIKKAL 377
>gi|298377838|ref|ZP_06987788.1| glycosyl transferase, group 2 family [Bacteroides sp. 3_1_19]
gi|298265284|gb|EFI06947.1| glycosyl transferase, group 2 family [Bacteroides sp. 3_1_19]
Length = 366
Score = 91.9 bits (227), Expect = 2e-16, Method: Composition-based stats.
Identities = 8/90 (8%), Positives = 19/90 (21%), Gaps = 8/90 (8%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
W SP+ + +F+ + + RI S
Sbjct: 280 YPTIIPNWDHSPRTGRYGAILKDSTPQLFQKHVEQTVHLIL------NKDDDHRIVILKS 333
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
E + ++ + +
Sbjct: 334 WNEWAEGNYVEPDLNFGRGYLEALRTALQK 363
>gi|253578786|ref|ZP_04856057.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251849729|gb|EES77688.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
Length = 387
Score = 91.9 bits (227), Expect = 2e-16, Method: Composition-based stats.
Identities = 13/111 (11%), Positives = 34/111 (30%), Gaps = 12/111 (10%)
Query: 18 LLRLDVEEKGNMQAIYIP---AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74
+L+ D +E +IP ++ G +V W +P++ + +
Sbjct: 276 VLKTDYDEAWKAILEHIPENEKNIPGAFVGWDNTPRKGHRGQVYIGDTPEKLNKY----- 330
Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ S + F + E + +L + + +
Sbjct: 331 --MSKQIQRAKSIYKKDMIFMYAWNEWAEGGYLEPDERTGYKNLEAIRDAL 379
>gi|313203439|ref|YP_004042096.1| hypothetical protein Palpr_0961 [Paludibacter propionicigenes WB4]
gi|312442755|gb|ADQ79111.1| hypothetical protein Palpr_0961 [Paludibacter propionicigenes WB4]
Length = 378
Score = 91.9 bits (227), Expect = 2e-16, Method: Composition-based stats.
Identities = 10/90 (11%), Positives = 18/90 (20%), Gaps = 8/90 (8%)
Query: 36 AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95
W +P+ +FE L RI F
Sbjct: 290 NIFPTLIPNWDHTPRSGYNGYLYTKSTPELFEK------HALQVFNMINSKPEDDRICFL 343
Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
S E + ++ + +
Sbjct: 344 KSWNEWGEGNYMEPDLKFGKKYIYALRSAL 373
>gi|146281782|ref|YP_001171935.1| hypothetical protein PST_1402 [Pseudomonas stutzeri A1501]
gi|145569987|gb|ABP79093.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
Length = 1615
Score = 91.6 bits (226), Expect = 2e-16, Method: Composition-based stats.
Identities = 21/135 (15%), Positives = 35/135 (25%), Gaps = 15/135 (11%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFE 67
LG L E+ W S ++R + +
Sbjct: 611 QFLGDYGKLADY--WSERPRPHY----KRFRCLVPSWDNSARRRKGRAGLFVNATPERYG 664
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
WL LA K + R+ F + E + L + + +
Sbjct: 665 ---QWLEHTLA--KTCEEFAGDERLVFINAWNEWGEGCHLEPDVRHGRAYLEATRNALDK 719
Query: 126 VKELFEGWNDRPSSP 140
+K E RP +P
Sbjct: 720 LKAATEI-PVRPYNP 733
>gi|256819540|ref|YP_003140819.1| hypothetical protein Coch_0700 [Capnocytophaga ochracea DSM 7271]
gi|256581123|gb|ACU92258.1| conserved hypothetical protein [Capnocytophaga ochracea DSM 7271]
Length = 366
Score = 91.6 bits (226), Expect = 2e-16, Method: Composition-based stats.
Identities = 14/124 (11%), Positives = 36/124 (29%), Gaps = 8/124 (6%)
Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
++ K+ + +++ + + W +P+
Sbjct: 249 KIYRKIFSVPDIVDYSKIYKSFITPLEAQENIFPTIIPNWDHTPRSGKGGTVFKNTNGEN 308
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F+ + + ++ +Y K RI F S E + +L + + E
Sbjct: 309 FKQHVMEVLKIISQKEYDK------RIVFIKSWNEWGEGNYLEPDLKNGYLYLDILQELL 362
Query: 124 LYVK 127
+ K
Sbjct: 363 VSQK 366
>gi|220926122|ref|YP_002501424.1| group 1 glycosyl transferase [Methylobacterium nodulans ORS 2060]
gi|219950729|gb|ACL61121.1| glycosyl transferase group 1 [Methylobacterium nodulans ORS 2060]
Length = 787
Score = 91.2 bits (225), Expect = 2e-16, Method: Composition-based stats.
Identities = 15/132 (11%), Positives = 40/132 (30%), Gaps = 10/132 (7%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
++ +G +E+ + V G + +W + ++R + +
Sbjct: 646 ENFVGYLEDYV---GVASSSINSPPTDYVRYRGCFPMWDNTARRRNAGHVFINEST---K 699
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
+ +WLR + + + + F + E + +L + + + E
Sbjct: 700 GYAYWLRFLVHEALVRRDQVEP--MVFINAWNEWAEGTYLEPDEHYGRAFLEVTREALAQ 757
Query: 126 VKELFEGWNDRP 137
F P
Sbjct: 758 GIADFVVGVRNP 769
>gi|282879758|ref|ZP_06288488.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
gi|281306427|gb|EFA98457.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
Length = 381
Score = 91.2 bits (225), Expect = 3e-16, Method: Composition-based stats.
Identities = 10/106 (9%), Positives = 27/106 (25%), Gaps = 8/106 (7%)
Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79
+ + + W +P+ + + F+ I + +
Sbjct: 281 YEKITQHFFAPEDSWQNVYPSIFPQWDRTPRAGNSEGVYVNATPTTFKKHIQNALNVI-- 338
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K RI F + E + ++ + + + E
Sbjct: 339 ----KNKDMEHRILFLRAWNEWGEGNYVEPDLKYGHGFLDAIKEAI 380
>gi|302880031|ref|YP_003848595.1| lipopolysaccharide biosynthesis protein-like protein [Gallionella
capsiferriformans ES-2]
gi|302582820|gb|ADL56831.1| lipopolysaccharide biosynthesis protein-like protein [Gallionella
capsiferriformans ES-2]
Length = 364
Score = 90.8 bits (224), Expect = 3e-16, Method: Composition-based stats.
Identities = 10/89 (11%), Positives = 23/89 (25%), Gaps = 8/89 (8%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
Y W +P++ + ++FE+ + L ++ F S
Sbjct: 282 YPCIYPNWDNTPRKGRKGLVLANSTPALFEAHLNDAVGALGERD------DEHKLVFVKS 335
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
E + L + +
Sbjct: 336 WNEWAEGNHLEPDTKWGLQYLQALKRVIE 364
>gi|312130478|ref|YP_003997818.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM
17132]
gi|311907024|gb|ADQ17465.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM
17132]
Length = 380
Score = 90.8 bits (224), Expect = 4e-16, Method: Composition-based stats.
Identities = 16/125 (12%), Positives = 41/125 (32%), Gaps = 10/125 (8%)
Query: 6 RLKSKLG--KIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63
R+K+KLG + + +V ++ + + H W S +++ + +H
Sbjct: 261 RIKNKLGWGQTYRKIDYAEVVQRMKSKPSFTQKHFKALVPGWDNSARRKNDAFIMHDATP 320
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
++E WL + + F + E + L ++ + + +
Sbjct: 321 ELYED---WLDHTCKTTT---IYSEEENFLFINAWNEWAEGNHLEPDKKWGRAFLETTKK 374
Query: 122 KFLYV 126
Sbjct: 375 ILSKY 379
>gi|298384772|ref|ZP_06994332.1| glycosyl transferase, group 2 family [Bacteroides sp. 1_1_14]
gi|298263051|gb|EFI05915.1| glycosyl transferase, group 2 family [Bacteroides sp. 1_1_14]
Length = 369
Score = 90.8 bits (224), Expect = 4e-16, Method: Composition-based stats.
Identities = 13/127 (10%), Positives = 34/127 (26%), Gaps = 9/127 (7%)
Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61
+K+ + + ++ D+ + + W SP+ +
Sbjct: 250 HKLRKYFPSIAPLDKY-KYKDIIKNFYTDYDRLENSYPSIIPNWDRSPRGGRRAVIYTGS 308
Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119
+F+ R K + +I F S E + ++ + + +
Sbjct: 309 TPELFK------RHIEDAIKIVENKKAEHKIIFLRSWNEWAEGNYVEPDIKFGHGYLDSL 362
Query: 120 SEKFLYV 126
L
Sbjct: 363 RSVILEE 369
>gi|113476766|ref|YP_722827.1| hypothetical protein Tery_3239 [Trichodesmium erythraeum IMS101]
gi|110167814|gb|ABG52354.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
Length = 955
Score = 90.4 bits (223), Expect = 4e-16, Method: Composition-based stats.
Identities = 17/119 (14%), Positives = 33/119 (27%), Gaps = 8/119 (6%)
Query: 17 LLLRLDVEEKGNMQAIY-IPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75
+ +Q W + +++ + E +E FWLR
Sbjct: 644 FVYDYKQTAINTIQEKLPDYQVFLSVMTSWDNTARRQQNATVWLNSEPEDYE---FWLRG 700
Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
K K S I F + E + A+L ++ + + L +
Sbjct: 701 TTE--KALKNYGDSENIVFINAWNEWAEGAYLEPDKKYGCAYLEATQRVLLGQHSIQTA 757
>gi|323139972|ref|ZP_08074990.1| glycosyl transferase family 2 [Methylocystis sp. ATCC 49242]
gi|322394772|gb|EFX97355.1| glycosyl transferase family 2 [Methylocystis sp. ATCC 49242]
Length = 984
Score = 90.0 bits (222), Expect = 5e-16, Method: Composition-based stats.
Identities = 13/124 (10%), Positives = 43/124 (34%), Gaps = 7/124 (5%)
Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75
+ ++ + + V W +P+ S + F++++ W
Sbjct: 866 EVHDYRELALAFMRRVEPGFPRIRSVLVGWDNTPRHPDNSLILEQSTPGAFQAWLEW--- 922
Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGW 133
+ + + ++ RI F + E + ++L +R ++ + + + +
Sbjct: 923 --TYRRTIEQNYGDARIVFINAWNEWCEGSYLEPDRHFGHAYLQALRNAQESIASGSDSF 980
Query: 134 NDRP 137
++P
Sbjct: 981 VEKP 984
>gi|281424202|ref|ZP_06255115.1| glycosyltransferase [Prevotella oris F0302]
gi|281401471|gb|EFB32302.1| glycosyltransferase [Prevotella oris F0302]
Length = 361
Score = 90.0 bits (222), Expect = 5e-16, Method: Composition-based stats.
Identities = 18/124 (14%), Positives = 29/124 (23%), Gaps = 8/124 (6%)
Query: 5 FRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELS 64
FR K G + LD + + H Y W S + + E
Sbjct: 239 FRTLRKFGGVVFGNNYLDYCNFFIKKYTPMAKHFPCIYPNWDHSARSGKIATIFRNVEPE 298
Query: 65 IFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
I W + F S E + +L +R + +
Sbjct: 299 I------WGDFCKRLFVKCSRQPTEENLIFIKSWNEWGEGNYLEPDRRYGRGYLEELKKA 352
Query: 123 FLYV 126
Sbjct: 353 LSSF 356
>gi|300728262|ref|ZP_07061630.1| conserved hypothetical protein [Prevotella bryantii B14]
gi|299774497|gb|EFI71121.1| conserved hypothetical protein [Prevotella bryantii B14]
Length = 371
Score = 90.0 bits (222), Expect = 5e-16, Method: Composition-based stats.
Identities = 14/122 (11%), Positives = 30/122 (24%), Gaps = 11/122 (9%)
Query: 7 LKSKLGKIEN-LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK--DVHFQEL 63
K I L +K + + + W SP+ T + E
Sbjct: 250 WNQKFRGIPKGALDYRKKYKKFILPKDKEIGVIPEIFPNWDHSPRSGKTGASTIYYNSEP 309
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
F + + K S ++ S E + ++ + + +
Sbjct: 310 EFFYKHVKEALDAI------KDKPESDQMLILKSWNEWGEGNYMEPDLRYGRGYIKALRK 363
Query: 122 KF 123
Sbjct: 364 AI 365
>gi|327312342|ref|YP_004327779.1| hypothetical protein HMPREF9137_0027 [Prevotella denticola F0289]
gi|326944812|gb|AEA20697.1| conserved hypothetical protein [Prevotella denticola F0289]
Length = 381
Score = 89.2 bits (220), Expect = 9e-16, Method: Composition-based stats.
Identities = 10/107 (9%), Positives = 22/107 (20%), Gaps = 8/107 (7%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
V + W +P+ F+ I +
Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVI- 338
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K +I F S E + ++ + + +
Sbjct: 339 -----KEKPKEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIKASV 380
>gi|295087225|emb|CBK68748.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 369
Score = 89.2 bits (220), Expect = 9e-16, Method: Composition-based stats.
Identities = 7/89 (7%), Positives = 23/89 (25%), Gaps = 10/89 (11%)
Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
+ W +P+ + + F + + + + ++ F
Sbjct: 283 VIPCIVPNWDHTPRSGMKGSMFLNESPEFFRLHVEDALKTVQYKR--------NKLIFLK 334
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
S E + ++ + + E
Sbjct: 335 SWNEWGEGNYMEPDLTFGKGYINALHEAL 363
>gi|325853275|ref|ZP_08171333.1| hypothetical protein HMPREF9303_1037 [Prevotella denticola CRIS
18C-A]
gi|325484364|gb|EGC87289.1| hypothetical protein HMPREF9303_1037 [Prevotella denticola CRIS
18C-A]
Length = 381
Score = 89.2 bits (220), Expect = 1e-15, Method: Composition-based stats.
Identities = 10/107 (9%), Positives = 22/107 (20%), Gaps = 8/107 (7%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
V + W +P+ F+ I +
Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVI- 338
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K +I F S E + ++ + + +
Sbjct: 339 -----KEKPKEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIKASV 380
>gi|255526750|ref|ZP_05393652.1| glycosyltransferase [Clostridium carboxidivorans P7]
gi|296187044|ref|ZP_06855444.1| hypothetical protein CLCAR_2519 [Clostridium carboxidivorans P7]
gi|255509585|gb|EET85923.1| glycosyltransferase [Clostridium carboxidivorans P7]
gi|296048482|gb|EFG87916.1| hypothetical protein CLCAR_2519 [Clostridium carboxidivorans P7]
Length = 374
Score = 88.9 bits (219), Expect = 1e-15, Method: Composition-based stats.
Identities = 14/125 (11%), Positives = 37/125 (29%), Gaps = 12/125 (9%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAH---VSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
G ++R+ + P + G +V W + ++ F+
Sbjct: 257 GMRPGGVIRVSYDAIWKEILKRKPQDEKCIPGAFVDWDNTSRKGEKGSIYEGATPEKFQK 316
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
++ A + ++ + + F + E + +L + + + L
Sbjct: 317 YLT------AQIRRARDVYKKD-MLFIFAWNEWAECGYLEPDEKFGYGYLEAIKQALLDN 369
Query: 127 KELFE 131
E E
Sbjct: 370 DEFSE 374
>gi|294775796|ref|ZP_06741298.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294450382|gb|EFG18880.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 364
Score = 88.5 bits (218), Expect = 2e-15, Method: Composition-based stats.
Identities = 13/94 (13%), Positives = 26/94 (27%), Gaps = 9/94 (9%)
Query: 38 VSGYYVLWSFSPKQRITS-KDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
W SP+++ +F+ WL+ L K + F
Sbjct: 276 FPCVSPGWDNSPRRKKPPYMAFVGSTPELFKK---WLKDTL---VRFKPFSKEENLVFIN 329
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
+ E + L ++ + E L +
Sbjct: 330 AWNEWAEGNHLEPDQKWGRRYLEVTKEAILETSK 363
>gi|295084063|emb|CBK65586.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 367
Score = 88.5 bits (218), Expect = 2e-15, Method: Composition-based stats.
Identities = 12/95 (12%), Positives = 26/95 (27%), Gaps = 8/95 (8%)
Query: 31 AIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSC 90
Y W SP+ + ++FE I ++ +
Sbjct: 277 YDYREDVYPSIIPNWDRSPRGGRRAVIYTDSTPALFEEHIKTALEIISKKQ------DEH 330
Query: 91 RIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+I F S E + ++ + + + E
Sbjct: 331 KILFLRSWNEWAEGNYVEPDLKFGHGYLDALKESI 365
>gi|325270047|ref|ZP_08136655.1| glycosyltransferase [Prevotella multiformis DSM 16608]
gi|324987632|gb|EGC19607.1| glycosyltransferase [Prevotella multiformis DSM 16608]
Length = 381
Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats.
Identities = 9/104 (8%), Positives = 21/104 (20%), Gaps = 8/104 (7%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
V + W +P+ F+ I +
Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVIN 339
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120
+I F S E + ++ + + +
Sbjct: 340 ------DKPNEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIK 377
>gi|190572675|ref|YP_001970520.1| putative glycosyl transferase [Stenotrophomonas maltophilia K279a]
gi|190010597|emb|CAQ44206.1| putative glycosyl transferase [Stenotrophomonas maltophilia K279a]
Length = 436
Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats.
Identities = 16/109 (14%), Positives = 38/109 (34%), Gaps = 7/109 (6%)
Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76
L+ V + + G W + +++ TS + SI++ +WLR
Sbjct: 245 LVDYRKVVAQSISRPKPDFRWYRGIVPSWDNTARRQHTSHTLVDASPSIYQ---YWLRRL 301
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ +++ + P +I F + E + L + + +
Sbjct: 302 VEYTRV--NNAPEDQILFINAWNEWGEGCHLEPDLKHGLAYLEATHAAL 348
>gi|260593223|ref|ZP_05858681.1| glycosyltransferase [Prevotella veroralis F0319]
gi|260534780|gb|EEX17397.1| glycosyltransferase [Prevotella veroralis F0319]
Length = 381
Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats.
Identities = 9/104 (8%), Positives = 21/104 (20%), Gaps = 8/104 (7%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78
V + W +P+ F+ I +
Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVIN 339
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120
+I F S E + ++ + + +
Sbjct: 340 ------DKPNEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIK 377
>gi|256830319|ref|YP_003159047.1| lipopolysaccharide biosynthesis protein-like protein
[Desulfomicrobium baculatum DSM 4028]
gi|256579495|gb|ACU90631.1| lipopolysaccharide biosynthesis protein-like protein
[Desulfomicrobium baculatum DSM 4028]
Length = 364
Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats.
Identities = 13/117 (11%), Positives = 30/117 (25%), Gaps = 8/117 (6%)
Query: 8 KSKLGKIENLLLRLD-VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
+ +LG+ ++ ++ + W +P+ + F
Sbjct: 239 RQRLGRFPRWVIDYSSLDRYFKNHLCDGITTLPTAIPNWDNTPRIGRRGLVFANSSPARF 298
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+ S + K RI F S E + +L + +
Sbjct: 299 ADHLRRSVSGFTAANDGK-----DRILFIKSWNEWAEGNYLEPDLVHDRGWLEAVRS 350
>gi|237714668|ref|ZP_04545149.1| conserved hypothetical protein [Bacteroides sp. D1]
gi|262406534|ref|ZP_06083083.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294645683|ref|ZP_06723370.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294806952|ref|ZP_06765775.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|229445437|gb|EEO51228.1| conserved hypothetical protein [Bacteroides sp. D1]
gi|262355237|gb|EEZ04328.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292638962|gb|EFF57293.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294445839|gb|EFG14483.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 368
Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats.
Identities = 13/107 (12%), Positives = 27/107 (25%), Gaps = 8/107 (7%)
Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79
D+ Y W SP+ + ++FE I +
Sbjct: 266 YKDIISNFYTSYDYREDVYPSIIPNWDRSPRAGRRAVIYTGSTPALFEEHIKKALEVILQ 325
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ +I F S E + ++ + + + L
Sbjct: 326 KQ------DQHKILFLRSWNEWAEGNYVEPDLKFGHGYLDVLKSSIL 366
>gi|168218133|ref|ZP_02643758.1| conserved hypothetical protein [Clostridium perfringens NCTC 8239]
gi|182625720|ref|ZP_02953488.1| conserved hypothetical protein [Clostridium perfringens D str.
JGS1721]
gi|177908982|gb|EDT71464.1| conserved hypothetical protein [Clostridium perfringens D str.
JGS1721]
gi|182379836|gb|EDT77315.1| conserved hypothetical protein [Clostridium perfringens NCTC 8239]
Length = 353
Score = 87.7 bits (216), Expect = 3e-15, Method: Composition-based stats.
Identities = 17/119 (14%), Positives = 36/119 (30%), Gaps = 10/119 (8%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
K KLG ++ L N Y G +V W + +++ ++ S F
Sbjct: 240 FKKKLGVLDKLNYDNLWNAVINKNEDYGKKKFLGAFVSWDNTARKKNKGLVLNEDSPSKF 299
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ + K F + E + +L ++ + + +E
Sbjct: 300 KKYFKKQYD--------KAIEIGSEYIFINAWNEWAEGTYLEPDKENEHGYIEALNEVL 350
>gi|55846838|gb|AAV67424.1| glycosyltransferase [Xanthomonas oryzae pv. oryzicola]
Length = 464
Score = 87.7 bits (216), Expect = 3e-15, Method: Composition-based stats.
Identities = 12/88 (13%), Positives = 32/88 (36%), Gaps = 7/88 (7%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G W + +++ TS + SI++ +WL + +++ + P ++ F +
Sbjct: 266 YRGIVPSWDNTARRQHTSHILLNSSPSIYQ---YWLGRLVDYTRV--NNAPEDQLIFINA 320
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123
E + L + + +
Sbjct: 321 WNEWGEGCHLEPDLKHGLAYLEATHAAV 348
>gi|166713475|ref|ZP_02244682.1| Tetratricopeptide TPR_2 [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 374
Score = 87.7 bits (216), Expect = 3e-15, Method: Composition-based stats.
Identities = 12/88 (13%), Positives = 32/88 (36%), Gaps = 7/88 (7%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G W + +++ TS + SI++ +WL + +++ + P ++ F +
Sbjct: 203 YRGIVPSWDNTARRQHTSHILLNSSPSIYQ---YWLGRLVDYTRV--NNAPEDQLIFINA 257
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123
E + L + + +
Sbjct: 258 WNEWGEGCHLEPDLKHGLAYLEATHAAV 285
>gi|325300544|ref|YP_004260461.1| hypothetical protein Bacsa_3463 [Bacteroides salanitronis DSM
18170]
gi|324320097|gb|ADY37988.1| hypothetical protein Bacsa_3463 [Bacteroides salanitronis DSM
18170]
Length = 385
Score = 87.3 bits (215), Expect = 4e-15, Method: Composition-based stats.
Identities = 10/105 (9%), Positives = 25/105 (23%), Gaps = 8/105 (7%)
Query: 22 DVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81
V + + W +P+ + + F+ + +
Sbjct: 286 KVSKLLFAEEDKWNNVYPTLIPNWDRTPRNGKNAIVWYHNNPEFFKQEVEIALDVI---- 341
Query: 82 YSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
K +I F S E + ++ + + E
Sbjct: 342 --KDKPMEHKILFLMSWNEWGEGNYMEPDIEFGKGYIHALREAIE 384
>gi|251771739|gb|EES52314.1| Lipopolysaccharide biosynthesis protein-like protein
[Leptospirillum ferrodiazotrophum]
Length = 360
Score = 87.3 bits (215), Expect = 4e-15, Method: Composition-based stats.
Identities = 7/89 (7%), Positives = 21/89 (23%), Gaps = 8/89 (8%)
Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
+ +P+ + +F + + S + + F
Sbjct: 276 LHPCVINSFDNTPRSGVNGVVYKNATPDLFRNHLREAIS------SIENYPTERKFIFLK 329
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
S E + L + + + +
Sbjct: 330 SWNEWAEGNHLEPDLRYGHGWLKAIQDVL 358
>gi|237725325|ref|ZP_04555806.1| conserved hypothetical protein [Bacteroides sp. D4]
gi|229436012|gb|EEO46089.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4]
Length = 383
Score = 87.3 bits (215), Expect = 4e-15, Method: Composition-based stats.
Identities = 17/127 (13%), Positives = 40/127 (31%), Gaps = 10/127 (7%)
Query: 1 MYKVFRLKSKLG-KIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKD-V 58
+Y + R KLG +I D+ ++ + SP+ +
Sbjct: 262 IYYIKRFLMKLGIRILVKCQYKDIISNYYVEQDRWENVYPTIIPNFDRSPRSGWKTNILW 321
Query: 59 HFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRM 116
+ ++F+ I + L + +I F S E + ++ + ++ +
Sbjct: 322 YGSTPTLFKKHIIQALNLL------EGRSAEHKILFLQSWNEWGEGNYVEPDLKFGHAYL 375
Query: 117 PFDSEKF 123
E
Sbjct: 376 EVLREVI 382
>gi|68643200|emb|CAI33488.1| conserved hypothetical protein [Streptococcus pneumoniae]
Length = 366
Score = 86.9 bits (214), Expect = 5e-15, Method: Composition-based stats.
Identities = 12/90 (13%), Positives = 21/90 (23%), Gaps = 9/90 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
G +V W +P++ S FE + A F
Sbjct: 281 KNISPGAFVSWDNTPRRGNRSLVFDGANPKKFEKYF-------AKQVQRAKEEYHSDFIF 333
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+ E + A L + +
Sbjct: 334 INAWNEWAEGAHLEPDEQYGYGYLEAVRAV 363
>gi|168214851|ref|ZP_02640476.1| conserved hypothetical protein [Clostridium perfringens CPE str.
F4969]
gi|170713695|gb|EDT25877.1| conserved hypothetical protein [Clostridium perfringens CPE str.
F4969]
Length = 353
Score = 86.9 bits (214), Expect = 5e-15, Method: Composition-based stats.
Identities = 17/119 (14%), Positives = 36/119 (30%), Gaps = 10/119 (8%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
K KLG ++ L N Y G +V W + +++ ++ S F
Sbjct: 240 FKKKLGVLDKLNYDNLWNAVINKNEDYGKKKFLGAFVSWDNTARKKNKGLVLNEDSPSKF 299
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ + K F + E + +L ++ + + +E
Sbjct: 300 KKYFKKQYD--------KAIEIGSEYIFINAWNEWAEGTYLEPDKENEHGYIKALNEVL 350
>gi|212694326|ref|ZP_03302454.1| hypothetical protein BACDOR_03852 [Bacteroides dorei DSM 17855]
gi|212662827|gb|EEB23401.1| hypothetical protein BACDOR_03852 [Bacteroides dorei DSM 17855]
Length = 370
Score = 86.5 bits (213), Expect = 6e-15, Method: Composition-based stats.
Identities = 8/111 (7%), Positives = 27/111 (24%), Gaps = 11/111 (9%)
Query: 18 LLRLDVEEKGNM---QAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74
L++ D + + + +P+ + F+ +
Sbjct: 261 LMKYDYNKVVRNYDTPENKLENCYPVITPGFDRTPRAGRRAGIYVNSSPKNFKKHVAE-- 318
Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K + R+ F + E + ++ + + +
Sbjct: 319 ----VCKSIQDKDDDHRLVFLSAWNEWGEGNYMEPDLKWGHGYLEALKSVV 365
>gi|265751844|ref|ZP_06087637.1| radical SAM domain-containing protein [Bacteroides sp. 3_1_33FAA]
gi|263236636|gb|EEZ22106.1| radical SAM domain-containing protein [Bacteroides sp. 3_1_33FAA]
Length = 367
Score = 86.5 bits (213), Expect = 7e-15, Method: Composition-based stats.
Identities = 15/122 (12%), Positives = 34/122 (27%), Gaps = 10/122 (8%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYI--PAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
S L ++R + I Y W SP+ ++ +H ++
Sbjct: 243 SYLFPFPINVIRYSKAIDKMVDDILFRKSKIYPIIYPNWDHSPRAGNSASIMHGSTPQLW 302
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ + S + +I F S E + +L + + ++
Sbjct: 303 GKLLEKVISLIH------DKDEGDQIIFIKSWNEWGEGNYLEPDLKYGRGYLDVMNKMLR 356
Query: 125 YV 126
Sbjct: 357 KE 358
>gi|322433407|ref|YP_004210624.1| lipopolysaccharide biosynthesis protein-like protein
[Acidobacterium sp. MP5ACTX9]
gi|321165796|gb|ADW71497.1| lipopolysaccharide biosynthesis protein-like protein
[Acidobacterium sp. MP5ACTX9]
Length = 381
Score = 86.5 bits (213), Expect = 7e-15, Method: Composition-based stats.
Identities = 9/113 (7%), Positives = 31/113 (27%), Gaps = 8/113 (7%)
Query: 13 KIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFW 72
+ + DV + + W +P+ + +F + +
Sbjct: 269 RRPTRIRYKDVVARALEDMPQEERFLPCVLPGWDNTPRSSHRGVIFEGETPELFRTLLQ- 327
Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ ++ RI F + E + ++ + ++ +
Sbjct: 328 -----KAVQHVSVNSVEQRIVFLKAWNEWAEGNYVEPDVLHGHAYLDVIRSVV 375
>gi|325105038|ref|YP_004274692.1| polysaccharide biosynthesis protein [Pedobacter saltans DSM 12145]
gi|324973886|gb|ADY52870.1| polysaccharide biosynthesis protein [Pedobacter saltans DSM 12145]
Length = 368
Score = 86.2 bits (212), Expect = 8e-15, Method: Composition-based stats.
Identities = 9/117 (7%), Positives = 25/117 (21%), Gaps = 8/117 (6%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
K K ++ E + W + +++ + F
Sbjct: 256 KKRVKQPTIIDYAKFTEFDSSLVNKPYKLYPCVSPGWDNTARKKENGIVFINSTPTNF-- 313
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
W + + + + F + E + L + +
Sbjct: 314 -YNWTKKKIKKFQ---PYSKEENLLFINAWNEWAEGNHLEPCNKNGLGYLKALKKAL 366
>gi|167745516|ref|ZP_02417643.1| hypothetical protein ANACAC_00207 [Anaerostipes caccae DSM 14662]
gi|167655237|gb|EDR99366.1| hypothetical protein ANACAC_00207 [Anaerostipes caccae DSM 14662]
Length = 382
Score = 85.4 bits (210), Expect = 1e-14, Method: Composition-based stats.
Identities = 16/122 (13%), Positives = 35/122 (28%), Gaps = 13/122 (10%)
Query: 7 LKSKLGKIENLLLRLDVEEKGN---MQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63
L+ G I +L R + N M + + SP+ + +
Sbjct: 264 LRKYFGGI--VLDRYKYDTIMNHFIMPEDFEESIYPQLIPKRDRSPRSGRKAMIYYGSTP 321
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
F+S + + R+ F + E + A++ + + + E
Sbjct: 322 EKFKSAAENAIKCV------EGRDKEHRLIFLNAWNEWGEGAYMEPDLKFGHGYLEALKE 375
Query: 122 KF 123
Sbjct: 376 IL 377
>gi|225548129|ref|ZP_03769414.1| hypothetical protein RUMHYD_00108 [Blautia hydrogenotrophica DSM
10507]
gi|225040805|gb|EEG51051.1| hypothetical protein RUMHYD_00108 [Blautia hydrogenotrophica DSM
10507]
Length = 379
Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats.
Identities = 13/120 (10%), Positives = 33/120 (27%), Gaps = 8/120 (6%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
K G + + D+ + Y SP+ + + +F+
Sbjct: 266 KYFGGMVLDKYRYSDIIKHFITPEDYSERIYPQLIPRRDRSPRSGRKAMIYYDSTPELFK 325
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
+ K + + R+ F + E + A++ + + + E
Sbjct: 326 ------LAAENAVKCVEKRDKNHRLIFLNAWNEWGEGAYMEPDLRFGHKYIEALREVLTN 379
>gi|332180567|gb|AEE16255.1| hypothetical protein Trebr_0819 [Treponema brennaborense DSM 12168]
Length = 366
Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats.
Identities = 16/117 (13%), Positives = 32/117 (27%), Gaps = 8/117 (6%)
Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69
K KI L+ ++ + + G W +P+ L +F+
Sbjct: 252 KFLKIPRLVNYKEIVKYAVSEKDKRNDFYPGIVCTWDHTPRSGRNGMVFINFSLKLFKE- 310
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ K +I F S E + F+ + ++ E
Sbjct: 311 -----HICTVLELVKNKPEQEQIVFLKSWNEWGEGNFMEPDIEYGKGKVDTLKEAIH 362
>gi|237808791|ref|YP_002893231.1| polysaccharide biosynthesis protein [Tolumonas auensis DSM 9187]
gi|237501052|gb|ACQ93645.1| polysaccharide biosynthesis protein [Tolumonas auensis DSM 9187]
Length = 370
Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats.
Identities = 12/124 (9%), Positives = 35/124 (28%), Gaps = 9/124 (7%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
LK+K+ + + V + W + +++ + +
Sbjct: 254 LKTKVSAVNKVNYAALVSNMVKKSWPKTYRKFPCVFPSWDNTARRKTPTVIQNLDS---- 309
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ WL + + +I F + E + L +R + + + +
Sbjct: 310 NVYARWLEYAVDSVSS---YPENEKIVFINAWNEWAEGCHLEPDRKVGRAFLEATKQVVE 366
Query: 125 YVKE 128
+
Sbjct: 367 RPSK 370
>gi|23098585|ref|NP_692051.1| hypothetical protein OB1130 [Oceanobacillus iheyensis HTE831]
gi|22776811|dbj|BAC13086.1| hypothetical protein [Oceanobacillus iheyensis HTE831]
Length = 531
Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats.
Identities = 13/114 (11%), Positives = 33/114 (28%), Gaps = 9/114 (7%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIF 71
GK + L E + G + W + + + + H + F+++
Sbjct: 421 GKAKYLDYDRIWESILSRNNKQHKKVFLGAFTDWDNTARMQSSGTIYHGATPAKFKNY-- 478
Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
L+ + F + E + A+L ++ + +
Sbjct: 479 -----LSRQIDRANNVYDSEFLFINAWNEWAEGAYLEPDKKFKYGYLEAVRDAL 527
>gi|58038685|ref|YP_190649.1| hypothetical protein GOX0204 [Gluconobacter oxydans 621H]
gi|58001099|gb|AAW59993.1| Hypothetical protein GOX0204 [Gluconobacter oxydans 621H]
Length = 1260
Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats.
Identities = 18/147 (12%), Positives = 48/147 (32%), Gaps = 17/147 (11%)
Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69
G++ + +D K Q + W +++ +H ++E
Sbjct: 489 FSGQVYDYGEVVD---KALAQPRTPFPLIRTAAPSWDNDARRQGKGLVLHGSTPELYE-- 543
Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127
WL + ++ +F + + E + A+L ++ ++ + +
Sbjct: 544 -RWLSGLIEQAQSR--TFFGDPVVCINAWNEWAKGAYLEPDQHFGSAYLNATARACTGAG 600
Query: 128 E-------LFEGWNDRPSSPKKSGLTI 147
+ L G + P+ ++ L I
Sbjct: 601 KNRSRSGILLIGHDAFPAGAQRLLLEI 627
>gi|237712790|ref|ZP_04543271.1| conserved hypothetical protein [Bacteroides sp. D1]
gi|237718379|ref|ZP_04548860.1| radical SAM [Bacteroides sp. 2_2_4]
gi|262408851|ref|ZP_06085396.1| radical SAM domain-containing protein [Bacteroides sp. 2_1_22]
gi|293370137|ref|ZP_06616700.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|294643855|ref|ZP_06721647.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294810735|ref|ZP_06769383.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|229447118|gb|EEO52909.1| conserved hypothetical protein [Bacteroides sp. D1]
gi|229452312|gb|EEO58103.1| radical SAM [Bacteroides sp. 2_2_4]
gi|262353062|gb|EEZ02157.1| radical SAM domain-containing protein [Bacteroides sp. 2_1_22]
gi|292634789|gb|EFF53315.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292640797|gb|EFF59023.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294442068|gb|EFG10887.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 360
Score = 84.6 bits (208), Expect = 2e-14, Method: Composition-based stats.
Identities = 11/124 (8%), Positives = 29/124 (23%), Gaps = 8/124 (6%)
Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
++ KL + + + I + SP+ + +
Sbjct: 243 KILRKLLRKPITIEYSQYSQYLLNNYIVNENVYPSICPNYDHSPRSKFRGTIIVNSTPQ- 301
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
W + + + F + E + +L + + +
Sbjct: 302 -----KWKKLCHEMFSKVSVRSAEDNLVFIKAWNEWGEGNYLEPDLKYGTQFLDVIRDVL 356
Query: 124 LYVK 127
VK
Sbjct: 357 EKVK 360
>gi|329944274|ref|ZP_08292533.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str.
F0386]
gi|328531004|gb|EGF57860.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str.
F0386]
Length = 699
Score = 84.2 bits (207), Expect = 3e-14, Method: Composition-based stats.
Identities = 31/209 (14%), Positives = 63/209 (30%), Gaps = 33/209 (15%)
Query: 163 WIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLL 220
+++ L L + + VT D E+ + ++ ++ +G FL
Sbjct: 341 ADDLAERLASLPEHWRVVVTSPSELNAADLERVTGRRTTFRKVRDLDPRG--TIAFLTEC 398
Query: 221 ELG------------------------VFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256
+ DR D + I G + RR +
Sbjct: 399 DDLWDPAHAGDVGASDGGDGTDTTDTAEVDRVDLVLTI--SAGPLSGSSERADDVARRQV 456
Query: 257 FFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF 316
LL +++ F ++P LG++ + + + + L++R G
Sbjct: 457 LDCLLASPGYVAGLLDLFGRHPSLGVVMPAACHIGQPYV-GPQWDGLVGAADALSRRLGL 515
Query: 317 PTKRLH--LDFFNGTMFWVKPKCLEPLRN 343
G+MF +P+ L L
Sbjct: 516 TAALDEIAPVAPVGSMFLARPEALRTLSE 544
>gi|313890159|ref|ZP_07823794.1| conserved hypothetical protein [Streptococcus pseudoporcinus SPIN
20026]
gi|313121520|gb|EFR44624.1| conserved hypothetical protein [Streptococcus pseudoporcinus SPIN
20026]
Length = 359
Score = 83.8 bits (206), Expect = 4e-14, Method: Composition-based stats.
Identities = 11/120 (9%), Positives = 30/120 (25%), Gaps = 8/120 (6%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
+K K+ + + + + + + W SP+ + + + F
Sbjct: 242 IKRKVFRRPTVFKYKEAIKYMIDDSAKDENVIPVVAPNWDHSPRSGNNAMILDNAKPKYF 301
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ K + S + S E + L + + +
Sbjct: 302 ADLLKE------TVKTVRSKPRSKQQVIIKSWNEWGEGNHLEPDLKYGLGYLEAVKKSIE 355
>gi|317476949|ref|ZP_07936191.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides
eggerthii 1_2_48FAA]
gi|316906742|gb|EFV28454.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides
eggerthii 1_2_48FAA]
Length = 360
Score = 83.8 bits (206), Expect = 4e-14, Method: Composition-based stats.
Identities = 13/114 (11%), Positives = 29/114 (25%), Gaps = 9/114 (7%)
Query: 3 KVF-RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61
K F +L S + I + V + W +P+ ++
Sbjct: 250 KCFDKLYSIVTGIPRIANYKSVSSHFIGKEEMEDNIYPTIIPNWDHTPRSGFNGYVLNNS 309
Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSN 113
+F + + + I F S E + ++ +
Sbjct: 310 TPELFRFHVRKALATTLQKR------ADNMIVFLKSWNEWGEGNYMEPDLKYGK 357
>gi|326403402|ref|YP_004283483.1| putative glycosyltransferase [Acidiphilium multivorum AIU301]
gi|325050263|dbj|BAJ80601.1| putative glycosyltransferase [Acidiphilium multivorum AIU301]
Length = 1247
Score = 83.1 bits (204), Expect = 7e-14, Method: Composition-based stats.
Identities = 11/130 (8%), Positives = 41/130 (31%), Gaps = 13/130 (10%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
+ + + ++++ + Y + W P++ +H + +
Sbjct: 482 FSADVYRYDDIV----AASLADPDPAY--PLIRTAVPGWDNDPRREGAGVVLHEATPAAY 535
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ WL + + ++ + + I + E + A+L + + + +
Sbjct: 536 Q---AWLAALIERARRAPV--HGEPIVCINAWNEWAEGAYLEPDLHFGAAFLNATARAIT 590
Query: 125 YVKELFEGWN 134
+ + N
Sbjct: 591 GRADAADAQN 600
>gi|148259629|ref|YP_001233756.1| glycosyl transferase, group 1 [Acidiphilium cryptum JF-5]
gi|146401310|gb|ABQ29837.1| glycosyl transferase, group 1 [Acidiphilium cryptum JF-5]
Length = 1247
Score = 83.1 bits (204), Expect = 7e-14, Method: Composition-based stats.
Identities = 11/130 (8%), Positives = 41/130 (31%), Gaps = 13/130 (10%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
+ + + ++++ + Y + W P++ +H + +
Sbjct: 482 FSADVYRYDDIV----AASLADPDPAY--PLIRTAVPGWDNDPRREGAGVVLHEATPAAY 535
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ WL + + ++ + + I + E + A+L + + + +
Sbjct: 536 Q---AWLAALIERARRAPV--HGEPIVCINAWNEWAEGAYLEPDLHFGAAFLNATARAIT 590
Query: 125 YVKELFEGWN 134
+ + N
Sbjct: 591 GRADAADAQN 600
>gi|237727673|ref|ZP_04558154.1| polysaccharide biosynthesis protein [Bacteroides sp. D4]
gi|229434529|gb|EEO44606.1| polysaccharide biosynthesis protein [Bacteroides dorei 5_1_36/D4]
Length = 363
Score = 83.1 bits (204), Expect = 8e-14, Method: Composition-based stats.
Identities = 11/95 (11%), Positives = 27/95 (28%), Gaps = 9/95 (9%)
Query: 38 VSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
W SP+++ +++ WL+ L + + F
Sbjct: 275 FPCVSPGWDNSPRRKKPPYTAFIGSTPCLYKK---WLKDTL---IRFQPFSEEENLVFIN 328
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKEL 129
+ E + L ++ + E K++
Sbjct: 329 AWNEWAEGNHLEPDQKWGRKYLEVTKEAIDETKDI 363
>gi|306831232|ref|ZP_07464393.1| glycosyltransferase [Streptococcus gallolyticus subsp. gallolyticus
TX20005]
gi|304426798|gb|EFM29909.1| glycosyltransferase [Streptococcus gallolyticus subsp. gallolyticus
TX20005]
Length = 381
Score = 83.1 bits (204), Expect = 8e-14, Method: Composition-based stats.
Identities = 14/107 (13%), Positives = 29/107 (27%), Gaps = 8/107 (7%)
Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79
D+ N + + SP+ + + F + S +
Sbjct: 278 YKDIIRSFNTKEDFQENIYPQLIPGRDRSPRSGKKAVIYYENTPEEFRIAVKNAISCV-- 335
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ P RI F S E + A++ + + E+
Sbjct: 336 ----EKRNPEHRIIFLNSWNEWAEGAYMEPDTTYGKRYIQVLREELE 378
>gi|300728504|ref|ZP_07061863.1| conserved hypothetical protein [Prevotella bryantii B14]
gi|299774222|gb|EFI70855.1| conserved hypothetical protein [Prevotella bryantii B14]
Length = 369
Score = 82.7 bits (203), Expect = 9e-14, Method: Composition-based stats.
Identities = 8/94 (8%), Positives = 22/94 (23%), Gaps = 10/94 (10%)
Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
+ W +P+ + + F + + +I F
Sbjct: 284 VIPQLLPQWDHTPRSGWNGTLLINCKPEYFYEHSKEALNIV--------KNKQNKIIFLK 335
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
S E + + + + + +E
Sbjct: 336 SWNEWGEGNMMEPDLTYGRGFINALRKAVDEYEE 369
>gi|68643231|emb|CAI33513.1| conserved hypothetical protein [Streptococcus pneumoniae]
Length = 381
Score = 82.7 bits (203), Expect = 1e-13, Method: Composition-based stats.
Identities = 15/112 (13%), Positives = 31/112 (27%), Gaps = 13/112 (11%)
Query: 17 LLLRLDVEEKGNMQAIY---IPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73
LL R D + ++G +V W + + + FE ++ L
Sbjct: 274 LLDRRDYDATWTNIINRPIKDNKMIAGAFVDWDNTAR-NKNGRVFDGANPEKFEGYMRQL 332
Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ I F + E + A+L ++ +
Sbjct: 333 IEKI-------QKEYQSEIVFINAWNEWAEGAYLEPDKKHGYGYLEALKTVI 377
>gi|320531345|ref|ZP_08032317.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str.
F0337]
gi|320136436|gb|EFW28412.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str.
F0337]
Length = 678
Score = 82.7 bits (203), Expect = 1e-13, Method: Composition-based stats.
Identities = 39/287 (13%), Positives = 70/287 (24%), Gaps = 52/287 (18%)
Query: 102 KAFLRLNRFMSNSRMPFDSEKFLYVKE-------------LFEGWNDRPSSPKKSGLTIK 148
L S S+ + P+
Sbjct: 243 GELLEDAARAGYSEDLILSDVVHNAPARDLIVNAGLTEVVVEAAPAPDEPDPEAGSTAPT 302
Query: 149 SKIAIVVHCYYQD--------TWIEISHILLRLNFDFDLFVTVVE--ANKDFEQDVLKYF 198
+VVH ++ L L + + VT D E+ +
Sbjct: 303 PSGCVVVHV--PAGGEGVERAEADGLAQRLASLPAHWRVVVTSPTHLDAADLERLTGRRP 360
Query: 199 PSA------------QLYVMENKGRDVRPFLYLLELGVFDRY--------DYLCKIHGKK 238
+ ++ +G PFL D + +I
Sbjct: 361 ADEAAAPGGAAVAFRAVRDLDPRG--TIPFLTECGDLWDPGRATGSDGGGDLVLRI-TVG 417
Query: 239 SQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA 298
S + + RR + LL +I+ FE++P LG+ +
Sbjct: 418 SPSGPESKAD-DVARRQVLDCLLASPGYTAGLIDLFERHPGLGVAMPAASHIGQAH-GGP 475
Query: 299 KRSEVYRRVIDLAKRAGF--PTKRLHLDFFNGTMFWVKPKCLEPLRN 343
+ L++R G + G MF +P+ L L
Sbjct: 476 TWDGLAGAAKTLSRRLGLTVELDPVAPVVPVGAMFMARPEALRTLSE 522
>gi|24637409|gb|AAN63687.1|AF454495_12 Eps4K [Streptococcus thermophilus]
Length = 384
Score = 82.3 bits (202), Expect = 1e-13, Method: Composition-based stats.
Identities = 10/88 (11%), Positives = 22/88 (25%), Gaps = 9/88 (10%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
+ G +V W + + + F+ ++ L S F +
Sbjct: 295 IPGAFVEWDNTSRHGDRGRVYDGATPQKFQKYMSALI-------KKTKSEYHKDYIFINA 347
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123
E + A L + +
Sbjct: 348 WNEWAEGAHLEPDEKNKYGYLEALKNAL 375
>gi|288803643|ref|ZP_06409073.1| glycosyl transferase, group 2 family [Prevotella melaninogenica
D18]
gi|288333883|gb|EFC72328.1| glycosyl transferase, group 2 family [Prevotella melaninogenica
D18]
Length = 369
Score = 82.3 bits (202), Expect = 1e-13, Method: Composition-based stats.
Identities = 10/95 (10%), Positives = 24/95 (25%), Gaps = 9/95 (9%)
Query: 37 HVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95
+ W SP+ + + F L + K +I
Sbjct: 281 IIPQIVPQWDHSPRSEHAADLIYYNSTPESF------YLHCLDAFEVLKDKSEDEQILIL 334
Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
S E + ++ + + + + V +
Sbjct: 335 KSWNEWGEGNYMEPDISNGDGYIKALRKALNKVSK 369
>gi|256392765|ref|YP_003114329.1| lipopolysaccharide biosynthesis protein-like protein [Catenulispora
acidiphila DSM 44928]
gi|256358991|gb|ACU72488.1| lipopolysaccharide biosynthesis protein-like protein [Catenulispora
acidiphila DSM 44928]
Length = 357
Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats.
Identities = 12/94 (12%), Positives = 29/94 (30%), Gaps = 9/94 (9%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
+ +P+ +H + IFE + + + + + P R+ F S
Sbjct: 271 HPCVVPGFDNTPRSGRRGVLLHHPDPEIFE-------AAVTEAVRREQAMPDPRMLFIKS 323
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKEL 129
E + + + ++ S + L
Sbjct: 324 WNEWAEGSVMEPDQHFGRSFLRALRRGLDVRPPL 357
>gi|330996598|ref|ZP_08320478.1| hypothetical protein HMPREF9442_01565 [Paraprevotella xylaniphila
YIT 11841]
gi|329572832|gb|EGG54459.1| hypothetical protein HMPREF9442_01565 [Paraprevotella xylaniphila
YIT 11841]
Length = 386
Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats.
Identities = 11/112 (9%), Positives = 31/112 (27%), Gaps = 12/112 (10%)
Query: 16 NLLLRLDVEEKGNM---QAIYIPAHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIF 71
+ +L +D + + + + SP+ + H +F +
Sbjct: 278 DYVLHIDYAKIIRNYYVENDKMENIYPTIIPNFDRSPRSGKKTNNIWHGSTPKLFGKMVE 337
Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+ K +I F S E + ++ + + + +
Sbjct: 338 QALDLI------KDKQDEHKILFLQSWNEWGEGNYMEPDLKFGHGYIDILGK 383
>gi|228937557|ref|ZP_04100197.1| Glycosyltransferase [Bacillus thuringiensis serovar berliner ATCC
10792]
gi|228970444|ref|ZP_04131097.1| Glycosyltransferase [Bacillus thuringiensis serovar thuringiensis
str. T01001]
gi|228789273|gb|EEM37199.1| Glycosyltransferase [Bacillus thuringiensis serovar thuringiensis
str. T01001]
gi|228822111|gb|EEM68099.1| Glycosyltransferase [Bacillus thuringiensis serovar berliner ATCC
10792]
gi|326938048|gb|AEA13944.1| glycosyltransferase [Bacillus thuringiensis serovar chinensis
CT-43]
Length = 120
Score = 81.5 bits (200), Expect = 2e-13, Method: Composition-based stats.
Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93
G +V W + +++ S F ++ SF +
Sbjct: 23 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 75
Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + E + +L ++ S +
Sbjct: 76 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 107
>gi|291520445|emb|CBK75666.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens
16/4]
Length = 109
Score = 81.1 bits (199), Expect = 2e-13, Method: Composition-based stats.
Identities = 22/94 (23%), Positives = 41/94 (43%), Gaps = 5/94 (5%)
Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYF--PSAQL 203
+++ A+ + ++ D + E L D++V K + + K + ++
Sbjct: 13 QNRYAVFAYLFFDDLFEESLRYFSNLPNYVDIYVATNTEEKVDVINGYIPKMLFRHNVKV 72
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGK 237
+ NKGRDV L LL+ YD +C +H K
Sbjct: 73 LLHNNKGRDVSALLVLLKRYY-SNYDVICFVHDK 105
>gi|319784640|ref|YP_004144116.1| hypothetical protein Mesci_4961 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
gi|317170528|gb|ADV14066.1| hypothetical protein Mesci_4961 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
Length = 936
Score = 81.1 bits (199), Expect = 3e-13, Method: Composition-based stats.
Identities = 13/120 (10%), Positives = 42/120 (35%), Gaps = 10/120 (8%)
Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
+ W + + + S V ++ +E WLR + ++ ++ ++ F
Sbjct: 262 IYRTVFPDWDNTARVKNRSLIVLGSTVANYE---RWLRGSSSLTRANRA--EGDQLVFIN 316
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIV 154
+ E + +L +R + + + +P++ ++ ++A +
Sbjct: 317 AWNEWAEGCYLEPDRRHGRGFLEAT---LRVKNGMSMVDDIYDVAPERVRFELRQQLAAI 373
>gi|42779379|ref|NP_976626.1| hypothetical protein BCE_0298 [Bacillus cereus ATCC 10987]
gi|42735295|gb|AAS39234.1| conserved domain protein [Bacillus cereus ATCC 10987]
Length = 358
Score = 81.1 bits (199), Expect = 3e-13, Method: Composition-based stats.
Identities = 14/115 (12%), Positives = 33/115 (28%), Gaps = 10/115 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F +
Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY- 296
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
L+ S + F + E + +L ++ + + +
Sbjct: 297 ------LSKQIQRTYSLYNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345
>gi|229194650|ref|ZP_04321445.1| Glycosyltransferase [Bacillus cereus m1293]
gi|228588820|gb|EEK46843.1| Glycosyltransferase [Bacillus cereus m1293]
Length = 358
Score = 80.8 bits (198), Expect = 3e-13, Method: Composition-based stats.
Identities = 14/115 (12%), Positives = 33/115 (28%), Gaps = 10/115 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F +
Sbjct: 238 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY- 296
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
L+ S + F + E + +L ++ + + +
Sbjct: 297 ------LSKQIQRTYSVYNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345
>gi|313199878|ref|YP_004038536.1| polysaccharide biosynthesis protein [Methylovorus sp. MP688]
gi|312439194|gb|ADQ83300.1| polysaccharide biosynthesis protein [Methylovorus sp. MP688]
Length = 379
Score = 80.8 bits (198), Expect = 3e-13, Method: Composition-based stats.
Identities = 15/88 (17%), Positives = 27/88 (30%), Gaps = 8/88 (9%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
W S ++R + + + +FE WLR+ S RI F +
Sbjct: 293 FPCVVPSWDKSARRRAGATVIQNHDPKLFE---LWLRNA---SSRVSKYPKDERIIFINA 346
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123
E + L + + + F
Sbjct: 347 WNEWAEGCHLEPDLRHGHQFLEAVRNVF 374
>gi|114571025|ref|YP_757705.1| glycosyl transferase family protein [Maricaulis maris MCS10]
gi|114341487|gb|ABI66767.1| glycosyl transferase, family 2 [Maricaulis maris MCS10]
Length = 882
Score = 80.8 bits (198), Expect = 4e-13, Method: Composition-based stats.
Identities = 14/117 (11%), Positives = 32/117 (27%), Gaps = 10/117 (8%)
Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
K GK+ ++ E A H + W S ++ F+
Sbjct: 768 KDFYGKLYSV--DGAYEALVRRGAPAW-RHFHSAFTGWDNSARRGDRGDIFLGDCPGKFQ 824
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+ + + K L + F + E + +L + ++ +
Sbjct: 825 ALLE-----VQMRKAKALGAAGEKAIFINAWNEWAEGTYLEPDLHHGHAWLEAVRNA 876
>gi|312131802|ref|YP_003999142.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM
17132]
gi|311908348|gb|ADQ18789.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM
17132]
Length = 361
Score = 80.8 bits (198), Expect = 4e-13, Method: Composition-based stats.
Identities = 15/121 (12%), Positives = 35/121 (28%), Gaps = 9/121 (7%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
L+ + + +E+ + I V + + ++ + + Q + F
Sbjct: 245 LQGVINPTLKIYDYKQYKERAKIHKIKYKG-FPCPIVGFDNTARKGKNAVILKNQNVEDF 303
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
++ S + + K +I F S E + L + E F
Sbjct: 304 KA------SLIDAVEDVKEFPEEEQIVFINSWNEWAEGNHLEPCVKFGRQFLEAVKEVFS 357
Query: 125 Y 125
Sbjct: 358 K 358
>gi|30018522|ref|NP_830153.1| glycosyltransferase [Bacillus cereus ATCC 14579]
gi|29894062|gb|AAP07354.1| Glycosyltransferase [Bacillus cereus ATCC 14579]
Length = 358
Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats.
Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93
G +V W + +++ S F ++ SF +
Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313
Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + E + +L ++ S +
Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345
>gi|325265289|ref|ZP_08132014.1| glycosyl transferase, group 2 family [Clostridium sp. D5]
gi|324029468|gb|EGB90758.1| glycosyl transferase, group 2 family [Clostridium sp. D5]
Length = 369
Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats.
Identities = 17/126 (13%), Positives = 38/126 (30%), Gaps = 14/126 (11%)
Query: 4 VFRLKSKLGKIENLLLRLDVEEKGNMQAIYIP---AHVSGYYVLWSFSPKQRITSKDVHF 60
V +LK K K+ ++ D ++ P + G +V W +P+ + +
Sbjct: 244 VNKLKIKQTKLSTIIF--DYDKAWKNILDMKPRDDKMIPGAFVDWDNTPRYKKLASVFRG 301
Query: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPF 118
F+ + L+ + I F + E + +L + +
Sbjct: 302 VTPEKFKYY-------LSRQIQNAKRVYRKDIIFMFAWNEWGEGGYLEPDEKNGYKMLDA 354
Query: 119 DSEKFL 124
Sbjct: 355 IKSALE 360
>gi|296501094|ref|YP_003662794.1| glycosyltransferase [Bacillus thuringiensis BMB171]
gi|296322146|gb|ADH05074.1| glycosyltransferase [Bacillus thuringiensis BMB171]
Length = 358
Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats.
Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93
G +V W + +++ S F ++ SF +
Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313
Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + E + +L ++ S +
Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345
>gi|229042160|ref|ZP_04189916.1| Glycosyltransferase [Bacillus cereus AH676]
gi|228727172|gb|EEL78373.1| Glycosyltransferase [Bacillus cereus AH676]
Length = 358
Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats.
Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93
G +V W + +++ S F ++ SF +
Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313
Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + E + +L ++ S +
Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345
>gi|47570410|ref|ZP_00241048.1| glycosyltransferase [Bacillus cereus G9241]
gi|47552914|gb|EAL11327.1| glycosyltransferase [Bacillus cereus G9241]
Length = 182
Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats.
Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 10/124 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F +
Sbjct: 62 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 120
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
L+ Y S + F + E + +L ++ + + + +
Sbjct: 121 ------LSKQIYRTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAISRGMK 174
Query: 129 LFEG 132
++
Sbjct: 175 AYKK 178
>gi|218231858|ref|YP_002365107.1| hypothetical protein BCB4264_A0319 [Bacillus cereus B4264]
gi|229148661|ref|ZP_04276913.1| Glycosyltransferase [Bacillus cereus m1550]
gi|218159815|gb|ACK59807.1| conserved hypothetical protein [Bacillus cereus B4264]
gi|228634798|gb|EEK91375.1| Glycosyltransferase [Bacillus cereus m1550]
Length = 358
Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats.
Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93
G +V W + +++ S F ++ SF +
Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313
Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + E + +L ++ S +
Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345
>gi|206972729|ref|ZP_03233664.1| conserved hypothetical protein [Bacillus cereus AH1134]
gi|206732341|gb|EDZ49528.1| conserved hypothetical protein [Bacillus cereus AH1134]
Length = 358
Score = 80.0 bits (196), Expect = 6e-13, Method: Composition-based stats.
Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%)
Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93
G +V W + +++ S F ++ SF +
Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313
Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + E + +L ++ S +
Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345
>gi|324324277|gb|ADY19537.1| hypothetical protein YBT020_01425 [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 358
Score = 80.0 bits (196), Expect = 6e-13, Method: Composition-based stats.
Identities = 10/98 (10%), Positives = 29/98 (29%), Gaps = 10/98 (10%)
Query: 29 MQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSF 87
++ G +V W + +++ + S F + L+ S
Sbjct: 255 KRSPSEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY-------LSKQIQRTYSL 307
Query: 88 PSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ F + E + +L ++ + + +
Sbjct: 308 YNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345
>gi|253581532|ref|ZP_04858757.1| methyltransferase type 11 [Fusobacterium varium ATCC 27725]
gi|251836602|gb|EES65137.1| methyltransferase type 11 [Fusobacterium varium ATCC 27725]
Length = 356
Score = 79.6 bits (195), Expect = 8e-13, Method: Composition-based stats.
Identities = 9/117 (7%), Positives = 27/117 (23%), Gaps = 15/117 (12%)
Query: 17 LLLRLDVEEKGNM-----QAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIF 71
+ + EE G + W + + + +F+ ++
Sbjct: 247 FVQKYKYEEFLKKSIDISNEFLNKKIYPGIFTGWDNTSRHGRRGYVIERNTPKLFKKYLL 306
Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
+ + + F + E + +L + + E
Sbjct: 307 EEKKIM--------KEKNIDYIFLNAWNEWAEGMYLEPDEKFKYGYLEAIKEVMETE 355
>gi|212694719|ref|ZP_03302847.1| hypothetical protein BACDOR_04251 [Bacteroides dorei DSM 17855]
gi|237727302|ref|ZP_04557783.1| conserved hypothetical protein [Bacteroides sp. D4]
gi|212662698|gb|EEB23272.1| hypothetical protein BACDOR_04251 [Bacteroides dorei DSM 17855]
gi|229434158|gb|EEO44235.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4]
Length = 352
Score = 79.2 bits (194), Expect = 1e-12, Method: Composition-based stats.
Identities = 16/118 (13%), Positives = 31/118 (26%), Gaps = 11/118 (9%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66
K KLG + D Q + W SP+ S + ++F
Sbjct: 237 FKHKLGALHTY-KYEDALRYFVSQEDKAENIIPTIISGWDHSPRAGENSLILTNYTPALF 295
Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+ + + L +I F + E + L + + +
Sbjct: 296 QKHLENVFDIL--------VQKENKICFIKAWNEWGEGNHLEPDLKYGLDFLKTLKQV 345
>gi|148927813|ref|ZP_01811238.1| Lipopolysaccharide biosynthesis protein-like protein [candidate
division TM7 genomosp. GTL1]
gi|147886839|gb|EDK72384.1| Lipopolysaccharide biosynthesis protein-like protein [candidate
division TM7 genomosp. GTL1]
Length = 468
Score = 79.2 bits (194), Expect = 1e-12, Method: Composition-based stats.
Identities = 12/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%)
Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
G W + +++ T + F S++ +LR++ ++ F
Sbjct: 307 YTLYRGIIPSWDNTARRQDTGTIIVNATPEFFGSWLKFLRAYTRETRPGASDP----FIF 362
Query: 95 YGSRKE--QKAFLRLNRFMSNSRM-PFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSK 150
+ E + L + + ++ ++L R ++ ++
Sbjct: 363 VNAWNEWGEGCHLEPDVQWGLGYLDEVARSSYISSEDLLPVDQARAAAFRRIEQIAARD 421
>gi|206978430|ref|ZP_03239298.1| conserved hypothetical protein [Bacillus cereus H3081.97]
gi|217957833|ref|YP_002336377.1| hypothetical protein BCAH187_A0334 [Bacillus cereus AH187]
gi|222094032|ref|YP_002528086.1| glycosyltransferase [Bacillus cereus Q1]
gi|206743362|gb|EDZ54801.1| conserved hypothetical protein [Bacillus cereus H3081.97]
gi|217068322|gb|ACJ82572.1| conserved hypothetical protein [Bacillus cereus AH187]
gi|221238084|gb|ACM10794.1| glycosyltransferase [Bacillus cereus Q1]
Length = 358
Score = 79.2 bits (194), Expect = 1e-12, Method: Composition-based stats.
Identities = 10/98 (10%), Positives = 29/98 (29%), Gaps = 10/98 (10%)
Query: 29 MQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSF 87
++ G +V W + +++ + S F + L+ S
Sbjct: 255 KRSPSEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY-------LSKQIQRTYSL 307
Query: 88 PSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ F + E + +L ++ + + +
Sbjct: 308 YNSEFLFINAWNEWAEGTYLEPDKKHGFAYLEGVKQAI 345
>gi|254724735|ref|ZP_05186518.1| hypothetical protein BantA1_20079 [Bacillus anthracis str. A1055]
Length = 358
Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats.
Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 10/124 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F +
Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 296
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
L+ Y S + F + E + +L ++ + + + +
Sbjct: 297 ------LSKQIYRTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAISRGMK 350
Query: 129 LFEG 132
++
Sbjct: 351 AYKK 354
>gi|218901466|ref|YP_002449300.1| hypothetical protein BCAH820_0304 [Bacillus cereus AH820]
gi|228925519|ref|ZP_04088610.1| Glycosyltransferase [Bacillus thuringiensis serovar pondicheriensis
BGSC 4BA1]
gi|218535510|gb|ACK87908.1| conserved hypothetical protein [Bacillus cereus AH820]
gi|228834134|gb|EEM79680.1| Glycosyltransferase [Bacillus thuringiensis serovar pondicheriensis
BGSC 4BA1]
Length = 358
Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats.
Identities = 13/124 (10%), Positives = 35/124 (28%), Gaps = 10/124 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F ++
Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIYL 297
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
S + F + E + +L ++ + + + +
Sbjct: 298 SKQIH-------RTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAIKRGMK 350
Query: 129 LFEG 132
++
Sbjct: 351 AYKK 354
>gi|196036928|ref|ZP_03104311.1| conserved hypothetical protein [Bacillus cereus W]
gi|228944071|ref|ZP_04106452.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|195990465|gb|EDX54450.1| conserved hypothetical protein [Bacillus cereus W]
gi|228815598|gb|EEM61838.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 358
Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats.
Identities = 13/124 (10%), Positives = 35/124 (28%), Gaps = 10/124 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F ++
Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIYL 297
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
S + F + E + +L ++ + + + +
Sbjct: 298 SKQIH-------RTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAIKRGMK 350
Query: 129 LFEG 132
++
Sbjct: 351 AYKK 354
>gi|163938265|ref|YP_001643149.1| hypothetical protein BcerKBAB4_0253 [Bacillus weihenstephanensis
KBAB4]
gi|163860462|gb|ABY41521.1| conserved hypothetical protein [Bacillus weihenstephanensis KBAB4]
Length = 358
Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats.
Identities = 14/115 (12%), Positives = 33/115 (28%), Gaps = 10/115 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F +
Sbjct: 238 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPRKFTIY- 296
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
L+ S + F + E + +L ++ + + +
Sbjct: 297 ------LSKQIQRTYSLYNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345
>gi|257468312|ref|ZP_05632408.1| hypothetical protein FulcA4_03172 [Fusobacterium ulcerans ATCC
49185]
gi|317062590|ref|ZP_07927075.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185]
gi|313688266|gb|EFS25101.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185]
Length = 355
Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats.
Identities = 7/91 (7%), Positives = 23/91 (25%), Gaps = 10/91 (10%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G + W + + + +F+ ++ + + + F +
Sbjct: 272 FPGVFTGWDNTSRHGRRGYVIKGNTPKLFKEYLLEQKKIM--------KEKNIEYIFLNA 323
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
E + +L + + E
Sbjct: 324 WNEWAEGMYLEPDEKFEYGYLEAVKEIMETE 354
>gi|229154032|ref|ZP_04282159.1| Glycosyltransferase [Bacillus cereus ATCC 4342]
gi|228629429|gb|EEK86129.1| Glycosyltransferase [Bacillus cereus ATCC 4342]
Length = 358
Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats.
Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 10/124 (8%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F +
Sbjct: 238 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 296
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
L+ Y S + F + E + +L ++ + + + +
Sbjct: 297 ------LSKQIYRTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAISRGMK 350
Query: 129 LFEG 132
++
Sbjct: 351 AYKK 354
>gi|75758487|ref|ZP_00738608.1| Hypothetical protein RBTH_07389 [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|74494014|gb|EAO57109.1| Hypothetical protein RBTH_07389 [Bacillus thuringiensis serovar
israelensis ATCC 35646]
Length = 353
Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats.
Identities = 12/101 (11%), Positives = 28/101 (27%), Gaps = 15/101 (14%)
Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKY 82
+ N Q G +V W SP+++ ++ + F+ ++
Sbjct: 260 WKRILNRQIKECENIYKGAFVDWDNSPRKKESALIMKGANPDKFKKYLL----------- 308
Query: 83 SKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
F + E + +L + + E
Sbjct: 309 --QHSKDTDFLFINAWNEWAEGTYLEPDEKYGYKYLEALME 347
>gi|260172490|ref|ZP_05758902.1| polysaccharide biosynthesis protein [Bacteroides sp. D2]
gi|315920784|ref|ZP_07917024.1| polysaccharide biosynthesis protein [Bacteroides sp. D2]
gi|313694659|gb|EFS31494.1| polysaccharide biosynthesis protein [Bacteroides sp. D2]
Length = 367
Score = 78.1 bits (191), Expect = 2e-12, Method: Composition-based stats.
Identities = 10/96 (10%), Positives = 25/96 (26%), Gaps = 8/96 (8%)
Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
G +W + +++ + E + WL S + F
Sbjct: 275 YKMYPGVTPMWDNTSRRKQKMFILDKSTP---EKYGEWLYSVMNKFV---PYSKDENFVF 328
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
+ E + L + + + ++E
Sbjct: 329 VNAWNEWAEGNHLEPDLKWGFRYLEETEKVVKSMQE 364
>gi|228904942|ref|ZP_04068994.1| Glycosyltransferase [Bacillus thuringiensis IBL 4222]
gi|228854684|gb|EEM99290.1| Glycosyltransferase [Bacillus thuringiensis IBL 4222]
Length = 340
Score = 78.1 bits (191), Expect = 2e-12, Method: Composition-based stats.
Identities = 12/101 (11%), Positives = 28/101 (27%), Gaps = 15/101 (14%)
Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKY 82
+ N Q G +V W SP+++ ++ + F+ ++
Sbjct: 247 WKRILNRQIKECENIYKGAFVDWDNSPRKKESALIMKGANPDKFKKYLL----------- 295
Query: 83 SKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
F + E + +L + + E
Sbjct: 296 --QHSKDTDFLFINAWNEWAEGTYLEPDEKYGYKYLEALME 334
>gi|187732137|ref|YP_001879843.1| WbwX [Shigella boydii CDC 3083-94]
gi|187429129|gb|ACD08403.1| WbwX [Shigella boydii CDC 3083-94]
Length = 361
Score = 78.1 bits (191), Expect = 2e-12, Method: Composition-based stats.
Identities = 13/92 (14%), Positives = 30/92 (32%), Gaps = 10/92 (10%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G +V W S +++ + +H F ++ L Y + +C F +
Sbjct: 277 YPGAFVDWDNSARKKSRALVIHGGSPKKFGLYLDKL--------YKRSIENNCPFLFINA 328
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127
E + +L + S + + +
Sbjct: 329 WNEWAEGTYLEPDEKNKYSYLEELKKVIEKYE 360
>gi|213962348|ref|ZP_03390611.1| conserved hypothetical protein [Capnocytophaga sputigena Capno]
gi|213955014|gb|EEB66333.1| conserved hypothetical protein [Capnocytophaga sputigena Capno]
Length = 368
Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats.
Identities = 20/120 (16%), Positives = 36/120 (30%), Gaps = 10/120 (8%)
Query: 4 VFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63
+ RLK K+ K + + E + V W +P+ + SK
Sbjct: 254 IGRLKFKMEKSQKVDYVAFGEALLTLAQQTQDKTYQSIIVDWDNTPRYKNRSKFFVNATP 313
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+ FE F+ L A F + E + A+L + + +
Sbjct: 314 ANFEHFLKELSLIEAAK--------GNEFVFINAWNEWSEGAYLEPDTTYEYQYLDVVKK 365
>gi|62955962|gb|AAY23338.1| WbwX [Shigella boydii]
Length = 327
Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats.
Identities = 13/92 (14%), Positives = 30/92 (32%), Gaps = 10/92 (10%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G +V W S +++ + +H F ++ L Y + +C F +
Sbjct: 243 YPGAFVDWDNSARKKSRALVIHGGSPKKFGLYLDKL--------YKRSIENNCPFLFINA 294
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127
E + +L + S + + +
Sbjct: 295 WNEWAEGTYLEPDEKNKYSYLEELKKVIEKYE 326
>gi|298480506|ref|ZP_06998703.1| glycosyl transferase, group 2 family [Bacteroides sp. D22]
gi|298273327|gb|EFI14891.1| glycosyl transferase, group 2 family [Bacteroides sp. D22]
Length = 365
Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats.
Identities = 9/91 (9%), Positives = 22/91 (24%), Gaps = 8/91 (8%)
Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
G +W + +++ + E + WL S + F
Sbjct: 275 YKMYPGVTPMWDNTSRRKQKMFILDKSTP---EKYGEWLYSVMNKFV---PYSKDENFVF 328
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ E + L + + +
Sbjct: 329 VNAWNEWAEGNHLEPDLKWGLRYLEETKKVV 359
>gi|313203616|ref|YP_004042273.1| polysaccharide biosynthesis protein [Paludibacter propionicigenes
WB4]
gi|312442932|gb|ADQ79288.1| polysaccharide biosynthesis protein [Paludibacter propionicigenes
WB4]
Length = 383
Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats.
Identities = 7/94 (7%), Positives = 26/94 (27%), Gaps = 8/94 (8%)
Query: 32 IYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCR 91
Y + W + ++ ++ +F+ ++ + F S +
Sbjct: 252 NYNYPVFRCVFPSWDNTARKNSKGTIFINNDIDVFKYYLQRIVEFTQQSTNK------EK 305
Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
F + E + + + + + +
Sbjct: 306 YIFINAWNEWGEGCHIEPDCRTNFKYLEVIKQTL 339
>gi|39996608|ref|NP_952559.1| hypothetical protein GSU1508 [Geobacter sulfurreducens PCA]
gi|39983489|gb|AAR34882.1| conserved hypothetical protein [Geobacter sulfurreducens PCA]
Length = 381
Score = 76.9 bits (188), Expect = 5e-12, Method: Composition-based stats.
Identities = 12/112 (10%), Positives = 26/112 (23%), Gaps = 9/112 (8%)
Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWL 73
+ + V+ G W S ++R T+ IF+ ++
Sbjct: 273 DVYVYSHLVDNDLKYDFQQGWPIFPGVCPGWDNSARRRDTTAIIFDKSTPEIFKLWVREK 332
Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ R F + E + L +
Sbjct: 333 IRITDWNLL------PERFLFVNAWNEWAEGNHLEPCEKWGTQYLAALQAGI 378
>gi|298505623|gb|ADI84346.1| conserved hypothetical protein [Geobacter sulfurreducens KN400]
Length = 372
Score = 76.5 bits (187), Expect = 6e-12, Method: Composition-based stats.
Identities = 12/112 (10%), Positives = 26/112 (23%), Gaps = 9/112 (8%)
Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWL 73
+ + V+ G W S ++R T+ IF+ ++
Sbjct: 264 DVYVYSHLVDNDLKYDFQQGWPIFPGVCPGWDNSARRRDTTAIIFDKSTPEIFKLWVREK 323
Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ R F + E + L +
Sbjct: 324 IRITDWNLL------PERFLFVNAWNEWAEGNHLEPCEKWGTQYLAALQAGI 369
>gi|228912320|ref|ZP_04076015.1| Glycosyltransferase [Bacillus thuringiensis IBL 200]
gi|228847303|gb|EEM92262.1| Glycosyltransferase [Bacillus thuringiensis IBL 200]
Length = 340
Score = 76.5 bits (187), Expect = 7e-12, Method: Composition-based stats.
Identities = 12/101 (11%), Positives = 28/101 (27%), Gaps = 15/101 (14%)
Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKY 82
+ N Q G +V W SP+++ ++ + F+ ++
Sbjct: 247 WKRILNRQIKERENIYKGAFVDWDNSPRKKESALIMEGASPDKFKKYLL----------- 295
Query: 83 SKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
F + E + +L + + E
Sbjct: 296 --QHSKDTDFLFINAWNEWAEGTYLEPDEKYGYKYLEALME 334
>gi|212694325|ref|ZP_03302453.1| hypothetical protein BACDOR_03851 [Bacteroides dorei DSM 17855]
gi|212662826|gb|EEB23400.1| hypothetical protein BACDOR_03851 [Bacteroides dorei DSM 17855]
Length = 359
Score = 75.8 bits (185), Expect = 1e-11, Method: Composition-based stats.
Identities = 8/94 (8%), Positives = 21/94 (22%), Gaps = 9/94 (9%)
Query: 38 VSGYYVLWSFSPKQRITS-KDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
+ + ++ ++ WL S K F
Sbjct: 271 FPCVTPNFDNASRRMHKGFTAFIGSTPQLYGK---WLSSVFEKF---KPYSQEENFIFIN 324
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
+ E + L ++ + + K+
Sbjct: 325 AWNEWAEGNHLEPDQKWGRKYLEETKKNIDQYKK 358
>gi|227890975|ref|ZP_04008780.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741]
gi|227867384|gb|EEJ74805.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741]
Length = 370
Score = 75.0 bits (183), Expect = 2e-11, Method: Composition-based stats.
Identities = 12/113 (10%), Positives = 32/113 (28%), Gaps = 12/113 (10%)
Query: 16 NLLLRLDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFW 72
N + ++ + P G +V W +P+++ FE ++
Sbjct: 255 NTIRHYKYDDIWKIILKQQPKGDDWYPGAFVDWDNTPRRKNKGSFCDGTSPEKFEYYLTQ 314
Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K ++ + + F + E + +L + +
Sbjct: 315 ------QIKRARNVYHKDYL-FMFAWNEWGESGYLEPDTKNGYKMLEAVRNAL 360
>gi|114328198|ref|YP_745355.1| glycosyltransferase [Granulibacter bethesdensis CGDNIH1]
gi|114316372|gb|ABI62432.1| glycosyltransferase [Granulibacter bethesdensis CGDNIH1]
Length = 946
Score = 74.6 bits (182), Expect = 3e-11, Method: Composition-based stats.
Identities = 13/132 (9%), Positives = 36/132 (27%), Gaps = 7/132 (5%)
Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75
++ + + + W +P+ + + + + WL
Sbjct: 696 RIVDYHKFASYHMGRPMPEYRRHRTVMLPWDNTPRYGSRAMVHVNTSNNAYRT---WLTQ 752
Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGW 133
+ + P RI F S E + ++ + + V+++
Sbjct: 753 AMLDTHRR--HVPEERIVFLHSWNEWCEGTYVEPDGRYGRHYLNETRAAVQDVRDILSLA 810
Query: 134 NDRPSSPKKSGL 145
+ S + L
Sbjct: 811 SSGESVNALAKL 822
>gi|228946140|ref|ZP_04108475.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228813553|gb|EEM59839.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 340
Score = 74.6 bits (182), Expect = 3e-11, Method: Composition-based stats.
Identities = 8/86 (9%), Positives = 24/86 (27%), Gaps = 15/86 (17%)
Query: 36 AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95
G ++ W SP+++ ++ + F+ ++ F
Sbjct: 260 NIYKGAFIDWDNSPRKKESALILKGANPDKFKKYLL-------------QHSKDTDFLFI 306
Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFD 119
+ E + +L + +
Sbjct: 307 NAWNEWAEGTYLEPDSKYGYKYLEAL 332
>gi|295085474|emb|CBK66997.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 389
Score = 73.4 bits (179), Expect = 5e-11, Method: Composition-based stats.
Identities = 7/91 (7%), Positives = 24/91 (26%), Gaps = 11/91 (12%)
Query: 38 VSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
+ W +P+ + + F +F+ + ++ ++
Sbjct: 297 FPNASIGWDDTPRFPNKTAKEVVHYNDSPESFAAFLQKTKEYVD------QRPDRPKLIT 350
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
S E + ++L + +
Sbjct: 351 INSWNEWVEGSYLLPDMKHGYGYLNAVKRVI 381
>gi|237717351|ref|ZP_04547832.1| conserved hypothetical protein [Bacteroides sp. D1]
gi|262406116|ref|ZP_06082666.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|229443334|gb|EEO49125.1| conserved hypothetical protein [Bacteroides sp. D1]
gi|262356991|gb|EEZ06081.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 401
Score = 73.1 bits (178), Expect = 7e-11, Method: Composition-based stats.
Identities = 7/91 (7%), Positives = 24/91 (26%), Gaps = 11/91 (12%)
Query: 38 VSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
+ W +P+ + + F +F+ + ++ ++
Sbjct: 309 FPNASIGWDDTPRFPNKTAKEVVHYNDSPESFAAFLQKTKEYVD------QRPDRPKLIT 362
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
S E + ++L + +
Sbjct: 363 INSWNEWVEGSYLLPDMKHGYGYLNAVKRVM 393
>gi|218257974|ref|ZP_03474434.1| hypothetical protein PRABACTJOHN_00087 [Parabacteroides johnsonii
DSM 18315]
gi|218225847|gb|EEC98497.1| hypothetical protein PRABACTJOHN_00087 [Parabacteroides johnsonii
DSM 18315]
Length = 404
Score = 73.1 bits (178), Expect = 7e-11, Method: Composition-based stats.
Identities = 10/108 (9%), Positives = 28/108 (25%), Gaps = 11/108 (10%)
Query: 21 LDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFL 77
E + + + W +P+ ++ Q F SF+ + +
Sbjct: 293 HTWEYVQKWDEAVMIPYFPNASIGWDDTPRFPHKTRKDVVHLNQSPQSFSSFLQKAKEYC 352
Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ + E + A+L + + +
Sbjct: 353 DKH------PDQPKLITVYAWNEWVEGAYLLPDMKYGFDYLNAVKDVM 394
>gi|294647019|ref|ZP_06724633.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294807810|ref|ZP_06766599.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|292637628|gb|EFF56032.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|294444986|gb|EFG13664.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
Length = 407
Score = 73.1 bits (178), Expect = 8e-11, Method: Composition-based stats.
Identities = 7/91 (7%), Positives = 24/91 (26%), Gaps = 11/91 (12%)
Query: 38 VSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
+ W +P+ + + F +F+ + ++ ++
Sbjct: 315 FPNASIGWDDTPRFPNKTAKEVVHYNDSPESFAAFLQKTKEYVD------QRPDRPKLIT 368
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
S E + ++L + +
Sbjct: 369 INSWNEWVEGSYLLPDMKHGYGYLNAVKRVM 399
>gi|260172434|ref|ZP_05758846.1| hypothetical protein BacD2_11264 [Bacteroides sp. D2]
gi|315920729|ref|ZP_07916969.1| conserved hypothetical protein [Bacteroides sp. D2]
gi|313694604|gb|EFS31439.1| conserved hypothetical protein [Bacteroides sp. D2]
Length = 403
Score = 72.7 bits (177), Expect = 1e-10, Method: Composition-based stats.
Identities = 11/110 (10%), Positives = 30/110 (27%), Gaps = 11/110 (10%)
Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSF 76
R +E + + W +P+ +K + F +++ + +
Sbjct: 292 RESMERMEKWVEALSVPYFPNASIGWDDTPRFPHKTKKDVVHYNNSPQSFATYLQKAKEY 351
Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124
+ ++ S E + +L + + E L
Sbjct: 352 VDAR------PDLPKLITVFSWNEWIEGGYLLPDMKYGFGYLEAVKEVML 395
>gi|295103156|emb|CBL00700.1| hypothetical protein [Faecalibacterium prausnitzii SL3/3]
Length = 372
Score = 72.7 bits (177), Expect = 1e-10, Method: Composition-based stats.
Identities = 12/87 (13%), Positives = 26/87 (29%), Gaps = 10/87 (11%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G + W SP++ + F+++ L Y K + +
Sbjct: 286 FLGCFCDWDNSPRKSYNCNVMMGVTAEKFKNYFRKL--------YIKAQTIGSPMIVINA 337
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEK 122
E + A+L + + + E
Sbjct: 338 WNEWAEGAYLEPDEKNGYAFLEAIKEA 364
>gi|269839527|ref|YP_003324219.1| hypothetical protein Tter_2508 [Thermobaculum terrenum ATCC
BAA-798]
gi|269791257|gb|ACZ43397.1| hypothetical protein Tter_2508 [Thermobaculum terrenum ATCC
BAA-798]
Length = 381
Score = 72.3 bits (176), Expect = 1e-10, Method: Composition-based stats.
Identities = 17/122 (13%), Positives = 37/122 (30%), Gaps = 23/122 (18%)
Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK----------QRITSKDVHFQEL 63
+ ++ R+ ++ G Y P G +P+ + ++ V +
Sbjct: 269 VREVVERVWPKQAGLSALPYWPCVSPGC----DDTPRHLLPRDLEHPRSWRTRPVVGETP 324
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+FE F+ FL R+ GS E + +L + + +
Sbjct: 325 EVFEGFVRAGVEFL-------QGRGGPRVLLIGSWNEWTEGHYLLPDTRLGFGMLRALQR 377
Query: 122 KF 123
Sbjct: 378 AL 379
>gi|302873795|ref|YP_003842428.1| hypothetical protein Clocel_0894 [Clostridium cellulovorans 743B]
gi|307689965|ref|ZP_07632411.1| hypothetical protein Ccel74_17519 [Clostridium cellulovorans 743B]
gi|302576652|gb|ADL50664.1| hypothetical protein Clocel_0894 [Clostridium cellulovorans 743B]
Length = 367
Score = 72.3 bits (176), Expect = 1e-10, Method: Composition-based stats.
Identities = 10/114 (8%), Positives = 33/114 (28%), Gaps = 10/114 (8%)
Query: 11 LGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFI 70
+G +E+ + E + + G + W +P++ + +F+ ++
Sbjct: 255 IGILESSFSYKNCWENIINRTPKQDNTILGGFTDWDNTPRRSYDGMIMKGTTPELFQYYM 314
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122
+ + + E + A+L + + +
Sbjct: 315 E--------KQMERCKEYKSPFVVINAWNEWAEGAYLEPDEKYGYAFLNAIKNC 360
>gi|218257975|ref|ZP_03474435.1| hypothetical protein PRABACTJOHN_00088 [Parabacteroides johnsonii
DSM 18315]
gi|218225848|gb|EEC98498.1| hypothetical protein PRABACTJOHN_00088 [Parabacteroides johnsonii
DSM 18315]
Length = 414
Score = 72.3 bits (176), Expect = 1e-10, Method: Composition-based stats.
Identities = 9/106 (8%), Positives = 27/106 (25%), Gaps = 11/106 (10%)
Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAF 79
+E + W +P+ ++ Q F +F+ + +
Sbjct: 305 LERLQKWDEAVSIPFFPNASIGWDDTPRFPHKTQKDVVHLNQSPQSFAAFLQKAKEYCDK 364
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ + E + A+L + + +
Sbjct: 365 H------PDQPKLITVYAWNEWVEGAYLLPDMKYGFGYLDALKDVM 404
>gi|255015690|ref|ZP_05287816.1| hypothetical protein B2_17433 [Bacteroides sp. 2_1_7]
Length = 400
Score = 71.9 bits (175), Expect = 1e-10, Method: Composition-based stats.
Identities = 13/113 (11%), Positives = 31/113 (27%), Gaps = 11/113 (9%)
Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITS--KDVH-FQELSIFESFIFWLRSFLAF 79
E + + W +P+ + VH Q F +F+ + +
Sbjct: 293 FERLEKWSEAVSIPYFPNASIGWDDTPRFPHKTQKDVVHFNQSPEAFAAFLQKAKEYCDR 352
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELF 130
++ + E + A+L + + + F+ K
Sbjct: 353 H------PEQPKLITVYAWNEWVEGAYLLPDVKYGFGYLNAVKDVFVNGKYQA 399
>gi|90961958|ref|YP_535874.1| glycosyltransferase [Lactobacillus salivarius UCC118]
gi|90821152|gb|ABD99791.1| Glycosyltransferase [Lactobacillus salivarius UCC118]
gi|300214668|gb|ADJ79084.1| Glycosyltransferase [Lactobacillus salivarius CECT 5713]
Length = 371
Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 12/106 (11%), Positives = 29/106 (27%), Gaps = 9/106 (8%)
Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79
D+ + Q G +V W +P+++ FE ++
Sbjct: 262 YDDIWKIILKQQPKGKNWYPGAFVDWDNTPRRKHQGSFCDGTSPEKFEYYLT------KQ 315
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K + + + F + E + +L + +
Sbjct: 316 IKRVRDVYHKDYL-FMFAWNEWGESGYLEPDVKNGYKMLEGVRNAL 360
>gi|301301020|ref|ZP_07207181.1| conserved hypothetical protein [Lactobacillus salivarius
ACS-116-V-Col5a]
gi|300851377|gb|EFK79100.1| conserved hypothetical protein [Lactobacillus salivarius
ACS-116-V-Col5a]
Length = 371
Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 9/88 (10%), Positives = 26/88 (29%), Gaps = 9/88 (10%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G + W +P+++ FE ++ K ++ + + F +
Sbjct: 280 YPGAFADWDNTPRRKNKGVFCDGTSPEKFEYYLTQ------QIKRARDIYYKDYL-FMFA 332
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123
E + +L + + +
Sbjct: 333 WNEWGESGYLEPDTKNGYKMLEAVRKAL 360
>gi|300214669|gb|ADJ79085.1| Glycosyltransferase [Lactobacillus salivarius CECT 5713]
Length = 371
Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 11/108 (10%), Positives = 30/108 (27%), Gaps = 12/108 (11%)
Query: 21 LDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77
++ + P G +V W +P+++ FE ++
Sbjct: 260 YSYDDIWKIILKQKPKGKDWYPGSFVDWDNTPRRKNRGSFCDGTSPEKFEYYLTQ----- 314
Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K ++ + + F + E + +L + +
Sbjct: 315 -QIKRARNVYHKDYL-FMFAWNEWGESGYLEPDTKNGYKMLEAVKNAL 360
>gi|90961959|ref|YP_535875.1| glycosyltransferase [Lactobacillus salivarius UCC118]
gi|90821153|gb|ABD99792.1| Glycosyltransferase [Lactobacillus salivarius UCC118]
Length = 371
Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 11/108 (10%), Positives = 30/108 (27%), Gaps = 12/108 (11%)
Query: 21 LDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77
++ + P G +V W +P+++ FE ++
Sbjct: 260 YSYDDIWKIILKQKPKGKDWYPGSFVDWDNTPRRKNRGSFCDGTSPEKFEYYLTQ----- 314
Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
K ++ + + F + E + +L + +
Sbjct: 315 -QIKRARNVYHKDYL-FMFAWNEWGESGYLEPDTKNGYKMLEAVKNAL 360
>gi|324991549|gb|EGC23482.1| rhamnosyltransferase [Streptococcus sanguinis SK353]
Length = 556
Score = 71.1 bits (173), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/241 (14%), Positives = 74/241 (30%), Gaps = 29/241 (12%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ + VT E K + + Q+ + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLVTTDQPEVLKQLQTALGHLGNKVQIVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266
K R L + + Y Y+ + S G + R L ++ D
Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326
A I E+ +G++ R + + R+ + + AG +
Sbjct: 396 ADASIEALEKESAVGLVIPDLPRLVRD--GLFESEPPRPRLAAVWQEAGLHKSFDFIITP 453
Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTEFSI 380
G+ W K L L + + E+ L D +E + +
Sbjct: 454 SLTRVYGSFVWFKYSALASLFQMKSLESLPSFEQELSD-----VLEHLLVYLAWDSHYDF 508
Query: 381 E 381
+
Sbjct: 509 K 509
>gi|269839540|ref|YP_003324232.1| hypothetical protein Tter_2521 [Thermobaculum terrenum ATCC
BAA-798]
gi|269791270|gb|ACZ43410.1| hypothetical protein Tter_2521 [Thermobaculum terrenum ATCC
BAA-798]
Length = 381
Score = 71.1 bits (173), Expect = 3e-10, Method: Composition-based stats.
Identities = 16/122 (13%), Positives = 37/122 (30%), Gaps = 23/122 (18%)
Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK----------QRITSKDVHFQEL 63
+ ++ R+ ++ G Y P G +P+ + ++ V +
Sbjct: 269 VREVVERVWPKQAGLSALPYWPCVSPGC----DDTPRHLLPRDLEHPRSWRTRPVVGETP 324
Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+FE F+ FL ++ GS E + +L + + +
Sbjct: 325 EVFEGFVRAGVEFL-------QGRGGPKVLLIGSWNEWTEGHYLLPDTRLGFGMLRALQR 377
Query: 122 KF 123
Sbjct: 378 AL 379
>gi|323694861|ref|ZP_08109014.1| hypothetical protein HMPREF9475_03878 [Clostridium symbiosum
WAL-14673]
gi|323501087|gb|EGB16996.1| hypothetical protein HMPREF9475_03878 [Clostridium symbiosum
WAL-14673]
Length = 374
Score = 71.1 bits (173), Expect = 3e-10, Method: Composition-based stats.
Identities = 13/86 (15%), Positives = 28/86 (32%), Gaps = 10/86 (11%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G +V W SP++ + + F ++ L K + +
Sbjct: 284 FLGAFVAWDNSPRKSYNATVITGATPEKFGEYMCKLM--------KKAQELHSPVIVINA 335
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSE 121
E + AFL ++ + + S+
Sbjct: 336 WNEWAEGAFLEPDKEYGTAYLEQISK 361
>gi|307340772|gb|ADN43835.1| WegG [Escherichia coli]
Length = 357
Score = 70.7 bits (172), Expect = 4e-10, Method: Composition-based stats.
Identities = 10/108 (9%), Positives = 26/108 (24%), Gaps = 8/108 (7%)
Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79
+ + N + W + + + F ++ ++ L+
Sbjct: 254 YSKLSKGFNTFVENSNRVIPVIIPRWDSTVRHGKNGWVLTGSTPKEFAKHVYDVKKILSK 313
Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
RI S E + F+ + + +F
Sbjct: 314 RDIK------YRIAIVKSWNEWAEGNFIEPDNIYGKRYLEILKSEFTN 355
>gi|227891408|ref|ZP_04009213.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741]
gi|227866797|gb|EEJ74218.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741]
Length = 357
Score = 70.7 bits (172), Expect = 4e-10, Method: Composition-based stats.
Identities = 17/121 (14%), Positives = 33/121 (27%), Gaps = 10/121 (8%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPA-HVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
KL + N + Y +SG + W S ++ S V + + F+
Sbjct: 242 KKLKMTDYQSFDKIWSYILNRKRTYDSKTIISGAFSGWDNSARKGKESMIVKGKTVPKFK 301
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
+ + S S + E + A+L + + E
Sbjct: 302 KYFEKFYT-------SDRENISEEFCVINAWNEWSEGAYLEPDDKDGFGYLEAIKEVVDK 354
Query: 126 V 126
Sbjct: 355 Y 355
>gi|91201537|emb|CAJ74597.1| conserved hypothetical protein [Candidatus Kuenenia
stuttgartiensis]
Length = 369
Score = 70.4 bits (171), Expect = 5e-10, Method: Composition-based stats.
Identities = 13/98 (13%), Positives = 30/98 (30%), Gaps = 19/98 (19%)
Query: 36 AHVSGYYVLWSFSPKQRITSK----------DVHFQELSIFESFIFWLRSFLAFSKYSKL 85
A+ W SP+ + V + +F F LR + ++ +
Sbjct: 273 AYYPSVSPGWDASPRGELHGNQKPFCYPWWPIVVNEHPELFSGF---LRKAIHYTMRNNT 329
Query: 86 SFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+ + F S E + +L + + + +
Sbjct: 330 TP----LCFIASWNEWSEGHYLEPDARFGTAWLEAVRQ 363
>gi|227890976|ref|ZP_04008781.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741]
gi|227867385|gb|EEJ74806.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741]
Length = 371
Score = 70.0 bits (170), Expect = 5e-10, Method: Composition-based stats.
Identities = 10/88 (11%), Positives = 26/88 (29%), Gaps = 9/88 (10%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
G + W +P+++ FE ++ K ++ + + F +
Sbjct: 280 YPGAFADWDNTPRRKNKGVFCDGTSPEKFEYYLTQ------QIKRARNVYHKNYL-FMFA 332
Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123
E + +L + S +
Sbjct: 333 WNEWGESGYLEPDTKNSYKMLEAVRNAL 360
>gi|291520448|emb|CBK75669.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens
16/4]
Length = 625
Score = 70.0 bits (170), Expect = 7e-10, Method: Composition-based stats.
Identities = 11/56 (19%), Positives = 21/56 (37%), Gaps = 1/56 (1%)
Query: 329 TMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383
+ FW + + L+ L F +E + HA+ER F + ++
Sbjct: 4 SCFWCRTEALKKLLEYDFSYNFFPKEPMDANLTTSHAIERIFPYVACDAGYYTSTI 59
Score = 43.0 bits (100), Expect = 0.086, Method: Composition-based stats.
Identities = 24/146 (16%), Positives = 55/146 (37%), Gaps = 15/146 (10%)
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY-RRVIDLAK 312
++ F +L+ + I F++N +G++G+ + + Y +++ K
Sbjct: 421 QYTFDELIKNNGYISAICEVFKENQSVGVVGNIYGEIIFQINSNMNIYSKYEDEILEFEK 480
Query: 313 RAGFPTKRLH----LDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERF 368
R F R L++ FW++ L+ + + I + + L D +
Sbjct: 481 RFNFDFNRGGKHSLLNYNG---FWLRRDALQMIADCEDI--YISAKKLCD---AEWI--V 530
Query: 369 FACSVRYTEFSIESVDCVAEYERLLH 394
+R F + +V C E + +
Sbjct: 531 LPELLRDKGFLLATVFCKREMNKAFY 556
>gi|326772087|ref|ZP_08231372.1| hypothetical protein HMPREF0059_00469 [Actinomyces viscosus C505]
gi|326638220|gb|EGE39121.1| hypothetical protein HMPREF0059_00469 [Actinomyces viscosus C505]
Length = 681
Score = 68.4 bits (166), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/210 (15%), Positives = 55/210 (26%), Gaps = 32/210 (15%)
Query: 163 WIEISHILLRLNFDFDLFVTVVE--ANKDFEQDVLKYFPSAQLYVMENKG---------R 211
++ L L + + VT E D E+ + G R
Sbjct: 323 ADGLAQRLASLPAHWRVVVTSPERLDAADLERVTGRRPSQEDTQEDSAHGEGDVSFRLVR 382
Query: 212 DVRP-----FLYLL-----------ELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255
D+ P FL D + +I + + R
Sbjct: 383 DLDPRGTIAFLTQCDDLWDPGRAAGGDEGGDSGPLVLRI-TVGPPPVPGTRAD-DVAHRQ 440
Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315
LL +I+ F ++P LG+ + + L++R G
Sbjct: 441 ALDCLLDSPGYTAGLIDLFARHPGLGVAMPAAGHIGQAH-GGPTWDGLAGAAKALSRRLG 499
Query: 316 F--PTKRLHLDFFNGTMFWVKPKCLEPLRN 343
L G MF +P+ L L
Sbjct: 500 LSAELDPLAPVAPPGAMFMARPEALRTLSE 529
>gi|320198724|gb|EFW73324.1| Hypothetical protein ECoL_04149 [Escherichia coli EC4100B]
Length = 355
Score = 66.9 bits (162), Expect = 5e-09, Method: Composition-based stats.
Identities = 14/120 (11%), Positives = 32/120 (26%), Gaps = 11/120 (9%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYI-PAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
+K+K ++ N Y + W +P+ + +
Sbjct: 240 IKNKRATYNQYKYSDYIQSMKNDVTEYKGKPVYPVVFPDWDNAPRYKENATFFCESSAYG 299
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
FE + ++ F + E + A+L + S + + F
Sbjct: 300 FEKALNIACDI--------TRNHDDKLIFINAWNEWSEGAYLEPDEMHKYSSLEIIKKVF 351
>gi|168481345|gb|ACA24831.1| WbsX [Escherichia coli]
Length = 378
Score = 66.9 bits (162), Expect = 5e-09, Method: Composition-based stats.
Identities = 14/120 (11%), Positives = 32/120 (26%), Gaps = 11/120 (9%)
Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYI-PAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
+K+K ++ N Y + W +P+ + +
Sbjct: 263 IKNKRATYNQYKYSDYIQSMKNDVTEYKGKPVYPVVFPDWDNAPRYKENATFFCESSAYG 322
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
FE + ++ F + E + A+L + S + + F
Sbjct: 323 FEKALNIACDI--------TRNHDDKLIFINAWNEWSEGAYLEPDEMHKYSSLEIIKKVF 374
>gi|46451858|gb|AAS98033.1| WbsX [Shigella boydii]
Length = 378
Score = 66.5 bits (161), Expect = 7e-09, Method: Composition-based stats.
Identities = 12/114 (10%), Positives = 27/114 (23%), Gaps = 12/114 (10%)
Query: 14 IENLLLRLDVEEKGNMQAIYIPA--HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIF 71
N D + + W +P+ + + FE +
Sbjct: 269 TYNHYKYSDYIQSMKNDVTEYKGKPIYPVVFPDWDNAPRYKENATFFCESSAFDFEKALN 328
Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ F + E + A+L + S + + F
Sbjct: 329 IACDI--------TRNHDDKLIFINAWNEWSEGAYLEPDEMYKYSNLEIIKKVF 374
>gi|332180195|gb|AEE15883.1| hypothetical protein Trebr_0439 [Treponema brennaborense DSM 12168]
Length = 376
Score = 65.4 bits (158), Expect = 2e-08, Method: Composition-based stats.
Identities = 14/112 (12%), Positives = 29/112 (25%), Gaps = 12/112 (10%)
Query: 16 NLLLRLDVEEKGNMQAIYIP--AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73
N + ++ P + W +P+ R + + L
Sbjct: 270 NKIWDYLLKNACVNDYPMFPNLKIFESAFWGWDNTPRYRNRATIFSELTRFEKRKYFSDL 329
Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
Y K+S F+ + E + A+L + + E
Sbjct: 330 --------YKKVSNSDSEFIFFNAWNEWSEGAYLEPDDKYGFENLEIIYEVL 373
>gi|325685344|gb|EGD27453.1| group 2 glycosyl transferase [Lactobacillus delbrueckii subsp.
lactis DSM 20072]
Length = 359
Score = 64.2 bits (155), Expect = 3e-08, Method: Composition-based stats.
Identities = 9/94 (9%), Positives = 26/94 (27%), Gaps = 9/94 (9%)
Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96
G W + ++ V + F+ + + S +
Sbjct: 272 IFKGCTSGWDNTARKGKQGMVVKGKTPKKFKKYFNQFLT-------KPRQDASDEFYVIN 324
Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
+ E + A+L + ++ + E ++
Sbjct: 325 AWNEWSEGAYLEPDEKDGDTYLEIIKEAVEKEEK 358
>gi|324993910|gb|EGC25829.1| rhamnosyltransferase [Streptococcus sanguinis SK405]
gi|324994771|gb|EGC26684.1| rhamnosyltransferase [Streptococcus sanguinis SK678]
Length = 556
Score = 63.8 bits (154), Expect = 4e-08, Method: Composition-based stats.
Identities = 39/244 (15%), Positives = 72/244 (29%), Gaps = 35/244 (14%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ ++ +T E K + + QL + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266
K R L + + Y Y+ + S G + R L ++ D
Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF- 325
A I EQ +G++ R + E + L DF
Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRD-----GLFESEPPLPSLTAVWQEAVLHKSFDFM 450
Query: 326 -------FNGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTE 377
G W K L L + + E+ L D +E V +
Sbjct: 451 TAPSLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIVWDSH 505
Query: 378 FSIE 381
+ +
Sbjct: 506 YDFK 509
>gi|319788852|ref|YP_004090167.1| glycosyltransferase [Ruminococcus albus 7]
gi|315450719|gb|ADU24281.1| glycosyltransferase [Ruminococcus albus 7]
Length = 360
Score = 63.4 bits (153), Expect = 6e-08, Method: Composition-based stats.
Identities = 15/117 (12%), Positives = 32/117 (27%), Gaps = 10/117 (8%)
Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73
+ N V + + P H G + W SP+ + F+
Sbjct: 250 VTNYFNYDSVCDLIEKRIDNDPNHYLGLFAEWDNSPRHSHNCTIFKNFSIPRFKQ----- 304
Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
L +S+ K + E + A+L + ++ + +
Sbjct: 305 ---LVYSQIKKSVSVGKGFLIIDAWNEWGEGAYLEPDNISGFEKLNTIRDVLSGFMQ 358
>gi|327463172|gb|EGF09493.1| rhamnosyltransferase [Streptococcus sanguinis SK1]
gi|327474781|gb|EGF20186.1| rhamnosyltransferase [Streptococcus sanguinis SK408]
Length = 556
Score = 62.7 bits (151), Expect = 9e-08, Method: Composition-based stats.
Identities = 38/244 (15%), Positives = 71/244 (29%), Gaps = 35/244 (14%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ ++ +T E K + + QL + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266
K R L + + Y Y+ + S G + R L ++ D
Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF- 325
A I EQ +G++ R + E + L DF
Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRD-----GLFESEPPLPSLTAVWQEAVLHKSFDFM 450
Query: 326 -------FNGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTE 377
G W K L L + + E+ L D +E +
Sbjct: 451 TAPSLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIAWDSH 505
Query: 378 FSIE 381
+ +
Sbjct: 506 YDFK 509
>gi|327489888|gb|EGF21677.1| rhamnosyltransferase [Streptococcus sanguinis SK1058]
Length = 556
Score = 62.7 bits (151), Expect = 1e-07, Method: Composition-based stats.
Identities = 38/244 (15%), Positives = 71/244 (29%), Gaps = 35/244 (14%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ ++ +T E K + + QL + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266
K R L + + Y Y+ + S G + R L ++ D
Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF- 325
A I EQ +G++ R + E + L DF
Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRD-----GLFESEPPLPSLTAVWQEAVLHKSFDFM 450
Query: 326 -------FNGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTE 377
G W K L L + + E+ L D +E +
Sbjct: 451 TAPSLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIAWDSH 505
Query: 378 FSIE 381
+ +
Sbjct: 506 YDFK 509
>gi|325694904|gb|EGD36809.1| rhamnosyltransferase [Streptococcus sanguinis SK150]
Length = 556
Score = 62.3 bits (150), Expect = 1e-07, Method: Composition-based stats.
Identities = 40/289 (13%), Positives = 81/289 (28%), Gaps = 26/289 (8%)
Query: 104 FLRLNRFMSNSRMPFD-SEKFLYV--KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQ 160
+L ++S E Y +L D+ S S + + +H
Sbjct: 236 YLLEELETNSSYPTSLIREHLFYHFGPDLPCLLQDKYLSQSTSSYRTNQSVLLHIHVTNF 295
Query: 161 DTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLY 218
+ + L L + VT E K + + Q+ + + K + L
Sbjct: 296 PIFQQYQEKLFSLASQYQYLVTTNLPEMLKQLQTALAHLDDKVQIVLSQ-KSHALLAMLE 354
Query: 219 LLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNP 278
+ + Y Y+ + + + R L ++ D A I EQ
Sbjct: 355 --QKEILQNYVYIGHLSTHR--IMENQAVFDQAMRSDLINMMV---DYADASIEALEQES 407
Query: 279 CLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-----NGTMFWV 333
+G++ R + + + + + AG + G W
Sbjct: 408 AVGLVIPDLPRLVRD--GLFESEPPLPSLTAVWQEAGLHKSFDFMTAPSLTRVYGGFLWF 465
Query: 334 KPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381
K L L + + E+ L D +E + + +
Sbjct: 466 KYSALTSLFQMKSLESLPSSEQELSD-----VLEHLLVYIAWDSHYDFK 509
>gi|325690859|gb|EGD32860.1| rhamnosyltransferase [Streptococcus sanguinis SK115]
Length = 556
Score = 61.1 bits (147), Expect = 3e-07, Method: Composition-based stats.
Identities = 36/239 (15%), Positives = 74/239 (30%), Gaps = 25/239 (10%)
Query: 153 IVVHCYYQDT--WIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + + L L+ + VTV E K + + QL + +
Sbjct: 286 VLLHIHVTDLPIFQQYQNKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
K L + + Y Y+ + + + R L ++ D A
Sbjct: 345 KSHAWLAMLE--QKEILQDYAYIGHLSTHR--IMENQAVFDQAMRSDLINLMV---DYAD 397
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-- 326
I EQ +G++ R + + + R+ + + AG +
Sbjct: 398 ASIEALEQESAVGLVIPDLPRLVRD--GLFESEPLRPRLAAIWQEAGLHKSFDFMTPPSL 455
Query: 327 ---NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381
G W K L L + + E+ L D +E + + +
Sbjct: 456 TRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHYDFK 509
>gi|29348315|ref|NP_811818.1| hypothetical protein BT_2906 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340219|gb|AAO78012.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 436
Score = 61.1 bits (147), Expect = 3e-07, Method: Composition-based stats.
Identities = 17/127 (13%), Positives = 35/127 (27%), Gaps = 26/127 (20%)
Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK------QRITSK--------DVHFQ 61
++ +L E G Y+PA G W +P+ + + +
Sbjct: 312 DVAFKLWDEHHGQFDIPYVPAVAPG----WDSTPRYIAPANRPAKADRSQWPGCTIFKNE 367
Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119
+ F++F+ + Y RI E + +L + +
Sbjct: 368 NPASFKAFVQ------SSFVYLNKHPEVPRILTIACFNEWSEGHYLLPDNRFGYGMLDAL 421
Query: 120 SEKFLYV 126
E
Sbjct: 422 GEALGKE 428
>gi|325067617|ref|ZP_08126290.1| hypothetical protein AoriK_07344 [Actinomyces oris K20]
Length = 233
Score = 60.7 bits (146), Expect = 4e-07, Method: Composition-based stats.
Identities = 16/82 (19%), Positives = 26/82 (31%), Gaps = 3/82 (3%)
Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF--PTKRL 321
+I+ F ++P LG+ + A + L++R G L
Sbjct: 1 PGYVAGLIDLFARHPGLGVAMPAAGHIGQAH-GGATWDGLAGAATALSRRLGLTVELDPL 59
Query: 322 HLDFFNGTMFWVKPKCLEPLRN 343
G MF +P L L
Sbjct: 60 APVVPVGAMFLARPAALRTLSE 81
>gi|253569319|ref|ZP_04846729.1| conserved hypothetical protein [Bacteroides sp. 1_1_6]
gi|251841338|gb|EES69419.1| conserved hypothetical protein [Bacteroides sp. 1_1_6]
Length = 415
Score = 60.7 bits (146), Expect = 4e-07, Method: Composition-based stats.
Identities = 17/127 (13%), Positives = 35/127 (27%), Gaps = 26/127 (20%)
Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK------QRITSK--------DVHFQ 61
++ +L E G Y+PA G W +P+ + + +
Sbjct: 291 DVAFKLWDEHHGQFDIPYVPAVAPG----WDSTPRYIAPANRPAKADRSQWPGCTIFKNE 346
Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119
+ F++F+ + Y RI E + +L + +
Sbjct: 347 NPASFKAFVQ------SSFVYLNKHPEVPRILTIACFNEWSEGHYLLPDNRFGYGMLDAL 400
Query: 120 SEKFLYV 126
E
Sbjct: 401 GEALGKE 407
>gi|13474019|ref|NP_105587.1| hypothetical protein mll4797 [Mesorhizobium loti MAFF303099]
gi|14024771|dbj|BAB51373.1| mll4797 [Mesorhizobium loti MAFF303099]
Length = 467
Score = 60.3 bits (145), Expect = 5e-07, Method: Composition-based stats.
Identities = 14/96 (14%), Positives = 28/96 (29%), Gaps = 10/96 (10%)
Query: 60 FQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMP 117
S + WL + +A + F S R+ F + E + A+L + + +
Sbjct: 2 NASPSRY---AEWLANAVADTCDRFADFDS-RLIFVNAWNEWAEGAYLEPDARYGYAYLQ 57
Query: 118 FDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAI 153
P+ L + A+
Sbjct: 58 ETRNVL----SAPSAAGKFPTGASWRVLFVSHDAAL 89
>gi|323351266|ref|ZP_08086922.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
sanguinis VMC66]
gi|322122490|gb|EFX94201.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha
alpha-1,3-L-rhamnosyltransferase [Streptococcus
sanguinis VMC66]
Length = 556
Score = 60.3 bits (145), Expect = 5e-07, Method: Composition-based stats.
Identities = 34/240 (14%), Positives = 72/240 (30%), Gaps = 27/240 (11%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ ++ +T E K + + Q+ + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQIVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266
K R L + + Y Y+ + S + R L ++ D
Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVENQAVFDQAMRSDLINLMV---DY 395
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326
A I EQ +G++ R + F + + + + + AG +
Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRDGLFETEP--LRPSLSAVWQEAGLHKSFDFMTAS 453
Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
G W K L L + + D L +E + + +
Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509
>gi|327470704|gb|EGF16160.1| rhamnosyltransferase [Streptococcus sanguinis SK330]
Length = 556
Score = 60.0 bits (144), Expect = 6e-07, Method: Composition-based stats.
Identities = 35/239 (14%), Positives = 71/239 (29%), Gaps = 25/239 (10%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ + VTV E K + + QL + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
K L + + Y Y+ + + + R L ++ A
Sbjct: 345 KSHAWLAMLE--QKEILQDYAYIGHLSTHR--IMENQAVFDQAMRSDLINMMVY---YAD 397
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-- 326
I EQ +G++ R + + R+ + + AG +
Sbjct: 398 TSIEALEQESAVGLVIPDLPRLVRD--GLFESEPPRPRLAAIWQEAGLHKSFDFMTPPSL 455
Query: 327 ---NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381
G W K L L + + E+ L D +E + + +
Sbjct: 456 TRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHYDFK 509
>gi|328946538|gb|EGG40677.1| rhamnosyltransferase [Streptococcus sanguinis SK1087]
Length = 556
Score = 59.6 bits (143), Expect = 8e-07, Method: Composition-based stats.
Identities = 40/290 (13%), Positives = 84/290 (28%), Gaps = 28/290 (9%)
Query: 104 FLRLNRFMSNSR-MPFDSEKFLYV--KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQ 160
+L + ++S + E Y +L D+ S S + + + +H
Sbjct: 236 YLLEDLETNSSYPILLIREHLFYHFGPDLPCLLEDKYLSQSTSNYCTEQPVLLHIHVTDF 295
Query: 161 DTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLY 218
+ + L L+ + VT E K + + Q+ + + K L
Sbjct: 296 PIFQQYQDNLFSLSSQYQYLVTTGQPEVLKQLQTSLAHLGNKVQIVLSQ-KSHAWLAMLE 354
Query: 219 LLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276
+ + Y Y+ + S + R L ++ +D + I E+
Sbjct: 355 --QKEILQNYAYIGHL----STHRLVENQAVFDQAMRSDLINMMVDSADAS---IEALEK 405
Query: 277 NPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-----NGTMF 331
N LG++ R + + R+ + + AG + G
Sbjct: 406 NSDLGLVIPDLPRLVRD--GLFESEPPRPRLTSVWQDAGLHKSFNFMSTPSLTRVYGGFL 463
Query: 332 WVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
W K L + + D L +E + + +
Sbjct: 464 WFKYSALASWFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509
>gi|283456866|ref|YP_003361430.1| putative glycosyltransferase [Bifidobacterium dentium Bd1]
gi|283103500|gb|ADB10606.1| Putative glycosyltransferase [Bifidobacterium dentium Bd1]
Length = 349
Score = 59.2 bits (142), Expect = 1e-06, Method: Composition-based stats.
Identities = 15/120 (12%), Positives = 32/120 (26%), Gaps = 15/120 (12%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIP-AHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
K+ +++ L N Q Y V + + SP++ + + F
Sbjct: 239 KKMKRLDCLDYDYLWNRILNKQRKYGTRQIVRSAFTNFDNSPRKGTRAFITQGSSYTKFA 298
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFF--YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ L S + F + E + A L + +
Sbjct: 299 DYLNQLIH----------SNRQDYMDFTVINAWNEWGEGAILEPTESDQYGWLQAVKDAV 348
>gi|171741995|ref|ZP_02917802.1| hypothetical protein BIFDEN_01098 [Bifidobacterium dentium ATCC
27678]
gi|171277609|gb|EDT45270.1| hypothetical protein BIFDEN_01098 [Bifidobacterium dentium ATCC
27678]
Length = 356
Score = 59.2 bits (142), Expect = 1e-06, Method: Composition-based stats.
Identities = 15/120 (12%), Positives = 32/120 (26%), Gaps = 15/120 (12%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIP-AHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67
K+ +++ L N Q Y V + + SP++ + + F
Sbjct: 246 KKMKRLDCLDYDYLWNRILNKQRKYGTRQIVRSAFTNFDNSPRKGTRAFITQGSSYTKFA 305
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFF--YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++ L S + F + E + A L + +
Sbjct: 306 DYLNQLIH----------SNRQDYMDFTVINAWNEWGEGAILEPTESDQYGWLQAVKDAV 355
>gi|281355222|ref|ZP_06241716.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548]
gi|281318102|gb|EFB02122.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548]
Length = 375
Score = 59.2 bits (142), Expect = 1e-06, Method: Composition-based stats.
Identities = 13/117 (11%), Positives = 25/117 (21%), Gaps = 23/117 (19%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK----------DVHFQELSIFES 68
+E Y + W SP+ +T + + F
Sbjct: 267 YWRKWDEIER---QYRIPYFPNVTAGWDPSPRTLMTDRWEPVGYPYTCTLSENTPENFRR 323
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ + +L R F E + + L + F
Sbjct: 324 AL--------AATRDRLLKSEIRTFSINCWNEWTEGSMLEPEARYGYGYLDALKAVF 372
>gi|327461067|gb|EGF07400.1| rhamnosyltransferase [Streptococcus sanguinis SK1057]
Length = 556
Score = 58.0 bits (139), Expect = 2e-06, Method: Composition-based stats.
Identities = 34/239 (14%), Positives = 72/239 (30%), Gaps = 25/239 (10%)
Query: 153 IVVHCYYQDT--WIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + + L L+ + VTV E K + + QL + +
Sbjct: 286 VLLHIHVTDLPIFQQYQNKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268
K L + + Y Y+ + + + R L ++ D A
Sbjct: 345 KSHAWLAMLE--QKEILQDYAYIGHLSTHR--IMENQAVFDQAMRSDLINLMV---DYAD 397
Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-- 326
I EQ +G++ R + + + R+ + + AG +
Sbjct: 398 ASIEALEQESAVGLVIPDLPRLVRD--GLFESEPLRPRLAAIWQEAGLHKSFDFMTPPSL 455
Query: 327 ---NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381
G W K L + + + E+ D +E + +
Sbjct: 456 TRVYGGFVWFKYSALASVFRMKSLESLPSSEQEFSD-----VLEHLLVYLAWDNHYDFK 509
>gi|125718317|ref|YP_001035450.1| lipopolysaccharide biosynthesis protein, putative [Streptococcus
sanguinis SK36]
gi|125498234|gb|ABN44900.1| Lipopolysaccharide biosynthesis protein, putative [Streptococcus
sanguinis SK36]
Length = 556
Score = 57.7 bits (138), Expect = 4e-06, Method: Composition-based stats.
Identities = 33/240 (13%), Positives = 69/240 (28%), Gaps = 27/240 (11%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ + +T E K + + Q+ + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLLTTNQPEVLKQLQTALGHLGNKVQIILSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266
K L + + Y Y+ + S + R L ++ D
Sbjct: 345 KSHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAVFDQAMRSDLINMMV---DY 395
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326
A I EQ+ G++ R + F + + + + AG +
Sbjct: 396 ADASIEALEQDSAEGLVIPDLPRLVRDGLFEIEP--PRPSLSAVWQEAGLHKSFDFMTAS 453
Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
G W K L L + + D L +E + + +
Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509
>gi|325687210|gb|EGD29232.1| rhamnosyltransferase [Streptococcus sanguinis SK72]
Length = 556
Score = 57.3 bits (137), Expect = 5e-06, Method: Composition-based stats.
Identities = 33/240 (13%), Positives = 69/240 (28%), Gaps = 27/240 (11%)
Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208
+++H + D + + L L+ + +T E K + + Q+ + +
Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLLTTNQPEVLKQLQTALGHLGNKVQIILSQ- 344
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266
K L + + Y Y+ + S + R L ++ D
Sbjct: 345 KSHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAVFDQTMRSDLINMMV---DY 395
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326
A I EQ+ G++ R + F + + + + AG +
Sbjct: 396 ADASIEALEQDSAEGLVIPDLPRLVRDGLFEIEP--PRPSLSAVWQEAGLHKSFDFMTAS 453
Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381
G W K L L + + D L +E + + +
Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509
>gi|29348316|ref|NP_811819.1| hypothetical protein BT_2907 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340220|gb|AAO78013.1| glycosyltransferase-like protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 452
Score = 56.9 bits (136), Expect = 6e-06, Method: Composition-based stats.
Identities = 11/111 (9%), Positives = 29/111 (26%), Gaps = 18/111 (16%)
Query: 27 GNMQAIYIPAHVSGYYVLWSFSPK---------QRITSK-----DVHFQELSIFESFIFW 72
+ ++ W +P+ Q + + + F++ +
Sbjct: 334 PKHHDDFAIPYLPSLSPGWDSTPRYIPPVSRPDQPNRDAWPNCVILDNENPASFKALVQ- 392
Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKF 123
S A+ K P I + + +L + + +E
Sbjct: 393 --SAFAYLNKHKDVPPILTIACFNEW-TEGHYLLPDNRFGYGMLDALAEAV 440
>gi|253569318|ref|ZP_04846728.1| conserved hypothetical protein [Bacteroides sp. 1_1_6]
gi|251841337|gb|EES69418.1| conserved hypothetical protein [Bacteroides sp. 1_1_6]
Length = 441
Score = 56.9 bits (136), Expect = 6e-06, Method: Composition-based stats.
Identities = 11/111 (9%), Positives = 29/111 (26%), Gaps = 18/111 (16%)
Query: 27 GNMQAIYIPAHVSGYYVLWSFSPK---------QRITSK-----DVHFQELSIFESFIFW 72
+ ++ W +P+ Q + + + F++ +
Sbjct: 323 PKHHDDFAIPYLPSLSPGWDSTPRYIPPVSRPDQPNRDAWPNCVILDNENPASFKALVQ- 381
Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKF 123
S A+ K P I + + +L + + +E
Sbjct: 382 --SAFAYLNKHKDVPPILTIACFNEW-TEGHYLLPDNRFGYGMLDALAEAV 429
>gi|325696073|gb|EGD37964.1| rhamnosyltransferase [Streptococcus sanguinis SK160]
Length = 556
Score = 56.5 bits (135), Expect = 7e-06, Method: Composition-based stats.
Identities = 36/243 (14%), Positives = 72/243 (29%), Gaps = 33/243 (13%)
Query: 153 IVVHCYYQDTWIEISHI---LLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVME 207
+++H + D + H L L+ + VTV E K + + QL + +
Sbjct: 286 VLLHIHVTD-FPIFQHYQDKLFSLSSQYQYLVTVAQPEMLKQLQTALAHLGDKVQLVLSQ 344
Query: 208 NKGRDVRPFLYLL-ELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFS 264
+L +L + + Y Y+ + S + R L ++
Sbjct: 345 ----ASHAWLAMLDQKEILQDYAYIGHL----STHRLVENQAVFDQAMRSDLINMMVY-- 394
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324
A I EQ +G++ R + + R+ + + A +
Sbjct: 395 -YADTSIEALEQESAVGLVIPDLPRLVRD--GLFESEPPRPRLAAIWQEADLHKSFDCMT 451
Query: 325 FF-----NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEF 378
G W K L L + + E+ L D +E + +
Sbjct: 452 PPSLTRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHY 506
Query: 379 SIE 381
+
Sbjct: 507 DFK 509
>gi|218282206|ref|ZP_03488505.1| hypothetical protein EUBIFOR_01087 [Eubacterium biforme DSM 3989]
gi|218216808|gb|EEC90346.1| hypothetical protein EUBIFOR_01087 [Eubacterium biforme DSM 3989]
Length = 355
Score = 55.0 bits (131), Expect = 2e-05, Method: Composition-based stats.
Identities = 11/102 (10%), Positives = 26/102 (25%), Gaps = 16/102 (15%)
Query: 22 DVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81
+ + N + + + G W +P+ + F ++
Sbjct: 263 KMYKVANDTKLNVNNVIRGLCFEWDNTPRHGYRGYVITPPSKESFFKYM----------- 311
Query: 82 YSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
S S F + E + L + + + E
Sbjct: 312 ---DSVQSDEYLFINAWNEWCEGMVLEPTQEKKYKYLEWIKE 350
>gi|296163856|ref|ZP_06846524.1| glycosyltransferase [Burkholderia sp. Ch1-1]
gi|295885899|gb|EFG65849.1| glycosyltransferase [Burkholderia sp. Ch1-1]
Length = 187
Score = 53.4 bits (127), Expect = 6e-05, Method: Composition-based stats.
Identities = 11/72 (15%), Positives = 23/72 (31%), Gaps = 4/72 (5%)
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
+ WL + + P +I F S E + +L + + SE
Sbjct: 11 YKQWLSQAILDTHDR--YSPDEQIVFLHSWNEWCEGTYLEPDGKSGRRFLEETSEAIKDA 68
Query: 127 KELFEGWNDRPS 138
+ + +D +
Sbjct: 69 ESVLALSDDSQA 80
>gi|315221431|ref|ZP_07863352.1| rhamnan synthesis protein F [Streptococcus anginosus F0211]
gi|315189550|gb|EFU23244.1| rhamnan synthesis protein F [Streptococcus anginosus F0211]
Length = 555
Score = 52.6 bits (125), Expect = 1e-04, Method: Composition-based stats.
Identities = 27/178 (15%), Positives = 47/178 (26%), Gaps = 23/178 (12%)
Query: 215 PFLYLL-ELGVFDRYDYLCKI--HGKKSQREGYHPIEGIIW-RRWLFFDLLGFSDIAIRI 270
P L + + Y Y+ + H W R LF ++ +
Sbjct: 348 PLLAMFAQAERLKTYKYIGHLSTHT-----LIPEVAGLDQWMRDDLFNMMI---ENMNYS 399
Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF----- 325
IN E LG+I + F+ K + + L K D
Sbjct: 400 INALEHCSNLGLIIPDLPSVVRNGLFYQKP--LKEEMEKLWKLLSCRKSFKFTDAVTLTR 457
Query: 326 FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383
G W K + +E L F + + +E + + +
Sbjct: 458 VYGGWMWFKYEAVESLFKASFKT-FSSYSLQEQSTI---LENLLVYVAWDKNYDFQII 511
>gi|283785857|ref|YP_003365722.1| hypothetical protein ROD_21731 [Citrobacter rodentium ICC168]
gi|282949311|emb|CBG88922.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
Length = 346
Score = 52.6 bits (125), Expect = 1e-04, Method: Composition-based stats.
Identities = 14/115 (12%), Positives = 30/115 (26%), Gaps = 14/115 (12%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
LG I + + + + I + + W + + FE
Sbjct: 239 KFLGPI-RYNYKKMISSLWHNETKDI-KEIPIIFSGWDTTIRHGKQGVFYSNFSEHSFE- 295
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
+ + P I F S E + + + S+ + S+
Sbjct: 296 ---------VNVQNAINYNPQQDIVFLKSWNEWAEGNTVEPDTIFSDKLLRIISK 341
>gi|160894490|ref|ZP_02075266.1| hypothetical protein CLOL250_02042 [Clostridium sp. L2-50]
gi|156863801|gb|EDO57232.1| hypothetical protein CLOL250_02042 [Clostridium sp. L2-50]
Length = 783
Score = 51.1 bits (121), Expect = 3e-04, Method: Composition-based stats.
Identities = 33/218 (15%), Positives = 69/218 (31%), Gaps = 33/218 (15%)
Query: 139 SPKKSGLTIKSKIAIVV-HCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY 197
P I KIA+V+ +Y +I L DL+ E + +++ +
Sbjct: 291 IPDSECERIAEKIAVVIDEDFYLQHQPDI----DDLETHADLYYWGSEESFHQKKNWEEM 346
Query: 198 F-------PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEG- 249
A++Y V F Y+Y+C + + + G
Sbjct: 347 HLLECTTGNFAEVYY------AVGAF--------AKEYEYICFLVNEDRSYIAENLDNGH 392
Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSF-FAKRSEVYRRVI 308
W +LG I++ N +G++ + +S + +R + +
Sbjct: 393 TGWIIE--NSILGKGVSLGNIVSCLNDNSGIGLVYPPNSSQSLYYSRQYKERELISCEIQ 450
Query: 309 DLAKRAGFPTKRLHLDFFNG---TMFWVKPKCLEPLRN 343
+ + + + G FW + + L+ L
Sbjct: 451 QILEDSDIHLNIAKVRGSIGQYTGCFWCRSQVLQNLTE 488
>gi|168481320|gb|ACA24808.1| WfgB [Shigella dysenteriae]
gi|168481331|gb|ACA24818.1| WfgB [Escherichia coli]
Length = 345
Score = 50.7 bits (120), Expect = 4e-04, Method: Composition-based stats.
Identities = 16/115 (13%), Positives = 29/115 (25%), Gaps = 14/115 (12%)
Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68
LG I + + Q I + + W + + FE
Sbjct: 239 KFLGPI-RYNYEKMISSLWHNQTKDI-KEIPIIFSGWDTTIRHGKQGVFYSDFSEHSFE- 295
Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121
K + P I F S E + + + S+ + S+
Sbjct: 296 ---------VNVKNAINYNPQQDIVFLKSWNEWAEGNTVEPDTIFSDKLLRIISK 341
>gi|127512343|ref|YP_001093540.1| tetratricopeptide TPR_2 [Shewanella loihica PV-4]
gi|126637638|gb|ABO23281.1| tetratricopeptide TPR_2 [Shewanella loihica PV-4]
Length = 372
Score = 50.3 bits (119), Expect = 5e-04, Method: Composition-based stats.
Identities = 10/92 (10%), Positives = 23/92 (25%), Gaps = 10/92 (10%)
Query: 18 LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77
V+ S W SP+ + + + F+ +
Sbjct: 268 NYASTVKTLEYAHQNISGTVHSTIVTGWDNSPRSNRRALVLTNFNENSFK-------YAI 320
Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRL 107
+ ++ ++ F S E + L
Sbjct: 321 DIAISNE-KNNENKLLFIKSWNEWAEGNTLEP 351
>gi|254431846|ref|ZP_05045549.1| hypothetical protein CPCC7001_1737 [Cyanobium sp. PCC 7001]
gi|197626299|gb|EDY38858.1| hypothetical protein CPCC7001_1737 [Cyanobium sp. PCC 7001]
Length = 205
Score = 50.3 bits (119), Expect = 5e-04, Method: Composition-based stats.
Identities = 29/178 (16%), Positives = 45/178 (25%), Gaps = 15/178 (8%)
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263
+ N G D F +L G F K+ KKS G G+ W +
Sbjct: 28 ERVTNYGEDWSSFHHLFYSGAFSSRGATFKLQTKKSSNLG--ADGGMAWVDEALQPIASS 85
Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-PTKRLH 322
+I + + + V + R G
Sbjct: 86 YRATATVIKNLKAG--------TIKLAASKLCKRTGFGANPQLVAEYIHRLGLNEQSAKR 137
Query: 323 LDFFNGTMFWVKPKCLEPLR-NLHLIGEFEEERN--LKDGAL-EHAVERFFACSVRYT 376
F G+MF ++ +L + G HA+ER F
Sbjct: 138 QSFCMGSMFAADNDLIQLFYSSLGDVDYRITSDGGSQFCGRYPGHAIERAFFYYSYQA 195
>gi|323483798|ref|ZP_08089177.1| hypothetical protein HMPREF9474_00926 [Clostridium symbiosum
WAL-14163]
gi|323402883|gb|EGA95202.1| hypothetical protein HMPREF9474_00926 [Clostridium symbiosum
WAL-14163]
Length = 358
Score = 49.2 bits (116), Expect = 0.001, Method: Composition-based stats.
Identities = 8/80 (10%), Positives = 20/80 (25%), Gaps = 16/80 (20%)
Query: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99
G + W +P+ I + + ++ ++ S F +
Sbjct: 278 GVFFEWDNTPRHSIRGTII---TPPDKKRYLQYMDSIKDT-----------EYLFINAWN 323
Query: 100 E--QKAFLRLNRFMSNSRMP 117
E + L +
Sbjct: 324 EWAEGMMLEPTVENKYKYLE 343
>gi|322510485|gb|ADX05799.1| putative N-acetyl glucosaminyl transferase [Organic Lake
phycodnavirus 1]
Length = 690
Score = 48.0 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 49/268 (18%), Positives = 81/268 (30%), Gaps = 63/268 (23%)
Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEIS-HILLRLNFDFDLFVTVVEANKDFEQDVLKYFP 199
+KS L KS A +HCY + I + L+ F + VT D + + +
Sbjct: 302 EKSELYSKSLFA-HLHCYDISQFTTIYKDYIYDLSKYFHIIVTYTIGYLDKKNEYITLLK 360
Query: 200 SAQLYVMENKGRDVRP--FLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLF 257
+ N G D+ + Y Y+ +H K + R ++
Sbjct: 361 ------IPNNGYDIGAKMMMVKYLKDKNIDYKYIYFMHSK-----------SDVNLRHIY 403
Query: 258 FDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR-----SEVYRRVIDLAK 312
FD L D I+ E G + Y+ Y +++ Y +L
Sbjct: 404 FDTLY--DHVDDIVKYIEDYD--GYFPNLLYKLYNQYNIKQSNKIKQPDYNYVYTNEL-- 457
Query: 313 RAGFPTKRLHLD-FFNGTMFWVKPKC---------LEPLRNLHLIGEF---------EEE 353
+ K + F G ++ ++ L L N ++ E
Sbjct: 458 KHYLNVKDTQFNTFVEGNVYILRRNICETIFGDERLYRLLNESDENDYVHLQNIYRKPLE 517
Query: 354 RNL------------KDGALEHAVERFF 369
DG LEHA ER
Sbjct: 518 EIYHKLKYNYQTKMIHDGQLEHAFERVV 545
>gi|53803315|ref|YP_114969.1| glycosyl transferase group 2 family protein [Methylococcus
capsulatus str. Bath]
gi|53757076|gb|AAU91367.1| glycosyl transferase, group 2 family protein [Methylococcus
capsulatus str. Bath]
Length = 957
Score = 48.0 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 8/62 (12%), Positives = 19/62 (30%), Gaps = 3/62 (4%)
Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
+ W S ++ + + L+ S+ WL + + + R+ F
Sbjct: 853 YKLFRSVTLAWDNSARRGKRATILRNFSLT---SYAQWLLTACKATLADHNLTENERLVF 909
Query: 95 YG 96
Sbjct: 910 IN 911
>gi|293611242|ref|ZP_06693540.1| predicted protein [Acinetobacter sp. SH024]
gi|292826493|gb|EFF84860.1| predicted protein [Acinetobacter sp. SH024]
Length = 347
Score = 48.0 bits (113), Expect = 0.003, Method: Composition-based stats.
Identities = 10/126 (7%), Positives = 24/126 (19%), Gaps = 14/126 (11%)
Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQE 62
K+ K+ + + W S + ++ +
Sbjct: 233 KINNFTKGRFKLGPFFYSYKRMMMLEKNLKNSSGEIPVIFSGWDTSIRHATNGIVLNEFD 292
Query: 63 LSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120
+F + S S E + L + + +
Sbjct: 293 SQVFNEHV------------SHNLNFESDFLIVKSWNEWAEGNLLEPDSIYGFTMLKVMK 340
Query: 121 EKFLYV 126
E
Sbjct: 341 EALRKY 346
>gi|302024024|ref|ZP_07249235.1| polysaccharide biosynthesis protein [Streptococcus suis 05HAS68]
Length = 587
Score = 47.3 bits (111), Expect = 0.005, Method: Composition-based stats.
Identities = 33/232 (14%), Positives = 81/232 (34%), Gaps = 25/232 (10%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDT--WIEISHILLRLNFDFDLFVTVVEAN-KDFEQD 193
P ++ T++S ++++H + + + E L +++ L +T+ E + +
Sbjct: 282 PIRVSQTTETVRSSTSVLLHVHIESVSIFEEYIEELCKISDRCQLLITLPETDFSNKCSI 341
Query: 194 VLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253
V +Y + +L K D F ++ + D YL + K+++ Y + I R
Sbjct: 342 VERYLSTYKLRAQIAKLTDELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-R 399
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
L +I+ FE L ++ + + +L ++
Sbjct: 400 HQLRKMFFTS---FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQ 451
Query: 314 AGF---------PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353
+ + G + +W+K + + + +F +E
Sbjct: 452 LNILYESLVRTKKVDFYKVPYIIGEEVSWYWIKTEDYKKIEEKFRNIDFSKE 503
>gi|223933250|ref|ZP_03625240.1| Rhamnan synthesis F [Streptococcus suis 89/1591]
gi|330832517|ref|YP_004401342.1| Rhamnan synthesis F [Streptococcus suis ST3]
gi|223898064|gb|EEF64435.1| Rhamnan synthesis F [Streptococcus suis 89/1591]
gi|329306740|gb|AEB81156.1| Rhamnan synthesis F [Streptococcus suis ST3]
Length = 574
Score = 46.9 bits (110), Expect = 0.005, Method: Composition-based stats.
Identities = 33/232 (14%), Positives = 81/232 (34%), Gaps = 25/232 (10%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDT--WIEISHILLRLNFDFDLFVTVVEAN-KDFEQD 193
P ++ T++S ++++H + + + E L +++ L +T+ E + +
Sbjct: 269 PIRVSQTTETVRSSTSVLLHVHIESVSIFEEYIEELCKISDRCQLLITLPETDFSNKCSI 328
Query: 194 VLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253
V +Y + +L K D F ++ + D YL + K+++ Y + I R
Sbjct: 329 VERYLSTYKLRAQIAKLTDELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-R 386
Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313
L +I+ FE L ++ + + +L ++
Sbjct: 387 HQLRKMFFTS---FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQ 438
Query: 314 AGF---------PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353
+ + G + +W+K + + + +F +E
Sbjct: 439 LNILYESLVRTKKVDFYKVPYIIGEEVSWYWIKTEDYKKIEEKFRNIDFSKE 490
>gi|146318939|ref|YP_001198651.1| polysaccharide biosynthesis protein [Streptococcus suis 05ZYH33]
gi|145689745|gb|ABP90251.1| polysaccharide biosynthesis protein [Streptococcus suis 05ZYH33]
gi|319758378|gb|ADV70320.1| polysaccharide biosynthesis protein [Streptococcus suis JS14]
Length = 587
Score = 46.9 bits (110), Expect = 0.006, Method: Composition-based stats.
Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 31/221 (14%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEAN-----KDFEQDVLKYFPSAQLY 204
+ + VH + E L ++ L +T+ EA+ E+ + Y AQ+
Sbjct: 297 SVLLHVHIESVSIFEEYIEELCKIADRCQLLITLPEADFSNKCSIVERCLFTYQLRAQIA 356
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
+ D F ++ + D YL + K+++ Y + I R L
Sbjct: 357 KLT----DELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-RHQLRKMFFTS- 409
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-------- 316
+I+ FE L ++ + + +L ++
Sbjct: 410 --FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQLNILYESLVRT 462
Query: 317 -PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353
+ + G + +W+K + + + +F +E
Sbjct: 463 KKVDFYKVPYIIGEEVSWYWIKTEHYKKIEEKFRNIDFSKE 503
>gi|253752012|ref|YP_003025153.1| rhamnan synthesis protein F family protein [Streptococcus suis
SC84]
gi|253753837|ref|YP_003026978.1| rhamnan synthesis protein F family protein [Streptococcus suis
P1/7]
gi|251816301|emb|CAZ51929.1| rhamnan synthesis protein F family protein [Streptococcus suis
SC84]
gi|251820083|emb|CAR46353.1| rhamnan synthesis protein F family protein [Streptococcus suis
P1/7]
Length = 574
Score = 46.5 bits (109), Expect = 0.007, Method: Composition-based stats.
Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 31/221 (14%)
Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEAN-----KDFEQDVLKYFPSAQLY 204
+ + VH + E L ++ L +T+ EA+ E+ + Y AQ+
Sbjct: 284 SVLLHVHIESVSIFEEYIEELCKIADRCQLLITLPEADFSNKCSIVERCLFTYQLRAQIA 343
Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264
+ D F ++ + D YL + K+++ Y + I R L
Sbjct: 344 KLT----DELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-RHQLRKMFFTS- 396
Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-------- 316
+I+ FE L ++ + + +L ++
Sbjct: 397 --FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQLNILYESLVRT 449
Query: 317 -PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353
+ + G + +W+K + + + +F +E
Sbjct: 450 KKVDFYKVPYIIGEEVSWYWIKTEHYKKIEEKFRNIDFSKE 490
>gi|322418496|ref|YP_004197719.1| hypothetical protein GM18_0965 [Geobacter sp. M18]
gi|320124883|gb|ADW12443.1| hypothetical protein GM18_0965 [Geobacter sp. M18]
Length = 393
Score = 46.1 bits (108), Expect = 0.009, Method: Composition-based stats.
Identities = 13/107 (12%), Positives = 26/107 (24%), Gaps = 14/107 (13%)
Query: 26 KGNMQAIYIPAHVSGYYVLWSFSPKQ-------RITSKDVHFQELSIFESFIFWLRSFLA 78
V V W P++ ++ S LR +
Sbjct: 271 FWEECKALAQQTVPVVNVGWDNRPRRTSPEQALKLRGPWYVPPTPDELASH---LRMAIQ 327
Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ + + + I + E + L R N+R+
Sbjct: 328 WERENPAYTEANAIL-IYAWNELDEGG-LVPTRSEGNARLQAVKTAL 372
>gi|253755287|ref|YP_003028427.1| rhamnan synthesis protein F family protein [Streptococcus suis
BM407]
gi|251817751|emb|CAZ55503.1| rhamnan synthesis protein F family protein [Streptococcus suis
BM407]
Length = 574
Score = 45.7 bits (107), Expect = 0.014, Method: Composition-based stats.
Identities = 34/236 (14%), Positives = 81/236 (34%), Gaps = 33/236 (13%)
Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDT--WIEISHILLRLNFDFDLFVTVVEAN-----KD 189
P ++ T++S ++++H + + + E L +++ L +T+ E +
Sbjct: 269 PIRVSQTTETVRSSTSVLLHIHIESVSIFEEYIEELCKISDRCQLLITLPETDFSNNFSI 328
Query: 190 FEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEG 249
E+ + Y AQ+ + D F ++ + D YL I K++ + Y +
Sbjct: 329 VERYLSTYKLRAQIVKLT----DELHFFEIVNNYMGDA-KYLAHITVKQTNKTKYSVEDI 383
Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID 309
I R L +I+ FE L ++ + + +
Sbjct: 384 ID-RYQLRKMFFTS---FDAVISNFESQSNLAVVIPDLTTNQRYDRKSLREGNP-----E 434
Query: 310 LAKRAGF---------PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353
L ++ + + G + +W+K + + + +F +E
Sbjct: 435 LIRQLNILYESLVRTKKVDFYKVPYIIGEEVSWYWIKTEHYKKIEEKFRNIDFSKE 490
>gi|30260448|ref|NP_842825.1| hypothetical protein BA_0273 [Bacillus anthracis str. Ames]
gi|47525533|ref|YP_016882.1| hypothetical protein GBAA_0273 [Bacillus anthracis str. 'Ames
Ancestor']
gi|49183290|ref|YP_026542.1| hypothetical protein BAS0259 [Bacillus anthracis str. Sterne]
gi|65317702|ref|ZP_00390661.1| COG1882: Pyruvate-formate lyase [Bacillus anthracis str. A2012]
gi|227812939|ref|YP_002812948.1| hypothetical protein BAMEG_0323 [Bacillus anthracis str. CDC 684]
gi|254736984|ref|ZP_05194689.1| hypothetical protein BantWNA_17640 [Bacillus anthracis str. Western
North America USA6153]
gi|254756036|ref|ZP_05208066.1| hypothetical protein BantV_26524 [Bacillus anthracis str. Vollum]
gi|254761686|ref|ZP_05213703.1| hypothetical protein BantA9_25534 [Bacillus anthracis str.
Australia 94]
gi|30253769|gb|AAP24311.1| conserved domain protein [Bacillus anthracis str. Ames]
gi|47500681|gb|AAT29357.1| conserved hypothetical protein [Bacillus anthracis str. 'Ames
Ancestor']
gi|49177217|gb|AAT52593.1| conserved domain protein [Bacillus anthracis str. Sterne]
gi|227005477|gb|ACP15220.1| conserved hypothetical protein [Bacillus anthracis str. CDC 684]
Length = 317
Score = 45.3 bits (106), Expect = 0.016, Method: Composition-based stats.
Identities = 13/87 (14%), Positives = 25/87 (28%), Gaps = 8/87 (9%)
Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70
GK N V ++ G +V W + +++ + S F +
Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 296
Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGS 97
L+ Y S + F +
Sbjct: 297 ------LSKQIYRTYSLYNSEFLFMNA 317
>gi|86144017|ref|ZP_01062355.1| hypothetical protein MED217_13656 [Leeuwenhoekiella blandensis
MED217]
gi|85829477|gb|EAQ47941.1| hypothetical protein MED217_13656 [Leeuwenhoekiella blandensis
MED217]
Length = 361
Score = 44.2 bits (103), Expect = 0.034, Method: Composition-based stats.
Identities = 10/91 (10%), Positives = 27/91 (29%), Gaps = 6/91 (6%)
Query: 36 AHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
+ + W P Q+ + + + R ++ K S ++
Sbjct: 269 KQIPTVTLNWDPRPMQKHSGAKIFSGFSAKSVKKAVLATRVWVDTH---KESVSKKKLIM 325
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ E + A+L + + + + E
Sbjct: 326 LYAWNEYAEGAWLTPSEVLGTTLLDGLKEGL 356
>gi|325106250|ref|YP_004275904.1| hypothetical protein Pedsa_3552 [Pedobacter saltans DSM 12145]
gi|324975098|gb|ADY54082.1| hypothetical protein Pedsa_3552 [Pedobacter saltans DSM 12145]
Length = 355
Score = 43.8 bits (102), Expect = 0.045, Method: Composition-based stats.
Identities = 10/120 (8%), Positives = 35/120 (29%), Gaps = 20/120 (16%)
Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65
R + ++ L + + ++P GY P +
Sbjct: 251 RWWAFYSFVD-LNWQNWKASLDKLNVEFVPCIFPGY-----NEP------------SAAT 292
Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
++++ ++ +K + ++ S + + L ++ + + +F
Sbjct: 293 QRIIDRTEKNYVDYANVAKRNMGVNQMVIINSWNDFSKGTALEPSKKYNKQFLELTKREF 352
>gi|254788145|ref|YP_003075574.1| glycosyltransferase family 2 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237685039|gb|ACR12303.1| glycosyltransferase family 2 domain protein [Teredinibacter
turnerae T7901]
Length = 307
Score = 43.4 bits (101), Expect = 0.064, Method: Composition-based stats.
Identities = 27/126 (21%), Positives = 42/126 (33%), Gaps = 32/126 (25%)
Query: 166 ISHILLRLNFDFDLFVTVVEANKD----FEQDVLKYFPSAQLYVMENKG-RDVRP----- 215
+ DL+V V + + D D + +P Q+ EN+G R V P
Sbjct: 19 TLDSVCNQTVPPDLWVVVDDGSTDETPAILADYSERYPFIQVITRENRGHRSVGPGVIEA 78
Query: 216 FLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFE 275
F Y + ++DY+CK L +I+ E
Sbjct: 79 FYYGYDKIDVSQFDYVCKFD---------------------LDLDLP-PRYFEILIDRME 116
Query: 276 QNPCLG 281
+NP LG
Sbjct: 117 KNPRLG 122
>gi|261338088|ref|ZP_05965972.1| 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium gallicum
DSM 20093]
gi|270276707|gb|EFA22561.1| 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium gallicum
DSM 20093]
Length = 639
Score = 43.4 bits (101), Expect = 0.070, Method: Composition-based stats.
Identities = 19/101 (18%), Positives = 28/101 (27%), Gaps = 17/101 (16%)
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG---------YHPIEGIIWRR 254
+E+ G DV + L + + IH K E W+
Sbjct: 214 RYVEH-GNDVHAMIQALREVKDTDHPVVLHIHTCKGLGLDQQDAHYGVLEGRCEANHWQN 272
Query: 255 WLFFDL--LGFSDIAIRII-----NTFEQNPCLGMIGSRRY 288
L LG R I F++ P L +I
Sbjct: 273 PLAQANAPLGSRKTYGRAIMAMLEQRFDEEPGLMVISPATP 313
>gi|257125628|ref|YP_003163742.1| 1-deoxy-D-xylulose-5-phosphate synthase [Leptotrichia buccalis
C-1013-b]
gi|257049567|gb|ACV38751.1| 1-deoxy-D-xylulose-5-phosphate synthase [Leptotrichia buccalis
C-1013-b]
Length = 582
Score = 43.0 bits (100), Expect = 0.074, Method: Composition-based stats.
Identities = 20/117 (17%), Positives = 44/117 (37%), Gaps = 13/117 (11%)
Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244
E+N + + K +YV +KG D+ + + E + + +H +K + Y
Sbjct: 193 ESNGQAQNNYFKSLGLDYVYV--DKGNDLDALIEVFEKVKDINHPIVVHVHTQKGKGLPY 250
Query: 245 HPIEGIIWRRWL-FFDLLG----------FSDIAIRIINTFEQNPCLGMIGSRRYRR 290
+ W + F G +D A +++ E++P + ++ S
Sbjct: 251 AEKDKETWHYGMPFDPKTGESKVNYSGGLSNDTAEFLMDKMEKDPTIAVVTSGTPTV 307
>gi|255533245|ref|YP_003093617.1| hypothetical protein Phep_3361 [Pedobacter heparinus DSM 2366]
gi|255346229|gb|ACU05555.1| hypothetical protein Phep_3361 [Pedobacter heparinus DSM 2366]
Length = 355
Score = 43.0 bits (100), Expect = 0.089, Method: Composition-based stats.
Identities = 9/101 (8%), Positives = 30/101 (29%), Gaps = 19/101 (18%)
Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84
+ Y+P GY P + ++++ ++ +K
Sbjct: 269 SLDKLNVEYVPCIFPGY-----NEP------------SAATQRIIERTEKNYVDYTNVAK 311
Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
+ + ++ S + + L ++ + + +F
Sbjct: 312 RNMGTNQMVIINSWNDFSKGTALEPSKKFNKQFLGITRREF 352
>gi|149279792|ref|ZP_01885919.1| hypothetical protein PBAL39_02720 [Pedobacter sp. BAL39]
gi|149229382|gb|EDM34774.1| hypothetical protein PBAL39_02720 [Pedobacter sp. BAL39]
Length = 357
Score = 42.2 bits (98), Expect = 0.15, Method: Composition-based stats.
Identities = 18/132 (13%), Positives = 40/132 (30%), Gaps = 15/132 (11%)
Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIP-AHVSGYYVLWSFSPK---QRITSKD 57
Y K+ +I ++ + N A P + + W P+ D
Sbjct: 228 YHSSGFKAGSTEIPISNMQAAENQMWNNIAYVSPLKFIPVATLNWD--PRPWANAGNGYD 285
Query: 58 ----VHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFM 111
+S + S + + +K P RI + E + A+L ++
Sbjct: 286 KAPYFVGYS---EKSVYKSVSSLIDWINKNKWETPKERIGLLYAWNENGEGAYLTPSQEG 342
Query: 112 SNSRMPFDSEKF 123
++ + +
Sbjct: 343 DDNLLRGVQKAL 354
>gi|238916219|ref|YP_002929736.1| polysaccharide biosynthesis protein [Eubacterium eligens ATCC
27750]
gi|238871579|gb|ACR71289.1| polysaccharide biosynthesis protein [Eubacterium eligens ATCC
27750]
Length = 621
Score = 42.2 bits (98), Expect = 0.16, Method: Composition-based stats.
Identities = 29/230 (12%), Positives = 67/230 (29%), Gaps = 35/230 (15%)
Query: 122 KFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTW-IEISHILLRLNFDFD-L 179
+ +LFE N + + I K AIVV C EI + R+ + +
Sbjct: 267 RLYNHADLFEKLNLQYVLQTRGEKEISLKNAIVVICGNVKLISNEIDEYIQRIKDEIKVI 326
Query: 180 FVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKS 239
F+T +K+ +++ E D +
Sbjct: 327 FIT---ESKEGCEELKNQIR-------------------EYEYVCLINCDIIL------- 357
Query: 240 QREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLG-MIGSRRYRRYKRWSFFA 298
+ +L+ + ++ F++N +G + +
Sbjct: 358 -ENNTFSCVNKSALYGVLENLIKSNSYISNVMGIFKRNKKIGALTIPELIHADFLGKAWK 416
Query: 299 KRSEVYRRVIDLA--KRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL 346
+ ++ +++ + K+ + N WV+ + LE +
Sbjct: 417 RWVQIRQKISYILDSKQIHCIFSMDKMPIVNSDNLWVRRELLEQAIEYND 466
>gi|149198517|ref|ZP_01875562.1| hypothetical protein LNTAR_06784 [Lentisphaera araneosa HTCC2155]
gi|149138523|gb|EDM26931.1| hypothetical protein LNTAR_06784 [Lentisphaera araneosa HTCC2155]
Length = 441
Score = 41.9 bits (97), Expect = 0.16, Method: Composition-based stats.
Identities = 7/68 (10%), Positives = 17/68 (25%), Gaps = 9/68 (13%)
Query: 58 VHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSR 115
H ++ + R + + G E + A+L +
Sbjct: 375 YHGGTPKLYGESLKLARESVE-------KNNGRKFITVGIWNEFYEDAYLEPDVKYGYEY 427
Query: 116 MPFDSEKF 123
+ + F
Sbjct: 428 LKQIEKNF 435
>gi|87312199|ref|ZP_01094301.1| hypothetical protein DSM3645_13248 [Blastopirellula marina DSM
3645]
gi|87285075|gb|EAQ77007.1| hypothetical protein DSM3645_13248 [Blastopirellula marina DSM
3645]
Length = 349
Score = 41.9 bits (97), Expect = 0.18, Method: Composition-based stats.
Identities = 12/104 (11%), Positives = 30/104 (28%), Gaps = 21/104 (20%)
Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84
+ Q ++ P + Y F K+ +T ++ + +A +
Sbjct: 260 QTWAEQTVFCPTLMPKY---HDFRGKRTLTG------TPEQYQ-------TMIAMMQALP 303
Query: 85 LSFPSC---RIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123
I+ S E + + + + + + E F
Sbjct: 304 KQPVGHGIGSIYLITSWNEWWEGTTIEPDTTDGEAFLKANREAF 347
>gi|75674738|ref|YP_317159.1| hypothetical protein Nwi_0540 [Nitrobacter winogradskyi Nb-255]
gi|74419608|gb|ABA03807.1| hypothetical protein Nwi_0540 [Nitrobacter winogradskyi Nb-255]
Length = 381
Score = 41.5 bits (96), Expect = 0.26, Method: Composition-based stats.
Identities = 12/117 (10%), Positives = 25/117 (21%), Gaps = 19/117 (16%)
Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPK-----------QRITSKDVHFQELSIFESFIF 71
E GN V W P+ + + F + +
Sbjct: 261 WNELGNGHLP----VVPTVMTGWDRRPRIENPVPWEKKQRPGEGIENFFAAPTK-KELAD 315
Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126
L L + + + + E + +L R+ +
Sbjct: 316 HLARALDWVGARPQGEQAP-VVLIYAWNENDEGGWLMPTLPCQTDRLDALRQVLKKT 371
>gi|295425765|ref|ZP_06818450.1| conserved hypothetical protein [Lactobacillus amylolyticus DSM
11664]
gi|295064573|gb|EFG55496.1| conserved hypothetical protein [Lactobacillus amylolyticus DSM
11664]
Length = 433
Score = 41.1 bits (95), Expect = 0.32, Method: Composition-based stats.
Identities = 24/113 (21%), Positives = 41/113 (36%), Gaps = 16/113 (14%)
Query: 97 SRKEQKAFLRLNRFMSNSR----MPFDSEKFLYVKELF-EGWNDRPSSPKKSGLTIK--- 148
+ E+ FL + + +K + V E+ E W + +GL +K
Sbjct: 122 ALNEKGDFLLPAAGHEKGYTRPIIAAEYKKPIKVGEITMEIWPSDHDAYGATGLIVKTPD 181
Query: 149 SKIA----IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY 197
KI+ I +H Y+ D E + DLF+T + E+ K
Sbjct: 182 KKISFTGDIRLHGYHPDWVHEFLAA----SKGADLFITEATGSSWPERKNEKQ 230
>gi|328948788|ref|YP_004366125.1| 1-deoxy-D-xylulose-5-phosphate synthase [Treponema succinifaciens
DSM 2489]
gi|328449112|gb|AEB14828.1| 1-deoxy-D-xylulose-5-phosphate synthase [Treponema succinifaciens
DSM 2489]
Length = 589
Score = 41.1 bits (95), Expect = 0.35, Method: Composition-based stats.
Identities = 10/46 (21%), Positives = 15/46 (32%), Gaps = 1/46 (2%)
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
EN G D+ + L E + + IH K + W
Sbjct: 215 EN-GNDIGAMIALFEKVKDIDHPVVLHIHTLKGKGYAPAEKNKEAW 259
>gi|313123698|ref|YP_004033957.1| metallo-beta-lactamase superfamily hydrolase [Lactobacillus
delbrueckii subsp. bulgaricus ND02]
gi|312280261|gb|ADQ60980.1| Metallo-beta-lactamase superfamily hydrolase [Lactobacillus
delbrueckii subsp. bulgaricus ND02]
Length = 412
Score = 40.7 bits (94), Expect = 0.39, Method: Composition-based stats.
Identities = 24/113 (21%), Positives = 41/113 (36%), Gaps = 16/113 (14%)
Query: 97 SRKEQKAFLRLNRFMSNSR----MPFDSEKFLYVKELF-EGWNDRPSSPKKSGLTIK--- 148
+ E+ FL + + +K + V E+ E W + +GL +K
Sbjct: 101 ALNEKGDFLLPAAGHEKGYTRPIIAAEYKKPIKVGEITMEIWPSDHDAYGATGLIVKTPD 160
Query: 149 SKIA----IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY 197
KI+ I +H Y+ D E + DLF+T + E+ K
Sbjct: 161 KKISFTGDIRLHGYHPDWVHEFLAA----SKGADLFITEATGSSWPERKNEKQ 209
>gi|300727407|ref|ZP_07060816.1| 1-deoxy-d-xylulose-5-phosphate synthase 2 [Prevotella bryantii B14]
gi|299775287|gb|EFI71886.1| 1-deoxy-d-xylulose-5-phosphate synthase 2 [Prevotella bryantii B14]
Length = 584
Score = 40.3 bits (93), Expect = 0.46, Method: Composition-based stats.
Identities = 18/148 (12%), Positives = 40/148 (27%), Gaps = 14/148 (9%)
Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244
E N ++ K F +YV E G +V + + + + IH +K
Sbjct: 194 ETNGQAANNLFKAFGLDYIYVEE--GNNVGKLVEAFQKVKDIDHPIVVHIHTEKGHGYAP 251
Query: 245 HPIEGIIWRRWL----FFDLLGFSDIAIR--------IINTFEQNPCLGMIGSRRYRRYK 292
W + L + +++P + I + +
Sbjct: 252 AVENKEGWHYHMPFNREDGSLKNPGNGENMTALLGQWMAEQLKKDPKMVCIAAGTAPAFY 311
Query: 293 RWSFFAKRSEVYRRVIDLAKRAGFPTKR 320
+ + + +A+ G
Sbjct: 312 FDKERREEAGKQFIDVGIAEEEGVAIAS 339
>gi|312373266|gb|EFR21041.1| hypothetical protein AND_17673 [Anopheles darlingi]
Length = 1344
Score = 40.3 bits (93), Expect = 0.53, Method: Composition-based stats.
Identities = 9/65 (13%), Positives = 21/65 (32%), Gaps = 4/65 (6%)
Query: 98 RKEQKAFLRLNRFMSNSRM----PFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAI 153
E+ +L + + + + + E P++ ++ IA+
Sbjct: 67 WIEEGGYLEKELAYTRKALGETDSSTHQLLIQTPKDMEASILHPTALLTHLDVVRKAIAV 126
Query: 154 VVHCY 158
VH Y
Sbjct: 127 TVHMY 131
>gi|297569722|ref|YP_003691066.1| glycosyl transferase family 2 [Desulfurivibrio alkaliphilus AHT2]
gi|296925637|gb|ADH86447.1| glycosyl transferase family 2 [Desulfurivibrio alkaliphilus AHT2]
Length = 318
Score = 40.3 bits (93), Expect = 0.57, Method: Composition-based stats.
Identities = 24/166 (14%), Positives = 46/166 (27%), Gaps = 35/166 (21%)
Query: 161 DTWIEISHILLRLNFDFDLFVTVVEANKD----FEQDVLKYFPSAQLYVMENKG-RDVRP 215
D ++ DL+V V + + D + + ++ N+G R V P
Sbjct: 17 DYMRHTLDSMVAQTVRPDLWVIVDDGSTDQTPQILAEYAAKYDFIKIVPKANRGHRSVGP 76
Query: 216 -----FLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270
F D ++Y+CK+ L +
Sbjct: 77 GVIEAFYAGYRAVRPDDFEYICKLD---------------------LDLELP-PRYFEIL 114
Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKR---SEVYRRVIDLAKR 313
+ E+NP +G + Y E + L ++
Sbjct: 115 LKRLEENPRIGTCSGKPYFLDNESGKLISEKCGDENSVGMTKLFRK 160
>gi|153811516|ref|ZP_01964184.1| hypothetical protein RUMOBE_01908 [Ruminococcus obeum ATCC 29174]
gi|149832257|gb|EDM87342.1| hypothetical protein RUMOBE_01908 [Ruminococcus obeum ATCC 29174]
Length = 589
Score = 39.9 bits (92), Expect = 0.61, Method: Composition-based stats.
Identities = 9/46 (19%), Positives = 16/46 (34%), Gaps = 1/46 (2%)
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
EN G D+ + L + + IH +K + + W
Sbjct: 219 EN-GNDIASLISLFRKVKDTDHPIVVHIHTQKGKGYEIAEKDKEGW 263
>gi|153814764|ref|ZP_01967432.1| hypothetical protein RUMTOR_00979 [Ruminococcus torques ATCC 27756]
gi|145847795|gb|EDK24713.1| hypothetical protein RUMTOR_00979 [Ruminococcus torques ATCC 27756]
Length = 589
Score = 39.9 bits (92), Expect = 0.61, Method: Composition-based stats.
Identities = 9/46 (19%), Positives = 16/46 (34%), Gaps = 1/46 (2%)
Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
EN G D+ + L + + IH +K + + W
Sbjct: 219 EN-GNDIASLISLFRKVKDTDHPIVVHIHTQKGKGYEIAEKDKEGW 263
>gi|255533249|ref|YP_003093621.1| hypothetical protein Phep_3365 [Pedobacter heparinus DSM 2366]
gi|255346233|gb|ACU05559.1| hypothetical protein Phep_3365 [Pedobacter heparinus DSM 2366]
Length = 348
Score = 39.9 bits (92), Expect = 0.65, Method: Composition-based stats.
Identities = 9/88 (10%), Positives = 21/88 (23%), Gaps = 12/88 (13%)
Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97
V ++ + + F + +K + S RI S
Sbjct: 268 VPCIAPGYNDKAMTPASKMYDIGYTPEFYTDF----------TNVAKRNMSSKRIVLINS 317
Query: 98 RK--EQKAFLRLNRFMSNSRMPFDSEKF 123
+ + N + ++F
Sbjct: 318 WNNFQLGTAIEPTETYGNIFLQMTRKQF 345
>gi|229112680|ref|ZP_04242216.1| Glycosytransferase [Bacillus cereus Rock1-15]
gi|228670812|gb|EEL26120.1| Glycosytransferase [Bacillus cereus Rock1-15]
Length = 355
Score = 39.5 bits (91), Expect = 0.80, Method: Composition-based stats.
Identities = 14/89 (15%), Positives = 37/89 (41%), Gaps = 6/89 (6%)
Query: 122 KFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVH-----CYYQDTWIEISHILLRLNFD 176
K +++ G R K G K K+ + +H +YQ++ +I + + +
Sbjct: 73 KIMHIHTASRGSFFRKRIFVKLGKLFKKKVVLHIHGAEFMVFYQESSEDIRNQIREILNQ 132
Query: 177 FDLFVTVVEANKDFEQDVLKYFPSAQLYV 205
D+ +T+ + K+ + + + ++
Sbjct: 133 VDVIITLSQKWKEDIESITNN-RNVKVIY 160
>gi|145591836|ref|YP_001153838.1| hypothetical protein Pars_1633 [Pyrobaculum arsenaticum DSM 13514]
gi|145283604|gb|ABP51186.1| hypothetical protein Pars_1633 [Pyrobaculum arsenaticum DSM 13514]
Length = 609
Score = 39.5 bits (91), Expect = 0.97, Method: Composition-based stats.
Identities = 8/103 (7%), Positives = 20/103 (19%), Gaps = 11/103 (10%)
Query: 22 DVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81
E + + + T + F + R + +
Sbjct: 515 KYGEWSEATNALGVGFIPSAMPGFDD--RAIRTGHIPLPKSTERFRKQLIIARQYTNINT 572
Query: 82 YSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFL 124
I + E + + S + + L
Sbjct: 573 IL--------ITTFNEWHENTN-IEPSVKDGFSYLQVLKQVLL 606
>gi|260889525|ref|ZP_05900788.1| 1-deoxy-D-xylulose 5-phosphate synthase [Leptotrichia hofstadii
F0254]
gi|260860936|gb|EEX75436.1| 1-deoxy-D-xylulose 5-phosphate synthase [Leptotrichia hofstadii
F0254]
Length = 592
Score = 39.5 bits (91), Expect = 0.99, Method: Composition-based stats.
Identities = 13/72 (18%), Positives = 28/72 (38%), Gaps = 2/72 (2%)
Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244
E+N + + K +YV +KG D+ + + E + + +H +K + Y
Sbjct: 203 ESNGQAQNNYFKSLGLDYIYV--DKGNDLEALIEVFEKVKDINHPIVVHVHTQKGKGLPY 260
Query: 245 HPIEGIIWRRWL 256
+ W +
Sbjct: 261 AEKDKETWHYGM 272
>gi|193216921|ref|YP_002000163.1| ribosome biogenesis GTP-binding protein YsxC [Mycoplasma
arthritidis 158L3-1]
gi|238692481|sp|B3PN57|ENGB_MYCA5 RecName: Full=Probable GTP-binding protein EngB
gi|193002244|gb|ACF07459.1| GTPase protein YihA (EngB) [Mycoplasma arthritidis 158L3-1]
Length = 183
Score = 39.2 bits (90), Expect = 1.2, Method: Composition-based stats.
Identities = 23/126 (18%), Positives = 46/126 (36%), Gaps = 8/126 (6%)
Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132
L + LA K +K S R + Q+ + ++ + + + +
Sbjct: 35 LINALASQKIAKTSSTPGRTRLINYFETQRKKIIVDLP-GYGFASMSKKAQSKISGIIDF 93
Query: 133 WNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVT-VVEANKDFE 191
+ + K + I +KI Y D E+ L L FD+ +T + +AN+ +
Sbjct: 94 YFRNSKNSKNICILIDAKIGFS----YIDL--EMIDYLKSLGLLFDIIITKIDKANQSQK 147
Query: 192 QDVLKY 197
V +
Sbjct: 148 HRVKQQ 153
>gi|94310676|ref|YP_583886.1| glycosyl transferase family protein [Cupriavidus metallidurans
CH34]
gi|93354528|gb|ABF08617.1| Cellulose synthase (UDP-forming), putative glycosyl transferase
[Cupriavidus metallidurans CH34]
Length = 658
Score = 38.8 bits (89), Expect = 1.4, Method: Composition-based stats.
Identities = 32/182 (17%), Positives = 62/182 (34%), Gaps = 23/182 (12%)
Query: 176 DFDLFVTVVEA-----NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY-- 228
D+F+ K + +P+ +++V+++ RD +L V RY
Sbjct: 117 PVDIFIATYNEGLDVLEKTIVAALDIDYPNFRVWVLDDTRRD---WLREFCDQVGARYVT 173
Query: 229 --DYLCKIHGK-----KSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLG 281
D H K R G + L D +I +RI+ F+ +P +G
Sbjct: 174 RPDNA---HAKAGNLNNGLRHSAELDGGAPFIMVLDADFAPNRNILLRIVGLFD-DPQVG 229
Query: 282 MI-GSRRYRRYKRWSFFAKRSEVY-RRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLE 339
++ + Y + + +E + F GT F V+ + L+
Sbjct: 230 VVQTPQFYYNADPIQYNLRSTECWVDEQRAFFDVMQPSKDAWGTAFCIGTSFVVRREALD 289
Query: 340 PL 341
+
Sbjct: 290 RI 291
>gi|118400046|ref|XP_001032346.1| hypothetical protein TTHERM_00636850 [Tetrahymena thermophila]
gi|89286687|gb|EAR84683.1| hypothetical protein TTHERM_00636850 [Tetrahymena thermophila
SB210]
Length = 420
Score = 38.8 bits (89), Expect = 1.6, Method: Composition-based stats.
Identities = 10/78 (12%), Positives = 21/78 (26%), Gaps = 8/78 (10%)
Query: 161 DTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLL 220
D +I L L + + + + ++ + Q+ P
Sbjct: 17 DLHKKIIKYLAPLPDQRAFIIYTSQEQNEDDTYIICNLLNGQIKY--------GPLFSKA 68
Query: 221 ELGVFDRYDYLCKIHGKK 238
E D + + KK
Sbjct: 69 EKIENINDDLIVFVDSKK 86
>gi|253682781|ref|ZP_04863576.1| putative formyl-CoA transferase [Clostridium botulinum D str. 1873]
gi|253560980|gb|EES90434.1| putative formyl-CoA transferase [Clostridium botulinum D str. 1873]
Length = 391
Score = 38.8 bits (89), Expect = 1.7, Method: Composition-based stats.
Identities = 22/108 (20%), Positives = 34/108 (31%), Gaps = 21/108 (19%)
Query: 236 GKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA-----IRIINTF--------EQNPCL-- 280
KKS YH EG + L+ +D+ + +F E NP L
Sbjct: 63 SKKSITINYHKSEGA----EIIKRLVKNTDMIIFNEPEEKLKSFGLGFPELKEVNPKLVY 118
Query: 281 GMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNG 328
G++ + W + L ++ G P K F G
Sbjct: 119 GILTP--FGEEGPWKDMPDYDLIIMARTGLLEKTGMPEKPTKFGFPLG 164
>gi|321451847|gb|EFX63374.1| hypothetical protein DAPPUDRAFT_335541 [Daphnia pulex]
Length = 337
Score = 38.4 bits (88), Expect = 1.9, Method: Composition-based stats.
Identities = 35/221 (15%), Positives = 65/221 (29%), Gaps = 20/221 (9%)
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVK 127
F +R+ L L + + R F G+ E+ +L R ++ F
Sbjct: 31 QFYSGVRTALGLQSNQLLIYYTER-VFAGTANEEPNYLERGRCRKRRKLRFVVVNRRLPS 89
Query: 128 ELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-- 185
+ P + S + + Y++ RL FD+ +
Sbjct: 90 TKLWCDFVNTNYPHRLMSLNSSDSLVQMQIYFR----TCMQPFGRLLASFDIHLNGPLKL 145
Query: 186 --ANKDFEQDVLKYFPSAQLYVMENKGRDVRP--FLYLLELGVFDRYDYLCKIHGKKSQR 241
+ + A V + P F L E + C+ H KK +
Sbjct: 146 LLDRVAKREYASQEIKEAYFLVFTS-----YPSTFKPLFEKTALVEDELYCEFHDKK-RD 199
Query: 242 EGYHPIEGIIWRRWLF---FDLLGFSDIAIRIINTFEQNPC 279
+ ++ + +R LL + II F +N
Sbjct: 200 KVFNSRQAKSYRLKNLAKTERLLSTKNGVNSIIYHFAENEK 240
>gi|239616887|ref|YP_002940209.1| hypothetical protein Kole_0482 [Kosmotoga olearia TBF 19.5.1]
gi|239505718|gb|ACR79205.1| hypothetical protein Kole_0482 [Kosmotoga olearia TBF 19.5.1]
Length = 715
Score = 38.4 bits (88), Expect = 1.9, Method: Composition-based stats.
Identities = 7/88 (7%), Positives = 18/88 (20%), Gaps = 17/88 (19%)
Query: 37 HVSGYYVLWSFS-PKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95
HV + + + S D L + +
Sbjct: 384 HVMNIMPGYDDTHVRVPGFSVDREN------GKLYEELWKLV--------LNLDPDMVII 429
Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFDSE 121
S E + + + + + +
Sbjct: 430 TSWNEWHEGSEIEPSLEYGRKYLEITKK 457
>gi|310831260|ref|YP_003969903.1| hypothetical protein crov271 [Cafeteria roenbergensis virus BV-PW1]
gi|309386444|gb|ADO67304.1| hypothetical protein crov271 [Cafeteria roenbergensis virus BV-PW1]
Length = 821
Score = 38.4 bits (88), Expect = 2.2, Method: Composition-based stats.
Identities = 27/203 (13%), Positives = 57/203 (28%), Gaps = 47/203 (23%)
Query: 197 YFPSAQLYVMEN-KGRDVRPFLYLLE-LGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254
F + + N G D+ P + F ++Y+ K+ K I WR
Sbjct: 210 NFNNKYFVIETNEYGNDIIPTIIGFNFANTFLNFNYILKLQTK----------SDIKWRN 259
Query: 255 WLFFDLL-GFSDIAIRIIN--TFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLA 311
L L I +++ F +P + + L
Sbjct: 260 PLINFFLNKSKTDLINLLDNQEFICHPKF----------------------ISKITSTLI 297
Query: 312 KRAGFPTKR-LHLDFFNGTMFWVKPKCLEPL---------RNLHLIGEFEEERNLKDGAL 361
+ F G++++ K + + + ++ L+ +
Sbjct: 298 NKLFLQNLNWNDKSFPAGSIYFCKKHKFDNMIKFINYSSPHKYFIQTMYDTHYVLRGNSS 357
Query: 362 EHAVERFFACSVRYTEFSIESVD 384
H +ER ++ F+ S
Sbjct: 358 VHFLERLVGINLDKHIFTTSSNY 380
>gi|281492254|ref|YP_003354234.1| 1-deoxy-D-xylulose 5-phosphate synthase [Lactococcus lactis subsp.
lactis KF147]
gi|281375925|gb|ADA65419.1| 1-deoxy-D-xylulose 5-phosphate synthase [Lactococcus lactis subsp.
lactis KF147]
Length = 580
Score = 38.0 bits (87), Expect = 2.4, Method: Composition-based stats.
Identities = 12/57 (21%), Positives = 22/57 (38%), Gaps = 1/57 (1%)
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260
+EN G D+ ++L E + + IH +K + + + FDL
Sbjct: 211 KYLEN-GNDIESLIHLFEEVKDINHPIVLHIHTEKGRGYQPALENKEAFHWHMPFDL 266
>gi|310831259|ref|YP_003969902.1| hypothetical protein crov270 [Cafeteria roenbergensis virus BV-PW1]
gi|309386443|gb|ADO67303.1| hypothetical protein crov270 [Cafeteria roenbergensis virus BV-PW1]
Length = 781
Score = 38.0 bits (87), Expect = 2.6, Method: Composition-based stats.
Identities = 28/176 (15%), Positives = 48/176 (27%), Gaps = 43/176 (24%)
Query: 208 NKGRDVRPFLYLLELGVFD-RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266
N G D+ P L + + Y+ KIH K ++ I + +L
Sbjct: 327 NIGNDLIPSLKIFNDNYSKFNFKYVLKIHTK------HNQIFNELTDFFLINY------- 373
Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326
+IN E N + I +Y + K+ + +L F
Sbjct: 374 -DNLINVMEDNHQIDFITKHKYCYNIEKDCYNKKITNKIIINK------------NLFFC 420
Query: 327 NGTMFWVKPKCLEP------------LRNLHLIGEFEEERNLKDGALEHAVERFFA 370
+ F K L N + + H +ER +
Sbjct: 421 AISFFIGKKDIFIKNLNKVAFLFKPSLLNCFYYDNI----MFINNSPVHTIERVIS 472
>gi|85713618|ref|ZP_01044608.1| hypothetical protein NB311A_03739 [Nitrobacter sp. Nb-311A]
gi|85699522|gb|EAQ37389.1| hypothetical protein NB311A_03739 [Nitrobacter sp. Nb-311A]
Length = 387
Score = 38.0 bits (87), Expect = 2.9, Method: Composition-based stats.
Identities = 12/120 (10%), Positives = 24/120 (20%), Gaps = 15/120 (12%)
Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK-----------QRITSKDVHFQELSIFE 67
L E N + V W P+ + + F + E
Sbjct: 259 LARFAERGWNALSHGRLPVVPTVMTGWDRRPRIEHPVPWETKQRPGEGMENFFTAPTKKE 318
Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125
R+ + + E + +L R+ +
Sbjct: 319 LADHLARALGWVAARPPD--EQAPAVLIYAWNENDEGGWLMPTLPCQTDRLDALRQVLKK 376
>gi|85105803|ref|XP_962039.1| peroxisomal hydratase-dehydrogenase-epimerase [Neurospora crassa
OR74A]
gi|3929350|sp|Q01373|FOX2_NEUCR RecName: Full=Peroxisomal hydratase-dehydrogenase-epimerase;
Short=HDE; AltName: Full=Multifunctional beta-oxidation
protein; Short=MFP; Includes: RecName: Full=2-enoyl-CoA
hydratase; Includes: RecName:
Full=(3R)-3-hydroxyacyl-CoA dehydrogenase
gi|510867|emb|CAA56355.1| multifunctional beta-oxidation protein [Neurospora crassa]
gi|28923632|gb|EAA32803.1| peroxisomal hydratase-dehydrogenase-epimerase [Neurospora crassa
OR74A]
Length = 894
Score = 37.6 bits (86), Expect = 3.1, Method: Composition-based stats.
Identities = 21/106 (19%), Positives = 40/106 (37%), Gaps = 15/106 (14%)
Query: 185 EANKDFEQDVLKYFPSAQLYVMENKG--RDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
E + +K F + ++ N G RD+ + + +D + K+H K S +
Sbjct: 77 ENGDKIIETAIKEFGRIDI-LINNAGILRDIS-----FKNMKDEDWDLIFKVHVKGSYKT 130
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ----NPCLGMIG 284
+R+ F ++ + A + F Q LGM+G
Sbjct: 131 ARAAWP--YFRKQKFGRVI-NTASAAGLFGNFGQANYSAAKLGMVG 173
>gi|312863645|ref|ZP_07723883.1| conserved hypothetical protein [Streptococcus vestibularis F0396]
gi|311101181|gb|EFQ59386.1| conserved hypothetical protein [Streptococcus vestibularis F0396]
Length = 262
Score = 37.6 bits (86), Expect = 3.2, Method: Composition-based stats.
Identities = 28/195 (14%), Positives = 60/195 (30%), Gaps = 23/195 (11%)
Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVM----E 207
AI VH + +L+ + ++ E ++L F + ++ +
Sbjct: 2 AIHVHISDLERLKVFFD--SKLSAFYYFTLSGHLDKNQVENNLLNSFDKDRFQIVSQKFD 59
Query: 208 NKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA 267
N + YD++ H + + R L LL
Sbjct: 60 NHYHALVSL-----ASQLSEYDFIGHFHT--ADFGNEGKLVDEATRLALIDMLL-DEKKV 111
Query: 268 IRIINTFEQNPCLGMIGSRRYR-RYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326
I F P +G++ + + Y + ++ + ++ + L F
Sbjct: 112 SSIFADF---PEVGLVFADLSKELYWTDAIGTLNQNQAAKLDNECQKT----IKNSLHVF 164
Query: 327 NGTMFWVKPKCLEPL 341
G+M W+ LE +
Sbjct: 165 QGSM-WLSKDFLEKI 178
>gi|213406643|ref|XP_002174093.1| DNA polymerase epsilon catalytic subunit A [Schizosaccharomyces
japonicus yFS275]
gi|212002140|gb|EEB07800.1| DNA polymerase epsilon catalytic subunit A [Schizosaccharomyces
japonicus yFS275]
Length = 2185
Score = 37.6 bits (86), Expect = 3.6, Method: Composition-based stats.
Identities = 19/117 (16%), Positives = 37/117 (31%), Gaps = 14/117 (11%)
Query: 86 SFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRP-SSPKKSG 144
+ + + E ++ ++ M + + N P S P
Sbjct: 28 AKANEEVVLENIWNEIQSKNEIDTKMGFDNIEA------GPPRIGWLLNVHPTSVPSDDN 81
Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFPS 200
KS IA Y+ E + + + L+V+ E + E + K FP+
Sbjct: 82 ANGKSAIA----LYFIQEDGETFR--VTVPYRPYLYVSTKEGKEAEVEDYLKKAFPN 132
>gi|73539059|ref|YP_299426.1| cellulose synthase (UDP-forming) [Ralstonia eutropha JMP134]
gi|72122396|gb|AAZ64582.1| Cellulose synthase (UDP-forming) [Ralstonia eutropha JMP134]
Length = 659
Score = 37.6 bits (86), Expect = 3.6, Method: Composition-based stats.
Identities = 30/181 (16%), Positives = 59/181 (32%), Gaps = 24/181 (13%)
Query: 178 DLFVTVVEA-----NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY---- 228
D+F+ K + +P+ +++V+++ RD +L V Y
Sbjct: 119 DIFIATYNEGLDVLEKTIVSALAIDYPNFRVWVLDDTRRD---WLKAYCARVGACYVTRP 175
Query: 229 DYLCKIHGKKS------QREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGM 282
D H K G + L D +I +RI+ F+ +P +G+
Sbjct: 176 DNA---HAKAGNLNNGLMHSAAQRGGGAPFIMVLDADFAPNRNILLRIVGLFD-DPAVGV 231
Query: 283 I-GSRRYRRYKRWSFFAKRSEVY-RRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEP 340
+ + Y + + +E + F GT F V+ + L
Sbjct: 232 VQTPQFYYNADPIQYNLRSTECWVDEQRAFFDIMQPAKDAWGTAFCIGTSFVVRREALAR 291
Query: 341 L 341
+
Sbjct: 292 I 292
>gi|228477260|ref|ZP_04061898.1| rhamnosyltransferase [Streptococcus salivarius SK126]
gi|228251279|gb|EEK10450.1| rhamnosyltransferase [Streptococcus salivarius SK126]
Length = 547
Score = 37.6 bits (86), Expect = 3.7, Method: Composition-based stats.
Identities = 31/222 (13%), Positives = 73/222 (32%), Gaps = 26/222 (11%)
Query: 78 AFSKYSKLSFPSCRIFFYG----SRKEQKAFLRLN-RFMSNSRMPFDSEKFL---YVKEL 129
FS Y +S ++ F + E+K L L+ ++ + L + +
Sbjct: 204 DFSYYRPISTLEHKVPFIKLKAFTDNEKKGRLLLDYLANLSTYPVALIKSHLNRYHSPDS 263
Query: 130 FEGWNDRPSSPKKSGLTI-KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK 188
+++ P L+ + ++ I VH + +L+ + ++
Sbjct: 264 LVISDEKIIGPSFITLSKHEYRMVIHVHISDLERLKVFFD--SKLSAFYYFTLSSHLDKN 321
Query: 189 DFEQDVLKYFPSAQLYVM----ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244
E +L F + ++ EN + + YD++ H + G
Sbjct: 322 KVENTLLNSFDKDRFQLVSKTFENHYHAL-----VFLASHLSEYDFVGHFHT---EAFGN 373
Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSR 286
R ++L + + I + F P +G++ +
Sbjct: 374 EGKLVDEDTRHALVNMLSDEEKVVSIFDHF---PEVGLVFAD 412
>gi|294499860|ref|YP_003563560.1| hypothetical protein BMQ_3104 [Bacillus megaterium QM B1551]
gi|294349797|gb|ADE70126.1| hypothetical protein BMQ_3104 [Bacillus megaterium QM B1551]
Length = 123
Score = 37.6 bits (86), Expect = 3.8, Method: Composition-based stats.
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDL 179
KIA ++H YY E + +L + DL
Sbjct: 43 KKIAQLIHLYYPGMLFEFATLLTNPTYRIDL 73
>gi|116512536|ref|YP_811443.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp.
cremoris SK11]
gi|116108190|gb|ABJ73330.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp.
cremoris SK11]
Length = 580
Score = 37.2 bits (85), Expect = 4.2, Method: Composition-based stats.
Identities = 17/76 (22%), Positives = 27/76 (35%), Gaps = 2/76 (2%)
Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244
E N E + K F +EN G D+ + L E + + IH +K +
Sbjct: 193 ETNGQVENNFFKTF-GLDYKYLEN-GNDIESLVNLFEEVKDIDHPIVLHIHTEKGRGYQP 250
Query: 245 HPIEGIIWRRWLFFDL 260
+ + FDL
Sbjct: 251 ALENKEAFHWHMPFDL 266
>gi|306823338|ref|ZP_07456713.1| 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium dentium
ATCC 27679]
gi|309802562|ref|ZP_07696666.1| putative 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium
dentium JCVIHMP022]
gi|304553045|gb|EFM40957.1| 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium dentium
ATCC 27679]
gi|308220626|gb|EFO76934.1| putative 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium
dentium JCVIHMP022]
Length = 648
Score = 37.2 bits (85), Expect = 4.3, Method: Composition-based stats.
Identities = 7/42 (16%), Positives = 13/42 (30%)
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250
+G DV + L + + +H K +G
Sbjct: 224 RGNDVHALVEALRAVKDIDHPIVVHVHTTKGLGFDEAAGDGN 265
>gi|283455635|ref|YP_003360199.1| 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium dentium
Bd1]
gi|283102269|gb|ADB09375.1| dxs 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium
dentium Bd1]
Length = 651
Score = 37.2 bits (85), Expect = 4.3, Method: Composition-based stats.
Identities = 7/42 (16%), Positives = 13/42 (30%)
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250
+G DV + L + + +H K +G
Sbjct: 227 RGNDVHALVEALRAVKDIDHPIVVHVHTTKGLGFDEAAGDGN 268
>gi|171740975|ref|ZP_02916782.1| hypothetical protein BIFDEN_00037 [Bifidobacterium dentium ATCC
27678]
gi|171276589|gb|EDT44250.1| hypothetical protein BIFDEN_00037 [Bifidobacterium dentium ATCC
27678]
Length = 648
Score = 37.2 bits (85), Expect = 4.3, Method: Composition-based stats.
Identities = 7/42 (16%), Positives = 13/42 (30%)
Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250
+G DV + L + + +H K +G
Sbjct: 224 RGNDVHALVEALRAVKDIDHPIVVHVHTTKGLGFDEAAGDGN 265
>gi|295705244|ref|YP_003598319.1| hypothetical protein BMD_3129 [Bacillus megaterium DSM 319]
gi|294802903|gb|ADF39969.1| hypothetical protein BMD_3129 [Bacillus megaterium DSM 319]
Length = 123
Score = 37.2 bits (85), Expect = 4.4, Method: Composition-based stats.
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDL 179
KIA ++H YY E S +L + DL
Sbjct: 43 KKIAQLIHLYYPGMLFEFSTLLTNPTYRIDL 73
>gi|325062498|gb|ADY66188.1| two component sensor kinase [Agrobacterium sp. H13-3]
Length = 345
Score = 37.2 bits (85), Expect = 4.4, Method: Composition-based stats.
Identities = 19/116 (16%), Positives = 36/116 (31%), Gaps = 4/116 (3%)
Query: 116 MPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNF 175
+ S + L + +S K + I ++ H + T +E + L
Sbjct: 15 LARLSSRILGRHGMEVVHAASVASGLKMFQDEQFDIVVLDHYFQTSTGMEFLAAIQSLPG 74
Query: 176 DFD-LFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDY 230
L+VT + + YV++N G D P L + +
Sbjct: 75 RVPVLYVTGSNEAQIAIDALKAGAAD---YVIKNVGDDFFPLLLTAIDQSLENHRL 127
>gi|15673655|ref|NP_267829.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp.
lactis Il1403]
gi|12724687|gb|AAK05771.1|AE006398_2 1-deoxyxylulose-5-phosphate synthase [Lactococcus lactis subsp.
lactis Il1403]
Length = 580
Score = 37.2 bits (85), Expect = 4.5, Method: Composition-based stats.
Identities = 12/57 (21%), Positives = 22/57 (38%), Gaps = 1/57 (1%)
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260
+EN G D+ ++L E + + IH +K + + + FDL
Sbjct: 211 KYLEN-GNDIESLIHLFEEVKDIDHPIVLHIHTEKGRGYQPALENKEAFHWHMPFDL 266
>gi|312278781|gb|ADQ63438.1| Rhamnosyltransferase [Streptococcus thermophilus ND03]
Length = 547
Score = 37.2 bits (85), Expect = 4.7, Method: Composition-based stats.
Identities = 43/280 (15%), Positives = 83/280 (29%), Gaps = 36/280 (12%)
Query: 78 AFSKYSKLSFPSCRIFFYG----SRKEQKAFLRLN--RFMSNSRMPFDSEKFLYVKELFE 131
FS Y +S ++ F + E+K L L+ +S +
Sbjct: 204 DFSYYRPISTLEHKVPFIKLKAFTDNEKKGRLLLDYITKLSAYPLALIKSHLNSYHSPDS 263
Query: 132 GWNDRPSSPKKSGLTIKSK---IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE--A 186
+ S ++ K AI VH D E + + T+
Sbjct: 264 LVILDEKIIEPSFHSVSGKGYHSAIHVHI--SDL--ERLKVFSDKKLSAFYYFTLSSHLD 319
Query: 187 NKDFEQDVLKYFPSAQLYVM----ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242
E +L F + ++ +N + L YD++ H +
Sbjct: 320 KNIVENTLLNSFDKDRFQLVSQKFDNH---YYALVSLASQF--SEYDFVGHFHTE--DFG 372
Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYR-RYKRWSFFAKRS 301
R L LL + I + F P +G++ + + Y +
Sbjct: 373 NEGKFVDEATRLALVNMLL-DEERVASIFDHF---PEVGLVFADLSKELYWTDAIGTLNQ 428
Query: 302 EVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPL 341
++ + ++ + L F G+M W+ LE +
Sbjct: 429 NQAAKLDNECQKT----IKNSLHVFQGSM-WLSKDFLEKI 463
>gi|77920184|ref|YP_357999.1| hypothetical protein Pcar_2591 [Pelobacter carbinolicus DSM 2380]
gi|77546267|gb|ABA89829.1| hypothetical protein Pcar_2591 [Pelobacter carbinolicus DSM 2380]
Length = 262
Score = 37.2 bits (85), Expect = 4.9, Method: Composition-based stats.
Identities = 18/105 (17%), Positives = 35/105 (33%), Gaps = 13/105 (12%)
Query: 139 SPKKSGLTIKSKIAIVV--HCYYQDTWIEISHILLRLNFD-----FDLFVTVVEANKDFE 191
SP + L +IA+V+ D + ++ +N +DLF+
Sbjct: 5 SPSTNCLPANGRIAVVISTWIGNPD--DYLLRLMDSMNTHSAGMDYDLFLCANGETYKLP 62
Query: 192 QDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHG 236
++ F + EN G ++ + Y Y Y +
Sbjct: 63 ANLQASFKKIFIR--ENSGFNLGAWDYAWRR--LSNYRYFLFLQD 103
>gi|146099084|ref|XP_001468551.1| ATP-dependent RNA helicase [Leishmania infantum]
gi|134072919|emb|CAM71636.1| putative ATP-dependent RNA helicase [Leishmania infantum JPCM5]
Length = 803
Score = 36.9 bits (84), Expect = 5.9, Method: Composition-based stats.
Identities = 16/65 (24%), Positives = 20/65 (30%), Gaps = 14/65 (21%)
Query: 158 YYQDTWIEISHILLRLNFDFDLFVT--------VVEANKDFEQDVLKYFPSAQL------ 203
YY D I L DL T + E + E D LK +
Sbjct: 381 YYIDLLQFIGRPLQSAPVPGDLLFTPDNGCYGRLPEEDIQLELDFLKRLHENDVEVRSMA 440
Query: 204 YVMEN 208
V+EN
Sbjct: 441 RVVEN 445
>gi|322502575|emb|CBZ37658.1| unnamed protein product [Leishmania donovani BPK282A1]
Length = 803
Score = 36.9 bits (84), Expect = 6.0, Method: Composition-based stats.
Identities = 16/65 (24%), Positives = 20/65 (30%), Gaps = 14/65 (21%)
Query: 158 YYQDTWIEISHILLRLNFDFDLFVT--------VVEANKDFEQDVLKYFPSAQL------ 203
YY D I L DL T + E + E D LK +
Sbjct: 381 YYIDLLQFIGRPLQSAPVPGDLLFTPDNGCYGRLPEEDIQLELDFLKRLHENDVEVRSMA 440
Query: 204 YVMEN 208
V+EN
Sbjct: 441 RVVEN 445
>gi|1764094|gb|AAB39865.1| ATP-dependent RNA helicase [Leishmania amazonensis]
Length = 855
Score = 36.9 bits (84), Expect = 6.0, Method: Composition-based stats.
Identities = 16/65 (24%), Positives = 20/65 (30%), Gaps = 14/65 (21%)
Query: 158 YYQDTWIEISHILLRLNFDFDLFVT--------VVEANKDFEQDVLKYFPSAQL------ 203
YY D I L DL T + E + E D LK +
Sbjct: 384 YYVDLMQFIGRPLQSSPVPGDLLFTADDGCYGRLPEEDIQLELDFLKRLHENDVEVRNMA 443
Query: 204 YVMEN 208
V+EN
Sbjct: 444 RVVEN 448
>gi|313905641|ref|ZP_07839002.1| 1-deoxy-D-xylulose-5-phosphate synthase [Eubacterium cellulosolvens
6]
gi|313469465|gb|EFR64806.1| 1-deoxy-D-xylulose-5-phosphate synthase [Eubacterium cellulosolvens
6]
Length = 588
Score = 36.5 bits (83), Expect = 8.1, Method: Composition-based stats.
Identities = 6/43 (13%), Positives = 15/43 (34%)
Query: 210 GRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252
G D+ ++ L + + +H +K + + W
Sbjct: 219 GNDIPMLIHALREVKDVDHPIVLHVHTQKGKGYKPAEEDRESW 261
>gi|18976927|ref|NP_578284.1| hypothetical protein PF0555 [Pyrococcus furiosus DSM 3638]
gi|18892545|gb|AAL80679.1| hypothetical protein PF0555 [Pyrococcus furiosus DSM 3638]
Length = 257
Score = 36.5 bits (83), Expect = 8.4, Method: Composition-based stats.
Identities = 7/96 (7%), Positives = 25/96 (26%), Gaps = 12/96 (12%)
Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94
+ + + + + ++ F + L +C+
Sbjct: 171 KCFIPTVSPGFDRTFDKSFNQQFPIPRDPKRFAEMLKIALDSLG----------NCKEIR 220
Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128
+ + + F+ + + + E +KE
Sbjct: 221 IDTWNDFYEGTFIEPSVSDGFTYLEVLEEFIKELKE 256
>gi|16519726|ref|NP_443846.1| probable methyl-accepting membrane chemoreceptor [Sinorhizobium
fredii NGR234]
gi|2497833|sp|P55439|Y4FA_RHISN RecName: Full=Probable chemoreceptor y4fA; AltName:
Full=Methyl-accepting chemotaxis protein
gi|2182384|gb|AAB91658.1| probable methyl-accepting membrane chemoreceptor [Sinorhizobium
fredii NGR234]
Length = 845
Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats.
Identities = 31/235 (13%), Positives = 62/235 (26%), Gaps = 39/235 (16%)
Query: 61 QELSIFESFIFWLRSFLAFSKYSKL-----SFPSCRIFFYGSRKEQKAFLRLNRFMSNSR 115
L F+ + FL + R + A L +
Sbjct: 54 ASLRGFKDVYAAMIGFLDQTTEENRALVFSKLDEQRAALDAA----GARLTPKAE-GWAE 108
Query: 116 MPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNF 175
+ S + + + + IK + ++ ++
Sbjct: 109 LESASAALSAINGRMDDLWALHADEARLEAGIKEALGVIS----TSQADLLTAATA---- 160
Query: 176 DFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYL---- 231
FD ++ E + + + SA +V+ RD F + Y +
Sbjct: 161 -FDKSISTQEDDAKEKLRDAQRILSATSFVV--ALRD--AF--AARKDDGEGYRAIAAAM 213
Query: 232 --CKIHGK--------KSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276
KIH K KS+ G + L D + +I++ F +
Sbjct: 214 GDLKIHQKLLPIALPKKSKPLGKAFSANVRALSALVDDKARPPENIEKILDVFAE 268
>gi|308479828|ref|XP_003102122.1| hypothetical protein CRE_06803 [Caenorhabditis remanei]
gi|308262277|gb|EFP06230.1| hypothetical protein CRE_06803 [Caenorhabditis remanei]
Length = 1266
Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats.
Identities = 17/93 (18%), Positives = 33/93 (35%), Gaps = 4/93 (4%)
Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFP 199
K A V H +Y +T EI +L ++ DL+ ++ ++ V P
Sbjct: 908 GMAQEFMAKETKAYVCHVHYAETLDEIYSMLK-MSCPEDLYNCTLDQMENVLITVTALRP 966
Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLC 232
L +++ + F + +YD
Sbjct: 967 DITLDQLKSI---LFKFAQRYKHLKETKYDLFG 996
>gi|301067040|ref|YP_003789063.1| glycosyl transferase, family 2 [Lactobacillus casei str. Zhang]
gi|300439447|gb|ADK19213.1| Glycosyl transferase, family 2 [Lactobacillus casei str. Zhang]
Length = 313
Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats.
Identities = 21/93 (22%), Positives = 37/93 (39%), Gaps = 14/93 (15%)
Query: 150 KIAIVVHCY----YQDTWIEISHILLRLNFDFDLFV---TVVEANKDFEQDVLKYFPSAQ 202
KIAI++ + Y ++ IL + + DLF+ + +++ + + P
Sbjct: 2 KIAILLSVFNGELY--LGKQVKSILEQKDVKLDLFIRDDGSTDGSRELVESIAATDPRVH 59
Query: 203 LYVMENKG--RDVRPFLYLLELGVFDRYDYLCK 233
L + N G R FL L+ YDY
Sbjct: 60 LIIGHNVGYKR---SFLELVNEPSMSDYDYFAF 89
>gi|291518896|emb|CBK74117.1| Domain of unknown function (DUF1975) [Butyrivibrio fibrisolvens
16/4]
Length = 320
Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats.
Identities = 7/68 (10%), Positives = 25/68 (36%), Gaps = 13/68 (19%)
Query: 150 KIAIVVHC-YYQD--------TWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS 200
++ +V+H ++ + W ++ D ++T + ++ + + +
Sbjct: 234 RVGVVIHADHFSEGATDDDNILWNNFYEYTFSMHRHIDFYITATDDQRNLLIEQFEKY-- 291
Query: 201 AQLYVMEN 208
+ V N
Sbjct: 292 --VGVTPN 297
>gi|125623603|ref|YP_001032086.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp.
cremoris MG1363]
gi|124492411|emb|CAL97353.1| 1-deoxyxylulose-5-phosphate synthase [Lactococcus lactis subsp.
cremoris MG1363]
gi|300070369|gb|ADJ59769.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp.
cremoris NZ9000]
Length = 580
Score = 36.1 bits (82), Expect = 9.1, Method: Composition-based stats.
Identities = 12/57 (21%), Positives = 21/57 (36%), Gaps = 1/57 (1%)
Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260
+EN G D+ + L E + + IH +K + + + FDL
Sbjct: 211 KYLEN-GNDIESLVNLFEEVKDIDHPIVLHIHTEKGRGYQPALENKEAFHWHMPFDL 266
>gi|89073969|ref|ZP_01160475.1| hypothetical protein SKA34_15405 [Photobacterium sp. SKA34]
gi|89050297|gb|EAR55801.1| hypothetical protein SKA34_15405 [Photobacterium sp. SKA34]
Length = 579
Score = 36.1 bits (82), Expect = 10.0, Method: Composition-based stats.
Identities = 14/82 (17%), Positives = 27/82 (32%)
Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73
+ N +++ D+ K Y+ H W + K I S H + F +
Sbjct: 392 VPNSIMKKDIMPKVRRNRNYLANHNYEVIKGWDNAQKIAINSIPYHLLSPNRFPYRLRQK 451
Query: 74 RSFLAFSKYSKLSFPSCRIFFY 95
K +FP+ + +
Sbjct: 452 PGKRNALGLYKFNFPNNQAIYL 473
Database: nr
Posted date: May 13, 2011 4:10 AM
Number of letters in database: 999,999,932
Number of sequences in database: 2,987,209
Database: /data/usr2/db/fasta/nr.01
Posted date: May 13, 2011 4:17 AM
Number of letters in database: 999,998,956
Number of sequences in database: 2,896,973
Database: /data/usr2/db/fasta/nr.02
Posted date: May 13, 2011 4:23 AM
Number of letters in database: 999,999,979
Number of sequences in database: 2,907,862
Database: /data/usr2/db/fasta/nr.03
Posted date: May 13, 2011 4:29 AM
Number of letters in database: 999,999,513
Number of sequences in database: 2,932,190
Database: /data/usr2/db/fasta/nr.04
Posted date: May 13, 2011 4:33 AM
Number of letters in database: 792,586,372
Number of sequences in database: 2,260,650
Lambda K H
0.312 0.145 0.443
Lambda K H
0.267 0.0443 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,672,879,989
Number of Sequences: 13984884
Number of extensions: 148137242
Number of successful extensions: 589719
Number of sequences better than 10.0: 468
Number of HSP's better than 10.0 without gapping: 340
Number of HSP's successfully gapped in prelim test: 128
Number of HSP's that attempted gapping in prelim test: 587837
Number of HSP's gapped (non-prelim): 604
length of query: 394
length of database: 4,792,584,752
effective HSP length: 141
effective length of query: 253
effective length of database: 2,820,716,108
effective search space: 713641175324
effective search space used: 713641175324
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (20.8 bits)
S2: 82 (36.1 bits)