BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780201|ref|YP_003064614.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] (394 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|254780201|ref|YP_003064614.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] gi|254039878|gb|ACT56674.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] Length = 394 Score = 434 bits (1116), Expect = e-119, Method: Composition-based stats. Identities = 394/394 (100%), Positives = 394/394 (100%) Query: 1 MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF 60 MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF Sbjct: 1 MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF 60 Query: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDS 120 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDS Sbjct: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDS 120 Query: 121 EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLF 180 EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLF Sbjct: 121 EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLF 180 Query: 181 VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240 VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ Sbjct: 181 VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240 Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR Sbjct: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300 Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA 360 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA Sbjct: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA 360 Query: 361 LEHAVERFFACSVRYTEFSIESVDCVAEYERLLH 394 LEHAVERFFACSVRYTEFSIESVDCVAEYERLLH Sbjct: 361 LEHAVERFFACSVRYTEFSIESVDCVAEYERLLH 394 >gi|315122628|ref|YP_004063117.1| hypothetical protein CKC_04400 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496030|gb|ADR52629.1| hypothetical protein CKC_04400 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 399 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 287/390 (73%), Positives = 335/390 (85%) Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQE 62 K+FRLK K +E L+ RLDVE KG++ +YIPA++SGYY+LWS S +Q+ITS+DV F+E Sbjct: 8 KIFRLKIKSETLEKLVFRLDVENKGSVNTLYIPANISGYYMLWSLSKEQKITSEDVFFEE 67 Query: 63 LSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEK 122 ++ F++ +FWLRSFL FSKYS+LSFPSCRIFFYGSRK++KAF RLNRFMSNSRMPFD +K Sbjct: 68 VTTFKACLFWLRSFLTFSKYSQLSFPSCRIFFYGSRKDKKAFFRLNRFMSNSRMPFDGKK 127 Query: 123 FLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVT 182 FLY+KELFEGW + S K + I SKIAIVVHCYYQDTW EISH+LLRLNFDFDLF+T Sbjct: 128 FLYIKELFEGWKNLSSLDNKGKIKINSKIAIVVHCYYQDTWDEISHLLLRLNFDFDLFIT 187 Query: 183 VVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 V+ NKDFEQDVLK FPSA+LYVMENKGRDV PFL LLELGVF YDYLCKIHGKKS R Sbjct: 188 TVKKNKDFEQDVLKNFPSARLYVMENKGRDVLPFLCLLELGVFYDYDYLCKIHGKKSARR 247 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE 302 YHP EGI+WRRW+FFDLLGFSDIA+RIIN FEQNP +GMIGS R+RRYK++SFF KRS+ Sbjct: 248 NYHPFEGILWRRWIFFDLLGFSDIALRIINKFEQNPSIGMIGSGRFRRYKKYSFFKKRSK 307 Query: 303 VYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALE 362 VY+RV+DLA+R FP + L LDFFNGTMFWV+PKCLEPLRN+HL GEFEEE NL+DGALE Sbjct: 308 VYKRVVDLARRIDFPVEELDLDFFNGTMFWVRPKCLEPLRNIHLTGEFEEECNLEDGALE 367 Query: 363 HAVERFFACSVRYTEFSIESVDCVAEYERL 392 HAVERFF SV+ FS+ESVDCVAEY++L Sbjct: 368 HAVERFFPLSVQRAGFSLESVDCVAEYDQL 397 >gi|254780923|ref|YP_003065336.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter asiaticus str. psy62] gi|254040600|gb|ACT57396.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter asiaticus str. psy62] Length = 365 Score = 351 bits (902), Expect = 9e-95, Method: Composition-based stats. Identities = 141/327 (43%), Positives = 198/327 (60%), Gaps = 4/327 (1%) Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 F FW + L + + KL + + YGSR +K F + N +M + FD ++ + + Sbjct: 38 FFFWFWT-LFYKRSKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQ 96 Query: 129 LFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK 188 L GW + P+ K + IK+KIAIVVH YY D WIEI+++L L+ FDL VT+V + Sbjct: 97 LLHGW-ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155 Query: 189 DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIE 248 + ++LK FP+A++++MEN GRDV PFL LLE YDY+CKIHGKKS+R+GY E Sbjct: 156 SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWE 215 Query: 249 GIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV- 307 G +WRRWLF+DLLG + +II TF+ + +GMIGSR YR ++ + R + Sbjct: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMI 275 Query: 308 IDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEE-RNLKDGALEHAVE 366 LA R G + LDFF GTMFWV+ + L+P++NL L FE + DG +EHAVE Sbjct: 276 CTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVE 335 Query: 367 RFFACSVRYTEFSIESVDCVAEYERLL 393 R F+ SV+ F I VDC+ Y + L Sbjct: 336 RCFSLSVKKANFRISDVDCILGYRKSL 362 >gi|77747764|ref|NP_636021.2| hypothetical protein XCC0629 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761299|ref|YP_244667.2| hypothetical protein XC_3605 [Xanthomonas campestris pv. campestris str. 8004] Length = 546 Score = 337 bits (865), Expect = 2e-90, Method: Composition-based stats. Identities = 73/378 (19%), Positives = 139/378 (36%), Gaps = 35/378 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ P G W P++ + + ++ Sbjct: 187 LARDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ART 237 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 + + R+ F + E + A L + + + + + ++ Sbjct: 238 VQHRLANAPSAHRMVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICS 296 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 PS+ +V+H +Y D E+ ++ + +T + + + Sbjct: 297 PSA------------CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQ 344 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR Sbjct: 345 RRGIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGE 400 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LLG I+N F +P G+ + + L R G Sbjct: 401 MLTALLG-PQRVDAIVNAFSTDPLAGLAAPEDHLLPVTEFIG----GNADALDYLTVRTG 455 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 + F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF +V Sbjct: 456 SDAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVT 515 Query: 375 YTEFSIESVDCVAEYERL 392 ++ + +V+ + Sbjct: 516 HSGHRVTTVEQTLGITKT 533 >gi|188993121|ref|YP_001905131.1| conserved protein involved in carbohydrate biosynthesis [Xanthomonas campestris pv. campestris str. B100] gi|189030067|sp|B0RVK2|WXCX_XANCB RecName: Full=Uncharacterized protein wxcX gi|167734881|emb|CAP53093.1| conserved protein involved in carbohydrate biosynthesis [Xanthomonas campestris pv. campestris] Length = 695 Score = 335 bits (860), Expect = 6e-90, Method: Composition-based stats. Identities = 73/378 (19%), Positives = 140/378 (37%), Gaps = 35/378 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ P G W P++ + + ++ Sbjct: 336 LARDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ART 386 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 + + R+ F + E + A L + + + + + ++ Sbjct: 387 VQHRLANAPSAHRMVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICS 445 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 PS+ +V+H +Y D E+ ++ + +T + + + Sbjct: 446 PSA------------CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQ 493 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR Sbjct: 494 RRGIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGE 549 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LLG I+N F +P +G+ + + L R G Sbjct: 550 MLTALLG-PQRVDAIVNAFSTDPLVGLAAPEDHLLPVTEFIG----GNADALDYLTVRTG 604 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 + F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF +V Sbjct: 605 SDAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVT 664 Query: 375 YTEFSIESVDCVAEYERL 392 ++ + +V+ + Sbjct: 665 HSGHRVTTVEQTLGITKT 682 >gi|122879048|ref|YP_199439.6| hypothetical protein XOO0800 [Xanthomonas oryzae pv. oryzae KACC10331] Length = 546 Score = 335 bits (860), Expect = 7e-90, Method: Composition-based stats. Identities = 84/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ + P G W P++ + + ++ L Sbjct: 187 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MLT 237 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 K + P+ R+ F + E + A L + + + + + L+ E+ Sbjct: 238 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 297 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 + +V+H +Y D E L L VT Q + Sbjct: 298 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 344 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR Sbjct: 345 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 400 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LL I+ F ++P LG++ ++ + L R G Sbjct: 401 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFMG----GNADALDYLTVRTG 455 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V Sbjct: 456 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 515 Query: 375 YTEFSIESVDCVAE 388 ++ + +++ + Sbjct: 516 HSGQRVATIEQLLG 529 >gi|189030068|sp|P0C7J1|WXCX_XANCP RecName: Full=Uncharacterized protein wxcX Length = 695 Score = 334 bits (858), Expect = 1e-89, Method: Composition-based stats. Identities = 73/378 (19%), Positives = 139/378 (36%), Gaps = 35/378 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ P G W P++ + + ++ Sbjct: 336 LARDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ART 386 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 + + R+ F + E + A L + + + + + ++ Sbjct: 387 VQHRLANAPSAHRMVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICS 445 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 PS+ +V+H +Y D E+ ++ + +T + + + Sbjct: 446 PSA------------CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQ 493 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR Sbjct: 494 RRGIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGE 549 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LLG I+N F +P G+ + + L R G Sbjct: 550 MLTALLG-PQRVDAIVNAFSTDPLAGLAAPEDHLLPVTEFIG----GNADALDYLTVRTG 604 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 + F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF +V Sbjct: 605 SDAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVT 664 Query: 375 YTEFSIESVDCVAEYERL 392 ++ + +V+ + Sbjct: 665 HSGHRVTTVEQTLGITKT 682 >gi|295687882|ref|YP_003591575.1| rhamnan synthesis protein F [Caulobacter segnis ATCC 21756] gi|295429785|gb|ADG08957.1| Rhamnan synthesis F [Caulobacter segnis ATCC 21756] Length = 818 Score = 334 bits (856), Expect = 2e-89, Method: Composition-based stats. Identities = 89/382 (23%), Positives = 146/382 (38%), Gaps = 32/382 (8%) Query: 8 KSKLGKIENL--LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 ++ GK+ + ++R + E + A Y+P + G W ++ H + Sbjct: 454 ENFTGKVYDYPAVVRHKLSELSRVDAAYVPGVMPG----WDNQARKPWAGHAFHNADP-- 507 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ES++ WL L + + F + E + A+L +R+ + + Sbjct: 508 -ESYLTWLSGAL--THAVARHPKGEAMVFVNAWNEWGEGAYLEPDRWFGHGYLHATRAAL 564 Query: 124 -LYVKELFEGWNDRPSSPKKSGLTIKSKIAI-VVHCYYQDTWIEISHILLRLNFDFDLFV 181 Y L + P + +K A+ ++H +Y + + L DL + Sbjct: 565 SAYQPRLTDA---HPLVAQAQAAFVKRADAVTLLHLFYPELIDWFAERLAATADVLDLMI 621 Query: 182 TVVEANKDF-EQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240 TV E + FP A L + EN+GRD+RPF+ L Y CK+H K+S Sbjct: 622 TVPETWSEADLARARATFPMAHLAIAENRGRDIRPFVETLRRARTLGYSVFCKLHSKRSP 681 Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRR--YRRYKRWSFFA 298 H +G WR L LLG A+ + Q+ LG++ + R Sbjct: 682 ----HRAKGDEWRAELVDGLLGGEAAALALRAF-AQDAKLGLLAAAGSRLRIGDPDVMNN 736 Query: 299 KRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLK 357 R + R LA+R G F G+MFW + + PL +L +F E Sbjct: 737 NRQDADR----LARRMGLKLAPET-PFSAGSMFWGRTEAFAPLSDLTDAEIDFGPELGRV 791 Query: 358 DGALEHAVERFFACSVRYTEFS 379 DG HA+ER A V + Sbjct: 792 DGTTAHAIERLTAAIVARAGYR 813 >gi|166713445|ref|ZP_02244652.1| hypothetical protein Xoryp_18900 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 695 Score = 333 bits (855), Expect = 2e-89, Method: Composition-based stats. Identities = 84/377 (22%), Positives = 145/377 (38%), Gaps = 35/377 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ + P G W P++ + + ++ L Sbjct: 336 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----ILT 386 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 K + P+ R+ F + E + A L + + + + + L+ E+ Sbjct: 387 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 446 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 + +V+H +Y D E L L VT Q + Sbjct: 447 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 493 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR Sbjct: 494 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 549 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LL I+ F ++P LG++ ++ + L R G Sbjct: 550 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFIG----GNADALDYLTVRTG 604 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V Sbjct: 605 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 664 Query: 375 YTEFSIESVDCVAEYER 391 ++ + +++ + + Sbjct: 665 HSGQRVATIEQLLGIPK 681 >gi|84622385|ref|YP_449757.1| hypothetical protein XOO_0728 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188578640|ref|YP_001915569.1| hypothetical protein PXO_03177 [Xanthomonas oryzae pv. oryzae PXO99A] gi|84366325|dbj|BAE67483.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188523092|gb|ACD61037.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 695 Score = 333 bits (854), Expect = 3e-89, Method: Composition-based stats. Identities = 84/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ + P G W P++ + + ++ L Sbjct: 336 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MLT 386 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 K + P+ R+ F + E + A L + + + + + L+ E+ Sbjct: 387 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 446 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 + +V+H +Y D E L L VT Q + Sbjct: 447 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 493 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR Sbjct: 494 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 549 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LL I+ F ++P LG++ ++ + L R G Sbjct: 550 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFMG----GNADALDYLTVRTG 604 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V Sbjct: 605 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 664 Query: 375 YTEFSIESVDCVAE 388 ++ + +++ + Sbjct: 665 HSGQRVATIEQLLG 678 >gi|58425017|gb|AAW74054.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 727 Score = 333 bits (854), Expect = 3e-89, Method: Composition-based stats. Identities = 84/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ + P G W P++ + + ++ L Sbjct: 368 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MLT 418 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 K + P+ R+ F + E + A L + + + + + L+ E+ Sbjct: 419 VRNRLKNTTPAHRLVFINAWNEWAEGAVLEPDTRVGYAWLDATRQALLHTAEVVTRSGQH 478 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 + +V+H +Y D E L L VT Q + Sbjct: 479 DA-------------CVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQ 525 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR Sbjct: 526 QRGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRD 581 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LL I+ F ++P LG++ ++ + L R G Sbjct: 582 MLSGLLA-PQHVAAIVRGFAEDPLLGLVAPAQHLLPVTDFMG----GNADALDYLTVRTG 636 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 H F +G+MFWVK + L PL + HL EFE E+ DG L HA+ERF A +V Sbjct: 637 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 696 Query: 375 YTEFSIESVDCVAE 388 ++ + +++ + Sbjct: 697 HSGQRVATIEQLLG 710 >gi|77748730|ref|NP_643883.2| hypothetical protein XAC3576 [Xanthomonas axonopodis pv. citri str. 306] Length = 546 Score = 332 bits (851), Expect = 7e-89, Method: Composition-based stats. Identities = 85/374 (22%), Positives = 144/374 (38%), Gaps = 35/374 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ + P G W P++ + + ++ Sbjct: 187 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MRT 237 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 + P+ R+ F + E + A L + + + + + L+ G + R Sbjct: 238 VRDRLTNTPPAHRLVFINAWNEWAEGAVLEPDTRLGYAWLHATRQALLHTAGAATGSDLR 297 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 + +V+H +Y D E + L VT + Q + Sbjct: 298 DA-------------CVVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQ 344 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR Sbjct: 345 QRGVQAQVDGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDAWRRE 400 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 +F LL A I+ F +P LG+ ++ + LA R G Sbjct: 401 MFSALL-TPQHADAIMRGFTDDPLLGLAAPAQHLLPVTDFIG----GNADALDYLAVRTG 455 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 H F +G+MFWVK + L PL + +L EFE E+ DG L HA+ERF A +V Sbjct: 456 TDAIDEHSVFASGSMFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVS 515 Query: 375 YTEFSIESVDCVAE 388 + + ++D + Sbjct: 516 HCGHHVATIDQLLG 529 >gi|325928558|ref|ZP_08189746.1| Lipopolysaccharide biosynthesis protein/putative glycosyl transferase [Xanthomonas perforans 91-118] gi|325541097|gb|EGD12651.1| Lipopolysaccharide biosynthesis protein/putative glycosyl transferase [Xanthomonas perforans 91-118] Length = 695 Score = 328 bits (842), Expect = 6e-88, Method: Composition-based stats. Identities = 83/374 (22%), Positives = 145/374 (38%), Gaps = 35/374 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ + P G W P++ + + ++ Sbjct: 336 LARDMEQRPLREYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRDWL-----MRT 386 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 + + P+ R+ F + E + A L + + + + + L+ G + Sbjct: 387 VRDRLRNTPPAHRLVFINAWNEWAEGAVLEPDTRLGYAWLHATRQALLHTAGAATGSD-- 444 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 + + +V+H +Y D E + L +T + Q + Sbjct: 445 -----------QRDVCVVLHAWYLDVLDEALEAIAHCGLSLRLVITTDITMVEQVRQRLQ 493 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + AQ+ EN+GRD+ PFL + + + + K+H KKS H +G WRR Sbjct: 494 QRGVQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDTWRRE 549 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 +F LL I+ F +P LG+ ++ + LA R G Sbjct: 550 MFSALLA-PQHVDAIMRGFADDPLLGLAAPAQHLLPVTDFIG----GNADALDYLAVRTG 604 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 H F +G+MFWVK + L PL + HL EFE+E+ DG L HA+ERF A +V Sbjct: 605 TDAINEHSMFASGSMFWVKLEALRPLLDAHLHPSEFEDEQGQIDGTLAHAIERFLAVAVG 664 Query: 375 YTEFSIESVDCVAE 388 + + +V+ + Sbjct: 665 HCGHHVATVEQLLG 678 >gi|325921211|ref|ZP_08183074.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC 19865] gi|325548310|gb|EGD19301.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC 19865] Length = 706 Score = 321 bits (824), Expect = 1e-85, Method: Composition-based stats. Identities = 79/369 (21%), Positives = 139/369 (37%), Gaps = 35/369 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ P G W P++ + + WL Sbjct: 329 LASDMEQRPLRDYTLYPGVNPG----WDNEPRRSGKGRIYLHASPRRYRD---WLSRT-- 379 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 + + P+ R+ F + E + A L + + ++ + E + ++ Sbjct: 380 VQQRLANALPAHRMVFINAWNEWAEGAVLEPDARLGHAWLEATREALIGPSKVVSELAPH 439 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 ++ +V+H +Y D E+ + L +T + V Sbjct: 440 -------------RVCVVLHAWYLDVLDEMLDAVAHCAISPRLVITTDLTMVVEVRHRVQ 486 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + A++ EN+GRD+ PFL++ + + + K+H KKS H +G WR Sbjct: 487 QRGMQAEVEGFENRGRDILPFLHVANRLLDEGVCLVVKLHTKKST----HRSDGDTWRHE 542 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LL + A I+N F +P LG+ + + L R G Sbjct: 543 MLSALLA-PERADAIVNAFSSDPLLGLAAPDGHLLPVADFIG----GNTDALDYLGARTG 597 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVR 374 T F +G+MFW + + L PL + HL EFE E+ DG L HA+ERF S Sbjct: 598 TETAIEQGMFASGSMFWARLEALRPLLDAHLHPSEFETEQGQIDGTLAHAIERFMGISAI 657 Query: 375 YTEFSIESV 383 + + I ++ Sbjct: 658 QSGYRIATI 666 >gi|134297301|ref|YP_001121036.1| lipopolysaccharide biosynthesis protein-like protein [Burkholderia vietnamiensis G4] gi|134140458|gb|ABO56201.1| Lipopolysaccharide biosynthesis protein-like protein [Burkholderia vietnamiensis G4] Length = 1231 Score = 321 bits (823), Expect = 1e-85, Method: Composition-based stats. Identities = 77/382 (20%), Positives = 149/382 (39%), Gaps = 24/382 (6%) Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69 G + + + K + + W ++ ++S Sbjct: 864 FSGHVYDYNEYAENATKVIADKKHT---FPCVMMNWDNEARKPGKGHIFLGASPESYKS- 919 Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127 WLR F + S R+ F + E + +L +R + + ++ + Sbjct: 920 --WLRRCFDFVLSNNKQ--SERLVFINAWNEWAEGTYLEPDRRYGYAYLHATADLL---R 972 Query: 128 ELFEGWNDRPSSPKKSGLTIKS-KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA 186 + + + S + +K + A+V H YY D E+ ++ R + D F+T+ Sbjct: 973 QYYNSEDLDESIKINNQRFVKKNENALVAHLYYFDLLPELLSLIERN-VNLDAFITIPVH 1031 Query: 187 -NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYH 245 +++ ++L + + ++N+GRD+ PFL + + Y L K+H KKS + Sbjct: 1032 FSREQVGEILASLDNVYVLRVQNRGRDILPFLNIYPIIKSYSYANLVKVHSKKSPQ---- 1087 Query: 246 PIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR 305 +G + R+ +LL I ++ +P +G+I S + + Sbjct: 1088 RADGALLRKRALLELL-DPSIVPGVLRALNTDPKIGLIAPSNSLCSLSNSDYLIN--NRK 1144 Query: 306 RVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHA 364 ++ R G L+ +F G+MFW + L L +L L +FEEE DG L HA Sbjct: 1145 QLNYCLSRLGLVDSSLNFEFIAGSMFWARVDALRMLSDLSLREEDFEEELGQLDGTLAHA 1204 Query: 365 VERFFACSVRYTEFSIESVDCV 386 +ER F ++ + VD + Sbjct: 1205 IERLFCFLGKHVGYRTLPVDQI 1226 >gi|16124886|ref|NP_419450.1| hypothetical protein CC_0633 [Caulobacter crescentus CB15] gi|221233606|ref|YP_002516042.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000] gi|13421844|gb|AAK22618.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|51039815|tpg|DAA00361.1| TPA_exp: conserved hypothetical protein [Caulobacter vibrioides] gi|220962778|gb|ACL94134.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000] Length = 818 Score = 321 bits (822), Expect = 2e-85, Method: Composition-based stats. Identities = 87/380 (22%), Positives = 142/380 (37%), Gaps = 32/380 (8%) Query: 10 KLGKIENL--LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 GK+ + + R ++E + A ++P + G W ++ H + E Sbjct: 456 FTGKVYDYPAVARHKLDELEQVPAAFVPGVMPG----WDNQARKPWAGVAFHNADP---E 508 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF-L 124 S+ WL L + F + E + A+L +R+ + + Sbjct: 509 SYFGWLSGAL--KHAEARHPKGEALVFVNAWNEWGEGAYLEPDRWFGHGYLHATRTALSA 566 Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAI-VVHCYYQDTWIEISHILLRLNFDFDLFVTV 183 ++ L P + K A+ ++H +Y + + L DL +TV Sbjct: 567 WLPRLTNA---HPIIAEAQSQFAKRADAVTLLHLFYPELIDWFAERLAATADVLDLMITV 623 Query: 184 VEANKDF-EQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 E + FP+A L + EN+GRD+RPF+ L Y CK+H K+S Sbjct: 624 PETWSEADLARARAAFPTAHLAIAENRGRDIRPFVETLRRARALGYSVFCKLHSKRSP-- 681 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRR--YKRWSFFAKR 300 H +G WR L LLG A+ + Q+P LG++ + R R Sbjct: 682 --HQAKGDQWRTTLVEGLLGGEAAALALRAF-AQDPKLGLLAAAGARMRIGDPDVMDNNR 738 Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDG 359 +E R L+ G + F G+MFW + + PL +L F E DG Sbjct: 739 AEADR----LSAHMGLKPRPET-PFAAGSMFWGRTEAFAPLTDLSDDEIAFGPELGRVDG 793 Query: 360 ALEHAVERFFACSVRYTEFS 379 HA+ER A V + Sbjct: 794 TTAHAIERLTAAIVERAGYR 813 >gi|325915787|ref|ZP_08178089.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937] gi|325538051|gb|EGD09745.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937] Length = 695 Score = 315 bits (807), Expect = 9e-84, Method: Composition-based stats. Identities = 84/377 (22%), Positives = 142/377 (37%), Gaps = 35/377 (9%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 L D+E++ + P G W P++ + + WL + Sbjct: 336 LASDIEQRPLREYTLYPGVNPG----WDNEPRRSGKGRVYLHASPRRYRD---WLSTT-- 386 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDR 136 + R+ F + E + A L + + ++ + + + Sbjct: 387 VHHRLAHVPTAHRLVFINAWNEWAEGAVLEPDMRLGHAWLDATRQAMTR--------SAH 438 Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVL 195 ++ + +VVH +Y D EI L L VT + Sbjct: 439 DVPAPRT-----YRACVVVHAWYLDVLDEILDALAPSVAMLRLIVTTDLTLVGQVRGRLQ 493 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 ++ A++ EN+GRD+ PFL++ + + + K+H KKS H +G WRR Sbjct: 494 QHGIEAEVEGFENRGRDILPFLHIANRLLDEGEQLVVKLHTKKST----HRHDGDAWRRE 549 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + LLG I+N F +P LG+ ++ + LA R G Sbjct: 550 MLAALLGG-GRVDAIVNAFVADPQLGLAAPAQHLLAVTDFIG----GNADALDYLAVRTG 604 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHAVERFFACSVR 374 T H F +G+MFW K L PL + HL +FE E+ DG L HA+ERF +V Sbjct: 605 TGTVTEHDRFASGSMFWAKLDALRPLLDAHLQPGDFEGEQGQIDGTLAHAIERFLGHAVL 664 Query: 375 YTEFSIESVDCVAEYER 391 ++ I ++D + Sbjct: 665 HSGHRIATIDGLMGQRE 681 >gi|291520004|emb|CBK75225.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 984 Score = 310 bits (794), Expect = 3e-82, Method: Composition-based stats. Identities = 67/393 (17%), Positives = 129/393 (32%), Gaps = 22/393 (5%) Query: 1 MYKVFRLKSKLGKIENLLLRLDVEEKG---NMQAIYIPAHVSGYYVLWSFSPKQRITSKD 57 +Y+V + G + + E + + Q+ Y V L+ + + TS D Sbjct: 169 LYRVVKFSELPGNLVEISDEEKAEYQKMENHFQSNYCFKDVKNLKELFDHAESRSKTSAD 228 Query: 58 VHFQELSIFESFIFWLRSFLAFS-KYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRM 116 + L + + + R+ + + + + S + Sbjct: 229 FAIASRDYQIKQLQELIAAKDVHIRNIEAVNEQLRVIYDNTVNTKGYKALESIRAFKSFL 288 Query: 117 PF------DSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHIL 170 ++++ ++ + + + +A+ +H +Y D E Sbjct: 289 TGKPSPAREAKRLEKEEKKARKAAAKEAKKAAAKGEEAPSVAVHLHLFYVDLLPEFVSYF 348 Query: 171 LRLNFDFDLFVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFD 226 + F FDL+++ E LK + + N+GRD+ P Sbjct: 349 ANIPFRFDLYISCQEGADVSVIKSGVKELKMANKVVIRPLPNRGRDLAPLYVGFADE-IR 407 Query: 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSR 286 ++DY +H KKS G E WR++ LLG + I N F +N G++ Sbjct: 408 QHDYFLHVHSKKSLYSG---AEKGGWRQFSLELLLGSPEKVNSIFNLF-KNKNAGLVYPD 463 Query: 287 RYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLH- 345 + L ++ G+ FW + L P+ N + Sbjct: 464 IHEEVP--MIAYSWLANAGLGRKLFDEFELGEMPTVFNYPAGSFFWARTDALMPIFNRNY 521 Query: 346 LIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 + +F EE DG L HA+ER R + Sbjct: 522 IYEDFPEEAGQTDGTLAHALERIIPFVSRKLGY 554 >gi|194364297|ref|YP_002026907.1| hypothetical protein Smal_0519 [Stenotrophomonas maltophilia R551-3] gi|194347101|gb|ACF50224.1| conserved hypothetical protein [Stenotrophomonas maltophilia R551-3] Length = 686 Score = 301 bits (772), Expect = 1e-79, Method: Composition-based stats. Identities = 75/369 (20%), Positives = 133/369 (36%), Gaps = 39/369 (10%) Query: 21 LDVEEKG-NMQAIYIPAH--VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77 D E M+ +P + G W ++ + + + WL Sbjct: 328 RDWRELAAQMRTAPLPDYPLYPGVNPGWDNEARRPGRGRVLLHASPRGYAD---WLHDT- 383 Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWND 135 + P+ R+ F + E + A L + + ++ + Sbjct: 384 -VHGRLRDVPPARRMVFINAWNEWAESAVLEPDARLGHAWLQATRRAMT----------- 431 Query: 136 RPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVL 195 PS P S +V+H ++ D E+ + L +T + Q + Sbjct: 432 -PSQPAPSRPC------VVIHAWHLDALPELLSAVKDSGLPARLVITTTSDRQAQVQSIT 484 Query: 196 KYFPS-AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254 + A+++ +N GRD+ PFL+ + + + K+H K+S H G WRR Sbjct: 485 ESHGLPAEIWAYDNHGRDILPFLHAADRLLQQNESLVLKLHTKRST----HRDNGDQWRR 540 Query: 255 WLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRA 314 + LLG + A + + +P LG++ + +R+ L + Sbjct: 541 EMVDALLGPAQAAAN-LAHLQADPRLGLMAPAGHLLNVADYIG----GNAQRMERLWAQL 595 Query: 315 GFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSV 373 G F +G+MFWV+ + L PL + HL+ FE E DG L HA+ER Sbjct: 596 GLDGAPGDGQFASGSMFWVRLQALRPLLDAHLLPSMFEVEAGQIDGTLAHAIERATGAVA 655 Query: 374 RYTEFSIES 382 FS+ Sbjct: 656 TCAGFSVGD 664 >gi|315122651|ref|YP_004063140.1| hypothetical protein CKC_04515 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496053|gb|ADR52652.1| hypothetical protein CKC_04515 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 405 Score = 301 bits (770), Expect = 2e-79, Method: Composition-based stats. Identities = 153/327 (46%), Positives = 204/327 (62%), Gaps = 7/327 (2%) Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKF 123 S F F FW+RS F +Y L + RI YGSR +K F N+ M +PFD EK Sbjct: 70 SFFLGFFFWIRSLFLFKRYQTLRYDENRIIAYGSRIGKKFFACSNKDMLARGVPFDGEKI 129 Query: 124 LYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV 183 L GW+ PSS K + + I+S++AIVVH YY D W EI+++L LNF FDL +T+ Sbjct: 130 HRFPRLLHGWD-SPSSEKIASVKIQSRVAIVVHIYYADLWAEIANLLSGLNFSFDLHITL 188 Query: 184 VEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243 V + ++LK FP+A +YVMEN GRD+R FL LLE G D YDY+CKIHGKKS+R G Sbjct: 189 VTEIASIKSEILKRFPNAHIYVMENYGRDIRSFLKLLEGGKLDSYDYVCKIHGKKSKRNG 248 Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303 + +G +WRRWLFFDLLG IA+ II TFE+ P +GMIGSR YR ++ S R Sbjct: 249 HVWWDGDLWRRWLFFDLLGAPGIALEIIKTFEKYPKIGMIGSRTYRYDQKISLGNNR--- 305 Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLK--DGAL 361 V +A + G + +DFF GTMFWV+P+ L+P++NL L F+ + ++ DG L Sbjct: 306 -EFVCAIANKMGVSFEDTKIDFFGGTMFWVRPQALDPIKNLALTQYFKSKVDMVGLDGCL 364 Query: 362 EHAVERFFACSVRYTEFSIESVDCVAE 388 EHA+ER F+ SV F + VDC++E Sbjct: 365 EHAIERCFSISVEKANFDLAYVDCLSE 391 >gi|285019449|ref|YP_003377160.1| hypothetical protein XALc_2689 [Xanthomonas albilineans GPE PC73] gi|283474667|emb|CBA17166.1| conserved hypothetical protein [Xanthomonas albilineans] Length = 686 Score = 300 bits (768), Expect = 3e-79, Method: Composition-based stats. Identities = 82/354 (23%), Positives = 135/354 (38%), Gaps = 37/354 (10%) Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 G W ++ + +E WLR+ + + + R+ F Sbjct: 342 YPLYPGVNPGWDNEARRPGNGRVYLHASPRGYED---WLRATIHTRLQGRRA--EQRLVF 396 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIA 152 + E + A L + + ++ + + E + Sbjct: 397 VNAWNEWAEGAVLEPDTRLGHAYLDATRRALS-PARVREATAPHHA-------------- 441 Query: 153 IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY--FPSAQLYVMENKG 210 +VH +Y + E+ + L + L VT Q L+ FP ++ V+EN+G Sbjct: 442 -IVHAWYPNVLPELLNPLAASALPWRLLVTTSPDQASAVQAQLRDCSFPY-EVMVLENRG 499 Query: 211 RDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270 RD+ PFL+ E + D D + K+H K+S H G WR L L G +D A RI Sbjct: 500 RDILPFLHAGERLLQDGVDVVLKLHTKRST----HLHNGDAWRSELLQRLAG-ADRAARI 554 Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-PTKRLHLDFFNGT 329 + F Q+P LG++ + L +R G+ F +G+ Sbjct: 555 LEAFAQDPMLGLVAPEGHLLPLADF----WGGNRMAADYLLRRTGYTDVCLDEAHFISGS 610 Query: 330 MFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIES 382 MFWV+ L PL + HL EFE E+ DG L HA ER A ++ + + + Sbjct: 611 MFWVRLHALRPLLDSHLCPSEFEPEQGQIDGTLAHAAERVTALLAQHRGYRVAT 664 >gi|325928537|ref|ZP_08189725.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans 91-118] gi|325541076|gb|EGD12630.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans 91-118] Length = 1415 Score = 296 bits (758), Expect = 4e-78, Method: Composition-based stats. Identities = 82/371 (22%), Positives = 148/371 (39%), Gaps = 36/371 (9%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 ++ +V +K + + G + W ++ V + + + WLR Sbjct: 1054 VVDYANVVDKALSEVKPEFDLIRGVFPSWDNDARKPGRGYTVARSTPARYRT---WLRGA 1110 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWN 134 + +S+ + + F + E + A L +R + + Sbjct: 1111 IDYSRKFPVR--GESLVFVNAWNEWAEGAHLEPDRKYGYAYLEATRRAL----------- 1157 Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDV 194 RP P+ ++A+V+H +Y + E+ L + + L ++ V D + Sbjct: 1158 RRPVMPRTPE-----RVAVVIHAFYPEILPEMLKELQSWDVPYFLIISTVADKADEVRGY 1212 Query: 195 LKYFPS-AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253 L A + V EN+GRD+ PFL +++ R + K+H K+S H +G WR Sbjct: 1213 LADLSVVADVRVFENRGRDILPFLEIMKDLR-GRESLVLKLHTKRSL----HRQDGESWR 1267 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 R + LL +A I F + LG+ + S V L+K+ Sbjct: 1268 RDMLEKLLA-PKVASEIFAAFREQERLGLAAPEGHIL----SMTTYWGANADTVHRLSKQ 1322 Query: 314 AGF-PTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFAC 371 P + F G+MF+V+P+ ++ + +L L +FE E DG L HA+ER F+ Sbjct: 1323 MHVDPVNPVTAMFAAGSMFYVRPEAIDSIMDLDLRREDFEPEAGQVDGTLAHAIERCFSL 1382 Query: 372 SVRYTEFSIES 382 +V T + I S Sbjct: 1383 AVCSTGYYIAS 1393 >gi|145588508|ref|YP_001155105.1| methyltransferase type 11 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145046914|gb|ABP33541.1| Methyltransferase type 11 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 1082 Score = 291 bits (746), Expect = 9e-77, Method: Composition-based stats. Identities = 84/360 (23%), Positives = 147/360 (40%), Gaps = 30/360 (8%) Query: 24 EEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYS 83 E ++ Y + W + +++ S + + ++ WL + + +K S Sbjct: 743 NEVKKLEPEY--KQYRAAMLSWDNTARRKNNSHIMANFSIRRYK---QWLSNIASCTKNS 797 Query: 84 KLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPK 141 + + F + E + L + + + P + Sbjct: 798 IRLNENEKFIFINAWNEWAEGTHLEPDTKYGFKYLQATYDILKNY--------INPEHAE 849 Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILL---RLNFDFDLFVTVVEANKDFEQDVLKYF 198 + ++ IAIVVH +Y DTW +I I+ ++ D+++T+ N + Q + F Sbjct: 850 IIRESQENSIAIVVHIHYMDTWEDIKKIIKKILSVHDS-DIYITIT--NLEQYQSIKNDF 906 Query: 199 PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258 PSA + ++EN+GRD+ PF+ +L+ + Y +CKIH KKS + +G + R+ L+F Sbjct: 907 PSANIELVENRGRDILPFINVLKKIIHKNYVAICKIHSKKS----EYRSDGEVIRKELYF 962 Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT 318 L+ +I FE N LGM+ +Y + + G Sbjct: 963 SLINNEITLEKIPKFFEVNKKLGMLVPGKYFLQHNDI---NMYFNRENISKVCSVIGVNF 1019 Query: 319 KRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 K F G+MFW +P L+ L L F+ E L DG + HAVER F + F Sbjct: 1020 KESK--FPAGSMFWARPAALQKLLKLESGELFDVEEGLADGTVAHAVERLFGLVSESSGF 1077 >gi|190572709|ref|YP_001970554.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia K279a] gi|190010631|emb|CAQ44240.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia K279a] Length = 707 Score = 289 bits (741), Expect = 4e-76, Method: Composition-based stats. Identities = 74/367 (20%), Positives = 130/367 (35%), Gaps = 37/367 (10%) Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79 R + P G W ++ + + + WL Sbjct: 352 RELATQMRRAPLADYP-LYPGVNPGWDNEARRPGRGRVLLHASPRGYSD---WLHDT--V 405 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRP 137 + + P+ R+ F + E + A L + + ++ + + RP Sbjct: 406 HQRLRHVAPARRLVFINAWNEWAESAVLEPDARLGHAWLQATRRAL--FPS--QAAPSRP 461 Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLK 196 IV+H +Y D E+ + L +T E + + Sbjct: 462 --------------CIVIHAWYLDALPELLQAVKDSGLQARLVITTTGERQAQVQSIIDA 507 Query: 197 YFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256 +A+++V +N GRDV PFL+ + + + K+H K+S H G WRR + Sbjct: 508 EGLTAEIWVYDNHGRDVLPFLHAADRLLQQNESLVLKLHTKRST----HRDNGDQWRREM 563 Query: 257 FFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF 316 LLG + A + + NP +G++ + +R+ L G Sbjct: 564 VDALLGTAQAAANLAHL-LANPSIGLMAPAGHLLKVADYIG----GNAQRMERLWALLGL 618 Query: 317 PTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRY 375 + F +G+MFWV+ L PL + HL+ F+ E DG L HA+ER V Sbjct: 619 DSAPGDGQFASGSMFWVRLPALRPLLDAHLLPSMFDTEAGQIDGTLAHAIERATGAVVSA 678 Query: 376 TEFSIES 382 F++ Sbjct: 679 AGFTVAD 685 >gi|21111631|gb|AAM39945.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66575237|gb|AAY50647.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 296 Score = 289 bits (740), Expect = 5e-76, Method: Composition-based stats. Identities = 65/305 (21%), Positives = 120/305 (39%), Gaps = 26/305 (8%) Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKS 149 + F + E + A L + + + + + ++ PS+ Sbjct: 1 MVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICSPSA---------- 49 Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208 +V+H +Y D E+ ++ + +T + + + + A++ EN Sbjct: 50 --CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFEN 107 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 +GRD+ PFL++ + + + K+H KKS H +G WR + LLG Sbjct: 108 RGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGEMLTALLG-PQRVD 162 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNG 328 I+N F +P G+ + + L R G + F +G Sbjct: 163 AIVNAFSTDPLAGLAAPEDHLLPVTEFIG----GNADALDYLTVRTGSDAPDTNSLFASG 218 Query: 329 TMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVDCVA 387 +MFW + + L PL + HL EFE E+ DG L HA+ERF +V ++ + +V+ Sbjct: 219 SMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVTHSGHRVTTVEQTL 278 Query: 388 EYERL 392 + Sbjct: 279 GITKT 283 >gi|312962408|ref|ZP_07776899.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas fluorescens WH6] gi|311283335|gb|EFQ61925.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas fluorescens WH6] Length = 1308 Score = 288 bits (736), Expect = 1e-75, Method: Composition-based stats. Identities = 83/383 (21%), Positives = 154/383 (40%), Gaps = 32/383 (8%) Query: 5 FRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELS 64 + + ++ N + Y + W + +++ S H L Sbjct: 954 ADFNGHIFSYDQVV----ANAVANKEPEY--KLFRASMLSWDNTARKQYNSHTFHGFSLL 1007 Query: 65 IFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 ++ WL S + ++ F + E + L +R + + Sbjct: 1008 RYK---QWLSSITNNVFNNAKYSKDEKLVFVNAWNEWAEGTHLEPDRKYGYGYLQATDDV 1064 Query: 123 FLYVKELFEGWNDRPSSPKKSGLTIKSKI-AIVVHCYYQDTWIEISHILLRLNF-DFDLF 180 + S +++ A+V+H +Y D W +I L ++DL+ Sbjct: 1065 LAEY-------DISKVSRMAFKRSVRQADYAVVLHLHYDDLWDDIKSYLDSFGQLEYDLY 1117 Query: 181 VTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240 VTV ++ V + +P A + ++EN+GRDV PFL +L++ Y +CKIH K+S Sbjct: 1118 VTVTSSSAGVR--VAQEYPKAHIQLVENRGRDVLPFLKILQVIKDMGYVAVCKIHSKRSL 1175 Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300 +G R L LLG + + +++ FE+ +G+I +Y Sbjct: 1176 Y----RDDGDKIRGELIGSLLGSKETILSVVDRFERQKDIGVIVPVKYLIPHTDHNMTYC 1231 Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGA 360 + V +L+ + GF +F G+MFW +PK LE L ++ FE E L DG Sbjct: 1232 GAI---VTELSSKLGFNFSYC--EFIAGSMFWFRPKALEALLSIDESS-FEVEDGLADGT 1285 Query: 361 LEHAVERFFACSVRYTEFSIESV 383 + H +ER V+ +++E++ Sbjct: 1286 IAHGIERVLCNVVKKANYTVETI 1308 >gi|21109952|gb|AAM38419.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 296 Score = 283 bits (724), Expect = 3e-74, Method: Composition-based stats. Identities = 76/301 (25%), Positives = 123/301 (40%), Gaps = 26/301 (8%) Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKS 149 + F + E + A L + + + + + L+ G + R + Sbjct: 1 MVFINAWNEWAEGAVLEPDTRLGYAWLHATRQALLHTAGAATGSDLRDA----------- 49 Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208 +V+H +Y D E + L VT + Q + + AQ+ EN Sbjct: 50 --CVVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQQRGVQAQVDGFEN 107 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 +GRD+ PFL + + + + K+H KKS H +G WRR +F LL A Sbjct: 108 RGRDILPFLRVANRLLDEGEQVVLKLHTKKST----HREDGDAWRREMFSALL-TPQHAD 162 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNG 328 I+ F +P LG+ ++ + LA R G H F +G Sbjct: 163 AIMRGFTDDPLLGLAAPAQHLLPVTDFIG----GNADALDYLAVRTGTDAIDEHSVFASG 218 Query: 329 TMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVDCVA 387 +MFWVK + L PL + +L EFE E+ DG L HA+ERF A +V + + ++D + Sbjct: 219 SMFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVSHCGHHVATIDQLL 278 Query: 388 E 388 Sbjct: 279 G 279 >gi|158422520|ref|YP_001523812.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium caulinodans ORS 571] gi|158329409|dbj|BAF86894.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium caulinodans ORS 571] Length = 661 Score = 269 bits (687), Expect = 8e-70, Method: Composition-based stats. Identities = 90/381 (23%), Positives = 160/381 (41%), Gaps = 24/381 (6%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 ++ +G ++ + D+ + + +P G +P Q ++ + + + Sbjct: 262 RAFVGPVDEFMFVADLAQ-HRARQATVP-LFPGICAGHDSTPGQGADARIMV--SPDLGD 317 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 + WL LA ++ ++ S + F + + + L + ++ + + Sbjct: 318 DYARWLTEVLAIARARPVAGAS--LVFINAWNDWLNGSHLLPDARYGHALLRATASTCA- 374 Query: 126 VKELFEGWNDRPSSPKKSGLTIKS-KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184 RP++ + +++ +A VVH YY+D + L LFVT Sbjct: 375 -PYAGAIGARRPAAAPVTPRPVRTGSLASVVHGYYEDLLPGLIAGL----DPAHLFVTTP 429 Query: 185 -EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243 E + + + P+A+L V+EN+GRDVRPFL LL + YD + K+H K+S +G Sbjct: 430 PEKAEAVRAVLARAAPAARLRVVENRGRDVRPFLSLLPELEAEGYDLVLKVHTKRSPHQG 489 Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303 EG W + L LL + R+ FE +P +G++G+ + + +A + Sbjct: 490 ---KEGSDWLQRLSGPLLKLARS-ERLAPVFEAHPQMGLLGAAGHVLDG--ALYAGSAGN 543 Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNL-HLIGEFEEERNLKDGALE 362 + LA G L + GTMF + PLR L+ F+ + LKDG L Sbjct: 544 AAWMRRLAAELG-TGAPLTSPYVAGTMFVARLGIFAPLRGASELLDLFDTDMGLKDGTLA 602 Query: 363 HAVERFFACSVRYTEFSIESV 383 HA ERFF S+ V Sbjct: 603 HAFERFFGVLAAEAGLSVGEV 623 >gi|289662624|ref|ZP_06484205.1| hypothetical protein XcampvN_05932 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 945 Score = 268 bits (685), Expect = 1e-69, Method: Composition-based stats. Identities = 71/275 (25%), Positives = 116/275 (42%), Gaps = 12/275 (4%) Query: 113 NSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLR 172 + + P ++ K+ ++VH +Y D E + L + Sbjct: 47 RGFLERVRLAGRKQPAAHRLADQAPFGRPVPSAQLQLKVGVMVHVFYPDLIDEFAQSLQQ 106 Query: 173 LNFDFDLFVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY 228 + +DL V+V++ + + L+ + ++ N+GRD+ P L + Sbjct: 107 MPVGYDLLVSVMDNAAEAQARDRFSKLQQIEKLDIRIVPNRGRDIAPLLVTFREQILA-L 165 Query: 229 DYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRY 288 D + +H KKS G E WRR+L L+G ++ + F+ P LGM+ Y Sbjct: 166 DVVGHLHTKKSLYTG---SEQGQWRRYLVSSLMGSAERIAWQLGMFQAEPRLGMLYPESY 222 Query: 289 RRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KRLHLDFFNGTMFWVKPKCLEPLRNLHL- 346 R W+ + LA+R GF ++DF G+MFW K L PL L+L Sbjct: 223 ERVPLWA--HTWLSNFEVCRTLAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLE 280 Query: 347 IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 + +F EE DG L HA+ER F VR+ + I Sbjct: 281 LKDFPEEHGQIDGTLHHAMERMFVAVVRHQHYRIG 315 >gi|289668432|ref|ZP_06489507.1| hypothetical protein XcampmN_08015 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 945 Score = 267 bits (684), Expect = 1e-69, Method: Composition-based stats. Identities = 71/275 (25%), Positives = 116/275 (42%), Gaps = 12/275 (4%) Query: 113 NSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLR 172 + + P ++ K+ ++VH +Y D E + L + Sbjct: 47 RGFLERVRLAGRKQPAAHRLADQAPFGRPVPSAQLQVKVGVMVHVFYPDLIDEFAQSLQQ 106 Query: 173 LNFDFDLFVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY 228 + +DL V+V++ + + L+ + ++ N+GRD+ P L + Sbjct: 107 MPVGYDLLVSVMDNAAEAQARDRFSKLQQIEKLDIRIVPNRGRDIAPLLVTFREQILA-L 165 Query: 229 DYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRY 288 D + +H KKS G E WRR+L L+G ++ + F+ P LGM+ Y Sbjct: 166 DVVGHLHTKKSLYTG---SEQGQWRRYLVSSLMGSAERIAWQLGMFQAEPRLGMLYPESY 222 Query: 289 RRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KRLHLDFFNGTMFWVKPKCLEPLRNLHL- 346 R W+ + LA+R GF ++DF G+MFW K L PL L+L Sbjct: 223 ERVPLWA--HTWLSNFEVCRTLAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLE 280 Query: 347 IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 + +F EE DG L HA+ER F VR+ + I Sbjct: 281 LKDFPEEHGQIDGTLHHAMERMFVAVVRHQHYRIG 315 >gi|258591058|emb|CBE67353.1| protein of unknown function [NC10 bacterium 'Dutch sediment'] Length = 1460 Score = 260 bits (664), Expect = 3e-67, Method: Composition-based stats. Identities = 73/331 (22%), Positives = 125/331 (37%), Gaps = 39/331 (11%) Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSR-----MPFDSEKFL 124 I WLR + + R + L++ + + + + Sbjct: 509 IRWLRHPI------RALPGKDRFAIDFA------HLKVTLRKAYFYHRKIGLRATVRRII 556 Query: 125 YVKELFEGWNDRPSSPKKSGLTI----------KSKIAIVVHCYYQDTWIEISHILLRLN 174 P+ L I S+IA+ H YY D E++ L + Sbjct: 557 VELRSLHTKARGPALCSSELLNIHDIYPMPGDISSRIAVHAHAYYPDLTKELASYLKNMP 616 Query: 175 FDFDLFVTVV-EANKDFEQDVLKYFPSAQ---LYVMENKGRDVRPFLYLLELGVFDRYDY 230 F FDLFV+V + +D + P A+ + V+ N+GRD+ P + G YDY Sbjct: 617 FAFDLFVSVSNDEARDVCRQAFAGLPQARRVIVDVVANRGRDIAPMVCHFG-GRLATYDY 675 Query: 231 LCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRR 290 +C +H KKS + W +L L+G D RI + F+ +P G+I + Y Sbjct: 676 ICHLHTKKSMYAQ---GKMDGWLEYLLRQLMGSEDQVRRIFSMFQSDPRAGIIYPQNYEY 732 Query: 291 YKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL-HLDFFNGTMFWVKPKCLEPLRNLHL-IG 348 W + ++ G + D+ G+MFW + + + L + + + Sbjct: 733 LPYW--GNTWLSNKALGAQMCRQMGITDVPEGYFDYPAGSMFWARSEAIRNLFSADIRLT 790 Query: 349 EFEEERNLKDGALEHAVERFFACSVRYTEFS 379 +F EE DG+L H +ER R+ + Sbjct: 791 DFPEEAGQTDGSLAHCIERLLVLVARHAGYK 821 >gi|260890973|ref|ZP_05902236.1| conserved hypothetical protein [Leptotrichia hofstadii F0254] gi|260859000|gb|EEX73500.1| conserved hypothetical protein [Leptotrichia hofstadii F0254] Length = 319 Score = 260 bits (664), Expect = 3e-67, Method: Composition-based stats. Identities = 61/242 (25%), Positives = 106/242 (43%), Gaps = 10/242 (4%) Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY---FPSA 201 + +K K+ ++ H Y++D E H + + DL +T + + F + Sbjct: 2 IYLKYKVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNI 61 Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261 ++ V+EN+GRDV L + V YDY+C +H KK+ + + G +R + + L Sbjct: 62 EVRVIENRGRDVSSLLVGAKDAVM-NYDYVCFMHDKKTAQLKPYSS-GQGFRYKCYENNL 119 Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRS-EVYRRVIDLAKRAGFPTKR 320 +I TF++NP LGM+ + +++ L K+ G Sbjct: 120 ATKKYVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVDF 179 Query: 321 L---HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYT 376 GTMFW +P+ L+ L + +F EE N DG + HAVER + +V+ Sbjct: 180 HWNLEPISPLGTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFAVQDA 239 Query: 377 EF 378 + Sbjct: 240 GY 241 >gi|320531350|ref|ZP_08032322.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] gi|320136441|gb|EFW28417.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] Length = 626 Score = 258 bits (660), Expect = 8e-67, Method: Composition-based stats. Identities = 57/239 (23%), Positives = 100/239 (41%), Gaps = 10/239 (4%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYV 205 + K+A++ H Y+ D + DL +TV + + + + P + + V Sbjct: 307 QQKVALIAHLYFMDLLDSTLAYARSMPEGTDLILTVGSQEKAELVERACQDLPYNVDVRV 366 Query: 206 MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSD 265 +EN+GRDV L + V D YD +C +H KK + + G + R F +LL + Sbjct: 367 IENRGRDVSALLVGCKDIV-DDYDLVCFMHDKKVTQLSPY-TVGEGFARKCFDNLLPTRE 424 Query: 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY--RRVIDLAKRAGFPTKRL-- 321 ++ TF+ P LG++ + ++ R + L K Sbjct: 425 FVENVVATFDSEPRLGLLSPTPPNHADYFPIYSYSWGPNFDRTKMLLEKELNLNVPLDAH 484 Query: 322 -HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 + GTMFW +P L+PL + +F E N DG + HA+ER + + + + Sbjct: 485 KEVIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNDIDGTILHAIERAYGYVAQASGY 543 >gi|13474020|ref|NP_105588.1| hypothetical protein mll4799 [Mesorhizobium loti MAFF303099] gi|14024772|dbj|BAB51374.1| mll4799 [Mesorhizobium loti MAFF303099] Length = 386 Score = 258 bits (659), Expect = 1e-66, Method: Composition-based stats. Identities = 97/244 (39%), Positives = 131/244 (53%), Gaps = 4/244 (1%) Query: 137 PSSPKKSGL-TIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVL 195 P + L T++ KIA+ +H +Y D W E +L F LF+T+ + Q V Sbjct: 126 PQAEAPERLPTVEPKIAVALHLHYPDLWPEFEALLEATGRQFQLFLTLTRPDAALAQRVQ 185 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 FP A++ V EN+GRDV PF+ LL G FD +D +CK+HGKKS + G + G IWR+ Sbjct: 186 ARFPGAEITVYENRGRDVGPFIQLLREGKFDPFDLICKLHGKKSGQSGPRMVLGEIWRQV 245 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV-IDLAKRA 314 FDL+G + RII FE++P MIGSRR+R W R + ++L + Sbjct: 246 SAFDLIGSRGVVDRIIANFERSPDTQMIGSRRFRLPNEWKGEKSAWGENRAMALNLLETM 305 Query: 315 GFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373 G P LDFF GTMFWV+ LEPLR L L + F EE +DG L+HA+ER Sbjct: 306 GMP-SSSRLDFFAGTMFWVRRGALEPLRRLDLPLAAFPEETGQQDGTLQHALERVLGMIC 364 Query: 374 RYTE 377 Sbjct: 365 TKIG 368 >gi|326772082|ref|ZP_08231367.1| rhamnan synthesis protein F [Actinomyces viscosus C505] gi|326638215|gb|EGE39116.1| rhamnan synthesis protein F [Actinomyces viscosus C505] Length = 652 Score = 257 bits (658), Expect = 2e-66, Method: Composition-based stats. Identities = 58/246 (23%), Positives = 100/246 (40%), Gaps = 10/246 (4%) Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP 199 + K+A++ H YY D + D +TV + + ++ K P Sbjct: 326 AVAREPKPQKVALIAHLYYMDLLEPTLAYARSMPEGTDFILTVGSQEKVELVEEACKDLP 385 Query: 200 -SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258 + + ++EN+GRDV L + V YD +C IH KK + + G + R F Sbjct: 386 YNVTVRLIENRGRDVSALLVGCKDIV-SDYDLVCFIHDKKVTQLSPY-TVGEGFARKCFD 443 Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY--RRVIDLAKRAGF 316 +LL + +I+TF+ P LG++ + ++ R + L K Sbjct: 444 NLLPTREFVENVISTFDSEPRLGLLSPTPPNHADYFPIYSYSWGPNFDRTKMLLEKELNL 503 Query: 317 PTKRL---HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACS 372 + GTMFW +P L+PL + +F E N DG + HA+ER + Sbjct: 504 SVPLDAHKEVIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNDIDGTILHAIERAYGYV 563 Query: 373 VRYTEF 378 + + + Sbjct: 564 AQASGY 569 >gi|331086190|ref|ZP_08335272.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406349|gb|EGG85863.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 592 Score = 257 bits (657), Expect = 2e-66, Method: Composition-based stats. Identities = 66/244 (27%), Positives = 108/244 (44%), Gaps = 10/244 (4%) Query: 143 SGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDF-EQDVLKYFP-- 199 T ++KIA+V+H Y++D E H + + D+++T K + V P Sbjct: 250 QKQTTENKIALVMHLYFEDLLEESYHYVSAMPEKADIYLTTDTEKKKAAIEKVFAKLPCN 309 Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 ++ V++N+GRDV L ++ + D YD +C H KK+ + I G + F + Sbjct: 310 KLEVRVIKNRGRDVSSLLVGVKDVIMD-YDLVCFAHDKKTAQVKPGTI-GASFAYKCFEN 367 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV-IDLAKRAGFPT 318 L +INTF NP +G++ ++ + DLAK+ G Sbjct: 368 TLSNKAYVGNVINTFVNNPRMGLLCPPEPNHSTFFTTIGFEWGPNFNITRDLAKKLGLTV 427 Query: 319 K---RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVR 374 GTMFW +PK ++PL N +F E N DG L HA+ER + V+ Sbjct: 428 PISVASPPVAPLGTMFWFRPKAMKPLYNKDWKYEDFPAEPNKIDGTLLHAIERIYPFIVQ 487 Query: 375 YTEF 378 + + Sbjct: 488 ESGY 491 >gi|260890969|ref|ZP_05902232.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia hofstadii F0254] gi|260859295|gb|EEX73795.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia hofstadii F0254] Length = 709 Score = 257 bits (656), Expect = 3e-66, Method: Composition-based stats. Identities = 58/239 (24%), Positives = 101/239 (42%), Gaps = 10/239 (4%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY---FPSAQLY 204 + K+ ++ H Y++D E H + + DL +T + + F + ++ Sbjct: 150 EDKVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNIEVR 209 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 V+EN+GRDV L + V YDY+C +H KK+ + + ++ + L Sbjct: 210 VIENRGRDVSSLLVGAKDAVM-NYDYVCFMHDKKTAQLKPYSSLNDVYINYC-KGTLATK 267 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRL-- 321 +I TF++NP LGM+ + +++ L K+ G Sbjct: 268 KYVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVDFHWN 327 Query: 322 -HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GTMFW +P+ L+ L + +F EE N DG + HAVER + V+ + Sbjct: 328 LEPISPLGTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFVVQDAGY 386 >gi|310829395|ref|YP_003961752.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612] gi|308741129|gb|ADO38789.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612] Length = 627 Score = 256 bits (653), Expect = 6e-66, Method: Composition-based stats. Identities = 65/239 (27%), Positives = 104/239 (43%), Gaps = 10/239 (4%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKYF--PSAQLY 204 + +IA + H Y++D E L + + D+++T K Q+ K F + ++ Sbjct: 310 EKRIAAIFHLYFEDLIDETYRYLSSMPEEADIYITTDTEPKKKLIQEKFKDFSCRNFKVI 369 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 +++N+GRDV L + + YDY+C H KK + + I G + F + L Sbjct: 370 LIQNRGRDVSALLVATKAFIM-NYDYVCFAHDKKVTQTKPYSI-GGAFAYKCFENTLQNK 427 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRS-EVYRRVIDLAKRAGFPTKRLH- 322 + + IIN FE+NP LGM+ + Y +L G Sbjct: 428 NFVLNIINAFEKNPRLGMLMPAPPNNGPYYPTLGNEWMCNYEVTKNLIDELGIKVPMDPG 487 Query: 323 --LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GTMFW +PK L+ L + + +F EE N DG L HA+ER + V+ F Sbjct: 488 KEPISPLGTMFWFRPKALKVLFDKNWEYSDFPEEPNKVDGTLLHAIERAYGLIVQSEGF 546 >gi|329944276|ref|ZP_08292535.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] gi|328531006|gb|EGF57862.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] Length = 636 Score = 254 bits (650), Expect = 1e-65, Method: Composition-based stats. Identities = 61/238 (25%), Positives = 96/238 (40%), Gaps = 9/238 (3%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYV 205 K KIA++ H YY D + + D+F++ E + E + ++ + Sbjct: 307 KQKIALIAHLYYMDLVEPTLKYIRNMPEGIDIFLSTSSPEKVEQVEAACKGLPYNIEVRL 366 Query: 206 MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSD 265 +EN+GRDV PFL + V YD +C H KK + + G + F +LL D Sbjct: 367 VENRGRDVGPFLVAWKDVV-HDYDVVCYTHDKKVTQLYPYS-VGDGFAYKCFENLLPTRD 424 Query: 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRLH-- 322 +I TF+ P LG + + F + R L + G Sbjct: 425 FVKNVIATFDAEPRLGFLAPTPPNHADYFPVFTYGWGPNFDRTKALLRELGLDVPLDPTK 484 Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 G+MFW +P+ L+PL + EF E DG L HA+ER + + + Sbjct: 485 EPIAPLGSMFWFRPQALKPLFDHDWQWEEFPPEPCPIDGTLMHAIERSHGYVAQGSGY 542 >gi|227546966|ref|ZP_03977015.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212567|gb|EEI80455.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 631 Score = 253 bits (647), Expect = 3e-65, Method: Composition-based stats. Identities = 61/238 (25%), Positives = 99/238 (41%), Gaps = 10/238 (4%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYVM 206 KIA+ +H YY D H + + D+ +TV EAN + ++ K FP + + V+ Sbjct: 309 KKIALAIHVYYMDLLESTFHYIQSMPEGCDIIITVGSEANAETVREYCKQFPYNFDVRVI 368 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 EN+GRDV L +F +YDY+C H KK + I G + F ++L + Sbjct: 369 ENRGRDVSALLVGCGEDLF-QYDYVCFAHDKKVTQLSPQSI-GDGFAYKCFENILASKEY 426 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY----RRVIDLAKRAGFPT-KRL 321 +I+ FE+NP LG+ + + + ++ P Sbjct: 427 VSNVIDLFERNPRLGIAMPTPPNHASYFPGYTFPWGPNFPGTKDFLEQTLNMHVPLNADK 486 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GTMFW +P+ L + +F E N DG L H +ER + + + Sbjct: 487 EPVAPMGTMFWFRPEAFRGLLDHGWEYTDFPPEPNKVDGTLLHFIERAYGYVPQANGY 544 >gi|90425670|ref|YP_534040.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18] gi|90107684|gb|ABD89721.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18] Length = 846 Score = 252 bits (645), Expect = 5e-65, Method: Composition-based stats. Identities = 62/251 (24%), Positives = 104/251 (41%), Gaps = 16/251 (6%) Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK- 196 P++ + +IAI H YY D ++ DLF+T E + + Sbjct: 586 PRRESNAARPRIAIHGHFYYPDLLESFLKLIAANASSVDLFLTTSGPEQAAQIRKSLRAF 645 Query: 197 YFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256 +A ++ + N+GRD+ PFL + YD + HGK+S+ G WR + Sbjct: 646 GIQNADVWSVPNRGRDIGPFLKEMPD-KLGSYDIVGHFHGKRSKHVD--STVGDQWRDFA 702 Query: 257 FFDLLGFS-DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 + L+G + + I + F ++ LG++ + E LA+R Sbjct: 703 WQHLIGDAFPMIDVIADAFAEDAKLGLVFAEDPYL-------NGWDENRDLAERLAQRMK 755 Query: 316 FPTK-RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373 H DF GTMFW + L+PL L+L ++ E DG + HA+ER +V Sbjct: 756 IEAPLPEHFDFPIGTMFWARVAALQPLFQLNLDWNDYPHEPLPIDGTILHALERIVPFAV 815 Query: 374 RYTEFSIESVD 384 + + F + Sbjct: 816 QKSGFEYATTY 826 >gi|160894491|ref|ZP_02075267.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50] gi|156863802|gb|EDO57233.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50] Length = 646 Score = 252 bits (644), Expect = 6e-65, Method: Composition-based stats. Identities = 58/246 (23%), Positives = 101/246 (41%), Gaps = 10/246 (4%) Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFP 199 K + K K+A+V+H Y+ D + + + D+++T K+ V K P Sbjct: 305 KMDEILKKRKLALVMHLYFPDLVEDSFQWASNVPKETDVYITTDTVEKKEAILKVFKNLP 364 Query: 200 S--AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLF 257 ++ V+ N+GRDV L ++ V YDY C +H KK+ + G + + Sbjct: 365 CNHLEVRVIVNRGRDVSSILVGVKD-VIQNYDYACFVHDKKTAQAKPGS-VGDSFGYKCW 422 Query: 258 FDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGF 316 + L + ++ TFE N LG++ + + + ++A + G Sbjct: 423 NNTLYNKEFVCNVLQTFEDNERLGILSPPEPNHGPFYQTLGNEWGCNFEKSREVADKLGI 482 Query: 317 PTK---RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACS 372 GT FW +P L+ L + EF EE N DG + HA+ER + Sbjct: 483 TIPMSEDKEALAPYGTFFWFRPTALKVLFDHDWQYEEFPEEPNNFDGTILHAIERLYPIC 542 Query: 373 VRYTEF 378 V+ + Sbjct: 543 VQQAGY 548 >gi|325067622|ref|ZP_08126295.1| hypothetical protein AoriK_07369 [Actinomyces oris K20] Length = 626 Score = 252 bits (644), Expect = 7e-65, Method: Composition-based stats. Identities = 58/238 (24%), Positives = 98/238 (41%), Gaps = 10/238 (4%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYVM 206 KIA++ H YY D + + DL +TV + + ++ K P + + ++ Sbjct: 308 QKIALIAHLYYMDLLEPTLAYVKSMPEGTDLILTVGSQEKAELVEEACKDLPYNVTVRLI 367 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 EN+GRDV L + + YD +C H KK + + G + F +LL D Sbjct: 368 ENRGRDVSALLVGCKD-IIHDYDLVCFTHDKKVTQVKPYS-VGDGFAIKCFENLLATRDF 425 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKR-AGFPTKRLH-- 322 +I TF+ P LG++ + F+ + R L ++ Sbjct: 426 VKNVIATFDAEPRLGLLAPTPPNHGDYFPVFSMGWGPNFERTKTLLEKELNLSVPIDESR 485 Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GTMFW +P L+PL + +F E N DG + HA+ER + + + + Sbjct: 486 APIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNNIDGTILHAIERAYGYVAQASGY 543 >gi|308235695|ref|ZP_07666432.1| hypothetical protein GvagA14_05663 [Gardnerella vaginalis ATCC 14018] gi|311114292|ref|YP_003985513.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019] gi|310945786|gb|ADP38490.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019] Length = 637 Score = 252 bits (644), Expect = 8e-65, Method: Composition-based stats. Identities = 64/249 (25%), Positives = 100/249 (40%), Gaps = 10/249 (4%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLK 196 SS S T K K+A+ +H YY D + H + + D+ +TV + N+ + ++ Sbjct: 303 SSTATSESTAKPKVALCMHLYYMDLLDKSLHYIQSMPQGCDVILTVGSKENQQIVKQRVE 362 Query: 197 YFP-SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + P + ++EN+GRDV FL +YDY+C H KK + I G + Sbjct: 363 HLPYDVDVRLIENRGRDVSAFLVGGGAD-LMKYDYVCFAHDKKVTQLSPRSI-GDGFAYK 420 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID--LAKR 313 F ++L + +IN FE +P LGM + F L K Sbjct: 421 CFENILASKEYVQNVINLFETHPRLGMAMPTPPNHADYFPGFTYTWGPNFEGTKKFLEKT 480 Query: 314 AGFPTKRLH---LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFF 369 G GTMFW + K + L + +F E DG L H +ER + Sbjct: 481 LGISVPLDENKDAIAPLGTMFWFRTKAMRGLLDRKWTYEDFPAEPLKIDGTLLHFIERAY 540 Query: 370 ACSVRYTEF 378 +Y + Sbjct: 541 GYVPQYNGY 549 >gi|311063512|ref|YP_003970237.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum PRL2010] gi|310865831|gb|ADP35200.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum PRL2010] Length = 631 Score = 252 bits (643), Expect = 8e-65, Method: Composition-based stats. Identities = 61/249 (24%), Positives = 99/249 (39%), Gaps = 10/249 (4%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLK 196 S L KIA+ +H YY D + + D+ +TV EAN + ++ K Sbjct: 298 SQSLSVPLPEGKKIALAIHVYYMDLLESTFRYIQSMPEGCDIIITVGSEANAEIVREYCK 357 Query: 197 YFP-SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 FP + V+EN+GRDV L +F +YDY+C H KK + I G + Sbjct: 358 QFPYRFDVRVIENRGRDVSSLLVGCGEDLF-QYDYVCFAHDKKVTQLSPQSI-GDGFAYK 415 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY----RRVIDLA 311 + ++L + +I+ FE+NP LG+ + + + ++ Sbjct: 416 CYENILASKEYVSNVIDLFEKNPRLGIAMPTPPNHASYFPGYTFPWGPNFPGTKDFLEQT 475 Query: 312 KRAGFPTKRLHLD-FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFF 369 P GTMFW +P+ L + +F E N DG L H +ER + Sbjct: 476 LNMHVPLNANKEPVAPMGTMFWFRPEAFRGLLDHGWKYEDFPPEPNKVDGTLLHFIERAY 535 Query: 370 ACSVRYTEF 378 + + Sbjct: 536 GYVPQANGY 544 >gi|119026520|ref|YP_910365.1| hypothetical protein BAD_1502 [Bifidobacterium adolescentis ATCC 15703] gi|118766104|dbj|BAF40283.1| hypothetical protein [Bifidobacterium adolescentis ATCC 15703] Length = 647 Score = 252 bits (643), Expect = 9e-65, Method: Composition-based stats. Identities = 62/251 (24%), Positives = 100/251 (39%), Gaps = 10/251 (3%) Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP- 199 + + +IA+++H YY D + + D TV E N ++ K P Sbjct: 301 TTPIPEGKRIALIMHLYYMDLLDKTLEYAKSMPEGCDFIFTVGSEENAKLVRERCKGLPY 360 Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 + + V++N+GRDV L +YDY+C H KK + + I G + F + Sbjct: 361 NVDVRVIQNRGRDVSALLIGAGKDCL-KYDYVCFAHDKKVTQLSPYSI-GDGFAYKCFEN 418 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID--LAKRAGFP 317 +LG + IIN FEQ+P G++ + FA L + G Sbjct: 419 ILGSKALVSNIINHFEQDPHAGLLAPTSPNHADYFGNFASLWGPNFEGTKKMLEETLGVK 478 Query: 318 T---KRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373 GTMFW +PK L L ++ +F E N DG++ H +ER + Sbjct: 479 VPLNPYKEPIAPLGTMFWFRPKALHQLFDIDWKYEDFPPEPNKIDGSMLHFIERAYGYLP 538 Query: 374 RYTEFSIESVD 384 + + V Sbjct: 539 QANGYYTGFVY 549 >gi|225352528|ref|ZP_03743551.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156722|gb|EEG70116.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 648 Score = 251 bits (641), Expect = 1e-64, Method: Composition-based stats. Identities = 59/250 (23%), Positives = 104/250 (41%), Gaps = 10/250 (4%) Query: 143 SGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-S 200 + + +IA+++H YY D + + D TV E N ++ K P + Sbjct: 303 APIPTNKRIALIMHLYYMDLLDKTLEYAKSMPEGCDFIFTVGSEENATIVRERCKDLPYN 362 Query: 201 AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260 + V++N+GRDV L +YDY+C H KK + + I G + F ++ Sbjct: 363 VDVRVIQNRGRDVSALLVGAGKDCL-QYDYVCFAHDKKVTQLSPYSI-GDGFSYKCFENV 420 Query: 261 LGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY----RRVIDLAKRAGF 316 LG + IIN FE +P G++ + FA +++++ + Sbjct: 421 LGSKALVSNIINHFENDPHAGVLAPAPPNHADYFGNFASLWGPNYEGTKKMLEETLQVKV 480 Query: 317 PTKRLHLD-FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVR 374 P + GTMFW +PK L+ ++ +F E N DG++ H VER + + Sbjct: 481 PLDKSKEPIAPMGTMFWFRPKALQQFFDIDWKYEDFPPEPNKIDGSMLHFVERAYGYVPQ 540 Query: 375 YTEFSIESVD 384 + + Sbjct: 541 ANGYYTGYIY 550 >gi|13476280|ref|NP_107850.1| hypothetical protein mlr7559 [Mesorhizobium loti MAFF303099] gi|14027041|dbj|BAB53995.1| mlr7559 [Mesorhizobium loti MAFF303099] Length = 644 Score = 251 bits (640), Expect = 2e-64, Method: Composition-based stats. Identities = 58/245 (23%), Positives = 100/245 (40%), Gaps = 12/245 (4%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYF--PSAQLYV 205 KIA+ H YY D EI + + +D T E + E + + + V Sbjct: 298 KIAVCAHIYYTDMLDEILGLTGNIPVPYDFIATTNTPEKKAEIETALANRPGVKNVIVRV 357 Query: 206 ME-NKGRDVRPFLYLLELGV-FDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263 +E N+GRD+ L + DRYD +C++H KKS + G +++R + +LL Sbjct: 358 VEQNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQV--QSSMGNLFKRHMVDNLLNS 415 Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF--PTKRL 321 +++ F NP +G+ + + +V + A+ Sbjct: 416 RGYVHNVLDMFHDNPSVGLAIPPIFHISYP-TMGFSWFANKPKVEETARLLNINVKFDEN 474 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 GTMFW +P+ L + EF E + DG HA+ER A +V+ ++ Sbjct: 475 TPVAAYGTMFWFRPRALRKMFEHKWKWEEFNAEPDHVDGGFAHALERLIAYAVQNAGYTT 534 Query: 381 ESVDC 385 + + C Sbjct: 535 QHIMC 539 >gi|310816773|ref|YP_003964737.1| lipopolysaccharide biosynthesis protein-like protein [Ketogulonicigenium vulgare Y25] gi|308755508|gb|ADO43437.1| lipopolysaccharide biosynthesis protein-like protein [Ketogulonicigenium vulgare Y25] Length = 726 Score = 250 bits (639), Expect = 3e-64, Method: Composition-based stats. Identities = 70/250 (28%), Positives = 104/250 (41%), Gaps = 16/250 (6%) Query: 139 SPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYF 198 P++ I + +H YYQ+ + L ++ L+V+ A K + + Sbjct: 455 QPRREAPAPARPIGVFLHLYYQELAPVFAKRLAQIPLPLSLYVSTDTAEKA--AQIERAL 512 Query: 199 PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258 P AQ+ V+ N+GRD+ P LY D +D + +HGKKS H W + Sbjct: 513 PQAQVRVLPNRGRDIFPKLYGFGDAYAD-HDIVLHLHGKKSL----HSSMLDEWLSHILD 567 Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRV-IDLAKRAGFP 317 LLG RI++ F+ P LG++ R A R + +LA R G Sbjct: 568 CLLGDPADVNRILSLFDSVPRLGIVMP----VVHRSVLNAAHWGFNRDIGAELAYRMGMA 623 Query: 318 TK---RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373 T L F G+MFW + L+P+ +L L F E DG L HAVER Sbjct: 624 TPLPENDALQFPAGSMFWARTAALQPILDLALEASHFPPEAGQVDGTLAHAVERMLGVVC 683 Query: 374 RYTEFSIESV 383 R + + V Sbjct: 684 RAGGYYMLPV 693 >gi|160936495|ref|ZP_02083863.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC BAA-613] gi|158440580|gb|EDP18318.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC BAA-613] Length = 674 Score = 249 bits (637), Expect = 4e-64, Method: Composition-based stats. Identities = 64/246 (26%), Positives = 101/246 (41%), Gaps = 13/246 (5%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFP-----SAQL 203 KIA+V H +Y D E L + + DL++TV AN + + V YF + ++ Sbjct: 291 KKIAVVAHLFYPDLMDETLRYLQNIQENIDLYITV--ANIETKYKVYNYFESIRRSNVKV 348 Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263 + N+GRD L +Y+YLC +H KK+ R G G + + + L Sbjct: 349 LLSGNRGRDAGSLLVACR-EYLMQYEYLCFVHDKKTTRGGGPVTVGKAFMYHAWENTLRS 407 Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWS--FFAKRSEVYRRVIDLAK--RAGFPTK 319 II FE+N LG++ + + + Y++ +LA+ P Sbjct: 408 GGFVSSIIKLFEKNDRLGILTPPVPALGGYLTELVGNEWTCCYQKTKELAEILSLKVPMS 467 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 F T FW +P L+PL +F EE DG L HA+ER + + Sbjct: 468 PQKQPFALATAFWCRPAALKPLFEYPWRYEDFPEEPLASDGTLNHAIERIIIYVAQSEGY 527 Query: 379 SIESVD 384 V+ Sbjct: 528 YTAMVE 533 >gi|261367011|ref|ZP_05979894.1| putative polysaccharide biosynthesis protein [Subdoligranulum variabile DSM 15176] gi|282571129|gb|EFB76664.1| putative polysaccharide biosynthesis protein [Subdoligranulum variabile DSM 15176] Length = 646 Score = 249 bits (636), Expect = 6e-64, Method: Composition-based stats. Identities = 55/255 (21%), Positives = 98/255 (38%), Gaps = 11/255 (4%) Query: 139 SPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVL--- 195 + + L + +IA+ +H Y+ D + + D+FV+ K + + Sbjct: 298 AKQAEELCAQRRIALAMHLYFMDMLEQSVAFAAKFPPQTDVFVSTNSEEKKEQIEQAFSG 357 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + S + V+EN+GRDV FL L YDY C +H KK+ + G + Sbjct: 358 QKLHSVTVMVVENRGRDVGAFLCDL-APHLRNYDYACFMHDKKAIQTKPGS-VGASFGYV 415 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRVIDLAKR 313 ++ + + ++ FE +P LG++ + + L K Sbjct: 416 CNENVCKNAAHVLNVLCEFENDPYLGILCPPYPTHGLYFMNMCSGGWGPNFENTKKLLKE 475 Query: 314 AGFPTK---RLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFF 369 G G++FW +PK LEPL +F +E +DG + HA+ER + Sbjct: 476 LGLDVPISGEESPIAPFGSVFWFRPKALEPLFAHGWQHTDFPQEPLPQDGTISHAIERVY 535 Query: 370 ACSVRYTEFSIESVD 384 + + V Sbjct: 536 PFVAQAAGYYPAVVM 550 >gi|83582737|ref|YP_425043.1| glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170] gi|83578053|gb|ABC24603.1| Glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170] Length = 1236 Score = 249 bits (635), Expect = 7e-64, Method: Composition-based stats. Identities = 64/241 (26%), Positives = 105/241 (43%), Gaps = 15/241 (6%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDF-EQDVLKYFPS--AQLYVM 206 K+ + H YY D + ++ +F DL +T + ++ + L+ + + ++ V+ Sbjct: 987 KVLLHGHFYYVDLIDDFLKKIIINDFSCDLIITTTDEDRAVFLRKKLEEYKNGSVEVRVV 1046 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N GRDV F L YD + IHGKKS G WR +L+ L+G Sbjct: 1047 PNIGRDVGAFFTGLSDLKNSDYDVVGHIHGKKSIHLSD--GTGNKWRNFLWEHLIGGEKK 1104 Query: 267 AIRI-INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK-RLHLD 324 A I ++ +NP +G++ + + + DLAK+ G D Sbjct: 1105 AAAIAVSALIRNPDIGLVFAEEPFLF-------GWDKNKELANDLAKKMGIEKSLPRFFD 1157 Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383 + GTMFW K K LEP+ +L+L ++ E G + HA+ER +V FS + Sbjct: 1158 WPIGTMFWAKRKALEPIFDLNLRWEDYPPEPIPVYGTMLHALERLLPFAVEKAGFSFATT 1217 Query: 384 D 384 Sbjct: 1218 Y 1218 >gi|82703518|ref|YP_413084.1| glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196] gi|82411583|gb|ABB75692.1| Glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196] Length = 828 Score = 247 bits (630), Expect = 3e-63, Method: Composition-based stats. Identities = 60/244 (24%), Positives = 97/244 (39%), Gaps = 13/244 (5%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLK 196 S L+ ++A+ +H YY + + EI L N DLF++V ++ +L Sbjct: 579 SEEAARPLSSSIRVALHLHVYYSELFPEIMARLKVNNVRPDLFISVPTECTRNEVTGLLN 638 Query: 197 YFPS--AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254 +P + ++ N+GRD+ P L D YD + +H KK+ I G W Sbjct: 639 DYPGKVVDIQIVPNRGRDIGPLLTAFGSVFLDDYDAIGHLHTKKTADLSDEMI-GKRWYT 697 Query: 255 WLFFDLLGFSDIAIRII-NTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 +L +LLG II +P +G++ + LA + Sbjct: 698 FLLENLLGGKRNMADIILGRMTADPAIGIVFPDDPHVFD-------WGNNKAHADSLASK 750 Query: 314 AGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACS 372 G + + F GTMFW + + L PL L L ++ E DG + HA+ER Sbjct: 751 LGLGKLQENFVFPMGTMFWARTEALRPLFTLDLSWQDYPAEPLPYDGTILHALERLLPLI 810 Query: 373 VRYT 376 Sbjct: 811 AAKQ 814 >gi|317047360|ref|YP_004115008.1| family 2 glycosyl transferase [Pantoea sp. At-9b] gi|316948977|gb|ADU68452.1| glycosyl transferase family 2 [Pantoea sp. At-9b] Length = 1419 Score = 246 bits (629), Expect = 4e-63, Method: Composition-based stats. Identities = 64/240 (26%), Positives = 97/240 (40%), Gaps = 17/240 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDV------LKYFPSAQ 202 I + +H YY D E L + FDLF+++ + E+ +K Sbjct: 597 RTIGVHLHLYYVDLADEFIKHLNTIPTGFDLFISLPRGKHNVEECERKFRSGIKTLKKLV 656 Query: 203 LYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLG 262 + ENKGRD+ PF+ + Y+ + IH KKS + WRR+L LG Sbjct: 657 VRETENKGRDIYPFIVEFGAELLS-YELILHIHSKKSPQ-----ALSKGWRRFLLHYTLG 710 Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH 322 I +I+N+F+ +P LG++ + R V R GF + Sbjct: 711 TESITTQILNSFDNDPKLGVLFPAYFYGVTRQP---NWGGNREIVKQQLARLGFSYDMTY 767 Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 D+ G+ FW + L PL N + +F+EE DG L H ER F +S Sbjct: 768 CPDYPAGSFFWSRSDALRPLLNGEYRLEDFDEEAGQYDGTLAHGFERLFGTIPLLQNYST 827 >gi|312133751|ref|YP_004001090.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773029|gb|ADQ02517.1| Hypothetical protein BBMN68_1492 [Bifidobacterium longum subsp. longum BBMN68] Length = 641 Score = 246 bits (628), Expect = 5e-63, Method: Composition-based stats. Identities = 65/248 (26%), Positives = 98/248 (39%), Gaps = 9/248 (3%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLK 196 S + K ++A+V+H YY D +I + D+ +TV E ++ + Sbjct: 298 SQDNAQPIPQKFRVALVLHLYYMDILDQILRYARSMPEGCDVIITVGSEEKACIVKERCE 357 Query: 197 YFP-SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 P + + V+EN+GRDV L V YD +C H KK ++ I G + + Sbjct: 358 GMPYNIDVRVIENRGRDVSALLVGAGKDVL-NYDLVCFAHDKKVRQLRPETI-GDGFAKK 415 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKRA 314 F + L IIN F NP LG+ + +A YR DL Sbjct: 416 CFENTLASKAYVANIINLFADNPRLGVAMPSAPNHADYFYSYAFSWGPNYRGTKDLLDGL 475 Query: 315 GFPTKRLH---LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370 G + GTMFW +PK L L + +F E N DG+ H VER + Sbjct: 476 GIKVPLSPHADVIAPLGTMFWFRPKALHGLIDKSWEYSDFPPEPNPADGSFLHFVERAYC 535 Query: 371 CSVRYTEF 378 + + Sbjct: 536 YVAQSNGY 543 >gi|227497960|ref|ZP_03928140.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] gi|226832618|gb|EEH65001.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] Length = 626 Score = 245 bits (625), Expect = 1e-62, Method: Composition-based stats. Identities = 56/249 (22%), Positives = 96/249 (38%), Gaps = 10/249 (4%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDV 194 P+ +SKIA+V+H Y+ D ++ H + DL TV K + Sbjct: 296 PTQAVAVQPE-ESKIALVMHVYHMDLLPQLLHYAASMPAGCDLIATVDTEAKAQQVREAT 354 Query: 195 LKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254 + + ++EN+GRDV L + D YD +C IH KK + G + + Sbjct: 355 AGLSLNVETILIENRGRDVAALLVGARPRLLD-YDLVCFIHDKKVTQIRPGS-VGEGFAK 412 Query: 255 WLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAKR 313 F ++L + +I TF+ P LG++ + A + +L Sbjct: 413 RCFENVLATPEFVCNVIATFQAEPRLGVLTPSAPHHGDYFPISAFSWGPNDKNTKELLAS 472 Query: 314 AGFP---TKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFF 369 G G++FW +P+ + PL +F E DG + HA+ER + Sbjct: 473 FGLHAPIDPDKEAIAPFGSVFWFRPQAIRPLLERKWRYDDFPAEPLPIDGTISHAIERVY 532 Query: 370 ACSVRYTEF 378 + + Sbjct: 533 CYMAQARGY 541 >gi|116071143|ref|ZP_01468412.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. BL107] gi|116066548|gb|EAU72305.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. BL107] Length = 1161 Score = 243 bits (621), Expect = 3e-62, Method: Composition-based stats. Identities = 59/287 (20%), Positives = 102/287 (35%), Gaps = 16/287 (5%) Query: 110 FMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTI-KSKIAIVVHCYYQDTWIEISH 168 F + ++++ P+K I + K + +H +Y + I+ Sbjct: 161 KFGIQEGRFSMDDIHFMRKTANIKKVSSPHPQKLTQAIEQKKFGVFLHIFYPELAKTIAD 220 Query: 169 ILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS---AQLYVMENKGRDVRPFLYLLELGVF 225 L ++ D++++ E D + + Q+ N GRDV PF+ + Sbjct: 221 YLAKIPVKIDIYISTTEKEVDELAKTFRRLDNSEHVQVKSFSNTGRDVAPFVVGFREEIL 280 Query: 226 DRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS 285 +YD++ K+H KKS H W +L+G D+ I N + Sbjct: 281 -KYDFILKLHSKKSP----HSDALSGWFEHCLDNLIGSKDVFYTNIFELMNNETAIIYPV 335 Query: 286 RRYRRY---KRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--HLDFFNGTMFWVKPKCLEP 340 Y K S + Y + L + F GTMFW K L+P Sbjct: 336 ENYALSLGIKHDSCWGHEDGNYDKAKPLLDKLNLKHIDRDSKFLFPTGTMFWCKSYILQP 395 Query: 341 LRNLHL-IGEFEEERNLKDGALEHAVERFFACSV-RYTEFSIESVDC 385 + + +L +F+ E DG L H++ER I + C Sbjct: 396 ILDWNLGFHDFDNEGGQIDGTLAHSIERLIGLCCTEKFHKRIITSYC 442 >gi|297538440|ref|YP_003674209.1| Rhamnan synthesis F [Methylotenera sp. 301] gi|297257787|gb|ADI29632.1| Rhamnan synthesis F [Methylotenera sp. 301] Length = 782 Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats. Identities = 66/261 (25%), Positives = 105/261 (40%), Gaps = 17/261 (6%) Query: 126 VKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE 185 E P S + +A++ H +Y + L + F+FD+++T Sbjct: 487 FARKIEYAMLVPFSYQVESPQNNPSLAVICHLFYHQMCEDYKVYLSNIPFNFDIYITTDT 546 Query: 186 ANKDFEQDVLKYFP-----SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQ 240 +K + + K F ++ + N+GRD+ P L Y+Y+ IH K S Sbjct: 547 EDK--KAYIEKSFSGWQRGKVEVRLAVNQGRDIAPKLIACRDIY-SAYEYILHIHSKNSP 603 Query: 241 REGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR 300 H WR ++ LLG I F+ N LG+I + ++ K Sbjct: 604 YSSIHTG----WRDYILDTLLGSQKTVSSIFEAFQLNSNLGIIAPQHFKALKLDI---GW 656 Query: 301 SEVYRRVIDLAKRAGFPTKRL-HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKD 358 ++ LA R GF R +DF +G+MFW + L PL N L + +F E KD Sbjct: 657 DRNFKIAKKLAGRMGFDISRKAPIDFPSGSMFWARSAALLPLLNCSLSLQDFPREDGQKD 716 Query: 359 GALEHAVERFFACSVRYTEFS 379 G H++ER + FS Sbjct: 717 GTTAHSIERLYFFICEKAGFS 737 >gi|225350704|ref|YP_002720664.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae WA1] gi|225216388|gb|ACN85121.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae WA1] Length = 342 Score = 242 bits (619), Expect = 5e-62, Method: Composition-based stats. Identities = 65/240 (27%), Positives = 107/240 (44%), Gaps = 12/240 (5%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFP---SAQL 203 K KI I +H YY D L +FDLF+T E NKD + P + + Sbjct: 24 KLKIGIHIHLYYIDMMDMFIKYLKDSPIEFDLFITTSKEENKDICLNAFNKLPKLKNITI 83 Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263 +++EN GRD+ P+L + YD C +H KKS H W +L +L+ Sbjct: 84 FIVENIGRDIAPWLIECNNIQ-NNYDLFCHLHTKKSL----HWESINEWGEYLIENLI-S 137 Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK-RAGFPTKRLH 322 + I++ F + +G+I Y + + + +++ + L K F K + Sbjct: 138 EEAINNILSNFILDNNIGIISPHIYYYLFPYILYIDKDDMHHIKLLLNKLNINFEPKPEN 197 Query: 323 LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 F G+M W +PK L+PL +L+L +F +E K G + HA+ER + + + Sbjct: 198 FVFPVGSMLWYRPKVLKPLFDLNLKYSDFPQEPIPKTGTIAHAIERIIGIICEQSNYKFK 257 >gi|269219069|ref|ZP_06162923.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon 848 str. F0332] gi|269211216|gb|EEZ77556.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon 848 str. F0332] Length = 687 Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats. Identities = 69/236 (29%), Positives = 104/236 (44%), Gaps = 9/236 (3%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMEN 208 S+IA+V+HC+Y D E+ L L DFDLFVT L+ + + +EN Sbjct: 75 SRIAVVIHCFYADLMPELFDRLRNLPTDFDLFVTNASGADVAVPKDLERMRHSVVVEVEN 134 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYH---PIEGIIWRRWLFFDLLGFSD 265 GRD+ P + L+ G+ D YD + K+H KKS H G W+ DL+G + Sbjct: 135 HGRDIFPTVQLVNSGILDPYDLILKLHTKKSPWREEHADLDGSGAAWKDQFLSDLVGSRE 194 Query: 266 IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF 325 I+N F +P LG++ + K + R V L R L+F Sbjct: 195 KVEEILNAFAADPTLGLVTAADSIVGKEF-----WGGDQRIVEQLMLRIEMSIDPDELEF 249 Query: 326 FNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 +G+M+W + L+ LR +L +F+EE+ D HA+ER Sbjct: 250 ASGSMYWTRAFVLQGLRAFNLTSADFDEEKGQVDATTAHAIERIVGIVTDEAGLRT 305 Score = 71.9 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 10/88 (11%), Positives = 24/88 (27%), Gaps = 8/88 (9%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G V + + +++ + + F +I L + R+ F + Sbjct: 604 YPGAMVGFDNTARRQWKADAWYGSNPYTFHRWIAGL------VRVVAPREAKDRLLFVNA 657 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123 E + A L + + Sbjct: 658 WNEWAESAILEPTTRFGRTYLLAVRNAV 685 >gi|190572676|ref|YP_001970521.1| putative glycosyltransferase, fusion protein [Stenotrophomonas maltophilia K279a] gi|190010598|emb|CAQ44207.1| putative glycosyltransferase, fusion protein [Stenotrophomonas maltophilia K279a] Length = 566 Score = 242 bits (617), Expect = 9e-62, Method: Composition-based stats. Identities = 80/250 (32%), Positives = 111/250 (44%), Gaps = 14/250 (5%) Query: 147 IKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE--ANKDFEQDVLKYFPSAQLY 204 +KS+ AIV+H Y+ D I + + D DLFV+V + + + A ++ Sbjct: 313 LKSRFAIVLHLYHLDLIESIQGYMKNMIVDHDLFVSVKSVADRRVAVRFFEERKVRAFVF 372 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 V N GRDV PF+ LL G+ DRYD +CKIH KKS G WR L LLG S Sbjct: 373 VHPNIGRDVGPFVSLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQWRDELMKSLLGSS 428 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324 ++I+ F + G++G R+ LA G R+ L Sbjct: 429 HTVLKILRAFRHDSSCGIVGPEHAYVSN----ARFWGGNEERLRRLAAETGIDDARIRLG 484 Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF---SI 380 FF GTMFW +P L LR L + EF+ E D L H +ER F V + + Sbjct: 485 FFAGTMFWFRPAALYALRERALALSEFDPEAGQLDATLAHVIERLFVLWVEQAGYFAATT 544 Query: 381 ESVDCVAEYE 390 + D +E Sbjct: 545 RTPDAALRHE 554 >gi|13476281|ref|NP_107851.1| hypothetical protein mlr7560 [Mesorhizobium loti MAFF303099] gi|14027042|dbj|BAB53996.1| mlr7560 [Mesorhizobium loti MAFF303099] Length = 637 Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 59/245 (24%), Positives = 99/245 (40%), Gaps = 12/245 (4%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYF--PSAQLYV 205 KIA+ H YY D EI + + +D T + + E + K + + V Sbjct: 298 KIAVCAHIYYTDMLEEILALTGNIPVPYDFIATTDTPDKKAEIEATLAKRPGVKNVIVRV 357 Query: 206 ME-NKGRDVRPFLYLLELGV-FDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263 +E N+GRD+ L + DRYD +C++H KKS + +++R + +LL Sbjct: 358 VEKNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQVQASRS--NLFKRHMLENLLNT 415 Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH- 322 +++ F NP +G+ A +V + A+ K H Sbjct: 416 RGYVHNVLDMFHDNPSVGLAVPPVVHISYPTMGHA-WFFNRPKVEETARLLNIKVKFDHD 474 Query: 323 -LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 GTMFW +P+ L + +F E N DG L H +ER A + + ++ Sbjct: 475 TPVAAYGTMFWFRPRALRKMFEHKWKWEDFNAEPNHVDGGLAHVLERLIAYAAQDAGYTT 534 Query: 381 ESVDC 385 + C Sbjct: 535 RHIMC 539 >gi|3399709|dbj|BAA32094.1| rgpFc [Streptococcus mutans] Length = 583 Score = 241 bits (615), Expect = 1e-61, Method: Composition-based stats. Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 19/260 (7%) Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192 K+ + +K K+A+ +H +Y D E + +F +DLF+T +K + E+ Sbjct: 270 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 + AQ++V N GRDV P L L YD++ H KKS+ + G W Sbjct: 330 ILSANGQEAQVFVTGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEADF--WAGQSW 385 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310 R L L+ +D I+ +QNP +G++ + + RY + + + L Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442 Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365 ++ G K F GT W K L+PL +L+L + E L ++ HA+ Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502 Query: 366 ERFFACSV--RYTEFSIESV 383 ER + +F I Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522 >gi|78184210|ref|YP_376645.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] gi|78168504|gb|ABB25601.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] Length = 1161 Score = 241 bits (615), Expect = 1e-61, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 94/249 (37%), Gaps = 17/249 (6%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFP---SAQLY 204 + K + +H +Y + I+ + ++ D+ ++ ++ K + Q+ Sbjct: 200 QKKFGVFLHIFYPELAPIIADYIRKIPVKIDIHISTTHDAISGLTEIFKGLENSLNVQVK 259 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 N GRDV PF+ + +YDY+ K+H KKS H W +L+G Sbjct: 260 SFPNIGRDVAPFIVGFREEIP-KYDYILKLHSKKSP----HSNALSGWFEHCLDNLIGSI 314 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRR----YKRWSFFAKRSEVYRRVIDLAKRAGFPTKR 320 D+ I + + ++ K S + Y + L K+ G Sbjct: 315 DVFYTNIQELNKED-ISIVYPVENYALSLGIKHDSCWGHEDGNYNKAKTLLKKLGLEQIN 373 Query: 321 L--HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV-RYT 376 F G MFW KP L+P+ + L +F+ E DG L H++ER Y Sbjct: 374 RNSEFLFPTGNMFWCKPDILKPILDWDLKFEDFDNEGGQIDGTLAHSIERLIGLCCTEYF 433 Query: 377 EFSIESVDC 385 I + C Sbjct: 434 HKKIITSYC 442 >gi|290580710|ref|YP_003485102.1| rhamnan synthesis protein F [Streptococcus mutans NN2025] gi|254997609|dbj|BAH88210.1| RgpFc protein [Streptococcus mutans NN2025] Length = 557 Score = 241 bits (615), Expect = 2e-61, Method: Composition-based stats. Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 19/260 (7%) Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192 K+ + +K K+A+ +H +Y D E + +F +DLF+T +K + E+ Sbjct: 244 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 303 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 + AQ++V N GRDV P L L YD++ H KKS+ + G W Sbjct: 304 ILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEADF--WAGQSW 359 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310 R L L+ +D I+ +QNP +G++ + + RY + + + L Sbjct: 360 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 416 Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365 ++ G K F GT W K L+PL +L+L + E L ++ HA+ Sbjct: 417 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 476 Query: 366 ERFFACSV--RYTEFSIESV 383 ER + +F I Sbjct: 477 ERLLIYIAWNEHYDFRISKN 496 >gi|30024644|dbj|BAC75698.1| rhamnosyltransferase [Streptococcus mutans] Length = 583 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 19/260 (7%) Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192 K+ + +K K+A+ +H +Y D E + +F +DLF+T +K + E+ Sbjct: 270 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 + AQ++V N GRDV P L L YD++ H KKS+ + G W Sbjct: 330 ILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSW 385 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310 R L L+ +D I+ +QNP +G++ + + RY + + + L Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442 Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365 ++ G K F GT W K L+PL +L+L + E L ++ HA+ Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502 Query: 366 ERFFACSV--RYTEFSIESV 383 ER + +F I Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522 >gi|30024633|dbj|BAC75688.1| rhamnosyltransferase [Streptococcus mutans] Length = 583 Score = 240 bits (613), Expect = 3e-61, Method: Composition-based stats. Identities = 66/260 (25%), Positives = 109/260 (41%), Gaps = 19/260 (7%) Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD- 193 K+ + +K K+A+ +H +Y D E + +F +DLF+T +K E + Sbjct: 270 HKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329 Query: 194 -VLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 + AQ++V N GRDV P L L YD++ H KKS+ + G W Sbjct: 330 VLSANSQEAQIFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSW 385 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310 R L L+ +D I+ +QNP +G++ + + RY + + + L Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442 Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365 ++ G K F GT W K L+PL +L+L + E L ++ HA+ Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502 Query: 366 ERFFACSV--RYTEFSIESV 383 ER + +F I Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522 >gi|299133415|ref|ZP_07026610.1| Rhamnan synthesis F [Afipia sp. 1NLS2] gi|298593552|gb|EFI53752.1| Rhamnan synthesis F [Afipia sp. 1NLS2] Length = 408 Score = 240 bits (613), Expect = 3e-61, Method: Composition-based stats. Identities = 92/238 (38%), Positives = 130/238 (54%), Gaps = 5/238 (2%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196 P +PK L + I+VH +Y D W + L L F L VT+ E+N DF V Sbjct: 153 PGAPKPLQLNGRIATGIIVHLHYCDVWPDFEKRLRNLTCPFSLIVTLNESNPDFAARVAG 212 Query: 197 YFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256 FP+A++ V N+GRDV PF+ LL G D ++ +CK+HGKK+ G I G IWRR L Sbjct: 213 QFPNAKVLVYPNRGRDVGPFIQLLREGHLDDFELICKLHGKKTVSLGPRMIFGEIWRRLL 272 Query: 257 FFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF 316 DL+G ++ I+ F P LG++GS + R ++ ++LAKR G Sbjct: 273 LNDLVGSDELVRAILQRFISQPGLGLVGSSHF----RGNYLGTWPRNAALTLELAKRLGC 328 Query: 317 PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSV 373 P +R LDFF GTMFWV+ + L+ L++L+L +F E DG L+HA+ER F Sbjct: 329 PEERFKLDFFAGTMFWVRRELLDLLKSLNLSQDDFPVEAGQTDGTLQHALERIFGALP 386 >gi|24379285|ref|NP_721240.1| RgpFc protein [Streptococcus mutans UA159] gi|24377204|gb|AAN58546.1|AE014924_6 RgpFc protein [Streptococcus mutans UA159] Length = 583 Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats. Identities = 66/260 (25%), Positives = 109/260 (41%), Gaps = 19/260 (7%) Query: 136 RPSSPKKSGLTIK-SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQ 192 K+ + +K K A+ +H +Y D E + +F +DLF+T +K + E+ Sbjct: 270 HKYVKKRERVDLKNQKAAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTDSDDKKAEIEE 329 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 + AQ++V N GRDV P L L YD++ H KKS+ + G W Sbjct: 330 ILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSW 385 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDL 310 R L L+ +D I+ +QNP +G++ + + RY + + + L Sbjct: 386 REELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTFFRYNKIVDAWNEHLIAPEMNTL 442 Query: 311 AKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAV 365 ++ G K F GT W K L+PL +L+L + E L ++ HA+ Sbjct: 443 WQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEEPLPQNSILHAI 502 Query: 366 ERFFACSV--RYTEFSIESV 383 ER + +F I Sbjct: 503 ERLLIYIAWNEHYDFRISKN 522 >gi|320095829|ref|ZP_08027469.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str. F0338] gi|319977239|gb|EFW08942.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str. F0338] Length = 619 Score = 239 bits (609), Expect = 9e-61, Method: Composition-based stats. Identities = 64/254 (25%), Positives = 104/254 (40%), Gaps = 10/254 (3%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVL 195 ++P+ ++ + H +Y D EI L L + L T + + E+ + Sbjct: 286 AAPEAREKAASLRVVAIAHIFYADMADEIIDRLSVLPDGWRLVATTADEERKAAIEETMA 345 Query: 196 KYFPSAQLYVM-ENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWR 253 + Q+ V+ N+GRD+ FL + D YD + KIH KKS ++ + +++ Sbjct: 346 RRGAVGQVRVVASNRGRDISAFLVDCSDVLAGDDYDVVVKIHSKKSVQDEANAA--QLFK 403 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 L+ +LL D I+ F +P LGM + A +LAKR Sbjct: 404 DHLYENLLDSKDHVANILAEFADHPGLGMALAPMPHMGYPTMGHA-WFANRPPARELAKR 462 Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFA 370 G P G+MF +P+ L PL L +F E +DG+L H +ER A Sbjct: 463 IGITVPFDDHQPLAPYGSMFIARPRALRPLVEAGLTHDDFPPEGGYQDGSLAHVIERLLA 522 Query: 371 CSVRYTEFSIESVD 384 +V + V Sbjct: 523 YAVLSEGYYARPVM 536 >gi|78213552|ref|YP_382331.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9605] gi|78198011|gb|ABB35776.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9605] Length = 1162 Score = 238 bits (608), Expect = 9e-61, Method: Composition-based stats. Identities = 54/241 (22%), Positives = 90/241 (37%), Gaps = 16/241 (6%) Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS 200 I K+ I +H +Y + I+ L + D+F++ E + + + + Sbjct: 195 AIKEGLINKKVGIFLHIFYPELGETIAAYLKNIPCSIDVFISTREDSVAALEKIFARVEN 254 Query: 201 ---AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLF 257 ++ N GRDV PF+ + YDY+ K+H KKS H W Sbjct: 255 TQKIEVRHFSNIGRDVAPFIVGFRDQIL-NYDYILKLHSKKSP----HSNALSGWFLHCL 309 Query: 258 FDLLGFSDIAIRIINTFEQNPCLGMIGS-RRYRRYK---RWSFFAKRSEVYRRVIDLAKR 313 +L+G I + + P +G++ Y S + Y + R Sbjct: 310 DNLIGSEAITATNLKALQS-PEVGIVYPIENYALSLGIQHDSCWGHEDGNYAKARPFLNR 368 Query: 314 AGFPTKRLH--LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370 + F GTMFW KP L+ + + L F+EE DG + H++ER Sbjct: 369 YNLRQIKRESQFQFPTGTMFWCKPAVLQSILDWGLNWNNFDEEGGQIDGTIAHSIERLIG 428 Query: 371 C 371 Sbjct: 429 I 429 >gi|220924211|ref|YP_002499513.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium nodulans ORS 2060] gi|219948818|gb|ACL59210.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium nodulans ORS 2060] Length = 1366 Score = 238 bits (608), Expect = 9e-61, Method: Composition-based stats. Identities = 68/245 (27%), Positives = 106/245 (43%), Gaps = 14/245 (5%) Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK---YFPS 200 GL + ++A++ H +Y D E+S L R+ DLF++ +K + Sbjct: 696 GLELPERVAVIAHVFYTDFCSELSAYLARIPTQADLFISTDTEDKRQQIAFALQSYNMGK 755 Query: 201 AQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260 + VM N GRD+ P L VF+ Y+Y IH KKS + WR +L +L Sbjct: 756 LTVRVMPNIGRDIAPMLVGF-DDVFNSYEYFLHIHSKKSPHDPAF----GSWREFLLENL 810 Query: 261 LGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK- 319 LG DI I+ + G++ S+ + + F + + L R G Sbjct: 811 LGSEDIIRSILYLLHAH-KTGIVFSQHFEPVRHLLNFGY---NFETMKGLLGRCGIKISN 866 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 L L+F + + FW + L+PL +L+L +F E DG L HA+ER V + F Sbjct: 867 DLVLEFPSSSFFWGRSSALKPLLDLNLDWSDFAAEAGQIDGTLAHAIERSVLYIVEKSGF 926 Query: 379 SIESV 383 V Sbjct: 927 RWAKV 931 >gi|218455303|gb|AAX19606.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%) Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192 +R ++ S+ AIV+H ++ D I + + D+D+FV+V + + + Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 ++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYRDG----GGQW 418 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312 R L LLG S +R++ F+ +P G++G R+ LA Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474 Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371 G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534 Query: 372 SVRYTEF 378 V F Sbjct: 535 WVEQAGF 541 >gi|33862360|ref|NP_893920.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313] gi|33640473|emb|CAE20262.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313] Length = 738 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 64/252 (25%), Positives = 99/252 (39%), Gaps = 18/252 (7%) Query: 137 PSSPKKSGLTIKSK---IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD 193 P S L + IA+ VH +Y + I + L DLF++ E Sbjct: 485 PMITPASSLQQQDSETTIALHVHVHYPELLDTILNALNYNKIRPDLFLSCTNHENHSEIQ 544 Query: 194 VLKYFPSAQ---LYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250 + + N+GRD+ P L + + +Y+ +H KKS +G Sbjct: 545 CKSAGANCTLKSIITTPNRGRDIGPLLTEIGKELDTKYEIYGHLHTKKSALLPG--KQGC 602 Query: 251 IWRRWLFFDLLGFSDI--AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI 308 WR +L +L+G DI A RI+ ++NP LG++ + S + Sbjct: 603 SWRDFLISNLVGMQDIAMADRIVTALKKNPKLGLVFADDPTCV-------GWSGNRKHAD 655 Query: 309 DLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVER 367 LA + DF GTMFW K L L NL+L ++ +E DG + HA+ER Sbjct: 656 ILANKLNLGPLPRCFDFPVGTMFWAKKGALTELYNLNLGWEDYPQEPLGYDGTILHAIER 715 Query: 368 FFACSVRYTEFS 379 F+ Sbjct: 716 LLPIIAAKQGFT 727 >gi|218455307|gb|AAX19610.2| WxocB [Xanthomonas oryzae pv. oryzicola] gi|218455309|gb|AAX19612.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%) Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192 +R ++ S+ AIV+H ++ D I + + D+D+FV+V + + + Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 ++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQW 418 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312 R L LLG S +R++ F+ +P G++G R+ LA Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474 Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371 G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534 Query: 372 SVRYTEF 378 V F Sbjct: 535 WVEQAGF 541 >gi|166713474|ref|ZP_02244681.1| hypothetical protein Xoryp_19045 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 568 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%) Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192 +R ++ S+ AIV+H ++ D I + + D+D+FV+V + + + Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 ++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQW 418 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312 R L LLG S +R++ F+ +P G++G R+ LA Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474 Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371 G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534 Query: 372 SVRYTEF 378 V F Sbjct: 535 WVEQAGF 541 >gi|218455296|gb|AAV67426.2| glycosyltransferase [Xanthomonas oryzae pv. oryzicola] gi|218455299|gb|AAX19602.2| WxocB [Xanthomonas oryzae pv. oryzicola] gi|218455301|gb|AAX19604.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 11/247 (4%) Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192 +R ++ S+ AIV+H ++ D I + + D+D+FV+V + + + Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 ++ A +++ N GRDV PF+ LL G+ DRYD +CKIH KKS G W Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKIHSKKSVYHDG----GGQW 418 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312 R L LLG S +R++ F+ +P G++G R+ LA Sbjct: 419 RDDLMKALLGSSFDVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474 Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371 G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534 Query: 372 SVRYTEF 378 V F Sbjct: 535 WVEQAGF 541 >gi|218455305|gb|AAX19608.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 76/247 (30%), Positives = 115/247 (46%), Gaps = 11/247 (4%) Query: 135 DRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQ 192 +R ++ S+ AIV+H ++ D I + + D+D+FV+V + + + Sbjct: 303 ERYGVGAIDAESLSSRFAIVLHLFHIDLIDAICAYMRNVIVDYDVFVSVKSISDRRMAVR 362 Query: 193 DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 ++ A +++ N GRDV PF+ LL G+ DRYD +CK+H KKS G W Sbjct: 363 YFQEHKIRASVFIHPNIGRDVGPFISLLNTGLLDRYDAVCKVHSKKSVYHDG----GGQW 418 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312 R L LLG S +R++ F+ +P G++G R+ LA Sbjct: 419 RDDLMKALLGSSFNVLRVLRAFDDHPACGIVGPESAYLSN----ARFWGGNEERLRVLAA 474 Query: 313 RAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFAC 371 G KR+ L FF GTMFW +P L LR + + EF+ E +D L H +ER F Sbjct: 475 ETGIEEKRIRLGFFAGTMFWFRPAALSALRARSIGLSEFDPEAGQRDATLAHVIERLFVL 534 Query: 372 SVRYTEF 378 V F Sbjct: 535 WVEQAGF 541 >gi|323138318|ref|ZP_08073389.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] gi|322396401|gb|EFX98931.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] Length = 754 Score = 236 bits (601), Expect = 6e-60, Method: Composition-based stats. Identities = 62/239 (25%), Positives = 106/239 (44%), Gaps = 13/239 (5%) Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFPS--A 201 + + +A +VH +Y D I L + DL+++ + V++ + Sbjct: 365 INMDKPVAAIVHAFYPDLLEHILGYLENIPCAVDLYISTDSAEKAEIIGKVVRNWSKGST 424 Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261 + +MEN+GRD+ P + VF ++D +H K+S G WR +L L Sbjct: 425 DVRIMENRGRDIAPMIVGFRD-VFAKHDIFLHVHTKRSPHAG---DLLYHWRDYLLNTLF 480 Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KR 320 G DIA +++ F +P +G++ + + +R + Y +L R G K Sbjct: 481 GTGDIARSVLSLF-NDPKIGVVFPQHFFEVRRMLNWGF---DYDLARNLLARVGVQLNKD 536 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 L L+F +G+MFW + + PL +L L +F EE DG L HA+ER + Sbjct: 537 LVLEFPSGSMFWGRTDAIRPLLDLDLQFSDFPEEAGQIDGTLAHAIERTLLMVAESKGY 595 >gi|84501312|ref|ZP_00999517.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597] gi|84390603|gb|EAQ03091.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597] Length = 741 Score = 235 bits (600), Expect = 1e-59, Method: Composition-based stats. Identities = 74/252 (29%), Positives = 110/252 (43%), Gaps = 13/252 (5%) Query: 134 NDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDF 190 R + P+ +++ AI +H YY D W E S L RL+ FDL+VT+ + Sbjct: 113 PIRTTIPRFDPRRPRARFAIHLHLYYPDLWPEFSERLDRLDLSFDLYVTLTWRGPETEWL 172 Query: 191 EQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250 + + P AQ++ + N+GRD+ PFL LL G FD Y+ +CK+HGKKS H +G Sbjct: 173 ADIIREAHPRAQVFPVANRGRDILPFLRLLNAGAFDGYEAICKLHGKKSP----HRDDGD 228 Query: 251 IWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDL 310 WRR L +L + + + + ++W R L Sbjct: 229 AWRRHLVDGVLPGKALWTSLSAFLADEDAALWVADGQRYSVRKW-----WGSNRARTDAL 283 Query: 311 AKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALEHAVERFF 369 +R DF G+M+W+KP L +R L L + FE E DG L HA ER Sbjct: 284 LRRVELDRSDTDFDFPAGSMYWMKPLLLGMIRALDLTEDLFEPESGQTDGTLAHAFERAI 343 Query: 370 ACSVRYTEFSIE 381 + + Sbjct: 344 GALAKAAGQEVR 355 Score = 54.6 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 36/114 (31%), Gaps = 9/114 (7%) Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69 G I + +A ++G W S ++R + + F S Sbjct: 620 FAGLIYDYPAVARRSLDKGYRAGLPEKTIAGIMPSWDNSARRRARAHIARGANPATFRS- 678 Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 WLR + S+ F + E +KA L +R + + +E Sbjct: 679 --WLRDL--QRERLAQSYRGE--LFINAWNEWGEKAMLEPSRTFGHLYLDILAE 726 >gi|312133752|ref|YP_004001091.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773032|gb|ADQ02520.1| Hypothetical protein BBMN68_1493 [Bifidobacterium longum subsp. longum BBMN68] Length = 651 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 56/238 (23%), Positives = 94/238 (39%), Gaps = 10/238 (4%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYFP-SAQLYVM 206 +A+V H YY D + + D+ +TV E ++ + P + + V+ Sbjct: 309 KHVALVFHLYYIDLLDSSLQYISSMPEGCDVIITVGSEEKACIVKERCEGMPYNIDVRVI 368 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 EN+GRDV L V YD +C H KK + G + F ++L Sbjct: 369 ENRGRDVSALLVGAGKDVL-NYDLVCFAHDKKVTQIKP-LSVGDGFAYKCFENILASKAY 426 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA-KRSEVYRRVIDLAKRAGFPTKRLH--- 322 II+ FE+ P LG++ + F + + + L + Sbjct: 427 VANIIDQFEREPHLGVLMPNPPEHGNYFPVFTLSWGDNFDGTVQLLRDIHKTVPLDKKKE 486 Query: 323 LDFFNGTMFWVKPKCL-EPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 + GTMFW +PK L + L N + +F +E N DG + H +ER + + + Sbjct: 487 VIAPLGTMFWFRPKALSDGLLNHNWQYSDFPKEPNKIDGTILHYIERAYCYVAQANGY 544 >gi|262038042|ref|ZP_06011449.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii F0264] gi|261747934|gb|EEY35366.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii F0264] Length = 629 Score = 234 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 61/241 (25%), Positives = 101/241 (41%), Gaps = 13/241 (5%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYFPSAQLYVM 206 K+ + H Y++D E L + D+F+T + K + + K + V+ Sbjct: 303 PKVGLFFHIYFEDLIEECYRYALNMPEYADIFITTDKEEKKEKIEKIFSKMKNKIDIKVI 362 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 +N+GRDV FL +YDY C H KK+++ I+G ++ F ++LG ++ Sbjct: 363 QNRGRDVSAFLIP-NKEEILKYDYACFAHDKKTKQLQPE-IKGEDFKFRCFENILGSKEL 420 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-----VYRRVIDLAKRAGFPTK-- 319 II F +NP LG++ + + + Y +L K Sbjct: 421 VENIIGLFIENPRLGLLSPPSPNHAEFYGNLGREWGHSGNDNYEETCNLLKELVIEVNVD 480 Query: 320 -RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTE 377 GT+FW +PK LE L +F +E N DG L HA+ER + V+ Sbjct: 481 ISKAPVAPYGTIFWFRPKSLEKLLKKGWKYEDFPKEPNKVDGTLLHAIERVYPFVVQGAG 540 Query: 378 F 378 + Sbjct: 541 Y 541 >gi|260434430|ref|ZP_05788400.1| glycosyltransferase [Synechococcus sp. WH 8109] gi|260412304|gb|EEX05600.1| glycosyltransferase [Synechococcus sp. WH 8109] Length = 772 Score = 234 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 95/241 (39%), Gaps = 15/241 (6%) Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK---DFEQDVLKYFPSA 201 + I K+ + +H +Y + EI + +++++ + Sbjct: 530 MNIDEKVGLHIHVHYPELLDEILKAISMNKIRPEIYISCTNQAIRDLAIKNINEHGLILK 589 Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261 ++ + N+GRD+ P L L + ++Y IH KKS H WR +L +L+ Sbjct: 590 KIILTPNRGRDIGPLLTCLGQELDEKYRIYGHIHTKKSIHIARHQSY--SWRTFLIENLI 647 Query: 262 GFSD--IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319 G + + II+ ++ +G+ YR+ LA++ + Sbjct: 648 GNEENHMMDCIISAMIKDKTIGLAFPSDPHCP-------GWDANYRQAKLLAEKLNIKSL 700 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 +F GTMFW + L PL +L+L ++ E DG L H++ER F Sbjct: 701 TNEFNFPIGTMFWARKNALSPLYSLNLGWDDYPSEPIGYDGTLLHSIERLIPFVAESQGF 760 Query: 379 S 379 S Sbjct: 761 S 761 >gi|163853098|ref|YP_001641141.1| lipopolysaccharide biosynthesis protein-like protein [Methylobacterium extorquens PA1] gi|163664703|gb|ABY32070.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium extorquens PA1] Length = 916 Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats. Identities = 66/262 (25%), Positives = 113/262 (43%), Gaps = 13/262 (4%) Query: 127 KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA 186 + P++ K A +VH +Y + EI L + NF D++V+ ++ Sbjct: 227 PRNENDYAFSIPLPERLRSHPYKKAAAIVHGFYPELMEEILIYLGKSNFPIDIYVSTDDS 286 Query: 187 NK-DFEQDVLKYFPSAQ--LYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243 K + + K + + Q + ++ N+GRD+ P L VFD Y+ IH KKS G Sbjct: 287 KKAEQIISMGKKYHNGQLDVRIISNRGRDIGPMLTGFSD-VFDNYEAFLHIHTKKSPHGG 345 Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303 WR +LF +L+G ++I ++ +G + + + Sbjct: 346 DGLS---SWRDYLFKNLIGSAEIIDSNLHILGT-RNVGFVYPQHLYALRGIL---NWGYN 398 Query: 304 YRRVIDLAKRAGFPTK-RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGAL 361 + V L +R G + L+F +G+MFW + L L +L L + +F+ E DG L Sbjct: 399 FDTVSSLLRRVGVRLSKDMVLEFPSGSMFWARTAALHGLLSLDLKLEDFDNEAGQVDGTL 458 Query: 362 EHAVERFFACSVRYTEFSIESV 383 HA+ER F + +S V Sbjct: 459 GHAIERSFLYFAETSGYSWAKV 480 >gi|171779906|ref|ZP_02920810.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171281254|gb|EDT46689.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 592 Score = 232 bits (593), Expect = 5e-59, Method: Composition-based stats. Identities = 63/270 (23%), Positives = 103/270 (38%), Gaps = 19/270 (7%) Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184 + + + KIA+ +H +Y D + +F +DLF+T Sbjct: 266 NFPDFKYLLARKYVKEVPAVSLADKKIAVHLHVFYVDLLEDFLDAFENFHFVYDLFITTD 325 Query: 185 --EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 ++ E + AQ++V N GRDV P L L YDY+ H KKS+ Sbjct: 326 NATKKQEIESILRSNGKDAQIFVTGNVGRDVLPMLKL--KDYLSDYDYIGHFHTKKSKEA 383 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300 + G WR L L+ +D I+ F+ N LG++ + + R+ + Sbjct: 384 DF--WAGESWRNELIDMLIKPAD---NILANFD-NDKLGIVIADIPTFFRFNKIVDAWNE 437 Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355 + + DL ++ G +F GT W K L+PL +L L E Sbjct: 438 HLIAPAMNDLWQQMGMTKAIDFNNFHNFVMSYGTYVWFKYDALKPLFDLGLTDEDVPAEP 497 Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESV 383 L ++ HA+ER + +F I Sbjct: 498 LPQNSILHAIERLLIYIAWNEHYDFRISKN 527 >gi|259414984|ref|ZP_05738907.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B] gi|259349435|gb|EEW61182.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B] Length = 680 Score = 232 bits (591), Expect = 1e-58, Method: Composition-based stats. Identities = 74/279 (26%), Positives = 120/279 (43%), Gaps = 17/279 (6%) Query: 109 RFMSNSRMPFDS--EKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEI 166 + + ++ + +P + ++ A+V+H YY D W E Sbjct: 30 QRPFEHFLRAGRHEQRVTREHSATIAESGSAVAPLRGAGINQNLQAVVIHLYYTDLWDEF 89 Query: 167 SHILLRLNFDFDLFVTVVE---ANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELG 223 L F FDL+VT+ E ++ + + +P A++ V+ N+GRD+ PFL+LL G Sbjct: 90 RDRLRSARFTFDLYVTLTEQGPETEETRARIAEDWPEARVLVLPNRGRDIYPFLHLLNAG 149 Query: 224 VFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGM- 282 D Y +CK+H KKS H +G +WR L +L + A ++ F G+ Sbjct: 150 WLDHYRAVCKLHSKKSP----HRQDGDVWRTHLTEGILPEGETAE-LLERFLAAEDCGLW 204 Query: 283 IGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLR 342 + ++ RW R +L R LDF G+++W+KP L+ LR Sbjct: 205 VADGQHYEGARW-----WGSNLERCRNLLARLELAASADTLDFPAGSIYWLKPAILDMLR 259 Query: 343 NLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 L L +F+ E+ DG L HA+ER I Sbjct: 260 GLALGFDDFDIEQGQTDGTLAHALERALGMICAAGGLQI 298 Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 37/117 (31%), Gaps = 11/117 (9%) Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAH-VSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 G I + R+ + A +PAH ++G W + ++ + FE Sbjct: 565 FGGVIYDY-DRVRARSQDPAYAGQLPAHTIAGTMPSWDNTARRGSAAHLAWGANPIRFER 623 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ LR+ S+ S + E +KA L + + Sbjct: 624 WLRELRT-----HRLPQSYRSE--IMINAWNEWAEKAVLEPSAQHGRGYLNALRRGL 673 >gi|154509526|ref|ZP_02045168.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC 17982] gi|153799160|gb|EDN81580.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC 17982] Length = 620 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 66/254 (25%), Positives = 99/254 (38%), Gaps = 10/254 (3%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVL 195 + KI V H +Y D EI L L + L T E Sbjct: 286 ADQATLDAAASLKILAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEARAQ 345 Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWR 253 + A + V+ N+GRD+ FL + YD + KIH KKS ++ Y+ +++ Sbjct: 346 ERGVDADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYNAA--QLFK 403 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 L+ +LL SD I+ F +P LGM+ + A D AK+ Sbjct: 404 EHLYDNLLASSDHVASILAEFAAHPGLGMVIAPMPHMGYPTMGHA-WFANRAPARDFAKK 462 Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370 G P G+MF +P+ L L L +F EE KDG+L H +ER + Sbjct: 463 VGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLLS 522 Query: 371 CSVRYTEFSIESVD 384 +V + + V Sbjct: 523 YAVLSRGYYVRPVM 536 >gi|221634566|ref|YP_002523254.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter sphaeroides KD131] gi|221163439|gb|ACM04401.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter sphaeroides KD131] Length = 755 Score = 231 bits (589), Expect = 2e-58, Method: Composition-based stats. Identities = 75/234 (32%), Positives = 109/234 (46%), Gaps = 15/234 (6%) Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLYVMEN 208 A+ VH YY D W E + L RL FDL+VT+ E Q++ FP A + M N Sbjct: 139 AVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAQEIRADFPGAFVTPMPN 198 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 +GRD+ PF+ LL G FD Y +CK H KKS H +G +WR+ L +L + + Sbjct: 199 RGRDILPFVTLLNAGAFDGYRAVCKFHTKKSP----HRQDGDLWRKHLIEGILPETGLEE 254 Query: 269 RIINTFEQNPCLG-MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327 + + F + P G + ++ +W L +R P R L F Sbjct: 255 K-LEAFVEAPEAGFWVADGQHYTGTQW-----WGSNVEATRHLLQRIEIPLDREALSFPA 308 Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 G+++WVKP L LR+L L + +F+ E DG L HA+ER + Sbjct: 309 GSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTLAHAIERVLGYLTARAGQKV 362 Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 38/137 (27%), Gaps = 10/137 (7%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 + G I + A ++G W + ++ + + F Sbjct: 626 AFSGLIYDYAAVARRALSETYVRTLPKATIAGVMPGWDNTARRGAAGHVAYGANPATFN- 684 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 WL L + S+ R F + E +KA L + + + + Sbjct: 685 --VWLAGALE--RRVPASY--RRELFVNAWNEWAEKAVLEPSLTFGDLNLQVMRQHLGAA 738 Query: 127 KELFEGWNDRPSSPKKS 143 + P+ +S Sbjct: 739 EPATHLAEP-PAHGMRS 754 >gi|298290915|ref|YP_003692854.1| Rhamnan synthesis F [Starkeya novella DSM 506] gi|296927426|gb|ADH88235.1| Rhamnan synthesis F [Starkeya novella DSM 506] Length = 633 Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats. Identities = 64/257 (24%), Positives = 107/257 (41%), Gaps = 15/257 (5%) Query: 133 WNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-DFE 191 W P + + K+ + H +Y D E+ L DLF+T K + Sbjct: 378 WAVPVFGPPAAPVASPLKVGLHGHFFYPDLLPELLERLAANASRPDLFLTTDTPAKVEQL 437 Query: 192 QDVLKYFP-SAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEG 249 + + +P ++ V+ N GRD+ PFL L + YD L +HGKK++ G G Sbjct: 438 RALTAAWPAKVRIDVVPNSGRDIGPFLTALRDVLTGGEYDVLLHLHGKKTK--GRRRAIG 495 Query: 250 IIWRRWLFFDLLGFSD-IAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI 308 WR +L+ +L+G + ++ +P +G++ + R V Sbjct: 496 DPWRNFLWENLIGGDHPMLDAVLAYMAAHPQVGLVYPEDTHLLD-------WARNGRVVE 548 Query: 309 DLAKRAGFPTKR-LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVE 366 +L + G ++DF G MF V+P L P+ L L ++ E DG + H +E Sbjct: 549 ELRRDMGLTEPMGTYVDFPVGNMFAVRPAALAPVLALDLKWSDYPVEPIPLDGTVLHGIE 608 Query: 367 RFFACSVRYTEFSIESV 383 R VR F+ +V Sbjct: 609 RLLPTVVRKAGFTTAAV 625 >gi|291516581|emb|CBK70197.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum subsp. longum F8] Length = 688 Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats. Identities = 56/257 (21%), Positives = 95/257 (36%), Gaps = 15/257 (5%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196 PS + + A + H Y+ D + H + L + DL++T E ++ ++ Sbjct: 313 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 372 Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249 A + N+GRDV L V YD + H KKS + G+H E Sbjct: 373 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 432 Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307 + L + LG I+ F +NP LG + + + Y Sbjct: 433 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANYEIT 492 Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361 +L + R G P G+ +W + + L+PL +F E + +DG + Sbjct: 493 KELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGTI 552 Query: 362 EHAVERFFACSVRYTEF 378 HA+ER + + Sbjct: 553 SHAIERANGYICQSRGY 569 >gi|126464825|ref|YP_001041801.1| lipopolysaccharide biosynthesis protein-like [Rhodobacter sphaeroides ATCC 17029] gi|126106640|gb|ABN79165.1| Lipopolysaccharide biosynthesis protein-like [Rhodobacter sphaeroides ATCC 17029] Length = 751 Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats. Identities = 74/234 (31%), Positives = 109/234 (46%), Gaps = 15/234 (6%) Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLYVMEN 208 A+ VH YY D W E + L RL FDL+VT+ E +++ FP A + M N Sbjct: 135 AVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAEEIRADFPGAFVTPMPN 194 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 +GRD+ PF+ LL G FD Y +CK H KKS H +G +WR+ L +L + + Sbjct: 195 RGRDILPFVTLLNAGAFDGYRAVCKFHTKKSP----HRQDGDLWRKHLIEGILPETGLEE 250 Query: 269 RIINTFEQNPCLG-MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327 + + F + P G + ++ +W L +R P R L F Sbjct: 251 K-LEAFVEAPEAGFWVADGQHYTGTQW-----WGSNVEATRHLLQRIEIPLDREALSFPA 304 Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 G+++WVKP L LR+L L + +F+ E DG L HA+ER + Sbjct: 305 GSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTLAHAIERVLGYLTARAGQKV 358 Score = 71.1 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 38/137 (27%), Gaps = 10/137 (7%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 + G I + A ++G W + ++ + + F Sbjct: 622 AFSGLIYDYAAVARRALSETYVRTLPKATIAGVMPGWDNTARRGAAGHVAYGANPATFN- 680 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 WL L + S+ R F + E +KA L + + + + Sbjct: 681 --VWLAGALE--RRVPASY--RRELFVNAWNEWAEKAVLEPSLTFGDLNLQVMRQHLGAA 734 Query: 127 KELFEGWNDRPSSPKKS 143 + P+ +S Sbjct: 735 EPATHLAEP-PAHGMRS 750 >gi|322690050|ref|YP_004209784.1| hypothetical protein BLIF_1872 [Bifidobacterium longum subsp. infantis 157F] gi|320461386|dbj|BAJ72006.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 672 Score = 230 bits (588), Expect = 2e-58, Method: Composition-based stats. Identities = 56/257 (21%), Positives = 95/257 (36%), Gaps = 15/257 (5%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196 PS + + A + H Y+ D + H + L + DL++T E ++ ++ Sbjct: 291 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 350 Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249 A + N+GRDV L V YD + H KKS + G+H E Sbjct: 351 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 410 Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307 + L + LG I+ F +NP LG + + + Y Sbjct: 411 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANYEIT 470 Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361 +L + R G P G+ +W + + L+PL +F E + +DG + Sbjct: 471 KELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGTI 530 Query: 362 EHAVERFFACSVRYTEF 378 HA+ER + + Sbjct: 531 SHAIERANGYICQSRGY 547 >gi|312866008|ref|ZP_07726229.1| rhamnan synthesis protein F [Streptococcus downei F0415] gi|311098412|gb|EFQ56635.1| rhamnan synthesis protein F [Streptococcus downei F0415] Length = 584 Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats. Identities = 60/256 (23%), Positives = 105/256 (41%), Gaps = 18/256 (7%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVL 195 + L +SK+A+ +H +Y D E +F +DLF+T + K + + + Sbjct: 274 EQAEAEELPAESKVAVHLHVFYVDLLQEFLDAFKTFHFAYDLFITTDKEEKRAEIQAILE 333 Query: 196 KYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + AQ++V N GRDV P L L YDY+ H KKS+ Y G WR+ Sbjct: 334 QNQVLAQIFVTGNIGRDVLPMLKL--KDQLKGYDYIGHFHTKKSKEADY--WAGQSWRQE 389 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 L L+ + +I+ +N LG++ + + R+ + + + + +L ++ Sbjct: 390 LIAMLVKPA---NQILAQMAKNDRLGIVIADMPSFFRFNKIVVAWNENLIAPEMEELWEK 446 Query: 314 AGFPTK-----RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERF 368 GT W K L PL +L L E+ L ++ HA+ER Sbjct: 447 MSLKKSIDFKAMDTFVMSYGTYAWFKYDALSPLFDLDLTDEYVPAEPLPQNSILHAIERL 506 Query: 369 FACSV--RYTEFSIES 382 ++ ++ I Sbjct: 507 LIYIAWDKHYDYRISP 522 >gi|189440434|ref|YP_001955515.1| lipopolysaccharide biosynthesis protein [Bifidobacterium longum DJO10A] gi|317482688|ref|ZP_07941702.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA] gi|189428869|gb|ACD99017.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum DJO10A] gi|316915934|gb|EFV37342.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA] Length = 666 Score = 230 bits (587), Expect = 3e-58, Method: Composition-based stats. Identities = 56/257 (21%), Positives = 95/257 (36%), Gaps = 15/257 (5%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196 PS + + A + H Y+ D + H + L + DL++T E ++ ++ Sbjct: 291 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 350 Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249 A + N+GRDV L V YD + H KKS + G+H E Sbjct: 351 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 410 Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307 + L + LG I+ F +NP LG + + + Y Sbjct: 411 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANYEIT 470 Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361 +L + R G P G+ +W + + L+PL +F E + +DG + Sbjct: 471 KELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGTI 530 Query: 362 EHAVERFFACSVRYTEF 378 HA+ER + + Sbjct: 531 SHAIERANGYICQSRGY 547 >gi|13476282|ref|NP_107852.1| hypothetical protein mlr7561 [Mesorhizobium loti MAFF303099] gi|14027043|dbj|BAB53997.1| mlr7561 [Mesorhizobium loti MAFF303099] Length = 609 Score = 230 bits (587), Expect = 3e-58, Method: Composition-based stats. Identities = 60/242 (24%), Positives = 98/242 (40%), Gaps = 11/242 (4%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVLKYFP--SAQLYV 205 +IA++ H Y+ D EI + +DL VT A+K +Q + K +A + V Sbjct: 298 RIAVLAHVYHLDMIDEILGYAENVPKGYDLIVTTDNADKQALIQQAIAKATNASNAVVLV 357 Query: 206 MENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 + N GRD L V DRYD +C++H K+S ++G G +++ F +LL Sbjct: 358 VRNDGRDTSALLVGCRDYVLEDRYDLICRVHSKRSPQDGPR---GELFKLHTFENLLHTP 414 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF--PTKRLH 322 ++ F NP LG++ + + V LA++ G Sbjct: 415 GYVSNLLELFANNPALGLVMPPLVHIGYP-TIGNSWAGNKANVAKLARQLGLIVHLDDST 473 Query: 323 LDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIES 382 G M+W +P L L + +DG+L HA+ER A ++ Sbjct: 474 PVAPYGGMYWFRPAALRKLFEERWNWNDFANMDYRDGSLVHAIERIIAYVAIDAGYTFRH 533 Query: 383 VD 384 V Sbjct: 534 VM 535 >gi|148927812|ref|ZP_01811237.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] gi|147886838|gb|EDK72383.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] Length = 498 Score = 230 bits (586), Expect = 3e-58, Method: Composition-based stats. Identities = 70/237 (29%), Positives = 111/237 (46%), Gaps = 9/237 (3%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVME 207 ++A+VVH +Y + EI ++ + FDL +T + S + + E Sbjct: 240 RLAVVVHIFYPELANEIYDVIKNIVEPFDLIITTPHEGAVSELIDTFAPLASSVAIALSE 299 Query: 208 NKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA 267 N+GRDV PFL + G+ +RYD + K+H KKS G W++ LF L G S I Sbjct: 300 NRGRDVGPFLAVHRSGLLERYDAVLKLHSKKSTY----SDSGQQWQQSLFRQLCGNSQIV 355 Query: 268 IRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327 R + ++ GM+G Y + A R V++ + L + + L FF Sbjct: 356 RRSV-ALLRDGKTGMVGPHDYYLTHPHYWGANRPAVHKLLQSLTA-TPLKEEDVPLRFFA 413 Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383 GTMFW PK + L ++ + FE E +DG L HA+ER F + +++ S+ Sbjct: 414 GTMFWFAPKAIVALHDIPEALLNFESENGKQDGTLAHALERLFGIVPQLGGYNVTSL 470 >gi|125654691|ref|YP_001033885.1| hypothetical protein RSP_3918 [Rhodobacter sphaeroides 2.4.1] gi|77386351|gb|ABA81780.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 751 Score = 230 bits (586), Expect = 4e-58, Method: Composition-based stats. Identities = 74/234 (31%), Positives = 109/234 (46%), Gaps = 15/234 (6%) Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLYVMEN 208 A+ VH YY D W E + L RL FDL+VT+ E +++ FP A + M N Sbjct: 135 AVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAEEIRADFPGAFVTPMPN 194 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 +GRD+ PF+ LL G FD Y +CK H KKS H +G +WR+ L +L + + Sbjct: 195 RGRDILPFVTLLNAGAFDGYRAVCKFHTKKSP----HRQDGDLWRKHLIEGILPETGLEE 250 Query: 269 RIINTFEQNPCLG-MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327 + + F + P G + ++ +W L +R P R L F Sbjct: 251 K-LEAFVEAPEAGFWVADGQHYTGTQW-----WGSNVEATRHLLQRIEIPLDREALSFPA 304 Query: 328 GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 G+++WVKP L LR+L L + +F+ E DG L HA+ER + Sbjct: 305 GSIYWVKPLVLGLLRSLQLRLEDFDLEEGQVDGTLAHAIERVLGYLTARAGQKV 358 Score = 71.1 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 38/137 (27%), Gaps = 10/137 (7%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 + G I + A ++G W + ++ + + F Sbjct: 622 AFSGLIYDYAAVARRALSETYVRTLPKATIAGVMPGWDNTARRGAAGHVAYGANPATFN- 680 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 WL L + S+ R F + E +KA L + + + + Sbjct: 681 --VWLAGALE--RRVPASY--RRELFVNAWNEWAEKAVLEPSLTFGDLNLQVMRQHLGAA 734 Query: 127 KELFEGWNDRPSSPKKS 143 + P+ +S Sbjct: 735 EPATHLAEP-PAHGMRS 750 >gi|298346187|ref|YP_003718874.1| hypothetical protein HMPREF0573_11061 [Mobiluncus curtisii ATCC 43063] gi|304390053|ref|ZP_07372007.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|298236248|gb|ADI67380.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 43063] gi|304326535|gb|EFL93779.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii ATCC 35241] Length = 680 Score = 230 bits (586), Expect = 4e-58, Method: Composition-based stats. Identities = 63/242 (26%), Positives = 100/242 (41%), Gaps = 14/242 (5%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-----DFEQDVLKYFPSAQLY 204 ++A+V+H YY D EI L + +FD+F+T + L + Sbjct: 51 RLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP---IEGIIWRRWLFFDLL 261 +EN GRD+ P + L+ G D Y + K+H KKS HP G W+ LL Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRENHPDLEGSGAQWKDEFLDALL 170 Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321 G D +I++ F +P LG++ + ++ +L +R K Sbjct: 171 GSKDSVEKIMSAFGADPWLGLVTAPGNIVGPQF-----WGGDQALTAELLRRLEMQLKPS 225 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 L F G+M+WV+ ++ LR+L L +FE E D HA+ER + Sbjct: 226 KLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAGLKL 285 Query: 381 ES 382 Sbjct: 286 RE 287 Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 11/86 (12%), Positives = 27/86 (31%), Gaps = 8/86 (9%) Query: 39 SGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSR 98 G V + + +++ + +F WL + ++ RI F + Sbjct: 598 PGVMVNFDNTARRQWKPDVWYGANPYLFR---RWLAAA---ARSVLDRPAPERIVFINAW 651 Query: 99 KE--QKAFLRLNRFMSNSRMPFDSEK 122 E + A L + + + + Sbjct: 652 NEWAEGAILEPTQRFGKTYLQAVRDV 677 >gi|219670466|ref|YP_002460901.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2] gi|219540726|gb|ACL22465.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2] Length = 606 Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats. Identities = 60/262 (22%), Positives = 102/262 (38%), Gaps = 10/262 (3%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDV 194 PS L + K+ + H YY+D H + + D+ +T + E+ + Sbjct: 279 PSDYVVKPLKRQPKVVVCFHVYYEDLLDSCFHYMQSIPQFADIVITTPKKELVGIIEEKI 338 Query: 195 LKY-FPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253 Y + + V+ +GR FL + + D YDY C +H KKS G+ + Sbjct: 339 KSYELNNTTIKVINARGRAESAFLVATKDFILD-YDYACIVHDKKSSFLRPG-CVGVEFG 396 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSE-VYRRVIDLAK 312 LL S I++ FE NP +G + + Y+ + K Sbjct: 397 LQNLDALLATSAYVENILSIFEDNPRIGALEPVHLLHANFRDLYGGEWGANYKGTEEFLK 456 Query: 313 RAGFPT---KRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERF 368 RAG + G MFW +P C++ + ++ +F EE DG+L H +ER Sbjct: 457 RAGIDLLISPDVPPLAPMGAMFWFRPICMKRILDMEWEYEDFPEEPLPLDGSLIHIIERA 516 Query: 369 FACSVRYTEFSIESVDCVAEYE 390 + V+ + V + + E Sbjct: 517 YPFIVQDAGYLTGWVSTIEDAE 538 >gi|315654770|ref|ZP_07907675.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333] gi|315490731|gb|EFU80351.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333] Length = 680 Score = 229 bits (585), Expect = 4e-58, Method: Composition-based stats. Identities = 63/242 (26%), Positives = 100/242 (41%), Gaps = 14/242 (5%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-----DFEQDVLKYFPSAQLY 204 ++A+V+H YY D EI L + +FD+F+T + L + Sbjct: 51 RLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP---IEGIIWRRWLFFDLL 261 +EN GRD+ P + L+ G D Y + K+H KKS HP G W+ LL Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDALL 170 Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321 G D +I++ F +P LG++ + ++ +L +R K Sbjct: 171 GSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQF-----WGGDQALTAELLRRLEMQLKPS 225 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 L F G+M+WV+ ++ LR+L L +FE E D HA+ER + Sbjct: 226 KLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAGLKL 285 Query: 381 ES 382 Sbjct: 286 RE 287 Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 11/86 (12%), Positives = 27/86 (31%), Gaps = 8/86 (9%) Query: 39 SGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSR 98 G V + + +++ + +F WL + ++ RI F + Sbjct: 598 PGVMVNFDNTARRQWKPDVWYGANPYLFR---RWLAAA---ARSVLDRPAPERIVFINAW 651 Query: 99 KE--QKAFLRLNRFMSNSRMPFDSEK 122 E + A L + + + + Sbjct: 652 NEWAEGAILEPTQRFGKTYLQAVRDV 677 >gi|293189412|ref|ZP_06608132.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309] gi|292821502|gb|EFF80441.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309] Length = 620 Score = 229 bits (583), Expect = 9e-58, Method: Composition-based stats. Identities = 65/254 (25%), Positives = 99/254 (38%), Gaps = 10/254 (3%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVL 195 + K+ V H +Y D EI L L + L T E Sbjct: 286 ADQATLDAAASLKVLAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEAHAQ 345 Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWR 253 + A + V+ N+GRD+ FL + YD + KIH KKS ++ Y+ +++ Sbjct: 346 ERGVDADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYNAA--QLFK 403 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 L+ +LL SD I+ F +P LGM+ + A D AK+ Sbjct: 404 EHLYDNLLASSDHVASILAKFAAHPGLGMVIAPMPHMGYPTMGHA-WFANRAPARDFAKK 462 Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370 G P G+MF +P+ L L L +F EE KDG+L H +ER + Sbjct: 463 VGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLLS 522 Query: 371 CSVRYTEFSIESVD 384 +V + + V Sbjct: 523 YAVLSRGYYVRPVM 536 >gi|258654317|ref|YP_003203473.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233] gi|258557542|gb|ACV80484.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233] Length = 631 Score = 229 bits (583), Expect = 9e-58, Method: Composition-based stats. Identities = 66/308 (21%), Positives = 107/308 (34%), Gaps = 28/308 (9%) Query: 101 QKAFLRLNRFMSNSRMPFDSEK-----FLYVKELFEGWNDR-----------PSSPKKSG 144 + +L N + M S ++ + P Sbjct: 232 EPTYLERNAILGRRVMEIVSRTDYPVDLIWRNVVRSAEPRTLYTNMSMLSVVPDVDTGFR 291 Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK---YFPSA 201 +I ++ H +Y+D E+ + + FDL VT A K + S Sbjct: 292 PDPPLRICVLAHIFYEDMTDEMMGWIGNIPVPFDLVVTTTSAAKKEAIESALEAYALKSV 351 Query: 202 QLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 ++ ++E N+GR FL + YD + KIH KKS + G + G +++ + Sbjct: 352 EVRLVESNRGRAESAFLIACRDVLTSGEYDLVLKIHSKKSPQNGANL--GQLFKHHSVDN 409 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-- 317 LL I+ F+ P LGM+ + +LA + G Sbjct: 410 LLSSPGYVASILGMFQSQPSLGMVFPPVVNIGFP-TLGHSWFTNREAAHELADQLGIHTI 468 Query: 318 TKRLHLDFFNGTMFWVKPKCLEPLRNLH-LIGEFEEE-RNLKDGALEHAVERFFACSVRY 375 R NGTMFW +P+ L L +F E DG L H +ER + +V Sbjct: 469 FDRTTPLAPNGTMFWARPESLAKLARHDFDYSQFAAEHEGWSDGMLGHVIERLYGYAVLD 528 Query: 376 TEFSIESV 383 I+ V Sbjct: 529 AGLRIQCV 536 >gi|261868364|ref|YP_003256286.1| lipopolysaccharide biosynthesis protein [Aggregatibacter actinomycetemcomitans D11S-1] gi|3132260|dbj|BAA28137.1| unnamed protein product [Actinobacillus actinomycetemcomitans] gi|261413696|gb|ACX83067.1| lipopolysaccharide biosynthesis protein [Aggregatibacter actinomycetemcomitans D11S-1] Length = 632 Score = 228 bits (582), Expect = 9e-58, Method: Composition-based stats. Identities = 55/250 (22%), Positives = 99/250 (39%), Gaps = 13/250 (5%) Query: 139 SPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD----- 193 S K + KI +V H YY D EI + +DL +T E + Sbjct: 284 SSKVEKVRSDIKILVVAHIYYSDMLDEIISYTQNIPCSYDLLITTANEKSKLEIESNPIL 343 Query: 194 VLKYFPSAQLYVME-NKGRDVRPFLYLLELGVFD-RYDYLCKIHGKKSQREGYHPIEGII 251 + + V+E N+GRD+ + + RYD++C++H KKS + ++ Sbjct: 344 KMSGAKGINVKVVEQNRGRDMSSLFITCKQEIISERYDWVCRLHSKKSPQNSHNMSI--H 401 Query: 252 WRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLA 311 ++ ++ ++L ++IN ++N +G A I +A Sbjct: 402 FKEMMYLNILKDKAYISKVINYLDKNKSIGFAMPSMVHIGHPTLGHA-WFTNRDLAIKIA 460 Query: 312 KRAGFPTK-RLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERF 368 +R G F GTMFW +P+ L+ L + +F +E +D +L H +ER Sbjct: 461 ERVGIKLPFDDISPFAAYGTMFWFRPEALKKLFEYNWKFEDFNKEPMHQDSSLAHILERL 520 Query: 369 FACSVRYTEF 378 + + Sbjct: 521 LVYAAHDAGY 530 >gi|254876593|ref|ZP_05249303.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254842614|gb|EET21028.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 765 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 100/249 (40%), Gaps = 13/249 (5%) Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYF 198 P S I K AI +H +Y D E + L +DL++T+ N +F ++ Sbjct: 520 PINSEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSS 579 Query: 199 P--SAQLYVMENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + ++ ++N GRD+ P ++ L+ + Y+ + H KK+ H G WR + Sbjct: 580 GAANVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKT--VSAHDNLGDKWRAY 637 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 L +L+G ++ I + +G++ + E V +L G Sbjct: 638 LLNNLIGDNEQISNSILNLFNDEKIGLVFPE-------DRTYIDIGENKFYVDELCTAIG 690 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375 F G MFW + + + +L+ +EE +DG+ HA+ER V Sbjct: 691 LEKICETPLFPLGNMFWARVDAIRDIFSLNEDMILQEEPLPRDGSYMHALERIIPNIVEK 750 Query: 376 TEFSIESVD 384 + +V Sbjct: 751 NGYKYVTVY 759 >gi|319939379|ref|ZP_08013739.1| RgpFc protein [Streptococcus anginosus 1_2_62CV] gi|319811365|gb|EFW07660.1| RgpFc protein [Streptococcus anginosus 1_2_62CV] Length = 587 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 63/271 (23%), Positives = 99/271 (36%), Gaps = 21/271 (7%) Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184 + + KI + +H +Y D + +F +DLF+T Sbjct: 266 NFPDFKYLLARKYIQTTAPTSLSNKKIGVHLHVFYVDLLEDFLKAFENFHFAYDLFITTD 325 Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 K + E + + +A ++V N GRDV P L L YDY+ H KKS+ Sbjct: 326 NDTKKLEIEAILNQNHKNAHIFVTGNIGRDVLPMLKL--KKYLSTYDYIGHFHTKKSKEA 383 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300 + G WR L L+ +D I+ FE N LG++ S + RY + Sbjct: 384 DF--WAGESWRNELIDMLIKPAD---NILANFE-NDKLGLVISDIPTFFRYNKIVDAWNE 437 Query: 301 SEVYRRVIDLAKRAGFPTKRLHLDF-----FNGTMFWVKPKCLEPLRNLHLIG-EFEEER 354 + + DL + F GT W K L+PL +L L + E Sbjct: 438 HLIAPEMNDLWYKMKMTKPIDFNTFHTFVMSYGTFIWFKYDALKPLFDLDLTDKDVPIEP 497 Query: 355 NLKDGALEHAVERFFACSV--RYTEFSIESV 383 ++ HA+ER + +F I Sbjct: 498 LP-QNSILHAIERLIVYVAWNEHYDFRISKN 527 >gi|241668058|ref|ZP_04755636.1| glycosyl transferase, group 1 [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 756 Score = 227 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 100/249 (40%), Gaps = 13/249 (5%) Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYF 198 P S I K AI +H +Y D E + L +DL++T+ N +F ++ Sbjct: 511 PINSEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSS 570 Query: 199 P--SAQLYVMENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + ++ ++N GRD+ P ++ L+ + Y+ + H KK+ H G WR + Sbjct: 571 GAANVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKT--VSAHDNLGDKWRAY 628 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 L +L+G ++ I + +G++ + E V +L G Sbjct: 629 LLNNLIGDNEQISNSILNLFNDEKIGLVFPE-------DRTYIDIGENKFYVDELCTAIG 681 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375 F G MFW + + + +L+ +EE +DG+ HA+ER V Sbjct: 682 LEKICETPLFPLGNMFWARVDAIRDIFSLNEDMILQEEPLPRDGSYMHALERIIPNIVEK 741 Query: 376 TEFSIESVD 384 + +V Sbjct: 742 NGYKYVTVY 750 >gi|315657309|ref|ZP_07910191.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315491781|gb|EFU81390.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 680 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 63/242 (26%), Positives = 100/242 (41%), Gaps = 14/242 (5%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-----DFEQDVLKYFPSAQLY 204 ++A+V+H YY D EI L + +FD+F+T + L + Sbjct: 51 RLAVVMHVYYSDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP---IEGIIWRRWLFFDLL 261 +EN GRD+ P + L+ G D Y + K+H KKS HP G W+ LL Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDALL 170 Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321 G D +I++ F +P LG++ + ++ +L +R K Sbjct: 171 GSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQF-----WGGDQALTAELLRRLEMQLKPS 225 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLI-GEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 L F G+M+WV+ ++ LR+L L +FE E D HA+ER + Sbjct: 226 KLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAGLKL 285 Query: 381 ES 382 Sbjct: 286 RE 287 Score = 82.3 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 11/86 (12%), Positives = 27/86 (31%), Gaps = 8/86 (9%) Query: 39 SGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSR 98 G V + + +++ + +F WL + ++ RI F + Sbjct: 598 PGVMVNFDNTARRQWKPDVWYGANPYLFR---RWLAAA---ARSVLDRPAPERIVFINAW 651 Query: 99 KE--QKAFLRLNRFMSNSRMPFDSEK 122 E + A L + + + + Sbjct: 652 NEWAEGAILEPTQRFGKTYLQAVRDV 677 >gi|167627488|ref|YP_001677988.1| group 1 glycosyl transferase [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167597489|gb|ABZ87487.1| glycosyl transferase, group 1 [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 763 Score = 227 bits (579), Expect = 2e-57, Method: Composition-based stats. Identities = 55/249 (22%), Positives = 100/249 (40%), Gaps = 13/249 (5%) Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV-VEANKDFEQDVLKYF 198 P S I K AI +H +Y D E + L +DL++T+ N +F ++ Sbjct: 518 PINSEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSS 577 Query: 199 P--SAQLYVMENKGRDVRPFLYLLELGVF-DRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 + ++ ++N GRD+ P ++ L+ + Y+ + H KK+ H G WR + Sbjct: 578 GAANVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKT--VSAHDNLGDKWRAY 635 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 L +L+G ++ I + +G++ + E V +L G Sbjct: 636 LLNNLIGDNEQISNSILNLFNDEKIGLVFPE-------DRTYIDIGENKFYVDELCTAIG 688 Query: 316 FPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375 F G MFW + + + +L+ +EE +DG+ HA+ER V Sbjct: 689 LEKICETPLFPLGNMFWARVDAIRDIFSLNEDMILQEEPLPRDGSYMHALERIIPNIVEK 748 Query: 376 TEFSIESVD 384 + +V Sbjct: 749 NGYKYVTVY 757 >gi|296876714|ref|ZP_06900762.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus parasanguinis ATCC 15912] gi|296432216|gb|EFH18015.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus parasanguinis ATCC 15912] Length = 582 Score = 225 bits (575), Expect = 6e-57, Method: Composition-based stats. Identities = 65/266 (24%), Positives = 107/266 (40%), Gaps = 18/266 (6%) Query: 127 KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-- 184 + + + ++ K+A+ +H +Y D E +FD+DL++T Sbjct: 262 PDFPYLLSRKYLKKQELAGDFDKKVAVHLHVFYVDLLEEFLDAFRDFHFDYDLWITTDVE 321 Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244 E + EQ + A++ V N GRDV P L L +YDY+ H KKS+ + Sbjct: 322 EKKQAIEQILSNRAQDARVVVTGNIGRDVLPMLLL--KEQLSKYDYVGHFHTKKSKEADF 379 Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSE 302 G WR+ L L+ +D +I+ E NP +G+ + + RY R + Sbjct: 380 --WAGESWRKELIEMLVKPAD---QILANMEANPKVGITIADIPTFFRYNRIVVAWNEAL 434 Query: 303 VYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLK 357 + + L +R G K GT W K L+PL +L+L L Sbjct: 435 ISPEMNKLWQRMGATKTIDFEKINTFVMSYGTFVWFKYDALKPLFDLNLTAADVPAEPLP 494 Query: 358 DGALEHAVERFFACSV--RYTEFSIE 381 ++ HA+ER + +F I Sbjct: 495 QNSILHAIERLLIYIAWDQKYDFRIS 520 >gi|289678438|ref|ZP_06499328.1| glycosyl transferase, group 1 [Pseudomonas syringae pv. syringae FF5] Length = 774 Score = 225 bits (574), Expect = 9e-57, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 90/241 (37%), Gaps = 11/241 (4%) Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFE-QDVLKYFPSA- 201 + +A+ +H +Y+D + SH L D+F+T+ +A + V P Sbjct: 260 PEAARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLADAKHQKKTIAVFSKHPRVK 319 Query: 202 --QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 ++ + N+GR+ P L YD C +H KKS G E W +L Sbjct: 320 NLKVRCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 375 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319 LL ++I R++N F + LG+ + W + Sbjct: 376 LLRDANIITRLLNAFADHKDLGLYYPTTFWMMPSWVNHVTM--NKSFMAAWHNEWQIDPC 433 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLH-LIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 L + G MFW +P+ L+ + F +E DG++ HA+ER + Sbjct: 434 DGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 493 Query: 379 S 379 Sbjct: 494 K 494 >gi|262282406|ref|ZP_06060174.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA] gi|262261697|gb|EEY80395.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA] Length = 582 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 100/244 (40%), Gaps = 18/244 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206 KIA+ +H +Y D E H +F +DLF+T K + + A++ V Sbjct: 283 KKIAVHLHVFYVDLLAEFLHAFESFHFSYDLFITTDSEKKKNEILDILEGKQAKAEVLVT 342 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N GRDV P L L +YDY+ H KKS+ Y G WR+ L L+ +D Sbjct: 343 GNVGRDVLPMLKLKR--HLSQYDYIGHFHTKKSKEADY--WAGESWRKELINMLVHPAD- 397 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK----- 319 +I++ Q+ LG++ + + R+ R + + + L +R + Sbjct: 398 --QIVSQLGQDDRLGLVIADIPSFFRFNRIVVAWNEALISPEMNKLWERMNCQKEVDFKQ 455 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L PL +L+L E L ++ HA+ER + + Sbjct: 456 MNTFVMSYGTFVWFKYDALSPLFDLNLTEEDVPSEPLPQNSILHAIERLLVYIAWDKQYD 515 Query: 378 FSIE 381 F I Sbjct: 516 FKIS 519 >gi|55821450|ref|YP_139892.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG 18311] gi|55737435|gb|AAV61077.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG 18311] Length = 594 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 64/247 (25%), Positives = 101/247 (40%), Gaps = 18/247 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206 KIA+ +H YY D + +F +DLF+T +K + + + K A++++ Sbjct: 287 KKIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDSEDKKAEIQSILDKNGKVARIFIT 346 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRDV P L L YDY+ H KKS Y G WR LF L+ +D Sbjct: 347 GNRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPEYPY--WVGDSWRNELFSMLIQPAD- 401 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321 II E++ LG++ + + RY + + + DL +R Sbjct: 402 --NIIANLERDDRLGLVIADIPSFFRYTKIVDPWNENRFAEGMNDLWERMDLGRDIDFDK 459 Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFAC--SVRYTE 377 GT W K L+PL +L L E + + H++ER R + Sbjct: 460 MNTFIMSYGTFIWFKYDALKPLFDLDLQDEEIPAEPIPQHTILHSIERILVYLAWARRYD 519 Query: 378 FSIESVD 384 ++I D Sbjct: 520 YAIAKND 526 >gi|306831662|ref|ZP_07464819.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325978600|ref|YP_004288316.1| rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] gi|304426087|gb|EFM29202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325178528|emb|CBZ48572.1| rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] Length = 586 Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 59/239 (24%), Positives = 96/239 (40%), Gaps = 16/239 (6%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E + +FD+DLF+T K + E + K AQ+++ Sbjct: 287 KKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESILEKNGKIAQVFLT 346 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L YDY+ H KKS Y G WR L+ L+ +D Sbjct: 347 GNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPEYPY--WVGDSWRNELYQMLIQSAD- 401 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319 I+ E N LG++ + + RY + + + +L +R Sbjct: 402 --NILANLENNDNLGLVIADIPSFFRYTKIVDPWNENRFADGMNELWERMNLERQIDFNN 459 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GT W K L+PL +L L + + + H++ER + Sbjct: 460 LSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPSEPIPQHTILHSIERILVYLAWANNY 518 >gi|55823377|ref|YP_141818.1| polysaccharide biosynthesis protein [Streptococcus thermophilus CNRZ1066] gi|55739362|gb|AAV63003.1| polysaccharide biosynthesis protein [Streptococcus thermophilus CNRZ1066] Length = 581 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 65/244 (26%), Positives = 100/244 (40%), Gaps = 18/244 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E +F +DL++T E ++ EQ + + A + V Sbjct: 284 KKVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWITTDIEEKKQEIEQILSRRSQDATIVVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N GRDV P L L RYDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321 +I+ E NP +G+ Y RY R + + + L +R G Sbjct: 399 --QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALISPEMNKLWQRMGATKNIDFKN 456 Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+PL +L+L L ++ HA+ER + + Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVPAEPLPQNSILHAIERLLVYIAWDQKYD 516 Query: 378 FSIE 381 F I Sbjct: 517 FRIS 520 >gi|94990172|ref|YP_598272.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10270] gi|94543680|gb|ABF33728.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10270] Length = 581 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319 I++ FE N +G+I + + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+ L +L L L ++ HA+ER + Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515 Query: 378 FSI 380 F I Sbjct: 516 FRI 518 >gi|94988294|ref|YP_596395.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS9429] gi|94992170|ref|YP_600269.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS2096] gi|94541802|gb|ABF31851.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS9429] gi|94545678|gb|ABF35725.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS2096] Length = 581 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319 I++ FE N +G+I + + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+ L +L L L ++ HA+ER + Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515 Query: 378 FSI 380 F I Sbjct: 516 FRI 518 >gi|330899783|gb|EGH31202.1| hypothetical protein PSYJA_20361 [Pseudomonas syringae pv. japonica str. M301072PT] Length = 626 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 90/241 (37%), Gaps = 11/241 (4%) Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFE-QDVLKYFPSA- 201 + +A+ +H +Y+D + SH L D+F+T+ +A + V P Sbjct: 112 PEAARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLADAKHQKKTIAVFSKHPRVK 171 Query: 202 --QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 ++ + N+GR+ P L YD C +H KKS G E W +L Sbjct: 172 NLKVRCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 227 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319 LL ++I R++N F + LG+ + W + Sbjct: 228 LLRDANIITRLLNAFADHKDLGLYYPTTFWMMPSWVNHVTM--NKSFMAAWHNEWQIAPC 285 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLH-LIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 L + G MFW +P+ L+ + F +E DG++ HA+ER + Sbjct: 286 DGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 345 Query: 379 S 379 Sbjct: 346 K 346 >gi|21910063|ref|NP_664331.1| hypothetical protein SpyM3_0527 [Streptococcus pyogenes MGAS315] gi|28896239|ref|NP_802589.1| hypothetical protein SPs1327 [Streptococcus pyogenes SSI-1] gi|21904254|gb|AAM79134.1| putative protein [Streptococcus pyogenes MGAS315] gi|28811490|dbj|BAC64422.1| conserved hypothetical protein [Streptococcus pyogenes SSI-1] Length = 581 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319 I++ FE N +G+I + + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+ L +L L L ++ HA+ER + Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515 Query: 378 FSI 380 F I Sbjct: 516 FRI 518 >gi|319946716|ref|ZP_08020950.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus australis ATCC 700641] gi|319746764|gb|EFV99023.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus australis ATCC 700641] Length = 581 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 62/244 (25%), Positives = 97/244 (39%), Gaps = 18/244 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E +F +DL++T E + E+ + A + V Sbjct: 284 KKVAVHLHVFYVDLLEEFLDAFQAFHFAYDLWITTDVEEKKQAIEEILSNRAQVATVVVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N GRDV P L L YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNIGRDVLPMLLL--KEQLSHYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-----K 319 +I+ E NP +G+ + + RY R + + L +R G Sbjct: 399 --KILANMEANPKVGITIADIPTFFRYNRIVVAWNEVLISPEMNKLWQRMGATKTIDFKN 456 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+PL +L+L L ++ HA+ER + + Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLKAADVPAEPLPQNSILHAIERLLVYIAWDQKYD 516 Query: 378 FSIE 381 F I Sbjct: 517 FRIS 520 >gi|50913971|ref|YP_059943.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10394] gi|50903045|gb|AAT86760.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10394] Length = 581 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 58/243 (23%), Positives = 100/243 (41%), Gaps = 19/243 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319 I++ FE N +G+I + + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETND-IGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQA 455 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+ L +L L L ++ HA+ER + Sbjct: 456 MDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYD 515 Query: 378 FSI 380 F I Sbjct: 516 FRI 518 >gi|322516362|ref|ZP_08069287.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124] gi|322125095|gb|EFX96488.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124] Length = 581 Score = 224 bits (570), Expect = 3e-56, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 99/244 (40%), Gaps = 18/244 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E +F +DL++T E + E+ + A + V Sbjct: 284 RKVAVHLHVFYVDLLEEFLDAFQAFHFIYDLWITTDVEEKKQAIEKILSNRVQDATVVVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N GRDV P L L RYDY+ H KKS+ + G WR+ L L+ +D+ Sbjct: 344 GNIGRDVLPMLLL--KEQLSRYDYVGHFHTKKSKEADF--WAGESWRKELIEMLVKPADL 399 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-----K 319 I+ E NP +G+ + + RY R + + + L +R G Sbjct: 400 ---ILANMEANPKVGITIADIPTFFRYNRIVVAWNEALISPEMNKLWQRMGATKTIDFKS 456 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+PL +L+L L ++ HA+ER + + Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLTAADVPAEPLPQNSILHAIERLLIYIAWDQKYD 516 Query: 378 FSIE 381 F I Sbjct: 517 FRIS 520 >gi|116628171|ref|YP_820790.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMD-9] gi|116101448|gb|ABJ66594.1| Lipopolysaccharide biosynthesis protein [Streptococcus thermophilus LMD-9] Length = 581 Score = 223 bits (569), Expect = 3e-56, Method: Composition-based stats. Identities = 65/244 (26%), Positives = 100/244 (40%), Gaps = 18/244 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E +F +DL++T E ++ EQ + + A + V Sbjct: 284 KKVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWITTDVEEKKQEIEQILSRRSQDATIVVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N GRDV P L L RYDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321 +I+ E NP +G+ Y RY R + + + L +R G Sbjct: 399 --QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALISPEMNKLWQRMGATKNIDFKN 456 Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+PL +L+L L ++ HA+ER + + Sbjct: 457 LNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVPAEPLPQNSILHAIERLLVYIAWDQKYD 516 Query: 378 FSIE 381 F I Sbjct: 517 FRIS 520 >gi|83950907|ref|ZP_00959640.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM] gi|83838806|gb|EAP78102.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM] Length = 752 Score = 223 bits (568), Expect = 4e-56, Method: Composition-based stats. Identities = 69/251 (27%), Positives = 106/251 (42%), Gaps = 13/251 (5%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFPSAQLY 204 K++ A+ H YY D W E + + DL++T+ E + ++ + FP A + Sbjct: 130 KARFALHAHIYYPDLWPEFATRFDEIGDGIDLYITLTWRGEETRWLADEITERFPRAFVT 189 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 + N+GRD+ PFL L G FD YD LCKIH KKS H +G WRR L +L + Sbjct: 190 PVPNRGRDILPFLLLANAGAFDGYDALCKIHTKKSP----HRDDGDQWRRHLIDGVLPAT 245 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324 + R+ + + + + + W + + +R L Sbjct: 246 GLQERLQHFLADDAAAFWVADGQAYAARDW-----WGINRDKTAAVLRRVELDPLLDALR 300 Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383 F G+++W+KP L ++ L L FE E+ DG L HAVER I Sbjct: 301 FPAGSIYWMKPLMLGMIKALDLDAPMFEPEKGQVDGTLAHAVERAIGGLALAAGQEIRET 360 Query: 384 DCVAEYERLLH 394 + R H Sbjct: 361 AALMRPRRAGH 371 Score = 73.8 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 18/110 (16%), Positives = 32/110 (29%), Gaps = 9/110 (8%) Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73 I + + P ++G W + ++ + H + F + WL Sbjct: 633 IYDYRAIAARSLTPQYRDRLPPNTIAGIMPSWDNTARRGPRAHIAHGATPASFRN---WL 689 Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 R LS F + E +KA L + + SE Sbjct: 690 RGLCG----GPLSQSYRGELFINAWNEWAEKAMLEPSTRFGRLYLDVLSE 735 >gi|288905572|ref|YP_003430794.1| polysaccharide biosynthesis protein (RgpF) [Streptococcus gallolyticus UCN34] gi|288732298|emb|CBI13867.1| Putative polysaccharide biosynthesis protein (RgpF) [Streptococcus gallolyticus UCN34] Length = 586 Score = 222 bits (567), Expect = 5e-56, Method: Composition-based stats. Identities = 58/239 (24%), Positives = 95/239 (39%), Gaps = 16/239 (6%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E + +FD+DLF+T K + E + K AQ+++ Sbjct: 287 KKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESILEKNGKIAQVFLT 346 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L YDY+ H KKS Y G WR L+ L+ +D Sbjct: 347 GNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPEYPY--WVGDSWRNELYQMLIQSAD- 401 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319 I+ E N LG++ + + RY + + + +L + Sbjct: 402 --NILANLENNDNLGLVIADIPSFFRYTKIVDPWNENRFADGMNELWECMNLERQIDFNN 459 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GT W K L+PL +L L + + + H++ER + Sbjct: 460 LSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPSEPIPQHTILHSIERILVYLAWANNY 518 >gi|302337198|ref|YP_003802404.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293] gi|301634383|gb|ADK79810.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293] Length = 1808 Score = 222 bits (566), Expect = 7e-56, Method: Composition-based stats. Identities = 56/239 (23%), Positives = 93/239 (38%), Gaps = 17/239 (7%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV---EANKDFEQDVLKYFP---SAQL 203 I + +H +Y D E+ L+ + F LF++ + + ++ V K P + Sbjct: 1018 SIGVHLHLFYIDLAEELLSSLINIPVCFSLFISTSAGVKDQEYIKKIVNKKLPLCNECTV 1077 Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263 EN+GRD+ PF+ ++D + H KKS H RR+L +LG Sbjct: 1078 IQTENRGRDIAPFIVEFGNS-LSQFDLILHFHSKKSL----HSDSLSDARRFLLHYILGN 1132 Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYR-RVIDLAKRAGFPTKRLH 322 I I+ +N F +N +GM+ + + K+ G Sbjct: 1133 KAITIQNLNMFFENGSIGMVAPPYH----PSLRNMPNFGLQEYETKQFLKKMGINYSGKC 1188 Query: 323 LDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 DF G+ FW + + L ++ F EE+ DG L H +ER + F I Sbjct: 1189 TDFPAGSFFWCRKDAIRQLLTSNIRWNSFPEEKGQIDGTLAHVIERSLGIICKQNNFKI 1247 >gi|225868697|ref|YP_002744645.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. zooepidemicus] gi|225701973|emb|CAW99527.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. zooepidemicus] Length = 581 Score = 222 bits (566), Expect = 7e-56, Method: Composition-based stats. Identities = 58/274 (21%), Positives = 105/274 (38%), Gaps = 19/274 (6%) Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184 ++ + + + + KIA+ +H +Y D E +FD+DL +T Sbjct: 260 HLPDAKYLLAHKYLPEQPISIDQSKKIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLITTD 319 Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 K + ++ + + SA + V N GRDV P L L +YDY+ H KKS+ Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300 + G WR L ++ +D +I+ + +G++ + + R+ + Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAADA-IGIVIADIPSFFRFNKIVDAWNE 431 Query: 301 SEVYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355 + + L + G GT W K L+PL +L L Sbjct: 432 HLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIPAEP 491 Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESVDCVA 387 L ++ HA+ER R+ +F I + + Sbjct: 492 LPQNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525 >gi|157151529|ref|YP_001450315.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr. CH1] gi|157076323|gb|ABV11006.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr. CH1] Length = 582 Score = 222 bits (566), Expect = 7e-56, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 102/244 (41%), Gaps = 18/244 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206 KIA+ +H +Y D E H +F +DLF+T K + + A+++V Sbjct: 283 KKIAVHLHVFYVDLLAEFLHAFESFHFSYDLFITTDSEKKKNEILGILEGKQAKAEVFVT 342 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N GRDV P L L +YDY+ H KKS+ Y G WR+ L L+ +D Sbjct: 343 GNVGRDVLPMLKLKR--HLSQYDYIGHFHTKKSKEADY--WAGESWRKELINMLVHPAD- 397 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK----- 319 +I++ Q+ CLG++ + + R+ R + + + L +R + Sbjct: 398 --QIVSQLGQDDCLGLVIADIPSFFRFNRIVVAWNEALISPEMNKLWERMNCQKEVDFKQ 455 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L PL +L++ E L ++ HA+ER + + Sbjct: 456 MNTFVMSYGTFVWFKYDALSPLFDLNMTEEDVPSEPLPQNSILHAIERLLVYIAWDKQYD 515 Query: 378 FSIE 381 F I Sbjct: 516 FKIS 519 >gi|269978088|ref|ZP_06185038.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1] gi|306818459|ref|ZP_07452182.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239] gi|307700705|ref|ZP_07637730.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16] gi|269933597|gb|EEZ90181.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1] gi|304648632|gb|EFM45934.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239] gi|307613700|gb|EFN92944.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16] Length = 613 Score = 222 bits (566), Expect = 8e-56, Method: Composition-based stats. Identities = 69/254 (27%), Positives = 106/254 (41%), Gaps = 10/254 (3%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVL 195 + K +IA V H +Y D EI L +F+T E EQ + Sbjct: 285 AEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKKTQIEQQLQ 344 Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGV-FDRYDYLCKIHGKKSQREGYHPIEGIIWR 253 A++ ++E N+GRDV FL + R+D + KIH KKS ++ Y+ +++ Sbjct: 345 TMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGRFDVVAKIHSKKSAQDAYNAA--ELFK 402 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 R LF +LL +++ F P LGM+ A + + L +R Sbjct: 403 RHLFENLLPSPGYTANLLHLFATEPYLGMVFPPAVSLGYPTLGHA-WFANKKPALALCER 461 Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370 G P G+MF+ +P+ L PL H +F EE DG+L H +ER F+ Sbjct: 462 LGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSLAHVIERIFS 521 Query: 371 CSVRYTEFSIESVD 384 S +SV Sbjct: 522 YSSLSEGLICKSVM 535 >gi|195977971|ref|YP_002123215.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195974676|gb|ACG62202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 581 Score = 222 bits (565), Expect = 9e-56, Method: Composition-based stats. Identities = 59/274 (21%), Positives = 106/274 (38%), Gaps = 19/274 (6%) Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184 ++ + + S + + KIA+ +H +Y D E +FD+DL +T Sbjct: 260 HLPDAKYLLAHKYLSNQPISIAPSKKIAVHLHVFYADLLSEFLEAFSHFHFDYDLLITTD 319 Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 K + ++ + + SA + V N GRDV P L L +YDY+ H KKS+ Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300 + G WR L ++ +D +I+ + +G++ + + R+ + Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAADA-IGIVIADIPSFFRFNKIVDAWNE 431 Query: 301 SEVYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355 + + L + G GT W K L+PL +L L Sbjct: 432 HLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLGLNEADIPAEP 491 Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESVDCVA 387 L ++ HA+ER R+ +F I + + Sbjct: 492 LPQNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525 >gi|222152862|ref|YP_002562039.1| rhamnan synthesis protein F family protein [Streptococcus uberis 0140J] gi|222113675|emb|CAR41606.1| rhamnan synthesis protein F family protein [Streptococcus uberis 0140J] Length = 585 Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats. Identities = 64/247 (25%), Positives = 101/247 (40%), Gaps = 19/247 (7%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYV 205 + IA+ +H +Y D E H F FDL++T E + + + + SA++ V Sbjct: 285 EHSIAVHLHVFYVDLLEEFLHAFTSFKFPFDLYITTDKSEKESEIKAILDSFRVSAKIVV 344 Query: 206 MENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSD 265 N GRDV P L L +YDY+ H KKS+ + G WR L L+ + Sbjct: 345 TGNIGRDVLPMLKL--KDELSQYDYIGHFHTKKSKEADF--WAGESWRNELIDMLIKPA- 399 Query: 266 IAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHL 323 IIN FE +P +G+I + + R+ + + + L ++ Sbjct: 400 --NTIINQFE-DPAIGIIIADIPSFFRFNKIVTPLNEHLIAPEMNKLWEKMNLSKTIDFE 456 Query: 324 DF-----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYT 376 F GT W K L+PL +L+L + L ++ HAVER + Sbjct: 457 QFDTFVMSYGTFVWFKYDALKPLFDLNLKDGDVPKEPLPQNSILHAVERLLIYIAWDSHF 516 Query: 377 EFSIESV 383 +F I Sbjct: 517 DFRIAKN 523 >gi|225870347|ref|YP_002746294.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. equi 4047] gi|225699751|emb|CAW93520.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. equi 4047] Length = 581 Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats. Identities = 58/274 (21%), Positives = 104/274 (37%), Gaps = 19/274 (6%) Query: 125 YVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV 184 + + + + + KIA+ +H +Y D E +FD+DL +T Sbjct: 260 HPPDAKYLLAHKYLPEQPISIDQSKKIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLITTD 319 Query: 185 EANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 K + ++ + + SA + V N GRDV P L L +YDY+ H KKS+ Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKR 300 + G WR L ++ +D +I+ + +G++ + + R+ + Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAADA-IGIVIADIPSFFRFNKIVDAWNE 431 Query: 301 SEVYRRVIDLAKRAGFP-----TKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERN 355 + + L + G GT W K L+PL +L L Sbjct: 432 HLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIPAEP 491 Query: 356 LKDGALEHAVERFFACSV--RYTEFSIESVDCVA 387 L ++ HA+ER R+ +F I + + Sbjct: 492 LSQNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525 >gi|227875198|ref|ZP_03993340.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris ATCC 35243] gi|227844103|gb|EEJ54270.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris ATCC 35243] Length = 613 Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats. Identities = 68/254 (26%), Positives = 105/254 (41%), Gaps = 10/254 (3%) Query: 138 SSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVL 195 + K +IA V H +Y D EI L +F+T E EQ + Sbjct: 285 AEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKKTQIEQQLQ 344 Query: 196 KYFPSAQLYVME-NKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQREGYHPIEGIIWR 253 A++ ++E N+GRDV FL + +D + KIH KKS ++ Y+ +++ Sbjct: 345 TMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGCFDVVAKIHSKKSAQDAYNAA--ELFK 402 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 R LF +LL +++ F P LGM+ A + + L +R Sbjct: 403 RHLFENLLPSPGYTANLLHLFATEPYLGMVFPPAVSLGYPTLGHA-WFANKKPALALCER 461 Query: 314 AGF--PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370 G P G+MF+ +P+ L PL H +F EE DG+L H +ER F+ Sbjct: 462 LGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSLAHVIERIFS 521 Query: 371 CSVRYTEFSIESVD 384 S +SV Sbjct: 522 YSSLSEGLICKSVM 535 >gi|306833804|ref|ZP_07466929.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis ATCC 700338] gi|304423998|gb|EFM27139.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis ATCC 700338] Length = 586 Score = 221 bits (564), Expect = 1e-55, Method: Composition-based stats. Identities = 58/239 (24%), Positives = 97/239 (40%), Gaps = 16/239 (6%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E + +FD+DLF+T K + E + K +AQ+++ Sbjct: 287 KKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESILEKNGKTAQVFLT 346 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L YDY+ H KKS Y G WR L+ L+ +D Sbjct: 347 GNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPEYPY--WVGDSWRNELYQMLIQSAD- 401 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TK 319 ++ E N LG++ + + RY + + + +L +R Sbjct: 402 --NVLANLENNDNLGLVIADIPSFFRYTKIVDPWNENRFADGMNELWERMNLGRQIDFNN 459 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GT W K L+PL +L L + + + H++ER + Sbjct: 460 LSTFIMSYGTFIWFKHDTLKPLFDLELTDDEIPSEPIPQHTILHSIERILVYLAWANNY 518 >gi|296135664|ref|YP_003642906.1| glycosyl transferase family 2 [Thiomonas intermedia K12] gi|295795786|gb|ADG30576.1| glycosyl transferase family 2 [Thiomonas intermedia K12] Length = 1414 Score = 221 bits (564), Expect = 1e-55, Method: Composition-based stats. Identities = 79/241 (32%), Positives = 109/241 (45%), Gaps = 19/241 (7%) Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGR 211 A+++H YY D W E L L D++V++ E ++ D+++ P A + NKGR Sbjct: 281 AVLLHLYYPDLWPEFLAHLKTLPAPCDVYVSLSEGREELLTDIVRDLPDAVVMRHPNKGR 340 Query: 212 DVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY---------HPIEGIIWRRWLFFDLLG 262 D+ P L LL L Y L +HGKKS +G WRR L LL Sbjct: 341 DIAPRLALLRLARAHNYKQLLFLHGKKSPHLKEVENIHIPFLQHKDGDRWRRELLAALL- 399 Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH 322 D + + I F Q P LG+IG + R + R+ A+R G Sbjct: 400 --DASEKTIAAFAQQPKLGLIGPHGFWLGLR------GDANFPRLSAQAQRMGITPDPAR 451 Query: 323 LDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 +F G+MFW +P+ L+PL L L +FE+E DG L H VER FA S F I Sbjct: 452 HGYFAGSMFWCRPQALDPLLALDLKDADFEDETGQTDGTLAHVVERLFALSAEKAGFQIA 511 Query: 382 S 382 Sbjct: 512 D 512 >gi|322373386|ref|ZP_08047922.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sp. C150] gi|321278428|gb|EFX55497.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sp. C150] Length = 594 Score = 221 bits (563), Expect = 2e-55, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 101/247 (40%), Gaps = 18/247 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYFPSAQLYVM 206 KIA+ +H YY D + +F +DLF+T E K+ + + K+ A++++ Sbjct: 287 KKIAVHLHTYYVDLLDDFLRQFENFHFTYDLFLTTDSEEKKKEIQSILDKHGKEARIFIT 346 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRDV P L L YDY+ H KKS Y G WR LF L+ +D Sbjct: 347 GNRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPEYPY--WVGDSWRNELFSMLIQPAD- 401 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321 II E + LG++ + + RY + + + DL +R Sbjct: 402 --NIIANLEHDDRLGLVIADIPTFFRYTKIVDPWNENRFAEGMNDLWERMDLGRDIDFDK 459 Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFAC--SVRYTE 377 GT W K L+PL +L L E + + H++ER R + Sbjct: 460 MNTFIMSYGTFIWFKYDTLKPLFDLDLQDEEIPAEPIPQHTILHSIERILVYLAWARRYD 519 Query: 378 FSIESVD 384 ++I D Sbjct: 520 YAIAKND 526 >gi|312867647|ref|ZP_07727853.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405] gi|311096710|gb|EFQ54948.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405] Length = 582 Score = 220 bits (561), Expect = 3e-55, Method: Composition-based stats. Identities = 63/252 (25%), Positives = 102/252 (40%), Gaps = 18/252 (7%) Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV--EANKDFEQDVLKYF 198 ++ K+A+ +H +Y D E +F +DL++T E + E+ + Sbjct: 276 QELAENFDRKVAVHLHVFYVDLLEEFLDAFQAFHFVYDLWITTDVEEKKQTIEKILSNRA 335 Query: 199 PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFF 258 A + V N GRDV P L L +YDY+ H KKS+ + G WR+ L Sbjct: 336 QDATVVVTGNIGRDVLPMLLL--KEQLSQYDYVGHFHTKKSKEADF--WAGESWRKELIE 391 Query: 259 DLLGFSDIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAG- 315 L+ +D +I+ E NP +G+ + + RY R + + + L +R G Sbjct: 392 MLVKPAD---QILANMEANPKVGITIADIPTFFRYNRIVVAWNEALISPEMNKLWERMGA 448 Query: 316 ---FPTKR-LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFAC 371 K GT W K L+PL +L+L L ++ HA+ER Sbjct: 449 AKTIDFKNLNTFVMSYGTFVWFKYDALKPLFDLNLTAANVPAEPLPQNSILHAIERLLIY 508 Query: 372 SV--RYTEFSIE 381 + +F I Sbjct: 509 IAWDQKYDFRIS 520 >gi|304309760|ref|YP_003809358.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1] gi|301795493|emb|CBL43691.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1] Length = 1315 Score = 219 bits (559), Expect = 4e-55, Method: Composition-based stats. Identities = 63/317 (19%), Positives = 111/317 (35%), Gaps = 27/317 (8%) Query: 84 KLSFPSCRIFFYGSRKEQKAFLR----LNRFMSNSRMPFDSEKFLYVK---ELFEGWNDR 136 + S + + F R + FL R+ + + + + L L + + Sbjct: 363 RNSQEAAAMLFPRLRTITRTFLEKLPTPLRYRLQAFLRTLAHRLLPNAVQGRLAQTATNH 422 Query: 137 PSSPKKSGL--------TIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK 188 P + L T + IAI +H YY D L R+ FDL++++ Sbjct: 423 PYPEQLKQLHELTLPKHTSNATIAIHIHLYYADLAPTFVQALSRMERPFDLYISIQVRAN 482 Query: 189 DFEQDVLKY----FPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244 E + + + N GRD+ PF+ + +YD + +H KKS Y Sbjct: 483 PVEIEAVVRKIPCLRGLDIRATPNLGRDLYPFVCIFG-EALRKYDIIAHLHSKKSL---Y 538 Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY 304 + W ++ L + RI+ G++ + + + + Sbjct: 539 NQGATAGWLEYILDSLFRSPEDIARILERLSDASQTGIVYPQNFS-GLPYMAYT-WLANR 596 Query: 305 RRVIDLAKRAGFPTKRL-HLDFFNGTMFWVKPKCLEPLRNLHLIG-EFEEERNLKDGALE 362 R + R G + + D+ G+MFW + + P L +FE E DG L Sbjct: 597 SRAQQVQARFGLTSLPSGYFDYPAGSMFWARADAIAPFFEAQLNEDDFENESGQTDGTLA 656 Query: 363 HAVERFFACSVRYTEFS 379 H +ERF F Sbjct: 657 HTLERFLVLVPESLGFR 673 >gi|329116186|ref|ZP_08244903.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020] gi|326906591|gb|EGE53505.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020] Length = 589 Score = 219 bits (558), Expect = 6e-55, Method: Composition-based stats. Identities = 66/248 (26%), Positives = 106/248 (42%), Gaps = 19/248 (7%) Query: 147 IKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYFPSAQLY 204 I K+AI +H +Y D E +FD+DLF+T K + + + + A+++ Sbjct: 288 INKKVAIHLHTFYVDLLQEFLSAFENFHFDYDLFITTDIEEKKTQIENVLNENNQKAEVF 347 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 V N GRDV P L L YDY+ H KKS+ + G WR+ L L+ + Sbjct: 348 VTGNIGRDVLPML--LLKEKLSVYDYIGHFHTKKSKEADF--WAGESWRKELIKMLVLPA 403 Query: 265 DIAIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL- 321 D I+ T E+N +G++ + Y RY + + + + +L K+ G Sbjct: 404 D---SILATLEKN-KVGIVIADMPTYFRYNKIVTAWNENLIAPEMNELWKKMGLTKSIDF 459 Query: 322 ----HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RY 375 GT W K L+PL +L+L E L ++ HA+ER ++ Sbjct: 460 NHLHTFVMSYGTFVWFKYDALKPLFDLNLTVEDVPAEPLPQNSILHAIERLLIYIAWNQH 519 Query: 376 TEFSIESV 383 +F I Sbjct: 520 YDFRISKN 527 >gi|322385732|ref|ZP_08059376.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus cristatus ATCC 51100] gi|321270470|gb|EFX53386.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus cristatus ATCC 51100] Length = 598 Score = 219 bits (558), Expect = 7e-55, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 102/247 (41%), Gaps = 18/247 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYFPSAQLYVM 206 KIA+ +H YY D + +F +DLF+T K E + +LK ++Y+ Sbjct: 287 KKIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDSEKKKLEIEAVLLKRNQLGKIYIT 346 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 NKGRD+ P L L YDY+ H KKS Y G WR LF LL +D+ Sbjct: 347 GNKGRDIIPMLKL--REELCTYDYIGHFHTKKSPEYPY--WVGDSWRNELFDMLLKPADL 402 Query: 267 AIRIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL--- 321 I+ + E + LG++ + + RY + ++ + L +R Sbjct: 403 ---IMASLENDKRLGLVIADIPTFFRYTKIVDPWNENKFADDMNILWERMDINRSIDFNK 459 Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTE 377 GT W K L+PL +L+L E L + H++ER + + Sbjct: 460 LNTFIMSYGTFIWFKYDALKPLFDLNLQDEDIPSEPLPQHTILHSIERILVYLAWSQRFD 519 Query: 378 FSIESVD 384 ++I D Sbjct: 520 YAISKND 526 >gi|32455988|ref|NP_861990.1| rb115 [Ruegeria sp. PR1b] gi|22726340|gb|AAN05136.1| RB115 [Ruegeria sp. PR1b] Length = 963 Score = 217 bits (554), Expect = 2e-54, Method: Composition-based stats. Identities = 63/264 (23%), Positives = 107/264 (40%), Gaps = 18/264 (6%) Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS- 200 + K ++ + +H YY D E+ +L RL F+L +++ E +++++ F + Sbjct: 148 RQPPLPKGRLVVQLHLYYVDMAAEMIALLARLPVTFELLLSLPETAVVADEEMISLFRAG 207 Query: 201 ------AQLYVMENKGRDVRPFLYLLELGV--FDRYDYLCKIHGKKSQREGYHPIEGIIW 252 L + N+GRDV P++ + D + +H KKS YH W Sbjct: 208 LERLGAITLRRVPNRGRDVAPWMVSFRSELRALADRDLVLHLHSKKSPHGNYHVG----W 263 Query: 253 RRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAK 312 R+L LLG + +A +++ F ++P LG++ + +R + K L + Sbjct: 264 GRYLGHSLLGSTAVAAQMLGLFAEDPELGLVAPAYWPALRRAPNYGKVG---DLCAHLFR 320 Query: 313 RAGF-PTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFA 370 R G + DF G+ F + L P L L +F E G L HAVER Sbjct: 321 RMGLGEVDPICADFPAGSFFCARAAVLRPFLTLGLEARDFPAEAGQICGTLAHAVERLLG 380 Query: 371 CSVRYTEFSIESVDCVAEYERLLH 394 + V +E H Sbjct: 381 QVPARLGLRFDMVAVDLPFEEAAH 404 >gi|192359986|ref|YP_001983898.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio japonicus Ueda107] gi|190686151|gb|ACE83829.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio japonicus Ueda107] Length = 872 Score = 217 bits (553), Expect = 2e-54, Method: Composition-based stats. Identities = 77/262 (29%), Positives = 112/262 (42%), Gaps = 14/262 (5%) Query: 127 KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE- 185 E + + + + + + +IA+V H YY+D EI L + FDL VT+ + Sbjct: 579 PEEAVRRDSQFAEIRAALEHSQKRIAVVAHLYYRDLVPEILSALETIPEAFDLIVTLPDW 638 Query: 186 ANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYH 245 + EQ V + +P A Y N+GRD+ PF+ LL L YD L KI K+ Sbjct: 639 GTRHIEQMVREAYPEAVFYRAVNRGRDIGPFVDLLPLITEKNYDALLKIQTKRGYYRSGR 698 Query: 246 --PIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303 P G +WR F LLG I+ +P L M+G Y + + ++ Sbjct: 699 LLPQFGQLWRSETFRALLGNKSRVTDILEALRTDPSLNMVGPSPYFLSLTKYPYHDQGDL 758 Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLH--LIGEFEEERNLKDGAL 361 + +++ FF GTMFWV+P CL PL I FE E DGA Sbjct: 759 AQTILN---------NPTGNGFFAGTMFWVRPSCLRPLTEPEHLSITAFEPESGANDGAT 809 Query: 362 EHAVERFFACSVRYTEFSIESV 383 H +ER F+ + I V Sbjct: 810 AHLIERLFSQVAFANDGKIAGV 831 >gi|310286583|ref|YP_003937841.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum S17] gi|309250519|gb|ADO52267.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum S17] Length = 662 Score = 216 bits (551), Expect = 4e-54, Method: Composition-based stats. Identities = 52/257 (20%), Positives = 89/257 (34%), Gaps = 15/257 (5%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196 P+ + + A + H Y+ D + + L + DL++T E D +D + Sbjct: 292 PTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRDYMA 351 Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249 + N+GRDV L V YD + H KKS + G+H E Sbjct: 352 SHGVNHPVTFISVVNRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 411 Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307 + L + L D I+ F P LG + + + + Sbjct: 412 QGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPHDWGANFEIT 471 Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361 +L + R P G+ +W + + L+PL +F E +DG + Sbjct: 472 KELLEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGEMGEDGTV 531 Query: 362 EHAVERFFACSVRYTEF 378 HA+ER + + Sbjct: 532 SHAIERANGYICQSQGY 548 >gi|224284010|ref|ZP_03647332.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum NCIMB 41171] gi|313141164|ref|ZP_07803357.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313133674|gb|EFR51291.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 662 Score = 216 bits (551), Expect = 4e-54, Method: Composition-based stats. Identities = 52/257 (20%), Positives = 89/257 (34%), Gaps = 15/257 (5%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLK 196 P+ + + A + H Y+ D + + L + DL++T E D +D + Sbjct: 292 PTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRDYMA 351 Query: 197 YF---PSAQLYVMENKGRDVRPFLYLLELGVFDR-YDYLCKIHGKKSQR---EGYHPIEG 249 + N+GRDV L V YD + H KKS + G+H E Sbjct: 352 SHGVNHPVTFISVVNRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 411 Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRW--SFFAKRSEVYRRV 307 + L + L D I+ F P LG + + + + Sbjct: 412 QGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPHDWGANFEIT 471 Query: 308 IDLAK-RAGF--PTKRLHLDFFN-GTMFWVKPKCLEPLRNLHL-IGEFEEE-RNLKDGAL 361 +L + R P G+ +W + + L+PL +F E +DG + Sbjct: 472 KELLEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGEMGEDGTV 531 Query: 362 EHAVERFFACSVRYTEF 378 HA+ER + + Sbjct: 532 SHAIERANGYICQSQGY 548 >gi|320330331|gb|EFW86314.1| hypothetical protein PsgRace4_09215 [Pseudomonas syringae pv. glycinea str. race 4] Length = 774 Score = 216 bits (551), Expect = 4e-54, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 88/241 (36%), Gaps = 11/241 (4%) Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK--YFP 199 + +AI +H +Y+D + SH L D+F+T+ K + Sbjct: 260 PEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVK 319 Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 + ++ + N+GR+ P L YD C +H KKS G E W +L Sbjct: 320 NLKVSCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 375 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319 LL ++I R++N F + LG+ + W + Sbjct: 376 LLRDANIITRLLNAFVDHKDLGLYYPTTFWMMPSWVNHVTM--NKAFMNAWHNEWQIDPC 433 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 L + G MFW +P+ L+ + F +E DG++ HA+ER + Sbjct: 434 EGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 493 Query: 379 S 379 Sbjct: 494 K 494 >gi|330882679|gb|EGH16828.1| hypothetical protein Pgy4_27710 [Pseudomonas syringae pv. glycinea str. race 4] Length = 608 Score = 215 bits (548), Expect = 8e-54, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 88/241 (36%), Gaps = 11/241 (4%) Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK--YFP 199 + +AI +H +Y+D + SH L D+F+T+ K + Sbjct: 94 PEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVK 153 Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 + ++ + N+GR+ P L YD C +H KKS G E W +L Sbjct: 154 NLKVSCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 209 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319 LL ++I R++N F + LG+ + W + Sbjct: 210 LLRDANIITRLLNAFVDHKDLGLYYPTTFWMMPSWVNHVTM--NKAFMNAWHNEWQIDPC 267 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 L + G MFW +P+ L+ + F +E DG++ HA+ER + Sbjct: 268 EGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 327 Query: 379 S 379 Sbjct: 328 K 328 >gi|15674835|ref|NP_269009.1| hypothetical protein SPy_0792 [Streptococcus pyogenes M1 GAS] gi|71910421|ref|YP_281971.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS5005] gi|13621968|gb|AAK33730.1| conserved hypothetical protein - possibly involved in cell wall localization and side chain formation of rhamnose-glucose polysaccharide [Streptococcus pyogenes M1 GAS] gi|71853203|gb|AAZ51226.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS5005] Length = 581 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFENWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320 I++ FE + +I + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378 GT W K L+ L +L L L ++ HA+ER +F Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGDSYDF 516 Query: 379 SI 380 I Sbjct: 517 RI 518 >gi|71903253|ref|YP_280056.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS6180] gi|71802348|gb|AAX71701.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS6180] Length = 581 Score = 214 bits (544), Expect = 3e-53, Method: Composition-based stats. Identities = 54/241 (22%), Positives = 95/241 (39%), Gaps = 15/241 (6%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPADS 399 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKRL 321 + + T + + + R+ + + + ++ L ++ Sbjct: 400 ILSVFETDDIGII--IADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAMD 457 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEFS 379 GT W K L+ L +L L L ++ HA+ER F +F Sbjct: 458 TFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLFVYIAWGNSYDFR 517 Query: 380 I 380 I Sbjct: 518 I 518 >gi|94994091|ref|YP_602189.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10750] gi|94547599|gb|ABF37645.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10750] Length = 581 Score = 214 bits (544), Expect = 3e-53, Method: Composition-based stats. Identities = 54/241 (22%), Positives = 95/241 (39%), Gaps = 15/241 (6%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPADS 399 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKRL 321 + + T + + + R+ + + + ++ L ++ Sbjct: 400 ILSVFETDDIGII--IADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAMD 457 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEFS 379 GT W K L+ L +L L L ++ HA+ER F +F Sbjct: 458 TFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLFVYIAWGNSYDFR 517 Query: 380 I 380 I Sbjct: 518 I 518 >gi|325276923|ref|ZP_08142610.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51] gi|324097938|gb|EGB96097.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51] Length = 758 Score = 213 bits (543), Expect = 4e-53, Method: Composition-based stats. Identities = 70/325 (21%), Positives = 109/325 (33%), Gaps = 24/325 (7%) Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSR----KEQKAFLRLNRFM-----SNSRMPFD 119 F L +F + S+ S + F +++ + Sbjct: 164 FASELDAFKDYLHKSRFSPVNPSENFDNEIYHRCNIDVFHAQISPLFHYIISGQTEGRAY 223 Query: 120 SEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDL 179 S FE +PK S KIAI +H YY D + L + DL Sbjct: 224 SSVMPKWTPKFEINPASELTPKAS----NQKIAICLHIYYDDYIERFAEALYTFPTEVDL 279 Query: 180 FVTVVEANKDFEQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIH 235 +T+ + ++ + + N+GR+ P L + YD LC +H Sbjct: 280 LITIANESFRDRAYQTFSKIQAVKKVTIKSVPNRGRNFGPLLVEFAQELLT-YDLLCHLH 338 Query: 236 GKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWS 295 KKS G E W +L LL + R++N F NP G+ + W Sbjct: 339 SKKSLYSG---REQTQWADYLSEYLLNDCSVVKRVLNAFSDNPQFGVYYPTTFWMMPSWV 395 Query: 296 FFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEER 354 + +L GF L + G MFW +PK L + N +F E Sbjct: 396 NHVTM--NKPHMRNLQTALGFGHFDDFLSYPAGGMFWARPKALVDILNKTYTYDDFPNEP 453 Query: 355 NLKDGALEHAVERFFACSVRYTEFS 379 DG++ HA+ER + Sbjct: 454 LPNDGSMLHALERVIGPVCEKNGYQ 478 >gi|209559162|ref|YP_002285634.1| RgpFc protein [Streptococcus pyogenes NZ131] gi|209540363|gb|ACI60939.1| RgpFc protein [Streptococcus pyogenes NZ131] Length = 581 Score = 213 bits (543), Expect = 4e-53, Method: Composition-based stats. Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320 I++ FE + +I + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378 GT W K L+ L +L L L ++ HA+ER +F Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516 Query: 379 SI 380 I Sbjct: 517 RI 518 >gi|306827605|ref|ZP_07460885.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes ATCC 10782] gi|304430168|gb|EFM33197.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes ATCC 10782] Length = 581 Score = 213 bits (543), Expect = 4e-53, Method: Composition-based stats. Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320 I++ FE + +I + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378 GT W K L+ L +L L L ++ HA+ER +F Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516 Query: 379 SI 380 I Sbjct: 517 RI 518 >gi|19745874|ref|NP_607010.1| hypothetical protein spyM18_0853 [Streptococcus pyogenes MGAS8232] gi|19748025|gb|AAL97509.1| conserved hypothetical protein [Streptococcus pyogenes MGAS8232] Length = 581 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320 I++ FE + +I + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378 GT W K L+ L +L L L ++ HA+ER +F Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516 Query: 379 SI 380 I Sbjct: 517 RI 518 >gi|56808559|ref|ZP_00366292.1| COG3754: Lipopolysaccharide biosynthesis protein [Streptococcus pyogenes M49 591] Length = 581 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320 I++ FE + +I + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378 GT W K L+ L +L L L ++ HA+ER +F Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516 Query: 379 SI 380 I Sbjct: 517 RI 518 >gi|139474025|ref|YP_001128741.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes str. Manfredo] gi|134272272|emb|CAM30524.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes str. Manfredo] Length = 581 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 56/242 (23%), Positives = 97/242 (40%), Gaps = 17/242 (7%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVM 206 K+A+ +H +Y D E NF +DLF+T K+ ++ + + +A + V Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L L +YDY+ H KKS+ + G WR+ L L+ +D Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 267 AIRIINTFEQNPCLGMIGS-RRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-----TKR 320 I++ FE + +I + R+ + + + ++ L ++ Sbjct: 399 --SILSAFETDDIGIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQIDFQAM 456 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378 GT W K L+ L +L L L ++ HA+ER +F Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLPQNSILHAIERLLVYIAWGNSYDF 516 Query: 379 SI 380 I Sbjct: 517 RI 518 >gi|71735705|ref|YP_273244.1| hypothetical protein PSPPH_0972 [Pseudomonas syringae pv. phaseolicola 1448A] gi|71556258|gb|AAZ35469.1| conserved hypothetical protein [Pseudomonas syringae pv. phaseolicola 1448A] Length = 1262 Score = 212 bits (540), Expect = 8e-53, Method: Composition-based stats. Identities = 53/237 (22%), Positives = 93/237 (39%), Gaps = 13/237 (5%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-----NKDFEQDVLKYFPSAQLY 204 +I + +H YY D IS L + FDLF++ + D + + Sbjct: 211 RIGVYLHLYYTDLLGAISKHLNNIPLAFDLFISTPHELDHKKLRKIVSDSVTNVKEISIK 270 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 + N+GRD+ PF+ YD +C IH KKS+ W + LLG Sbjct: 271 HVPNRGRDIAPFIIEFGNE-LQAYDAICHIHTKKSEHTKG----LSDWGDDILSSLLGSR 325 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324 + +I+ + + + + Y + +++ E+ + ++ + Sbjct: 326 EDVKKILTLLKGDAKIIYPEGQNYYMKDP-TGWSENHEIAKHILSDHLETDISNFP-KAE 383 Query: 325 FFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 F G+MFW + + ++ N+ L +F EE DG L HA+ER S I Sbjct: 384 FPEGSMFWARQEGIQSFLNIPLDWEDFPEEPIPTDGTLAHALERIILISAYAAPGRI 440 Score = 85.8 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 31/108 (28%), Gaps = 7/108 (6%) Query: 30 QAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPS 89 A + W + + S VH F+ WL +AF+K Sbjct: 727 DAPKEFEYFRSLVPTWDNTARYGSESYVVHESTPEKFQG---WLEQSIAFTK--ANLPED 781 Query: 90 CRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWND 135 + + E + A L + + + + +K L + Sbjct: 782 RHLVVINAWNEWAEGAHLEPDTYSGYAYLNSVGRVLSGIKYLDDKPTA 829 >gi|320325880|gb|EFW81940.1| hypothetical protein PsgB076_04646 [Pseudomonas syringae pv. glycinea str. B076] Length = 774 Score = 211 bits (538), Expect = 1e-52, Method: Composition-based stats. Identities = 51/241 (21%), Positives = 87/241 (36%), Gaps = 11/241 (4%) Query: 144 GLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLK--YFP 199 + +AI +H +Y+D + SH L D+F+T+ K + Sbjct: 260 PEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVK 319 Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFD 259 + ++ + N+ R+ P L YD C +H KKS G E W +L Sbjct: 320 NLKVSCVPNRERNFGPLLVEFSKD-LMAYDLFCHLHSKKSLYSG---REQTQWADYLTEY 375 Query: 260 LLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTK 319 LL ++I R++N F + LG+ + W + Sbjct: 376 LLRDANIITRLLNAFVDHKDLGLYYPTTFWMMPSWVNHVTM--NKAFMNAWHNEWQIDPC 433 Query: 320 RLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 L + G MFW +P+ L+ + F +E DG++ HA+ER + Sbjct: 434 EGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKNGY 493 Query: 379 S 379 Sbjct: 494 K 494 >gi|160936497|ref|ZP_02083865.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC BAA-613] gi|158440582|gb|EDP18320.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC BAA-613] Length = 373 Score = 211 bits (537), Expect = 2e-52, Method: Composition-based stats. Identities = 47/233 (20%), Positives = 91/233 (39%), Gaps = 9/233 (3%) Query: 158 YYQDTWIEISHILLRLNFDFDL-FVTVVEANKDFEQDVLKYFPSA--QLYVMENKGRDVR 214 +Y+D + + ++ D+ FVT + + ++ V EN+GRD+ Sbjct: 2 FYEDLLNQCYLYIEQIPKYIDVCFVTSNPKIAFKVKKYINNTKKINYKVLVKENRGRDMA 61 Query: 215 PFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTF 274 L + + Y+YLC +H KKS + G + +G + ++ +L+G + + I+ Sbjct: 62 ALLVTCHDFIME-YEYLCFVHDKKSLQMG-NDNDGCKFMELIWKNLIGSTGLIENILRYL 119 Query: 275 EQNPCLGMIGSRRYRRYKRWSFFAK-RSEVYRRVIDLAKRAGFP--TKRLHLDFFNGTMF 331 N +G++ F + Y VI+L + G F Sbjct: 120 GNNRDVGLMVPPIPYWGNYIGVFINPWTCNYDNVINLGNQLKLKKNVCYEKEYVTIGGAF 179 Query: 332 WVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383 W + L+PL + +F +E DG + HA+ER + + + Sbjct: 180 WCRTNALKPLFEYKWKLEDFCQEPMAVDGTISHAIERILGFVALNNGYDVLEI 232 >gi|323135560|ref|ZP_08070643.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] gi|322398651|gb|EFY01170.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] Length = 812 Score = 210 bits (534), Expect = 3e-52, Method: Composition-based stats. Identities = 60/241 (24%), Positives = 100/241 (41%), Gaps = 13/241 (5%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFPS--AQLY 204 + IA +VH +Y + + L + DLF + K +DV + +P ++ Sbjct: 144 ERPIAAIVHGFYPEIAPLVLEKLKNVTGPVDLFFSTDTQEKKHALEDVCRDWPKGRVEIR 203 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 + N+GRD+ + D YD +H K+S G WR +LF +LLG Sbjct: 204 ICPNRGRDIAAKFFGFRDVYAD-YDLFIHLHTKRSPHGG---AALARWRDYLFDNLLGSP 259 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL-HL 323 +I I++ F+ +P +G++ + + Y L KR G + L Sbjct: 260 EIVNSILSLFD-DPKIGVVFPQHLFELRGIL---NWGYDYDHARALMKRMGVEIDKNLVL 315 Query: 324 DFFNGTMFWVKPKCLEPLRNLHLIGEF-EEERNLKDGALEHAVERFFACSVRYTEFSIES 382 +F +G+MFW + PL +L + + +E DG L HA+ER F Sbjct: 316 EFPSGSMFWGRSAAFRPLLDLDIDFDDFPQEGGQVDGTLAHAIERSLLMIAESRGFEWLK 375 Query: 383 V 383 V Sbjct: 376 V 376 >gi|116071634|ref|ZP_01468902.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107] gi|116065257|gb|EAU71015.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107] Length = 934 Score = 210 bits (534), Expect = 4e-52, Method: Composition-based stats. Identities = 54/258 (20%), Positives = 98/258 (37%), Gaps = 15/258 (5%) Query: 131 EGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD- 189 S K+ + + ++AI +H YY ++ E L L L +T + K Sbjct: 26 HIDILDHSGKCKTSIFQECQVAIYLHIYYPESLHEFLEYLTVLPSQIRLVITTTTSEKKE 85 Query: 190 ------FEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG 243 ++ + ENKGRD+ F+ + +YD +CK+H KKS G Sbjct: 86 LIIEILERALLINRLDLCHV-YHENKGRDIGAFINIY--DELIKYDVVCKLHAKKSPHLG 142 Query: 244 YHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEV 303 G W R+L +G I+N + +G++ ++ +A ++ Sbjct: 143 E---FGKSWFRYLIRSTIGNQSAIENIVNILYHSKDIGILAPTSFQ-GTNNHDWASNFDI 198 Query: 304 YRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALE 362 + + D + + L + + T+FW KP+ L + + + F EE DG Sbjct: 199 SQSISDHIFNSELDINKEKLRYPSATVFWFKPEALNQQQFRSIQPDFFPEEPIPIDGTTA 258 Query: 363 HAVERFFACSVRYTEFSI 380 H++ER Sbjct: 259 HSLERLIPYISILNGLKT 276 >gi|222148479|ref|YP_002549436.1| hypothetical protein Avi_2007 [Agrobacterium vitis S4] gi|221735467|gb|ACM36430.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 513 Score = 207 bits (526), Expect = 3e-51, Method: Composition-based stats. Identities = 61/243 (25%), Positives = 109/243 (44%), Gaps = 14/243 (5%) Query: 146 TIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFP---SA 201 ++ + I VHC+Y + + EI+ L L F L VTV E++ +++L F + Sbjct: 252 ALQLSLCIHVHCFYVELFNEIADRLQCLTLPFYLVVTVCNESDAKVVENLLVDFNQRQNT 311 Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261 + V+EN+GRD+ PFL ++ + D + +H KKS H G WRR+LF + Sbjct: 312 HILVVENRGRDIAPFLIDASP-IWRKSDLVLHLHTKKSP----HITWGDNWRRYLFDQTI 366 Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321 G+ + II+ F+ +GM+ + K ++ + + + +A++ Sbjct: 367 GYEPLLKGIIDQFQDRDDMGMMYPENFCMIKHFT---EEEKNKDAIRYIAQKLRLECSFE 423 Query: 322 HLD-FFNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALEHAVERFFACSVRYTEFS 379 L + G+M + + K L + + F E+ DG H +ER VR F Sbjct: 424 ALGAYAAGSMAFYRVKALASVLEYDALENLFGPEQGQLDGTAAHVLERLLPEMVRLNGFE 483 Query: 380 IES 382 + Sbjct: 484 TQP 486 >gi|332035169|gb|EGI71680.1| glycosyl transferase, group 1 [Pseudoalteromonas haloplanktis ANT/505] Length = 672 Score = 207 bits (526), Expect = 3e-51, Method: Composition-based stats. Identities = 55/254 (21%), Positives = 87/254 (34%), Gaps = 11/254 (4%) Query: 131 EGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDF 190 W + + +G K+A+ H +Y + L + D+FV+V Sbjct: 145 AKWYPKAIASSANGEPTTLKLAMCFHVFYGEFIDYYCGALAKFTQQVDVFVSVASEELAK 204 Query: 191 EQ----DVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHP 246 + + V+ N GR+ P L YD C +H KKS G Sbjct: 205 KAIHDFKACSKVNKVVVKVVPNHGRNFGPMLVEFASD-LQNYDLFCHMHSKKSLYSGRAQ 263 Query: 247 IEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRR 306 W +L LL + +++N F NP G+ + W + Sbjct: 264 T---QWADYLGEYLLNDPHVIKQVLNHFNDNPKSGLYYPTSFWMMPDWVNH--WLKNKPA 318 Query: 307 VIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAV 365 K+ K L + G MFW +P+ L+ L N +F E DG+ HA+ Sbjct: 319 AQKFTKKWNIELKDDFLAYPAGGMFWARPEALKQLLNKEYKYDDFPGEPLPNDGSQLHAL 378 Query: 366 ERFFACSVRYTEFS 379 ER V + Sbjct: 379 ERMLGLLVEKNGYK 392 >gi|281490695|ref|YP_003352675.1| bifunctional alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis subsp. lactis KF147] gi|281374464|gb|ADA63985.1| Alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis subsp. lactis KF147] Length = 589 Score = 202 bits (515), Expect = 7e-50, Method: Composition-based stats. Identities = 63/245 (25%), Positives = 102/245 (41%), Gaps = 19/245 (7%) Query: 151 IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVLKYFPSAQLYVMEN 208 +A+ +H YY + E +FD+DL++T K+ ++ + A+L N Sbjct: 289 VAVHLHVYYPELLEEFLDAFKNFSFDYDLYLTTNTDEKEEIIKEMLKCKDARAKLVRTPN 348 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 GRD+ PFL L +YD + H K+S + G WR L L+ + A Sbjct: 349 HGRDIVPFLAL--KEELKKYDIVGHFHTKRSLEAAFF--AGESWRTELISMLI---EPAD 401 Query: 269 RIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRLHLDF 325 I+ FEQ LG++ + + R+ + ++ + + D+ KR K DF Sbjct: 402 NIMAHFEQKQKLGIVIADIPSFFRFNKIVNADNENKQIAPIMNDIWKRMKMNKKVNFHDF 461 Query: 326 -----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV--RYTEF 378 GT FW K + LEPL NL ++ L + HA+ER + +F Sbjct: 462 NTFTMSYGTFFWAKTEVLEPLFNLEIMDREIPNEPLPQNTILHAIERVLIYLAWDKEMDF 521 Query: 379 SIESV 383 I Sbjct: 522 KISPN 526 >gi|23009067|ref|ZP_00050256.1| COG3754: Lipopolysaccharide biosynthesis protein [Magnetospirillum magnetotacticum MS-1] Length = 486 Score = 202 bits (513), Expect = 1e-49, Method: Composition-based stats. Identities = 56/221 (25%), Positives = 90/221 (40%), Gaps = 13/221 (5%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK-DFEQDVLKYF--PSAQLYVM 206 +I + H ++ D + + FD ++VT A+K DF + ++ + Sbjct: 274 RIGVFAHIFHTDLCEYVLKYTNNIPFDTTVYVTTSSASKADFIRKTFGRLSKHRYEIVIA 333 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N+GRD+ P L F DY +H KKS WR +LF LG +++ Sbjct: 334 PNRGRDIAPMLVGYRNA-FQNCDYAVHVHTKKSLHYSSGF---DAWRDYLFEMNLGSAEL 389 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH-LDF 325 I+N ++ +G + Y + + + L G + LDF Sbjct: 390 ITGIVNVLSRS-NIGAVAPDHYA---PIAKLIQWGGNIDAINGLLSFTGLSVASENVLDF 445 Query: 326 FNGTMFWVKPKCLEPLRNLHLIGE-FEEERNLKDGALEHAV 365 +G+MFW KP L L +HL F+ E DG L HA+ Sbjct: 446 PSGSMFWFKPDALSKLMEIHLQSYHFDPELGQVDGTLAHAI 486 >gi|116511036|ref|YP_808252.1| lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp. cremoris SK11] gi|116106690|gb|ABJ71830.1| Lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp. cremoris SK11] Length = 588 Score = 201 bits (512), Expect = 1e-49, Method: Composition-based stats. Identities = 59/239 (24%), Positives = 106/239 (44%), Gaps = 20/239 (8%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS---AQLYVM 206 KI I +H +Y D E + + ++DL++T K +++LK +P ++ V Sbjct: 299 KIGIHLHAFYLDLIPEYLNYFDKYVQNYDLYITTDTEEK--YEEILKNYPLPQIKKVIVT 356 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 NKGRDV P++ + + YD H KKS+ I G WRR + + LL + Sbjct: 357 GNKGRDVLPWMQV--SELMTDYDLCGHFHTKKSKDND--WIVGESWRRDIEYSLL---EP 409 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSF--FAKRSEVYRRVIDLAKRAGFPTKRL--- 321 A I FE+NP LG+I + ++ + + +++ + ++ ++ F + Sbjct: 410 AQAIFQEFEKNPKLGLIIADVPSFFEHFYGPTYITERDIWPDMQEIWQKIDFENSKELKQ 469 Query: 322 --HLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GTM W +P+ L L N+++ + EE ++ HA ER + Sbjct: 470 KDSYVMSYGTMIWYRPQALNNLLNVNIQADVPEEPLPY-NSILHAFERLLVYVSWANGY 527 >gi|15672189|ref|NP_266363.1| polysaccharide biosynthesis protein [Lactococcus lactis subsp. lactis Il1403] gi|12723062|gb|AAK04305.1|AE006258_8 polysaccharide biosynthesis protein [Lactococcus lactis subsp. lactis Il1403] Length = 589 Score = 200 bits (510), Expect = 2e-49, Method: Composition-based stats. Identities = 61/233 (26%), Positives = 98/233 (42%), Gaps = 17/233 (7%) Query: 151 IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD--FEQDVLKYFPSAQLYVMEN 208 +A+ +H YY + E +FD+DL++T K+ ++ + A+L N Sbjct: 289 VAVHLHVYYPELLEEFLDAFKNFSFDYDLYLTTNTDEKEEIIKEMLKCKDAKAKLVRTPN 348 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 GRD+ PFL L +YD + H K+S + G WR L L+ + A Sbjct: 349 HGRDIVPFLAL--KEELKKYDIVGHFHTKRSLEAAFF--AGESWRTELISMLI---EPAD 401 Query: 269 RIINTFEQNPCLGMIGS--RRYRRYKRWSFFAKRSE-VYRRVIDLAKRAGFPTKRLHLDF 325 I+ FEQ LG++ + + R+ + ++ + + D+ KR K DF Sbjct: 402 NIMAHFEQKQKLGIVIADIPSFFRFNKIVNADNENKQIAPIMNDIWKRMKMNKKVNFHDF 461 Query: 326 -----FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSV 373 GT FW K + LEPL NL ++ L + HA+ER Sbjct: 462 NTFTMSYGTFFWAKIEVLEPLFNLEIMDREIPNEPLPQNTILHAIERVLIYLA 514 >gi|88808074|ref|ZP_01123585.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805] gi|88788113|gb|EAR19269.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805] Length = 512 Score = 200 bits (509), Expect = 3e-49, Method: Composition-based stats. Identities = 55/241 (22%), Positives = 93/241 (38%), Gaps = 11/241 (4%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKY----FPSAQLY 204 KI +V+H YY ++ I L + FDL VTV +K+ ++ L+ + Sbjct: 50 KILVVIHAYYPESLATIFPSLRHMPCHFDLVVTVCSCGDKEVVKEYLEKVDLPIDVLDIK 109 Query: 205 VMENKGRDVRPFLYLLELGVFDR--YDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLG 262 V+ N GRD+ PF+ +++ YD++ K+H K+S G W +LLG Sbjct: 110 VLTNLGRDLLPFVQVIKGLKLQNKAYDFVLKLHTKRSVASSKGKEFGGKWLEGSLSNLLG 169 Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLH 322 + I+ Q ++ R+ + + L R G Sbjct: 170 SPENVKYILLELLQTTNCALVSPLISLDVFRFCKWKNNLAP---ISHLLDRFGVRESPED 226 Query: 323 L-DFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 F G+MFWV K + + E +G+ HA ER + T+ ++ Sbjct: 227 FICFPAGSMFWVDFKAAVLIASCFEESRVPPEPLPSNGSYLHAFERLVPYILESTQKRMQ 286 Query: 382 S 382 S Sbjct: 287 S 287 >gi|125623094|ref|YP_001031577.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris MG1363] gi|124491902|emb|CAL96823.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris MG1363] gi|300069842|gb|ADJ59242.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris NZ9000] Length = 588 Score = 194 bits (492), Expect = 3e-47, Method: Composition-based stats. Identities = 60/239 (25%), Positives = 103/239 (43%), Gaps = 20/239 (8%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLY---VM 206 KIAI +H +Y D E + ++DLF+T +K + ++K +P Q+ V Sbjct: 299 KIAIHLHAFYLDLIPEYLDYFDKYVQNYDLFITTDTKDK--YEQIIKSYPLNQIKKVLVT 356 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 NKGRDV P++ + + YD H KKS+ I G WRR + + LL + Sbjct: 357 GNKGRDVLPWMEI--SELMADYDLCGHFHTKKSKDND--WIVGESWRRDIEYSLLKPAQ- 411 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSF--FAKRSEVYRRVIDLAKRAGFPTKR---- 320 I FE+NP LG++ + ++ + + +++ + ++ K+ F R Sbjct: 412 --AIFQEFEKNPKLGLMIADVPSFFEHFYGPTYITERDIWPDMEEIWKKINFENPRGLKQ 469 Query: 321 -LHLDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEF 378 GTM W +P+ L L + + EE ++ HA ER + + Sbjct: 470 KDSYVMSYGTMIWYRPQALNNLLKVDIEAAVPEEPLPY-NSILHAFERLLVYTSWANGY 527 >gi|209524107|ref|ZP_03272658.1| glycosyl transferase family 2 [Arthrospira maxima CS-328] gi|209495482|gb|EDZ95786.1| glycosyl transferase family 2 [Arthrospira maxima CS-328] Length = 2819 Score = 194 bits (492), Expect = 3e-47, Method: Composition-based stats. Identities = 69/240 (28%), Positives = 107/240 (44%), Gaps = 13/240 (5%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD-FEQDVLKYFPSAQLYVME 207 KIA+V+H YY + E+ L L D+DLFVT+ E D + KY + Q+ +++ Sbjct: 1737 PKIAVVLHAYYPELLPELFSKLDNL-SDYDLFVTIPENVVDSVTSALDKYTKNYQVSIVK 1795 Query: 208 NKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA 267 N G D+ PFL ++ Y Y+CKIH K+ HP G +WR L +LG +I Sbjct: 1796 NIGYDILPFLEVISELDTLGYKYVCKIHTKR-----DHPDFGSLWRECLLDAVLGDKNIT 1850 Query: 268 IRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFN 327 +II F+ NP L ++G + + ++ + + D + FF Sbjct: 1851 EQIITAFDNNPSLQIVGPALLYMSMLGTIYDGHEKMKKMIHDFMEPLNL---IEDWGFFG 1907 Query: 328 GTMFWVKPKCLEPLRN---LHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD 384 G+MFW + L+ + + L I + L G H VER E + VD Sbjct: 1908 GSMFWSRITPLKYIADQILLKPIDWQASKSWLTTGFYYHIVERLLGLVSYINEGQVGLVD 1967 >gi|221634514|ref|YP_002523202.1| hypothetical protein RSKD131_4489 [Rhodobacter sphaeroides KD131] gi|221163387|gb|ACM04349.1| Hypothetical Protein RSKD131_4489 [Rhodobacter sphaeroides KD131] Length = 1042 Score = 193 bits (491), Expect = 4e-47, Method: Composition-based stats. Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 11/234 (4%) Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKYFPSAQLYVMENKG 210 A ++H ++ D +++ L L+ D FVT+ + ++ V FP A + +EN+G Sbjct: 8 AAIIHVWHLDVLDDLTEALEHLHGSADQFVTLPSSFRQEQRDRVTAAFPKATIVEVENRG 67 Query: 211 RDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270 +D+ L++ RYD++CKIH KK WRR L +LG I Sbjct: 68 QDIGALFQLMQKVNLGRYDFICKIHTKKGPNMP------EEWRRALLDGVLGSKRQVTHI 121 Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTM 330 + +F +P + + G+R+ Y +V L F + F GT Sbjct: 122 VESFRADPKVMLAGARQLFVYGPAYLEPNADKVAEDYASLIG--DFDVRSEDWGFIAGTC 179 Query: 331 FWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD 384 FW++ L+ + +F + DGA HA ER F V ++ D Sbjct: 180 FWIRTSILQEMAAC--AVDFLPADYVTDGAPAHAAERMFGLCVALRGGTVLLQD 231 >gi|302337197|ref|YP_003802403.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293] gi|301634382|gb|ADK79809.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293] Length = 1100 Score = 192 bits (487), Expect = 1e-46, Method: Composition-based stats. Identities = 64/228 (28%), Positives = 94/228 (41%), Gaps = 14/228 (6%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208 I +V H Y++D + + + FDL VT E N D V +P A++ +N Sbjct: 186 SIVVVFHIYHEDLVGSCLQYISHIPYPFDLIVTTPLEENNDAILQVKSLYPDAEIVRSKN 245 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 GRD+ PFL + + + +YD CK+H KK + IWR +L D Sbjct: 246 AGRDIGPFLQVWDRVL--QYDLCCKVHTKK-----GNSAYSEIWRDLSLRGILETVDTVH 298 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPT-KRLHLDFFN 327 I+ FEQ L + G+ ++ + L K P + FF Sbjct: 299 GILRMFEQEDSLALAGAELLYGSYQFLLG----KNKDLSNSLIKDYNIPVNSYSNNGFFM 354 Query: 328 GTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375 GTMFW++ K L NL + F E DG EHA+ER + Sbjct: 355 GTMFWMRVKKFIFLSNLKQLQ-FPIEDGKNDGKYEHALERLLGSLSLH 401 >gi|78184217|ref|YP_376652.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] gi|78168511|gb|ABB25608.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] Length = 519 Score = 190 bits (482), Expect = 5e-46, Method: Composition-based stats. Identities = 56/244 (22%), Positives = 100/244 (40%), Gaps = 19/244 (7%) Query: 151 IAIVVHCYYQDTWIEISHILLRL-----NFDFDLFVTVVEANKDFEQDVLKY--FPSAQL 203 +A+++H +Y D +I L DL+V+ D + L+ F +L Sbjct: 268 LALMIHGFYPDVLDDILLKLPSFCAGMVGTQLDLYVSTSMDQIDQVEKKLRDLDFACVRL 327 Query: 204 YVMENKGRDVRPFL-YLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLG 262 + +EN+GRDV PFL +LL + + K+H KKS + + W R L LL Sbjct: 328 FGVENRGRDVAPFLLHLLPAVAAAGHHFFVKLHTKKSLQ--FGIDGLDKWSRHLIESLL- 384 Query: 263 FSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVI--DLAKRAGFPTKR 320 + I F + LG + + F ++ ++ + ++ R Sbjct: 385 SAAGLEAIRYQFLDDEDLGCLCPSGTLLPLAIALFKNKTHLHHLLSHSEINGRWALMQT- 443 Query: 321 LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFS 379 F G+MF + + L + + +FE E DG HA+ER + V+ + + Sbjct: 444 ----FVAGSMFAGRVEAFRSLLDQGFSLDDFELEGGQFDGTFAHALERLISLEVKRSGWQ 499 Query: 380 IESV 383 I+ + Sbjct: 500 IKEM 503 >gi|14090418|gb|AAK53494.1| putative methyltransferase [Xanthomonas campestris pv. campestris] Length = 212 Score = 189 bits (481), Expect = 6e-46, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 83/235 (35%), Gaps = 32/235 (13%) Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKS 149 + F + E + A L + + + + + ++ PS+ Sbjct: 1 MVFINAWNEWAEGAVLEPDARLGYAWLDATRQALTRAPDVA-TEICSPSA---------- 49 Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVV-EANKDFEQDVLKYFPSAQLYVMEN 208 +V+H +Y D E+ ++ + +T + + + + A++ EN Sbjct: 50 --CVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFEN 107 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 +GRD+ PFL++ + + + K+H KKS H +G WR + LLG Sbjct: 108 RGRDILPFLHVANRLLDENVQLVLKLHTKKST----HRDDGNAWRGEMLTALLG-PQRVD 162 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHL 323 I+N F +P +G+ + A+R G L Sbjct: 163 AIVNAFSTDPLVGLAAPEDHLLPVTEFIGGN-----------AERTGLSYCSHRL 206 >gi|148556902|ref|YP_001264484.1| glycosyl transferase family protein [Sphingomonas wittichii RW1] gi|148502092|gb|ABQ70346.1| glycosyl transferase, family 2 [Sphingomonas wittichii RW1] Length = 1301 Score = 187 bits (474), Expect = 3e-45, Method: Composition-based stats. Identities = 61/237 (25%), Positives = 102/237 (43%), Gaps = 12/237 (5%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEA-NKDFEQDVLKYFPSAQLYVMEN 208 K A+V+H +Y + +E+ + + D+FVT A ++ + + A++ + N Sbjct: 2 KAALVLHLFYPEVAVELIDRVAAIGASVDIFVTHSVALDETVLAALDRLPRKAEVVTVAN 61 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 +G D+ P LL L YD + K+H KK WRR + ++G + Sbjct: 62 RGWDIGPLFELLPLLAERGYDLIGKLHSKK-----GGSGYAPEWRRLAYDGMIGSPALVA 116 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFP-TKRLHLDFFN 327 I+ F+ +P L ++G++ + F + DLA R P FF Sbjct: 117 DIVAAFDAHPDLSLLGAKPLYKSVASHLFRNA----ELLSDLAPRLTAPAYPPADWGFFA 172 Query: 328 GTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESVD 384 GT FW + LE + L + ++ +DGAL HAVER F + I V+ Sbjct: 173 GTFFWARRTLLEKVAALADFRDAAPNQD-RDGALGHAVERLFGLAPIGLGGKIGLVE 228 >gi|146279467|ref|YP_001169625.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC 17025] gi|145557708|gb|ABP72320.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC 17025] Length = 823 Score = 172 bits (436), Expect = 9e-41, Method: Composition-based stats. Identities = 61/341 (17%), Positives = 98/341 (28%), Gaps = 23/341 (6%) Query: 49 PKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFL--- 105 P++ T+ +F L LA + R EQ AF Sbjct: 480 PRKTGTAGAAQPAGGLLFARIRRALFDRLAAQRRFVRGASDIDAPLLFPRPEQAAFRILE 539 Query: 106 -RLNRFMSNSR----MPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQ 160 + R + E + + A+ VH +Y Sbjct: 540 REKMQRYGRRRVWRDLAEVEETLSASDNWVHRALRLAPYATVADSSDLPPFALHVHAFYT 599 Query: 161 DTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFLY 218 D + +T K + + ++ ++ N+GRD+ PF+ Sbjct: 600 DDLAADVRSHRAFRLARRIVITTDNERKASEIRTRMGAEGLYPEVILVPNRGRDILPFMQ 659 Query: 219 LLELGVFDRYD-YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQN 277 L G D C +H KKS G +WR +L LLG + ++ Sbjct: 660 LFLPGGPAGKDEIWCHLHQKKSLATSDS---GDVWRAFLLRILLGDDAGLSDAVGHL-RD 715 Query: 278 PCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKC 337 P +G++ + A R P L F G MFWV+ Sbjct: 716 PAVGLVAPFDPYHVP-------WDASRALLPRFAPRLPGPLPDNPLLFPVGNMFWVRAGV 768 Query: 338 LEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTE 377 + + +L E DG H VER + Sbjct: 769 VRAMNDLFGPSYPWPNEPIANDGTEFHLVERLWPTMAARCG 809 >gi|221218294|ref|YP_002524321.1| glycosyltransferase [Rhodobacter sphaeroides KD131] gi|221163321|gb|ACM04287.1| glycosyltransferase [Rhodobacter sphaeroides KD131] Length = 821 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 58/284 (20%), Positives = 94/284 (33%), Gaps = 20/284 (7%) Query: 105 LRLNRFMSNSRMPFDSEKFLYVKELFEGWN-----DRPSSPKKSGLTIKSKIAIVVHCYY 159 L + M R + + L + N R + + T + ++ VH +Y Sbjct: 537 LEREKMMRYGRRRMWRDLAEVEERLADADNWVHRKLRIAPYGTAEATELPRFSLHVHAFY 596 Query: 160 QDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFL 217 D + + VT K + + + ++ V N+GRD+ PFL Sbjct: 597 TDDLAQDVRRHAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRPNRGRDILPFL 656 Query: 218 YLLELGVFDRYD-YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276 L G D C +H KKS G IWR +L LLG + Sbjct: 657 QLFLPGGAAGEDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEASLSDAATHL-R 712 Query: 277 NPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPK 336 NP +G++ + +A R P L F G MF+V+ + Sbjct: 713 NPGVGLVAPFDPYFIP-------WDASRALLPRVAPRLPGPLPDNPLLFPVGNMFFVRSR 765 Query: 337 CLEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTEFS 379 + + +L G E DG H +ER + + Sbjct: 766 VVRAMNDLFGAGYPWPNEPIPNDGTEFHLIERLWPAMAAQCGLT 809 >gi|291520449|emb|CBK75670.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 486 Score = 159 bits (403), Expect = 7e-37, Method: Composition-based stats. Identities = 43/174 (24%), Positives = 71/174 (40%), Gaps = 6/174 (3%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKD---FEQDVLKYFPSAQLYVM 206 K+A+V H YY + + L ++ + D+ +T +K E K ++ V Sbjct: 291 KVAVVAHLYYVEMFELCMDYLAKVPYGIDIIITTNSDDKKQNIIEVASEKGVKLTEVIVA 350 Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 EN+GR++ L + +Y Y C +H KKS H G+ +R L+ L Sbjct: 351 ENRGRELAALLVGCGKFLL-KYKYFCFVHDKKSS-AKEHLSVGLAFRDILWDSSLYSEGY 408 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFF-AKRSEVYRRVIDLAKRAGFPTK 319 II+ FEQN C+G+ + F Y + I+L+K Sbjct: 409 IRNIIDMFEQNECMGLAVPPTVYCGSYFYPFPDYWVGNYEKTIELSKILNINVD 462 >gi|297182567|gb|ADI18727.1| lipopolysaccharide biosynthesis protein [uncultured Rhizobiales bacterium HF4000_32B18] Length = 887 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 55/236 (23%), Positives = 80/236 (33%), Gaps = 21/236 (8%) Query: 153 IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQL--YVMENKG 210 + VH +Y D + E + T K E + V+ N+G Sbjct: 648 VHVHAHYTDGFAEDLAGFAAWRHAARVVATTDTEAKAAEIAAAGRNGGVAIETRVVANRG 707 Query: 211 RDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270 RDV PFL L + D C +H KKS G G +WR +L LLG + Sbjct: 708 RDVLPFLELFDGSEDDN-ALWCHVHLKKSVGLGP-TSPGAVWRAFLMRILLGGPERLSTA 765 Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG--------FPTKRLH 322 + + P G++G+ + R + L R P Sbjct: 766 L-ALIRAPEAGLVGAFDPYV-------MGWTGSRRLLAPLQARLDGWEADGGRRPLPDHP 817 Query: 323 LDFFNGTMFWVKPKCLEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTE 377 L F G MFWVK + +R L E DG + H +ER + + Sbjct: 818 LLFPVGDMFWVKAGVVNAMRRLFGADYPWPGEPLPGDGTVYHLIERLWPTAAALAG 873 >gi|50982351|gb|AAT91804.1| hypothetical protein [Yersinia enterocolitica] Length = 358 Score = 152 bits (385), Expect = 8e-35, Method: Composition-based stats. Identities = 56/247 (22%), Positives = 96/247 (38%), Gaps = 16/247 (6%) Query: 142 KSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSA 201 K +K I+VH +YQ EI + L+ +D+ +T N + + Sbjct: 120 KIKPNTDNKKLIIVHAFYQREAEEIFNRLVAFTD-YDIVITSPYNNIICKAKEILGQERV 178 Query: 202 QLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLL 261 ++M N GRD+ PFL L+L V ++Y+Y K+H K+SQ H + W L+ Sbjct: 179 IGFIMPNYGRDILPFLICLQLIVIEKYEYFVKVHTKRSQ----HLNDNGAWFNNNLDYLV 234 Query: 262 GFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRL 321 G + + + + + Y + + + L + Sbjct: 235 GNKNATDGLFSIMSDDE---------PQIYGEYILPIQDHIAN-NIHWLTYLLEKEPASV 284 Query: 322 HLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSI 380 F GTMF L +R+L L + + E+E DG HA+ER+F Sbjct: 285 EASFIPGTMFIGNRAFLVLIRDLQLHLFQIEKENGQLDGCCVHAIERYFGYIASVNGGKC 344 Query: 381 ESVDCVA 387 S++ + Sbjct: 345 CSIETLI 351 >gi|301632931|ref|XP_002945533.1| PREDICTED: o-antigen export system ATP-binding protein rfbB-like, partial [Xenopus (Silurana) tropicalis] Length = 367 Score = 140 bits (354), Expect = 3e-31, Method: Composition-based stats. Identities = 40/150 (26%), Positives = 61/150 (40%), Gaps = 10/150 (6%) Query: 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSR 286 RY + ++H K+S G WR L+ L G I++TF +P LGM+ Sbjct: 203 RYALILRLHSKRSLHIPGQ--VGEEWRALLYTSLAGSRQRVNAIVDTFNTHPKLGMLCPA 260 Query: 287 RYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHL-DFFNGTMFWVKPKCLEPLRNLH 345 ++ Y+R+ L + G DF G+MFW +P+ L Sbjct: 261 ---VIDHYADCLHFGGNYKRMCALLQPHGITLPPDQPIDFPMGSMFWCRPQALSVWLEPG 317 Query: 346 L-IGEFEEERNL---KDGALEHAVERFFAC 371 +F +L +DG L HA+ER F Sbjct: 318 FTFDDFTPTNDLDTDRDGTLAHALERLFFF 347 >gi|77404644|ref|YP_345218.1| glycosyltransferase [Rhodobacter sphaeroides 2.4.1] gi|77390294|gb|ABA81477.1| possible glycosyltransferase [Rhodobacter sphaeroides 2.4.1] Length = 793 Score = 140 bits (354), Expect = 3e-31, Method: Composition-based stats. Identities = 51/249 (20%), Positives = 83/249 (33%), Gaps = 19/249 (7%) Query: 105 LRLNRFMSNSRMPFDSEKFLYVKELFEGWN-----DRPSSPKKSGLTIKSKIAIVVHCYY 159 L + M R + + L + N R + + T + ++ VH +Y Sbjct: 537 LEREKMMRYGRRRMWRDLAEVEERLADADNWVHRKLRIAPYGTAEATELPRFSLHVHAFY 596 Query: 160 QDTWIEISHILLRLNFDFDLFVTVVEANK--DFEQDVLKYFPSAQLYVMENKGRDVRPFL 217 D + + VT K + + + ++ V N+GRD+ PFL Sbjct: 597 TDDLAQDVRRHAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRPNRGRDILPFL 656 Query: 218 YLLELGVFDRYD-YLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276 L G D C +H KKS G IWR +L LLG + Sbjct: 657 QLFLPGGAAGEDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEASLSDAATNL-R 712 Query: 277 NPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPK 336 NP +G++ + +A R P L F G MF+V+ Sbjct: 713 NPGVGLVAPFDPYFIP-------WDASRALLPRVAPRLPGPLPDNPLLFPVGNMFFVRSA 765 Query: 337 CLEPLRNLH 345 + + +L Sbjct: 766 VVRAMNDLF 774 >gi|224536718|ref|ZP_03677257.1| hypothetical protein BACCELL_01594 [Bacteroides cellulosilyticus DSM 14838] gi|224521634|gb|EEF90739.1| hypothetical protein BACCELL_01594 [Bacteroides cellulosilyticus DSM 14838] Length = 361 Score = 123 bits (310), Expect = 4e-26, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 37/127 (29%), Gaps = 8/127 (6%) Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61 Y+ + + K+ + ++ + Y + W SP+ + ++ Sbjct: 241 YECAKWRHKIFRTPKIVEYKKASSFFVGEEEYDKEIIPTIIPNWDHSPRSLGKALVLNHA 300 Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119 E FE + + CR+ F S E + +L + + Sbjct: 301 EPRYFEK------HVKNVMIHIENKPFECRLAFVKSWNEWAEGNYLEPDLRYGKRYLEVM 354 Query: 120 SEKFLYV 126 E L Sbjct: 355 KECILKE 361 >gi|291520444|emb|CBK75665.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 424 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 52/144 (36%), Gaps = 4/144 (2%) Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA-KRSEV 303 + G + ++ LLG ++ +++ F LG++ + + + Sbjct: 3 YESVGRDFNNRIWQSLLGSKELVEEVLSAFSDEKYLGLLMPSMVTHGEYFHTAIDSWTIC 62 Query: 304 YRRVIDLAKRAGFPTKR--LHLDFFNGTMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGA 360 Y ++LAK+ G GT FW + K LE L + F E DG+ Sbjct: 63 YDGTVELAKKIGLNVPIYGDRNPLSLGTAFWARTKALEKLFEYNFSYDMFPGEPFPVDGS 122 Query: 361 LEHAVERFFACSVRYTEFSIESVD 384 + H +ER F + V Sbjct: 123 ISHYIERIFPYVALDAGYYTGIVY 146 >gi|270294908|ref|ZP_06201109.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270274155|gb|EFA20016.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 358 Score = 115 bits (289), Expect = 1e-23, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 37/122 (30%), Gaps = 8/122 (6%) Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61 YK + K K+ +I ++ Y + W SP+ R S ++ Sbjct: 238 YKYAKWKHKIFRIPKVVEYKKASSFFVGDEEYEENIIPTIIPNWDHSPRSRGKSLVLNHA 297 Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119 E S F R K + R+ F S E + +L + + Sbjct: 298 EPSYF------ARHLKEAIKRIENKPLDHRLAFVKSWNEWAEGNYLEPDLHYGKRYLEVI 351 Query: 120 SE 121 + Sbjct: 352 KK 353 >gi|160888551|ref|ZP_02069554.1| hypothetical protein BACUNI_00968 [Bacteroides uniformis ATCC 8492] gi|317477905|ref|ZP_07937089.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides sp. 4_1_36] gi|156861865|gb|EDO55296.1| hypothetical protein BACUNI_00968 [Bacteroides uniformis ATCC 8492] gi|316905921|gb|EFV27691.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides sp. 4_1_36] Length = 358 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 37/122 (30%), Gaps = 8/122 (6%) Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61 YK + K K+ +I ++ Y + W SP+ R S ++ Sbjct: 238 YKYAKWKHKIFRIPKVVEYKKASSFFVGDEEYEENIIPTIIPNWDHSPRSRGKSLVLNHA 297 Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119 E S F R K + R+ F S E + +L + + Sbjct: 298 EPSYF------ARHMKEAIKRIENKPLDHRLAFVKSWNEWAEGNYLEPDLHYGKRYLEVI 351 Query: 120 SE 121 + Sbjct: 352 KK 353 >gi|75674736|ref|YP_317157.1| lipopolysaccharide biosynthesis protein [Nitrobacter winogradskyi Nb-255] gi|74419606|gb|ABA03805.1| lipopolysaccharide biosynthesis protein [Nitrobacter winogradskyi Nb-255] Length = 734 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 37/122 (30%), Gaps = 12/122 (9%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 +S GK+ + + + Y PA + W +P++ + FE Sbjct: 429 ESFTGKVYDYVDAVRSSLGKTYDFPYFPAVMP----RWDNTPRKGSRGHVFNRSSPEAFE 484 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 WLR ++ + P I F S E + A L + + + Sbjct: 485 ---VWLRDATGRARRGPFAEP---IVFINSWNEWAEGAHLEPDSRYGRAFLEAVRRVASS 538 Query: 126 VK 127 Sbjct: 539 EP 540 >gi|92116633|ref|YP_576362.1| lipopolysaccharide biosynthesis protein [Nitrobacter hamburgensis X14] gi|91799527|gb|ABE61902.1| lipopolysaccharide biosynthesis protein [Nitrobacter hamburgensis X14] Length = 734 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 37/121 (30%), Gaps = 12/121 (9%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 S GK+ + + + Y PA + W +P++ + FE Sbjct: 430 SFTGKVYDYVDAVRSSLGKTYDFPYFPAVMP----RWDNTPRKGSRGHIFNRSSPEAFE- 484 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 WLR ++ S + P I F S E + A L + + + Sbjct: 485 --VWLRDAANRARKSAFAEP---IVFINSWNEWAEGAHLEPDSRYGRAFLEAVRRVASSE 539 Query: 127 K 127 Sbjct: 540 P 540 >gi|148238469|ref|YP_001223856.1| sulfotransferase [Synechococcus sp. WH 7803] gi|147847008|emb|CAK22559.1| Possible sulfotransferase [Synechococcus sp. WH 7803] Length = 476 Score = 109 bits (273), Expect = 8e-22, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 53/160 (33%), Gaps = 8/160 (5%) Query: 222 LGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLG 281 +D + H K++ G WR+ L D T P G Sbjct: 4 RDRLKEFDLVVHCHTKRTPHAPD--GFGESWRQSLLQCTFPNPDRCQE-FQTLLHKPEAG 60 Query: 282 MIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD-FFNGTMFWVKPKCLEP 340 +I +R + R +++L G +R L F G+ FW + L Sbjct: 61 LIMPWPHRFVAHNVNWGSNFTQTRALMNL---MGHTIRRDTLLAFPAGSFFWARVDSLLA 117 Query: 341 LRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFS 379 L +L L +F E DG L H++ER + Sbjct: 118 LLDLTLRWEDFAAEPLPGDGRLAHSLERCLGLLPMLNDRR 157 >gi|85713620|ref|ZP_01044610.1| lipopolysaccharide biosynthesis protein [Nitrobacter sp. Nb-311A] gi|85699524|gb|EAQ37391.1| lipopolysaccharide biosynthesis protein [Nitrobacter sp. Nb-311A] Length = 734 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 33/121 (27%), Gaps = 12/121 (9%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 + GKI + + + Y W +P++ + FE Sbjct: 430 TFTGKIYDYVDAVRSSLGK----TYDFPCFPAVMPRWDNTPRKGSRGHIFNRSSPEAFE- 484 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 WLR ++ + P I F S E + A L + + + Sbjct: 485 --VWLRDAAGRARREPFAEP---IVFINSWNEWAEGAHLEPDSRYGRAFLEAVRRVASSE 539 Query: 127 K 127 Sbjct: 540 P 540 >gi|189426434|ref|YP_001953611.1| radical SAM protein [Geobacter lovleyi SZ] gi|189422693|gb|ACD97091.1| Radical SAM domain protein [Geobacter lovleyi SZ] Length = 843 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 39/118 (33%), Gaps = 14/118 (11%) Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69 K+ + E+L+L L + + + W +P+ + +F Sbjct: 731 KVSRYEDLVLYLKQYQLSDNE-------YPLVVPNWDNTPRSGSNGFVLQGSTPELFGEM 783 Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 L L + K P+ RI F + E + L + ++ + + L+ Sbjct: 784 ---LEDALRKVEQRKD--PADRIVFIKAWNEWAEGNHLEPDLLHGHAYLQALYKALLH 836 >gi|296445524|ref|ZP_06887480.1| lipopolysaccharide biosynthesis protein-like protein [Methylosinus trichosporium OB3b] gi|296256929|gb|EFH04000.1| lipopolysaccharide biosynthesis protein-like protein [Methylosinus trichosporium OB3b] Length = 431 Score = 106 bits (264), Expect = 9e-21, Method: Composition-based stats. Identities = 16/133 (12%), Positives = 34/133 (25%), Gaps = 6/133 (4%) Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75 N++ + E G W ++ ++E WL Sbjct: 273 NVVAYEAMIEASLNHRPTGYKLFPGVCPSWDNEARRPGKGSCFAGASPRLYED---WLTG 329 Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGW 133 + RI F + E + A+L +R + + + V+ + Sbjct: 330 ACRAVLTDAQTRDE-RIVFINAWNEWGEGAYLEPDRHYGYAYLVATANALRRVENQRDNE 388 Query: 134 NDRPSSPKKSGLT 146 + S Sbjct: 389 GAIEGAKGASNRN 401 >gi|312100417|gb|ADQ27813.1| glycosyltransferase [Burkholderia pseudomallei] gi|312100462|gb|ADQ27848.1| putative glycosyltransferase [Burkholderia pseudomallei] Length = 1738 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 28/110 (25%), Gaps = 6/110 (5%) Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84 E+ G W ++ ++E WL + A + Sbjct: 980 ERSRAYPDTEYRLFRGVTPSWDNEARKPGRGAVFVGSTPKLYEE---WLLNA-ATDTVER 1035 Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 + P R+ F + E + A L +R + + Sbjct: 1036 IDNPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATRNALTQANRTGAA 1085 >gi|312100431|gb|ADQ27825.1| glycosyltransferase [Burkholderia pseudomallei] Length = 1706 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 28/110 (25%), Gaps = 6/110 (5%) Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84 E+ G W ++ ++E WL + A + Sbjct: 948 ERSRAYPDTEYRLFRGVTPSWDNEARKPGRGAVFVGSTPKLYEE---WLLNA-ATDTVER 1003 Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 + P R+ F + E + A L +R + + Sbjct: 1004 IDNPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATRNALTQANRTGAA 1053 >gi|150010201|ref|YP_001304944.1| hypothetical protein BDI_3624 [Parabacteroides distasonis ATCC 8503] gi|149938625|gb|ABR45322.1| conserved hypothetical protein [Parabacteroides distasonis ATCC 8503] Length = 370 Score = 101 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 11/109 (10%), Positives = 27/109 (24%), Gaps = 10/109 (9%) Query: 24 EEKGNMQAIYIPA--HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81 + + Y W SP++ + H ++F+ Sbjct: 268 KAIEKIDTPYYEEDRVYPNIIPGWDNSPRRGPGAFIFHKATPALFKK------HVKMILN 321 Query: 82 YSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 K ++ F S E + ++ + + E + Sbjct: 322 RIKDKPDEDKVIFLKSWNEWAEGNYMEPDLKWGKGYIRALREALEEDAK 370 >gi|308813905|ref|XP_003084258.1| conserved domain protein (ISS) [Ostreococcus tauri] gi|116056142|emb|CAL58323.1| conserved domain protein (ISS) [Ostreococcus tauri] Length = 684 Score = 101 bits (253), Expect = 2e-19, Method: Composition-based stats. Identities = 39/248 (15%), Positives = 76/248 (30%), Gaps = 55/248 (22%) Query: 175 FDFDLFVTVVE------ANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELG--VFD 226 L++++ F + L+ + ++ ++++G D+ FL L Sbjct: 103 VQLQLYLSLTPTVANAPEVAYFTERFLRNEKNIRVVHVKDEGYDIGAFLKQLHRFRHELQ 162 Query: 227 RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGS- 285 + Y+ K+H K IW L G I+ FE L ++ Sbjct: 163 VHQYILKVHSKSDP----------IWLERAVESLCGSEHQVKSILKAFETQSTLDIVSPM 212 Query: 286 ---------RRYRRYKRWSFFAKRSEVY--------RRVIDLAKRAGFPTKRLHLDF--- 325 + + + ++ + L + G + Sbjct: 213 GSTFSATTSKDAVFPHLKRKYFNKVDLATAFDDKTMHTMERLCAQLGLEACPYFEKYLAS 272 Query: 326 -FNGTMFWVK---------PKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRY 375 GTMFW + P+ E +RN L ++ + +EHA+ER R Sbjct: 273 ITAGTMFWARNSRLYTEHLPRLFESIRN-ELSQDY-----SNNNRIEHALERLIPTLSRL 326 Query: 376 TEFSIESV 383 I + Sbjct: 327 NGRMIGDI 334 Score = 43.8 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 13/88 (14%), Positives = 27/88 (30%), Gaps = 2/88 (2%) Query: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99 G V + P+ + + F S + + L+ ++ I + Sbjct: 594 GSTVRFDRRPRSGDYNFPILR-TPQEFGSAYSAMIARLSTMPGREIDVGFNFICAWNEWN 652 Query: 100 EQKAFLRLNRFMSNSRMPFDSEKFLYVK 127 EQ A L + + R+ + V Sbjct: 653 EQ-AVLEPDEWWGFQRLQEILKVVNNVP 679 >gi|322418494|ref|YP_004197717.1| group 1 glycosyl transferase [Geobacter sp. M18] gi|320124881|gb|ADW12441.1| glycosyl transferase group 1 [Geobacter sp. M18] Length = 708 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 11/128 (8%), Positives = 31/128 (24%), Gaps = 11/128 (8%) Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNM---QAIYIPAHVSGYYVLWSFSPKQRITSKDVH 59 ++ +L+ + L + + W +P+ +H Sbjct: 241 EILKLRFFSKEKPELPQVYSYKSFVANAFPDNTLRRDYYPCVVPNWDNTPRSGKNGFVLH 300 Query: 60 FQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMP 117 ++E + + R+ F S E + +L + + + Sbjct: 301 GSTPQLYEQHLEEAVDLVD------DRPEDERVIFVKSWNEWAETNYLEPDLRWGKAYLD 354 Query: 118 FDSEKFLY 125 Sbjct: 355 ATLRAVTR 362 >gi|293407666|gb|ADE44320.1| putative glycosyl transferase [Burkholderia pseudomallei] Length = 740 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 14/110 (12%), Positives = 28/110 (25%), Gaps = 6/110 (5%) Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84 E+ G W ++ ++E WL + A + Sbjct: 366 ERSRAYPDTEYRLFRGVTPSWDNEARKPGRGAVFVGSTPKLYEE---WLLNA-ATDTVER 421 Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 + P R+ F + E + A L +R + + Sbjct: 422 IDNPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATRNALTQANRTGAA 471 >gi|30248500|ref|NP_840570.1| hypothetical protein NE0485 [Nitrosomonas europaea ATCC 19718] gi|30138386|emb|CAD84396.1| conserved hypothetical protein [Nitrosomonas europaea ATCC 19718] Length = 445 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 34/113 (30%), Gaps = 7/113 (6%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 +L D+ E P +W S ++ ++E WL Sbjct: 337 VLDYRDIVEHKKYFLYNHPKLHRAAMPMWDNSARRDNKGMIFEGASPDLYE---RWLTDI 393 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127 L +K + F + E + A+L ++ + + + V+ Sbjct: 394 LLEAKNREDL--EDHYIFINAWNEWGEGAYLEPDKKYGYAYLNATRQAIEGVR 444 >gi|221201094|ref|ZP_03574134.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2M] gi|221206454|ref|ZP_03579467.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2] gi|221173763|gb|EEE06197.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2] gi|221178944|gb|EEE11351.1| glycosyl transferase, group 1 [Burkholderia multivorans CGD2M] Length = 1714 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 15/111 (13%), Positives = 32/111 (28%), Gaps = 6/111 (5%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 +L E+ G W ++ ++E WL + Sbjct: 948 ILDWTHYVERSRSYQDAEYRLFRGVTPSWDNEARKPGRGTVFVGSTPKLYEE---WLCNA 1004 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 A +++ P R+ F + E + A L +R + + + Sbjct: 1005 -ATDTVRRIANPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATNNALSR 1054 >gi|330826738|ref|YP_004390041.1| family 2 glycosyl transferase [Alicycliphilus denitrificans K601] gi|329312110|gb|AEB86525.1| glycosyl transferase family 2 [Alicycliphilus denitrificans K601] Length = 1669 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 39/133 (29%), Gaps = 8/133 (6%) Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLR 74 NL + E + G W + ++R +H ++E WLR Sbjct: 678 NLADYAQLAEFWLDRPSPAYKRFRGIVPAWDNAARRRKGGATVIHGSTPQLYEK---WLR 734 Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 +A + + RI F + E + +L + ++ + + Sbjct: 735 GTVA--RTLEEREGDERIVFINAWNEWGEGCYLEPDEKFGHAYLEATQRVLRDPPQALLE 792 Query: 133 WNDRPSSPKKSGL 145 R + + Sbjct: 793 DLRRERAAVAAPA 805 >gi|319764522|ref|YP_004128459.1| glycosyl transferase family 2 [Alicycliphilus denitrificans BC] gi|317119083|gb|ADV01572.1| glycosyl transferase family 2 [Alicycliphilus denitrificans BC] Length = 1669 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 39/133 (29%), Gaps = 8/133 (6%) Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLR 74 NL + E + G W + ++R +H ++E WLR Sbjct: 678 NLADYAQLAEFWLDRPSPAYKRFRGIVPAWDNAARRRKGGATVIHGSTPQLYEK---WLR 734 Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 +A + + RI F + E + +L + ++ + + Sbjct: 735 GTVA--RTLEEREGDERIVFINAWNEWGEGCYLEPDEKFGHAYLEATQRVLRDPPQALLE 792 Query: 133 WNDRPSSPKKSGL 145 R + + Sbjct: 793 DLRRERAAVAAPA 805 >gi|217420529|ref|ZP_03452034.1| glycosyltransferase, group 1 [Burkholderia pseudomallei 576] gi|217395941|gb|EEC35958.1| glycosyltransferase, group 1 [Burkholderia pseudomallei 576] Length = 1736 Score = 99.6 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 15/111 (13%), Positives = 32/111 (28%), Gaps = 6/111 (5%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 +L E+ G W ++ ++E WL + Sbjct: 969 ILDWTHYLERSRSYPDAEYRLFRGVTPSWDNEARKPGRGTVFVGSTPKLYEE---WLFNA 1025 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 + ++ P R+ F + E + A L +R + + S+ Sbjct: 1026 -SVDTVRRIENPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATSDALSR 1075 >gi|237653904|ref|YP_002890218.1| lipopolysaccharide biosynthesis protein-like protein [Thauera sp. MZ1T] gi|237625151|gb|ACR01841.1| lipopolysaccharide biosynthesis protein-like protein [Thauera sp. MZ1T] Length = 358 Score = 99.3 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 14/122 (11%), Positives = 37/122 (30%), Gaps = 9/122 (7%) Query: 5 FRLKSKLGKIE-NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63 R++ GK + +L + + W +P+ + +H Sbjct: 239 ARMRMAKGKYKLTVLDYARIMSGLTRASPPQFTEYPTVLPNWDNTPRSGLNGLVLHGSTP 298 Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 +F++ + + + RI F + E + +L ++ + + E Sbjct: 299 ELFKTVLRRGVDLV------QGYPAEQRIVFIKAWNEWAEGNYLEPDQRFGHGYLRAVRE 352 Query: 122 KF 123 Sbjct: 353 VL 354 >gi|294675724|ref|YP_003576339.1| family 2 glycosyl transferase [Rhodobacter capsulatus SB 1003] gi|294474544|gb|ADE83932.1| glycosyl transferase, family 2/group 1 [Rhodobacter capsulatus SB 1003] Length = 1993 Score = 97.7 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 13/135 (9%), Positives = 36/135 (26%), Gaps = 10/135 (7%) Query: 21 LDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFS 80 E+ + W + +++ + + WL + + + Sbjct: 1237 RSYVERSRNYPMPDYKLYRSVCPSWDNTARRKNKGAIFANSNPAEYR---VWLENAVTRT 1293 Query: 81 KYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF--LYVKELFE--GWN 134 + R+ F + E + A L + + + + V + G + Sbjct: 1294 LADARTPDE-RVIFVNAWNEWAEGAHLEPDTKYGYAYLEASRAALNPVEVPRMVTLVGHD 1352 Query: 135 DRPSSPKKSGLTIKS 149 P + L + Sbjct: 1353 AHPHGAQILLLNLAR 1367 >gi|322418493|ref|YP_004197716.1| group 1 glycosyl transferase [Geobacter sp. M18] gi|320124880|gb|ADW12440.1| glycosyl transferase group 1 [Geobacter sp. M18] Length = 1687 Score = 97.7 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 12/115 (10%), Positives = 33/115 (28%), Gaps = 8/115 (6%) Query: 16 NLLLRLD-VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74 N + D + + + W S +++ + + WL Sbjct: 1490 NYVHYYDNLANEMMAKPPVAYKRFRCATPSWDNSARRQEGANIFVGSTPEKYR---QWLE 1546 Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127 +++++ K +I F + E + L ++ + + V Sbjct: 1547 HIVSYTR--KTFKGDEQIAFVNAWNEWAEGNHLEPDQKYGRAYLEATRSAIAGVP 1599 >gi|264678899|ref|YP_003278806.1| hyaluronan synthase [Comamonas testosteroni CNB-2] gi|262209412|gb|ACY33510.1| hyaluronan synthase [Comamonas testosteroni CNB-2] Length = 795 Score = 97.7 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 11/117 (9%), Positives = 28/117 (23%), Gaps = 6/117 (5%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 + D+ + + + W ++ +E WL+S Sbjct: 39 YMHYDDLISRSLDEVPPSFELIKTLVPSWDNEARKPGRGMGFVGATPEKYE---RWLKSL 95 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFE 131 + L F + E + A L + + + + + Sbjct: 96 ARRAVERPLLGKQPY-VFVNAWNEWAEGALLEPDLHYGYAYLNATFRALTNTPRVSK 151 >gi|260174685|ref|ZP_05761097.1| hypothetical protein BacD2_22702 [Bacteroides sp. D2] gi|315922947|ref|ZP_07919187.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313696822|gb|EFS33657.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 372 Score = 97.3 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 9/115 (7%), Positives = 25/115 (21%), Gaps = 8/115 (6%) Query: 11 LGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFI 70 L + ++ + + W +P+ F Sbjct: 259 LKRPPRMIDYSKYYHSLITEDDQSVDVIPSIVPQWDHTPRSGWNGSLWVNSTPYFF---- 314 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + L K + +I S E + ++ + + + Sbjct: 315 --YKHVLEALDAIKNKPQNQQILLLKSWNEWGEGNYMEPDLKNGKGYIEALKKAL 367 >gi|46241633|gb|AAS83018.1| hypothetical protein pRhico010 [Azospirillum brasilense] Length = 1380 Score = 96.9 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 14/122 (11%), Positives = 41/122 (33%), Gaps = 10/122 (8%) Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69 G+I + +D + + + W +++ H + Sbjct: 623 FSGEIRDYNAMVDAS---LNEPAPSFPLIKTVFPSWDNDARRQGRGAVYHGSTPENYR-- 677 Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127 W+ +A++K + F + R+ F + E + A+L + + + + + Sbjct: 678 -RWMEGVIAYAKANP--FHNERMMFINAWNEWAEGAYLEPDLHFGAAYLNATARAIYGRR 734 Query: 128 EL 129 ++ Sbjct: 735 QV 736 >gi|167903945|ref|ZP_02491150.1| glycosyl transferase, group 1 [Burkholderia pseudomallei NCTC 13177] Length = 1741 Score = 96.9 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 34/113 (30%), Gaps = 6/113 (5%) Query: 11 LGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFI 70 G ++L E+ G W ++ ++E Sbjct: 968 TGYAGHILDWTHYLERSRSYPDAEYRLFRGVTPSWDNEARKPGRGTVFVGSTPKLYEE-- 1025 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 WL + + ++ P R+ F + E + A L +R + + S+ Sbjct: 1026 -WLFNA-SVDTVRRIENPDERLVFINAWNEWAEGAHLEPDRRYGYAWLQATSD 1076 >gi|313202892|ref|YP_004041549.1| hypothetical protein Palpr_0404 [Paludibacter propionicigenes WB4] gi|312442208|gb|ADQ78564.1| hypothetical protein Palpr_0404 [Paludibacter propionicigenes WB4] Length = 381 Score = 96.6 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 8/116 (6%), Positives = 26/116 (22%), Gaps = 9/116 (7%) Query: 11 LGKIENLLLRLDVEEK-GNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69 L + + + + + W +P+ F Sbjct: 260 LHRPPRITDYRKYYKFLVDKSEDACEDVLPTIVPNWDHTPRSGWNGTLFVHATPEYFRKH 319 Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + + + P R+ S E + ++ + + + + Sbjct: 320 VDEVLDIVH------KKSPERRVVMLKSWNEWGEGNYMEPDLVFGKAYIRALRDAI 369 >gi|94972405|ref|YP_595623.1| hypothetical protein LIC007 [Lawsonia intracellularis PHE/MN1-00] gi|94731942|emb|CAJ53959.1| conserved hypothetical protein [Lawsonia intracellularis PHE/MN1-00] Length = 789 Score = 96.2 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 40/151 (26%), Gaps = 21/151 (13%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 K G+I + + E + W +P++ S + Sbjct: 298 KRFKGRIRHYSM---FAEAVVKDYTTKYTLYPCVFPGWDNTPRRLYFSSIFACSTPQAYR 354 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 WL F+ S R F + E + A L N+ + + S Sbjct: 355 ---QWLTDACTFA--STTHEKDNRFVFINAWNEWAEGAHLEPNKAYGYAYLNATSRVVEN 409 Query: 126 VKELFEGWNDRPSSPKKSGLTIKSKIAIVVH 156 + P + K+ +V H Sbjct: 410 F-----------AVPPSTAENNPHKVLVVGH 429 >gi|86132907|ref|ZP_01051498.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85816613|gb|EAQ37800.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 361 Score = 95.4 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 37/127 (29%), Gaps = 11/127 (8%) Query: 2 YKVFRLKSKLGKIEN-LLLRLDVEEKGNMQAIYIPA--HVSGYYVLWSFSPKQRITSKDV 58 Y L+ I+N L D E+ ++Q G +W + +++ + Sbjct: 240 YTTALLRKFKWTIDNRYELFYDYEQFVDLQINTEFKSKVYPGITPMWDNTARRKKNYFAL 299 Query: 59 HFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRM 116 H + WL+ Y P + F + E + L + + Sbjct: 300 HNSTPQ---KYAKWLKHI--VLNYPWQKMPENYL-FINAWNEWAEGNHLEPCQKWGKQYL 353 Query: 117 PFDSEKF 123 + Sbjct: 354 EETYKAL 360 >gi|294672884|ref|YP_003573500.1| hypothetical protein PRU_0097 [Prevotella ruminicola 23] gi|294473985|gb|ADE83374.1| conserved hypothetical protein [Prevotella ruminicola 23] Length = 369 Score = 95.4 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 27/94 (28%), Gaps = 8/94 (8%) Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 Y W SP+ + +FE + K P +I F Sbjct: 281 VYPAIYPNWDHSPRSGRNGFIIVDSTPDLFEKHVAQ------VLDEVKSKQPEHQIAFIK 334 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 S E + ++ + N + S + V+ Sbjct: 335 SWNEWGEGNYIEPDLKFGNGYLEALSRQIEKVRY 368 >gi|253565823|ref|ZP_04843278.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251946102|gb|EES86509.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 362 Score = 95.0 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 10/117 (8%), Positives = 29/117 (24%), Gaps = 12/117 (10%) Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74 ++ + E Y W +P+ + +FE Sbjct: 256 YKKVIPTLIGELERNCDNY----FPTIIPNWDHTPRSGVNGDLFTKSTPDLFE------I 305 Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKEL 129 + + ++ F S E + ++ + + + ++ L Sbjct: 306 HCMDVLSSVTKKNTNRQVCFLKSWNEWGEGNYMEPDLKYGKGYIYALRKVVDTLESL 362 >gi|218244934|ref|YP_002370305.1| polysaccharide biosynthesis protein [Cyanothece sp. PCC 8801] gi|218165412|gb|ACK64149.1| polysaccharide biosynthesis protein [Cyanothece sp. PCC 8801] Length = 383 Score = 94.6 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 12/91 (13%), Positives = 29/91 (31%), Gaps = 8/91 (8%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G W + ++++ + + I+E +WL++ + + P I F + Sbjct: 298 FPGVTPSWDNTARRQVAATILKDSTPEIYE---YWLKAVIEKTISKPELPP---IIFINA 351 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 E + L + S + Sbjct: 352 WNEWAEGNHLEPCQRWGRSYLEATQRAIKQF 382 >gi|148264392|ref|YP_001231098.1| lipopolysaccharide biosynthesis protein-like protein [Geobacter uraniireducens Rf4] gi|146397892|gb|ABQ26525.1| Lipopolysaccharide biosynthesis protein-like protein [Geobacter uraniireducens Rf4] Length = 368 Score = 94.6 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 33/123 (26%), Gaps = 9/123 (7%) Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 + + K GK D + ++ + + W +P+ + +H Sbjct: 238 QYQVKTGKPAIFSYEKDFADLQPIKIAHG-DNYPCLLPNWDNTPRSKSNGLVLHDSTPEA 296 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + S+ ++ F S E + L + + + + Sbjct: 297 FRKHVKKALEI------SRDKPDERKLVFIKSWNEWAEGNHLEPDLKFGRAYLEILRNEI 350 Query: 124 LYV 126 Sbjct: 351 SNE 353 >gi|87201246|ref|YP_498503.1| polysaccharide biosynthesis protein [Novosphingobium aromaticivorans DSM 12444] gi|87136927|gb|ABD27669.1| polysaccharide biosynthesis protein [Novosphingobium aromaticivorans DSM 12444] Length = 377 Score = 93.9 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 11/86 (12%), Positives = 28/86 (32%), Gaps = 7/86 (8%) Query: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99 W S +++ + WLR +A+++ + P R F + Sbjct: 287 CVTPGWDNSARKKNRPLIFVGSTPERYG---RWLREMVAWTRR--NAPPERRFIFINAWN 341 Query: 100 E--QKAFLRLNRFMSNSRMPFDSEKF 123 E + L ++ ++ + + Sbjct: 342 EWAEGNHLEPDQRNGHANLEATARAL 367 >gi|288803153|ref|ZP_06408588.1| glycosyltransferase [Prevotella melaninogenica D18] gi|288334414|gb|EFC72854.1| glycosyltransferase [Prevotella melaninogenica D18] Length = 381 Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 32/121 (26%), Gaps = 9/121 (7%) Query: 7 LKSKLGKIENL-LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 L KL + +L L V W +P+ + Sbjct: 267 LHKKLSFLPSLKLDYSKVVSNFFAPEDKWDNVYPMIIPGWDRTPRAGNSEGIYINSTPEN 326 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F+ I S + +I F S E + ++ N ++ + E Sbjct: 327 FKKHIKQALSIVD------SKPQDHKILFLKSWNEWGEGNYVEPNLKFGHAYLDAIKENL 380 Query: 124 L 124 L Sbjct: 381 L 381 >gi|241763180|ref|ZP_04761239.1| Methyltransferase type 12 [Acidovorax delafieldii 2AN] gi|241367679|gb|EER61945.1| Methyltransferase type 12 [Acidovorax delafieldii 2AN] Length = 1786 Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 40/136 (29%), Gaps = 12/136 (8%) Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74 + + V + Y ++V +W + + + +H F+ W+ Sbjct: 1243 YDQVRDYYVAQNDRKSFDYFRSNVP----MWDNTARYGTGALLLHGSTPQSFQ---QWME 1295 Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 +A ++ R + E + A L + S + + E Sbjct: 1296 HSIADAQ--ANLPADRRFVVVNAWNEWAEGAHLEPDTRYGYSYLNSVGRALAGLPYAHEL 1353 Query: 133 WNDRPSSPKKSGLTIK 148 P P+ L ++ Sbjct: 1354 NATAPL-PQGLCLQVQ 1368 >gi|302186464|ref|ZP_07263137.1| glycosyl transferase family 2 [Pseudomonas syringae pv. syringae 642] Length = 1318 Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 + D+ + + G W + ++ TS F+ WL Sbjct: 1206 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 1262 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 +A +K + +F R+ F + E + A+L +R ++ + Sbjct: 1263 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 1308 >gi|257481987|ref|ZP_05636028.1| glycosyl transferase family 2 [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 1360 Score = 93.5 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 + D+ + + G W + ++ TS F+ WL Sbjct: 1248 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 1304 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 +A +K + +F R+ F + E + A+L +R ++ + Sbjct: 1305 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 1350 >gi|86140376|ref|ZP_01058935.1| Glycosyltransferase [Leeuwenhoekiella blandensis MED217] gi|85832318|gb|EAQ50767.1| Glycosyltransferase [Leeuwenhoekiella blandensis MED217] Length = 380 Score = 93.5 bits (231), Expect = 6e-17, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 39/125 (31%), Gaps = 15/125 (12%) Query: 6 RLKSKLGKIENLLLRL-----DVEEKGNMQAIYIP--AHVSGYYVLWSFSPKQRITSKDV 58 + KS +G + R D ++ + + IP ++ + W SP+ S Sbjct: 255 KYKSLIGHTNKIGERKRPLIFDYKKGARLLSQNIPHKKYIPCVFPNWDNSPRSGKKSLIF 314 Query: 59 HFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRM 116 + W + K + +I S E + +L ++ S + Sbjct: 315 KNATPN------AWKEHLKHTIEVLKSKPENPQIIIIKSWNEWAEGNYLEPDQEFGISML 368 Query: 117 PFDSE 121 E Sbjct: 369 KVVKE 373 >gi|330989699|gb|EGH87802.1| glycosyl transferase family 2 [Pseudomonas syringae pv. lachrymans str. M301315] Length = 1301 Score = 93.1 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 + D+ + + G W + ++ TS F+ WL Sbjct: 1189 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 1245 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 +A +K + +F R+ F + E + A+L +R ++ + Sbjct: 1246 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 1291 >gi|262383300|ref|ZP_06076436.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|262294198|gb|EEY82130.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] Length = 387 Score = 93.1 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 17/128 (13%), Positives = 37/128 (28%), Gaps = 15/128 (11%) Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIP-----AHVSGYYVLWSFSPKQRITSKD 57 +V R + + +++ + IY+P W S + ++ Sbjct: 264 RVIRW--LMFNLFKYRTLSKCDQRVINKYIYVPEDKWDNVYPILLPQWDRSARAGKMARI 321 Query: 58 VHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSR 115 +F S I S L + +I F S E + ++ + + Sbjct: 322 YVGSTPDVFRSQIQSALSLL------ENKTDEHKILFLRSWNEWAEGNYVEPDLKYGHGY 375 Query: 116 MPFDSEKF 123 + E Sbjct: 376 LDVLRECL 383 >gi|94497762|ref|ZP_01304329.1| hypothetical protein SKA58_12300 [Sphingomonas sp. SKA58] gi|94422811|gb|EAT07845.1| hypothetical protein SKA58_12300 [Sphingomonas sp. SKA58] Length = 1425 Score = 93.1 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 22/229 (9%), Positives = 57/229 (24%), Gaps = 46/229 (20%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 K G+I + +D + Y G + W ++ ++ H F Sbjct: 596 KDFGGEIFDYGAVVDGD-VERYADGYEWPVHRGAMLGWDNMARRLTDARVFHGATPQGFR 654 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 W++ L + + F + E + +L ++ + + Sbjct: 655 ---RWIKGILDQESRHNSAP--ETLMFINAWNEWAEGTYLEPDQRWGRTNLAAFRSAVDA 709 Query: 126 VKELFEGWND-----------------RPSSPK-------------KSGLTIKSKIAIVV 155 + P +P + K I + Sbjct: 710 TPGMKAVTLPAGIAAAPKQEGRLAHLGSPLAPDGTMPRGPVWYRGYREVDPTKPTILLCA 769 Query: 156 HCYYQDT------WIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYF 198 H +++ L + + + +T+ N + ++ Sbjct: 770 HISGHQLFGGERSLLDVLEALATMPVN--VIMTLPSDNNRAYIEAIQKL 816 >gi|331008848|gb|EGH88904.1| glycosyl transferase family 2 [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 846 Score = 92.7 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 37/108 (34%), Gaps = 7/108 (6%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 + D+ + + G W + ++ TS F+ WL Sbjct: 734 VADYRDLAVRYATRPAPGYTRFKGVMPGWDNTARRPHTSFCFENATPGAFQ---AWLEES 790 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 +A +K + +F R+ F + E + A+L +R ++ + Sbjct: 791 IADTK--EQNFGDERLVFVNAWNEWAEGAYLEPDRRFGHTYLEAVKNA 836 >gi|330996421|ref|ZP_08320304.1| hypothetical protein HMPREF9442_01389 [Paraprevotella xylaniphila YIT 11841] gi|329573279|gb|EGG54893.1| hypothetical protein HMPREF9442_01389 [Paraprevotella xylaniphila YIT 11841] Length = 367 Score = 92.7 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 12/121 (9%), Positives = 24/121 (19%), Gaps = 8/121 (6%) Query: 5 FRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELS 64 + + K + + W SP+ + Sbjct: 247 AKFQRIALKRGRHIEYSRASQYFQGPEEQANDCYPTLIPNWDHSPRSGRAGHILIRSTPE 306 Query: 65 IFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 F+ A RI F S E + ++ + + E Sbjct: 307 KFKKHAQ------ASFNNISHKAMEDRIVFLKSWNEWAEGNYMEPDLKFGKGYLKALKEA 360 Query: 123 F 123 Sbjct: 361 I 361 >gi|255014255|ref|ZP_05286381.1| hypothetical protein B2_10114 [Bacteroides sp. 2_1_7] Length = 392 Score = 92.7 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 37/129 (28%), Gaps = 10/129 (7%) Query: 1 MYKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHF 60 +++V + + ++ L + + + W SP+ + Sbjct: 272 IHRVLSSRFHISSLDKY-DYLKIIKHYYVPEDKWDNVYPSLLPQWDRSPRSGVNG-IYVN 329 Query: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPF 118 F+ I+ + L RI F S E + ++ + + + Sbjct: 330 STPVNFKKMIYEALNLLN------NKQDEHRILFLKSWNEWAEGNYVEPDLKYGHGYLDV 383 Query: 119 DSEKFLYVK 127 E + K Sbjct: 384 LRECLVNDK 392 >gi|118580521|ref|YP_901771.1| polysaccharide biosynthesis protein [Pelobacter propionicus DSM 2379] gi|118503231|gb|ABK99713.1| polysaccharide biosynthesis protein [Pelobacter propionicus DSM 2379] Length = 363 Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 37/119 (31%), Gaps = 13/119 (10%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 LK ++ + +L+ + +E + SP+++ S H ++ Sbjct: 252 LKHQIYEYSSLVDAMLGKELPTYPF------YRCVCPSFDNSPRRKTDSVVFHNSTPELY 305 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 WL + ++ P R+ F + E + L + + + Sbjct: 306 ---FRWLNEVVEWTSC--NHSPEERLVFVNAWNEWGEGNHLEPDLRWGKQYLEKTRQAI 359 >gi|254411253|ref|ZP_05025030.1| hypothetical protein MC7420_1744 [Microcoleus chthonoplastes PCC 7420] gi|196181754|gb|EDX76741.1| hypothetical protein MC7420_1744 [Microcoleus chthonoplastes PCC 7420] Length = 379 Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 37/135 (27%), Gaps = 23/135 (17%) Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIY---------------IPAHVSGYYVLWSFSPK 50 ++K KL + ++ + +Y + W +P+ Sbjct: 249 KVKQKLSAFSSRRFYQKYKQFSDYPLLYSYEKAIKCAFKGSHPYFVTYPCIFPNWDNTPR 308 Query: 51 QRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLN 108 I +F + ++ + K R+ F S E + +L + Sbjct: 309 TGIYGLVFLKSTPDLFRVHLQEAIETVSERESEK------RLIFIRSWNEWAEGNYLEPD 362 Query: 109 RFMSNSRMPFDSEKF 123 + + ++ Sbjct: 363 LKFGKAFLEVIRDEI 377 >gi|256827944|ref|YP_003156672.1| glycosyl transferase family 2 [Desulfomicrobium baculatum DSM 4028] gi|256577120|gb|ACU88256.1| glycosyl transferase family 2 [Desulfomicrobium baculatum DSM 4028] Length = 1077 Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 12/118 (10%), Positives = 32/118 (27%), Gaps = 14/118 (11%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 + ++ + + R + A Y W +P++ S + + Sbjct: 800 EHRVYDYDEFVAR----QLTKPAASY--RRYPCVTPRWDNTPRRPKDSVVLLDPSPDRYR 853 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 WL + R+ F + E + L + ++ + + Sbjct: 854 ---RWLSHAVESVT---KLPADERLVFINAWNEWGEGCALEPDLLRGDAYLKATAAAL 905 >gi|149276164|ref|ZP_01882308.1| hypothetical protein PBAL39_00552 [Pedobacter sp. BAL39] gi|149232684|gb|EDM38059.1| hypothetical protein PBAL39_00552 [Pedobacter sp. BAL39] Length = 399 Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 38/124 (30%), Gaps = 11/124 (8%) Query: 5 FRLKSKLGKIENLLLRLDVEEKG--NMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQE 62 ++ ++ L L EK + + + V W SP+ S V Sbjct: 281 YQFVHFTEVNKDYLDILTAVEKEWARIDTAFEFNYYPHISVGWDNSPRTG-KSAVVKNNT 339 Query: 63 LSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120 FE LR A++ P + S E + ++L+ + + Sbjct: 340 PENFEKG---LRMAKAYADAHPKQVP---LITINSWNEWTETSYLQPDNVYGYGYLDAIK 393 Query: 121 EKFL 124 FL Sbjct: 394 RVFL 397 >gi|163814421|ref|ZP_02205810.1| hypothetical protein COPEUT_00572 [Coprococcus eutactus ATCC 27759] gi|158450056|gb|EDP27051.1| hypothetical protein COPEUT_00572 [Coprococcus eutactus ATCC 27759] Length = 387 Score = 92.3 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 14/109 (12%), Positives = 37/109 (33%), Gaps = 12/109 (11%) Query: 20 RLDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 R+D ++ P +V G +V W +P+ + + FE ++ Sbjct: 276 RVDYDKAWETILNTTPESIINVPGAFVDWDNTPRHGERGRVYIGKTPEKFEKYLSE---- 331 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + +K + + F + E + +L ++ + + + Sbjct: 332 --QIRRAKNVYHKD-MIFMYAWNEWAEGGYLEPDQTSGYAYLEAIKKAL 377 >gi|298377838|ref|ZP_06987788.1| glycosyl transferase, group 2 family [Bacteroides sp. 3_1_19] gi|298265284|gb|EFI06947.1| glycosyl transferase, group 2 family [Bacteroides sp. 3_1_19] Length = 366 Score = 91.9 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 8/90 (8%), Positives = 19/90 (21%), Gaps = 8/90 (8%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 W SP+ + +F+ + + RI S Sbjct: 280 YPTIIPNWDHSPRTGRYGAILKDSTPQLFQKHVEQTVHLIL------NKDDDHRIVILKS 333 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 E + ++ + + Sbjct: 334 WNEWAEGNYVEPDLNFGRGYLEALRTALQK 363 >gi|253578786|ref|ZP_04856057.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA] gi|251849729|gb|EES77688.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA] Length = 387 Score = 91.9 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 13/111 (11%), Positives = 34/111 (30%), Gaps = 12/111 (10%) Query: 18 LLRLDVEEKGNMQAIYIP---AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74 +L+ D +E +IP ++ G +V W +P++ + + Sbjct: 276 VLKTDYDEAWKAILEHIPENEKNIPGAFVGWDNTPRKGHRGQVYIGDTPEKLNKY----- 330 Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ S + F + E + +L + + + Sbjct: 331 --MSKQIQRAKSIYKKDMIFMYAWNEWAEGGYLEPDERTGYKNLEAIRDAL 379 >gi|313203439|ref|YP_004042096.1| hypothetical protein Palpr_0961 [Paludibacter propionicigenes WB4] gi|312442755|gb|ADQ79111.1| hypothetical protein Palpr_0961 [Paludibacter propionicigenes WB4] Length = 378 Score = 91.9 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 10/90 (11%), Positives = 18/90 (20%), Gaps = 8/90 (8%) Query: 36 AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95 W +P+ +FE L RI F Sbjct: 290 NIFPTLIPNWDHTPRSGYNGYLYTKSTPELFEK------HALQVFNMINSKPEDDRICFL 343 Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 S E + ++ + + Sbjct: 344 KSWNEWGEGNYMEPDLKFGKKYIYALRSAL 373 >gi|146281782|ref|YP_001171935.1| hypothetical protein PST_1402 [Pseudomonas stutzeri A1501] gi|145569987|gb|ABP79093.1| conserved hypothetical protein [Pseudomonas stutzeri A1501] Length = 1615 Score = 91.6 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 35/135 (25%), Gaps = 15/135 (11%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFE 67 LG L E+ W S ++R + + Sbjct: 611 QFLGDYGKLADY--WSERPRPHY----KRFRCLVPSWDNSARRRKGRAGLFVNATPERYG 664 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 WL LA K + R+ F + E + L + + + Sbjct: 665 ---QWLEHTLA--KTCEEFAGDERLVFINAWNEWGEGCHLEPDVRHGRAYLEATRNALDK 719 Query: 126 VKELFEGWNDRPSSP 140 +K E RP +P Sbjct: 720 LKAATEI-PVRPYNP 733 >gi|256819540|ref|YP_003140819.1| hypothetical protein Coch_0700 [Capnocytophaga ochracea DSM 7271] gi|256581123|gb|ACU92258.1| conserved hypothetical protein [Capnocytophaga ochracea DSM 7271] Length = 366 Score = 91.6 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 14/124 (11%), Positives = 36/124 (29%), Gaps = 8/124 (6%) Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 ++ K+ + +++ + + W +P+ Sbjct: 249 KIYRKIFSVPDIVDYSKIYKSFITPLEAQENIFPTIIPNWDHTPRSGKGGTVFKNTNGEN 308 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F+ + + ++ +Y K RI F S E + +L + + E Sbjct: 309 FKQHVMEVLKIISQKEYDK------RIVFIKSWNEWGEGNYLEPDLKNGYLYLDILQELL 362 Query: 124 LYVK 127 + K Sbjct: 363 VSQK 366 >gi|220926122|ref|YP_002501424.1| group 1 glycosyl transferase [Methylobacterium nodulans ORS 2060] gi|219950729|gb|ACL61121.1| glycosyl transferase group 1 [Methylobacterium nodulans ORS 2060] Length = 787 Score = 91.2 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 15/132 (11%), Positives = 40/132 (30%), Gaps = 10/132 (7%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 ++ +G +E+ + V G + +W + ++R + + Sbjct: 646 ENFVGYLEDYV---GVASSSINSPPTDYVRYRGCFPMWDNTARRRNAGHVFINEST---K 699 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 + +WLR + + + + F + E + +L + + + E Sbjct: 700 GYAYWLRFLVHEALVRRDQVEP--MVFINAWNEWAEGTYLEPDEHYGRAFLEVTREALAQ 757 Query: 126 VKELFEGWNDRP 137 F P Sbjct: 758 GIADFVVGVRNP 769 >gi|282879758|ref|ZP_06288488.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1] gi|281306427|gb|EFA98457.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1] Length = 381 Score = 91.2 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 10/106 (9%), Positives = 27/106 (25%), Gaps = 8/106 (7%) Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79 + + + W +P+ + + F+ I + + Sbjct: 281 YEKITQHFFAPEDSWQNVYPSIFPQWDRTPRAGNSEGVYVNATPTTFKKHIQNALNVI-- 338 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K RI F + E + ++ + + + E Sbjct: 339 ----KNKDMEHRILFLRAWNEWGEGNYVEPDLKYGHGFLDAIKEAI 380 >gi|302880031|ref|YP_003848595.1| lipopolysaccharide biosynthesis protein-like protein [Gallionella capsiferriformans ES-2] gi|302582820|gb|ADL56831.1| lipopolysaccharide biosynthesis protein-like protein [Gallionella capsiferriformans ES-2] Length = 364 Score = 90.8 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 10/89 (11%), Positives = 23/89 (25%), Gaps = 8/89 (8%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 Y W +P++ + ++FE+ + L ++ F S Sbjct: 282 YPCIYPNWDNTPRKGRKGLVLANSTPALFEAHLNDAVGALGERD------DEHKLVFVKS 335 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 E + L + + Sbjct: 336 WNEWAEGNHLEPDTKWGLQYLQALKRVIE 364 >gi|312130478|ref|YP_003997818.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM 17132] gi|311907024|gb|ADQ17465.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM 17132] Length = 380 Score = 90.8 bits (224), Expect = 4e-16, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 41/125 (32%), Gaps = 10/125 (8%) Query: 6 RLKSKLG--KIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63 R+K+KLG + + +V ++ + + H W S +++ + +H Sbjct: 261 RIKNKLGWGQTYRKIDYAEVVQRMKSKPSFTQKHFKALVPGWDNSARRKNDAFIMHDATP 320 Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 ++E WL + + F + E + L ++ + + + Sbjct: 321 ELYED---WLDHTCKTTT---IYSEEENFLFINAWNEWAEGNHLEPDKKWGRAFLETTKK 374 Query: 122 KFLYV 126 Sbjct: 375 ILSKY 379 >gi|298384772|ref|ZP_06994332.1| glycosyl transferase, group 2 family [Bacteroides sp. 1_1_14] gi|298263051|gb|EFI05915.1| glycosyl transferase, group 2 family [Bacteroides sp. 1_1_14] Length = 369 Score = 90.8 bits (224), Expect = 4e-16, Method: Composition-based stats. Identities = 13/127 (10%), Positives = 34/127 (26%), Gaps = 9/127 (7%) Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61 +K+ + + ++ D+ + + W SP+ + Sbjct: 250 HKLRKYFPSIAPLDKY-KYKDIIKNFYTDYDRLENSYPSIIPNWDRSPRGGRRAVIYTGS 308 Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119 +F+ R K + +I F S E + ++ + + + Sbjct: 309 TPELFK------RHIEDAIKIVENKKAEHKIIFLRSWNEWAEGNYVEPDIKFGHGYLDSL 362 Query: 120 SEKFLYV 126 L Sbjct: 363 RSVILEE 369 >gi|113476766|ref|YP_722827.1| hypothetical protein Tery_3239 [Trichodesmium erythraeum IMS101] gi|110167814|gb|ABG52354.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101] Length = 955 Score = 90.4 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 33/119 (27%), Gaps = 8/119 (6%) Query: 17 LLLRLDVEEKGNMQAIY-IPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75 + +Q W + +++ + E +E FWLR Sbjct: 644 FVYDYKQTAINTIQEKLPDYQVFLSVMTSWDNTARRQQNATVWLNSEPEDYE---FWLRG 700 Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 K K S I F + E + A+L ++ + + L + Sbjct: 701 TTE--KALKNYGDSENIVFINAWNEWAEGAYLEPDKKYGCAYLEATQRVLLGQHSIQTA 757 >gi|323139972|ref|ZP_08074990.1| glycosyl transferase family 2 [Methylocystis sp. ATCC 49242] gi|322394772|gb|EFX97355.1| glycosyl transferase family 2 [Methylocystis sp. ATCC 49242] Length = 984 Score = 90.0 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 13/124 (10%), Positives = 43/124 (34%), Gaps = 7/124 (5%) Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75 + ++ + + V W +P+ S + F++++ W Sbjct: 866 EVHDYRELALAFMRRVEPGFPRIRSVLVGWDNTPRHPDNSLILEQSTPGAFQAWLEW--- 922 Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGW 133 + + + ++ RI F + E + ++L +R ++ + + + + Sbjct: 923 --TYRRTIEQNYGDARIVFINAWNEWCEGSYLEPDRHFGHAYLQALRNAQESIASGSDSF 980 Query: 134 NDRP 137 ++P Sbjct: 981 VEKP 984 >gi|281424202|ref|ZP_06255115.1| glycosyltransferase [Prevotella oris F0302] gi|281401471|gb|EFB32302.1| glycosyltransferase [Prevotella oris F0302] Length = 361 Score = 90.0 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 29/124 (23%), Gaps = 8/124 (6%) Query: 5 FRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELS 64 FR K G + LD + + H Y W S + + E Sbjct: 239 FRTLRKFGGVVFGNNYLDYCNFFIKKYTPMAKHFPCIYPNWDHSARSGKIATIFRNVEPE 298 Query: 65 IFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 I W + F S E + +L +R + + Sbjct: 299 I------WGDFCKRLFVKCSRQPTEENLIFIKSWNEWGEGNYLEPDRRYGRGYLEELKKA 352 Query: 123 FLYV 126 Sbjct: 353 LSSF 356 >gi|300728262|ref|ZP_07061630.1| conserved hypothetical protein [Prevotella bryantii B14] gi|299774497|gb|EFI71121.1| conserved hypothetical protein [Prevotella bryantii B14] Length = 371 Score = 90.0 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 14/122 (11%), Positives = 30/122 (24%), Gaps = 11/122 (9%) Query: 7 LKSKLGKIEN-LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK--DVHFQEL 63 K I L +K + + + W SP+ T + E Sbjct: 250 WNQKFRGIPKGALDYRKKYKKFILPKDKEIGVIPEIFPNWDHSPRSGKTGASTIYYNSEP 309 Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 F + + K S ++ S E + ++ + + + Sbjct: 310 EFFYKHVKEALDAI------KDKPESDQMLILKSWNEWGEGNYMEPDLRYGRGYIKALRK 363 Query: 122 KF 123 Sbjct: 364 AI 365 >gi|327312342|ref|YP_004327779.1| hypothetical protein HMPREF9137_0027 [Prevotella denticola F0289] gi|326944812|gb|AEA20697.1| conserved hypothetical protein [Prevotella denticola F0289] Length = 381 Score = 89.2 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 10/107 (9%), Positives = 22/107 (20%), Gaps = 8/107 (7%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 V + W +P+ F+ I + Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVI- 338 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K +I F S E + ++ + + + Sbjct: 339 -----KEKPKEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIKASV 380 >gi|295087225|emb|CBK68748.1| hypothetical protein [Bacteroides xylanisolvens XB1A] Length = 369 Score = 89.2 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 7/89 (7%), Positives = 23/89 (25%), Gaps = 10/89 (11%) Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 + W +P+ + + F + + + + ++ F Sbjct: 283 VIPCIVPNWDHTPRSGMKGSMFLNESPEFFRLHVEDALKTVQYKR--------NKLIFLK 334 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 S E + ++ + + E Sbjct: 335 SWNEWGEGNYMEPDLTFGKGYINALHEAL 363 >gi|325853275|ref|ZP_08171333.1| hypothetical protein HMPREF9303_1037 [Prevotella denticola CRIS 18C-A] gi|325484364|gb|EGC87289.1| hypothetical protein HMPREF9303_1037 [Prevotella denticola CRIS 18C-A] Length = 381 Score = 89.2 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 10/107 (9%), Positives = 22/107 (20%), Gaps = 8/107 (7%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 V + W +P+ F+ I + Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVI- 338 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K +I F S E + ++ + + + Sbjct: 339 -----KEKPKEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIKASV 380 >gi|255526750|ref|ZP_05393652.1| glycosyltransferase [Clostridium carboxidivorans P7] gi|296187044|ref|ZP_06855444.1| hypothetical protein CLCAR_2519 [Clostridium carboxidivorans P7] gi|255509585|gb|EET85923.1| glycosyltransferase [Clostridium carboxidivorans P7] gi|296048482|gb|EFG87916.1| hypothetical protein CLCAR_2519 [Clostridium carboxidivorans P7] Length = 374 Score = 88.9 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 14/125 (11%), Positives = 37/125 (29%), Gaps = 12/125 (9%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAH---VSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 G ++R+ + P + G +V W + ++ F+ Sbjct: 257 GMRPGGVIRVSYDAIWKEILKRKPQDEKCIPGAFVDWDNTSRKGEKGSIYEGATPEKFQK 316 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 ++ A + ++ + + F + E + +L + + + L Sbjct: 317 YLT------AQIRRARDVYKKD-MLFIFAWNEWAECGYLEPDEKFGYGYLEAIKQALLDN 369 Query: 127 KELFE 131 E E Sbjct: 370 DEFSE 374 >gi|294775796|ref|ZP_06741298.1| conserved hypothetical protein [Bacteroides vulgatus PC510] gi|294450382|gb|EFG18880.1| conserved hypothetical protein [Bacteroides vulgatus PC510] Length = 364 Score = 88.5 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 26/94 (27%), Gaps = 9/94 (9%) Query: 38 VSGYYVLWSFSPKQRITS-KDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 W SP+++ +F+ WL+ L K + F Sbjct: 276 FPCVSPGWDNSPRRKKPPYMAFVGSTPELFKK---WLKDTL---VRFKPFSKEENLVFIN 329 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 + E + L ++ + E L + Sbjct: 330 AWNEWAEGNHLEPDQKWGRRYLEVTKEAILETSK 363 >gi|295084063|emb|CBK65586.1| hypothetical protein [Bacteroides xylanisolvens XB1A] Length = 367 Score = 88.5 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 26/95 (27%), Gaps = 8/95 (8%) Query: 31 AIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSC 90 Y W SP+ + ++FE I ++ + Sbjct: 277 YDYREDVYPSIIPNWDRSPRGGRRAVIYTDSTPALFEEHIKTALEIISKKQ------DEH 330 Query: 91 RIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 +I F S E + ++ + + + E Sbjct: 331 KILFLRSWNEWAEGNYVEPDLKFGHGYLDALKESI 365 >gi|325270047|ref|ZP_08136655.1| glycosyltransferase [Prevotella multiformis DSM 16608] gi|324987632|gb|EGC19607.1| glycosyltransferase [Prevotella multiformis DSM 16608] Length = 381 Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 9/104 (8%), Positives = 21/104 (20%), Gaps = 8/104 (7%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 V + W +P+ F+ I + Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVIN 339 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120 +I F S E + ++ + + + Sbjct: 340 ------DKPNEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIK 377 >gi|190572675|ref|YP_001970520.1| putative glycosyl transferase [Stenotrophomonas maltophilia K279a] gi|190010597|emb|CAQ44206.1| putative glycosyl transferase [Stenotrophomonas maltophilia K279a] Length = 436 Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 38/109 (34%), Gaps = 7/109 (6%) Query: 17 LLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSF 76 L+ V + + G W + +++ TS + SI++ +WLR Sbjct: 245 LVDYRKVVAQSISRPKPDFRWYRGIVPSWDNTARRQHTSHTLVDASPSIYQ---YWLRRL 301 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + +++ + P +I F + E + L + + + Sbjct: 302 VEYTRV--NNAPEDQILFINAWNEWGEGCHLEPDLKHGLAYLEATHAAL 348 >gi|260593223|ref|ZP_05858681.1| glycosyltransferase [Prevotella veroralis F0319] gi|260534780|gb|EEX17397.1| glycosyltransferase [Prevotella veroralis F0319] Length = 381 Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 9/104 (8%), Positives = 21/104 (20%), Gaps = 8/104 (7%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLA 78 V + W +P+ F+ I + Sbjct: 280 DYAKVTKHFFSPEDRWDNVYPTIMPQWDRTPRAGKHEGIYINSTPENFKKHIEEALEVIN 339 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120 +I F S E + ++ + + + Sbjct: 340 ------DKPNEHKILFLRSWNEWGEGNYVEPDLKYGHKFLEAIK 377 >gi|256830319|ref|YP_003159047.1| lipopolysaccharide biosynthesis protein-like protein [Desulfomicrobium baculatum DSM 4028] gi|256579495|gb|ACU90631.1| lipopolysaccharide biosynthesis protein-like protein [Desulfomicrobium baculatum DSM 4028] Length = 364 Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 13/117 (11%), Positives = 30/117 (25%), Gaps = 8/117 (6%) Query: 8 KSKLGKIENLLLRLD-VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 + +LG+ ++ ++ + W +P+ + F Sbjct: 239 RQRLGRFPRWVIDYSSLDRYFKNHLCDGITTLPTAIPNWDNTPRIGRRGLVFANSSPARF 298 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 + S + K RI F S E + +L + + Sbjct: 299 ADHLRRSVSGFTAANDGK-----DRILFIKSWNEWAEGNYLEPDLVHDRGWLEAVRS 350 >gi|237714668|ref|ZP_04545149.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262406534|ref|ZP_06083083.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645683|ref|ZP_06723370.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294806952|ref|ZP_06765775.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] gi|229445437|gb|EEO51228.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262355237|gb|EEZ04328.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292638962|gb|EFF57293.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294445839|gb|EFG14483.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] Length = 368 Score = 88.1 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 13/107 (12%), Positives = 27/107 (25%), Gaps = 8/107 (7%) Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79 D+ Y W SP+ + ++FE I + Sbjct: 266 YKDIISNFYTSYDYREDVYPSIIPNWDRSPRAGRRAVIYTGSTPALFEEHIKKALEVILQ 325 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + +I F S E + ++ + + + L Sbjct: 326 KQ------DQHKILFLRSWNEWAEGNYVEPDLKFGHGYLDVLKSSIL 366 >gi|168218133|ref|ZP_02643758.1| conserved hypothetical protein [Clostridium perfringens NCTC 8239] gi|182625720|ref|ZP_02953488.1| conserved hypothetical protein [Clostridium perfringens D str. JGS1721] gi|177908982|gb|EDT71464.1| conserved hypothetical protein [Clostridium perfringens D str. JGS1721] gi|182379836|gb|EDT77315.1| conserved hypothetical protein [Clostridium perfringens NCTC 8239] Length = 353 Score = 87.7 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 36/119 (30%), Gaps = 10/119 (8%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 K KLG ++ L N Y G +V W + +++ ++ S F Sbjct: 240 FKKKLGVLDKLNYDNLWNAVINKNEDYGKKKFLGAFVSWDNTARKKNKGLVLNEDSPSKF 299 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + + K F + E + +L ++ + + +E Sbjct: 300 KKYFKKQYD--------KAIEIGSEYIFINAWNEWAEGTYLEPDKENEHGYIEALNEVL 350 >gi|55846838|gb|AAV67424.1| glycosyltransferase [Xanthomonas oryzae pv. oryzicola] Length = 464 Score = 87.7 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 12/88 (13%), Positives = 32/88 (36%), Gaps = 7/88 (7%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G W + +++ TS + SI++ +WL + +++ + P ++ F + Sbjct: 266 YRGIVPSWDNTARRQHTSHILLNSSPSIYQ---YWLGRLVDYTRV--NNAPEDQLIFINA 320 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123 E + L + + + Sbjct: 321 WNEWGEGCHLEPDLKHGLAYLEATHAAV 348 >gi|166713475|ref|ZP_02244682.1| Tetratricopeptide TPR_2 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 374 Score = 87.7 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 12/88 (13%), Positives = 32/88 (36%), Gaps = 7/88 (7%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G W + +++ TS + SI++ +WL + +++ + P ++ F + Sbjct: 203 YRGIVPSWDNTARRQHTSHILLNSSPSIYQ---YWLGRLVDYTRV--NNAPEDQLIFINA 257 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123 E + L + + + Sbjct: 258 WNEWGEGCHLEPDLKHGLAYLEATHAAV 285 >gi|325300544|ref|YP_004260461.1| hypothetical protein Bacsa_3463 [Bacteroides salanitronis DSM 18170] gi|324320097|gb|ADY37988.1| hypothetical protein Bacsa_3463 [Bacteroides salanitronis DSM 18170] Length = 385 Score = 87.3 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 10/105 (9%), Positives = 25/105 (23%), Gaps = 8/105 (7%) Query: 22 DVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81 V + + W +P+ + + F+ + + Sbjct: 286 KVSKLLFAEEDKWNNVYPTLIPNWDRTPRNGKNAIVWYHNNPEFFKQEVEIALDVI---- 341 Query: 82 YSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 K +I F S E + ++ + + E Sbjct: 342 --KDKPMEHKILFLMSWNEWGEGNYMEPDIEFGKGYIHALREAIE 384 >gi|251771739|gb|EES52314.1| Lipopolysaccharide biosynthesis protein-like protein [Leptospirillum ferrodiazotrophum] Length = 360 Score = 87.3 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 7/89 (7%), Positives = 21/89 (23%), Gaps = 8/89 (8%) Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 + +P+ + +F + + S + + F Sbjct: 276 LHPCVINSFDNTPRSGVNGVVYKNATPDLFRNHLREAIS------SIENYPTERKFIFLK 329 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 S E + L + + + + Sbjct: 330 SWNEWAEGNHLEPDLRYGHGWLKAIQDVL 358 >gi|237725325|ref|ZP_04555806.1| conserved hypothetical protein [Bacteroides sp. D4] gi|229436012|gb|EEO46089.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] Length = 383 Score = 87.3 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 40/127 (31%), Gaps = 10/127 (7%) Query: 1 MYKVFRLKSKLG-KIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKD-V 58 +Y + R KLG +I D+ ++ + SP+ + Sbjct: 262 IYYIKRFLMKLGIRILVKCQYKDIISNYYVEQDRWENVYPTIIPNFDRSPRSGWKTNILW 321 Query: 59 HFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRM 116 + ++F+ I + L + +I F S E + ++ + ++ + Sbjct: 322 YGSTPTLFKKHIIQALNLL------EGRSAEHKILFLQSWNEWGEGNYVEPDLKFGHAYL 375 Query: 117 PFDSEKF 123 E Sbjct: 376 EVLREVI 382 >gi|68643200|emb|CAI33488.1| conserved hypothetical protein [Streptococcus pneumoniae] Length = 366 Score = 86.9 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 12/90 (13%), Positives = 21/90 (23%), Gaps = 9/90 (10%) Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 G +V W +P++ S FE + A F Sbjct: 281 KNISPGAFVSWDNTPRRGNRSLVFDGANPKKFEKYF-------AKQVQRAKEEYHSDFIF 333 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 + E + A L + + Sbjct: 334 INAWNEWAEGAHLEPDEQYGYGYLEAVRAV 363 >gi|168214851|ref|ZP_02640476.1| conserved hypothetical protein [Clostridium perfringens CPE str. F4969] gi|170713695|gb|EDT25877.1| conserved hypothetical protein [Clostridium perfringens CPE str. F4969] Length = 353 Score = 86.9 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 36/119 (30%), Gaps = 10/119 (8%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 K KLG ++ L N Y G +V W + +++ ++ S F Sbjct: 240 FKKKLGVLDKLNYDNLWNAVINKNEDYGKKKFLGAFVSWDNTARKKNKGLVLNEDSPSKF 299 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + + K F + E + +L ++ + + +E Sbjct: 300 KKYFKKQYD--------KAIEIGSEYIFINAWNEWAEGTYLEPDKENEHGYIKALNEVL 350 >gi|212694326|ref|ZP_03302454.1| hypothetical protein BACDOR_03852 [Bacteroides dorei DSM 17855] gi|212662827|gb|EEB23401.1| hypothetical protein BACDOR_03852 [Bacteroides dorei DSM 17855] Length = 370 Score = 86.5 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 8/111 (7%), Positives = 27/111 (24%), Gaps = 11/111 (9%) Query: 18 LLRLDVEEKGNM---QAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLR 74 L++ D + + + +P+ + F+ + Sbjct: 261 LMKYDYNKVVRNYDTPENKLENCYPVITPGFDRTPRAGRRAGIYVNSSPKNFKKHVAE-- 318 Query: 75 SFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K + R+ F + E + ++ + + + Sbjct: 319 ----VCKSIQDKDDDHRLVFLSAWNEWGEGNYMEPDLKWGHGYLEALKSVV 365 >gi|265751844|ref|ZP_06087637.1| radical SAM domain-containing protein [Bacteroides sp. 3_1_33FAA] gi|263236636|gb|EEZ22106.1| radical SAM domain-containing protein [Bacteroides sp. 3_1_33FAA] Length = 367 Score = 86.5 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 34/122 (27%), Gaps = 10/122 (8%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYI--PAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 S L ++R + I Y W SP+ ++ +H ++ Sbjct: 243 SYLFPFPINVIRYSKAIDKMVDDILFRKSKIYPIIYPNWDHSPRAGNSASIMHGSTPQLW 302 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + + S + +I F S E + +L + + ++ Sbjct: 303 GKLLEKVISLIH------DKDEGDQIIFIKSWNEWGEGNYLEPDLKYGRGYLDVMNKMLR 356 Query: 125 YV 126 Sbjct: 357 KE 358 >gi|322433407|ref|YP_004210624.1| lipopolysaccharide biosynthesis protein-like protein [Acidobacterium sp. MP5ACTX9] gi|321165796|gb|ADW71497.1| lipopolysaccharide biosynthesis protein-like protein [Acidobacterium sp. MP5ACTX9] Length = 381 Score = 86.5 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 9/113 (7%), Positives = 31/113 (27%), Gaps = 8/113 (7%) Query: 13 KIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFW 72 + + DV + + W +P+ + +F + + Sbjct: 269 RRPTRIRYKDVVARALEDMPQEERFLPCVLPGWDNTPRSSHRGVIFEGETPELFRTLLQ- 327 Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ ++ RI F + E + ++ + ++ + Sbjct: 328 -----KAVQHVSVNSVEQRIVFLKAWNEWAEGNYVEPDVLHGHAYLDVIRSVV 375 >gi|325105038|ref|YP_004274692.1| polysaccharide biosynthesis protein [Pedobacter saltans DSM 12145] gi|324973886|gb|ADY52870.1| polysaccharide biosynthesis protein [Pedobacter saltans DSM 12145] Length = 368 Score = 86.2 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 9/117 (7%), Positives = 25/117 (21%), Gaps = 8/117 (6%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 K K ++ E + W + +++ + F Sbjct: 256 KKRVKQPTIIDYAKFTEFDSSLVNKPYKLYPCVSPGWDNTARKKENGIVFINSTPTNF-- 313 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 W + + + + F + E + L + + Sbjct: 314 -YNWTKKKIKKFQ---PYSKEENLLFINAWNEWAEGNHLEPCNKNGLGYLKALKKAL 366 >gi|167745516|ref|ZP_02417643.1| hypothetical protein ANACAC_00207 [Anaerostipes caccae DSM 14662] gi|167655237|gb|EDR99366.1| hypothetical protein ANACAC_00207 [Anaerostipes caccae DSM 14662] Length = 382 Score = 85.4 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 35/122 (28%), Gaps = 13/122 (10%) Query: 7 LKSKLGKIENLLLRLDVEEKGN---MQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63 L+ G I +L R + N M + + SP+ + + Sbjct: 264 LRKYFGGI--VLDRYKYDTIMNHFIMPEDFEESIYPQLIPKRDRSPRSGRKAMIYYGSTP 321 Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 F+S + + R+ F + E + A++ + + + E Sbjct: 322 EKFKSAAENAIKCV------EGRDKEHRLIFLNAWNEWGEGAYMEPDLKFGHGYLEALKE 375 Query: 122 KF 123 Sbjct: 376 IL 377 >gi|225548129|ref|ZP_03769414.1| hypothetical protein RUMHYD_00108 [Blautia hydrogenotrophica DSM 10507] gi|225040805|gb|EEG51051.1| hypothetical protein RUMHYD_00108 [Blautia hydrogenotrophica DSM 10507] Length = 379 Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 13/120 (10%), Positives = 33/120 (27%), Gaps = 8/120 (6%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 K G + + D+ + Y SP+ + + +F+ Sbjct: 266 KYFGGMVLDKYRYSDIIKHFITPEDYSERIYPQLIPRRDRSPRSGRKAMIYYDSTPELFK 325 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 + K + + R+ F + E + A++ + + + E Sbjct: 326 ------LAAENAVKCVEKRDKNHRLIFLNAWNEWGEGAYMEPDLRFGHKYIEALREVLTN 379 >gi|332180567|gb|AEE16255.1| hypothetical protein Trebr_0819 [Treponema brennaborense DSM 12168] Length = 366 Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 16/117 (13%), Positives = 32/117 (27%), Gaps = 8/117 (6%) Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69 K KI L+ ++ + + G W +P+ L +F+ Sbjct: 252 KFLKIPRLVNYKEIVKYAVSEKDKRNDFYPGIVCTWDHTPRSGRNGMVFINFSLKLFKE- 310 Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + K +I F S E + F+ + ++ E Sbjct: 311 -----HICTVLELVKNKPEQEQIVFLKSWNEWGEGNFMEPDIEYGKGKVDTLKEAIH 362 >gi|237808791|ref|YP_002893231.1| polysaccharide biosynthesis protein [Tolumonas auensis DSM 9187] gi|237501052|gb|ACQ93645.1| polysaccharide biosynthesis protein [Tolumonas auensis DSM 9187] Length = 370 Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 12/124 (9%), Positives = 35/124 (28%), Gaps = 9/124 (7%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 LK+K+ + + V + W + +++ + + Sbjct: 254 LKTKVSAVNKVNYAALVSNMVKKSWPKTYRKFPCVFPSWDNTARRKTPTVIQNLDS---- 309 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + WL + + +I F + E + L +R + + + + Sbjct: 310 NVYARWLEYAVDSVSS---YPENEKIVFINAWNEWAEGCHLEPDRKVGRAFLEATKQVVE 366 Query: 125 YVKE 128 + Sbjct: 367 RPSK 370 >gi|23098585|ref|NP_692051.1| hypothetical protein OB1130 [Oceanobacillus iheyensis HTE831] gi|22776811|dbj|BAC13086.1| hypothetical protein [Oceanobacillus iheyensis HTE831] Length = 531 Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 13/114 (11%), Positives = 33/114 (28%), Gaps = 9/114 (7%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIF 71 GK + L E + G + W + + + + H + F+++ Sbjct: 421 GKAKYLDYDRIWESILSRNNKQHKKVFLGAFTDWDNTARMQSSGTIYHGATPAKFKNY-- 478 Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 L+ + F + E + A+L ++ + + Sbjct: 479 -----LSRQIDRANNVYDSEFLFINAWNEWAEGAYLEPDKKFKYGYLEAVRDAL 527 >gi|58038685|ref|YP_190649.1| hypothetical protein GOX0204 [Gluconobacter oxydans 621H] gi|58001099|gb|AAW59993.1| Hypothetical protein GOX0204 [Gluconobacter oxydans 621H] Length = 1260 Score = 85.0 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 18/147 (12%), Positives = 48/147 (32%), Gaps = 17/147 (11%) Query: 10 KLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESF 69 G++ + +D K Q + W +++ +H ++E Sbjct: 489 FSGQVYDYGEVVD---KALAQPRTPFPLIRTAAPSWDNDARRQGKGLVLHGSTPELYE-- 543 Query: 70 IFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127 WL + ++ +F + + E + A+L ++ ++ + + Sbjct: 544 -RWLSGLIEQAQSR--TFFGDPVVCINAWNEWAKGAYLEPDQHFGSAYLNATARACTGAG 600 Query: 128 E-------LFEGWNDRPSSPKKSGLTI 147 + L G + P+ ++ L I Sbjct: 601 KNRSRSGILLIGHDAFPAGAQRLLLEI 627 >gi|237712790|ref|ZP_04543271.1| conserved hypothetical protein [Bacteroides sp. D1] gi|237718379|ref|ZP_04548860.1| radical SAM [Bacteroides sp. 2_2_4] gi|262408851|ref|ZP_06085396.1| radical SAM domain-containing protein [Bacteroides sp. 2_1_22] gi|293370137|ref|ZP_06616700.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] gi|294643855|ref|ZP_06721647.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294810735|ref|ZP_06769383.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] gi|229447118|gb|EEO52909.1| conserved hypothetical protein [Bacteroides sp. D1] gi|229452312|gb|EEO58103.1| radical SAM [Bacteroides sp. 2_2_4] gi|262353062|gb|EEZ02157.1| radical SAM domain-containing protein [Bacteroides sp. 2_1_22] gi|292634789|gb|EFF53315.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] gi|292640797|gb|EFF59023.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294442068|gb|EFG10887.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] Length = 360 Score = 84.6 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 11/124 (8%), Positives = 29/124 (23%), Gaps = 8/124 (6%) Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 ++ KL + + + I + SP+ + + Sbjct: 243 KILRKLLRKPITIEYSQYSQYLLNNYIVNENVYPSICPNYDHSPRSKFRGTIIVNSTPQ- 301 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 W + + + F + E + +L + + + Sbjct: 302 -----KWKKLCHEMFSKVSVRSAEDNLVFIKAWNEWGEGNYLEPDLKYGTQFLDVIRDVL 356 Query: 124 LYVK 127 VK Sbjct: 357 EKVK 360 >gi|329944274|ref|ZP_08292533.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] gi|328531004|gb|EGF57860.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] Length = 699 Score = 84.2 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 63/209 (30%), Gaps = 33/209 (15%) Query: 163 WIEISHILLRLNFDFDLFVTVVEA--NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLL 220 +++ L L + + VT D E+ + ++ ++ +G FL Sbjct: 341 ADDLAERLASLPEHWRVVVTSPSELNAADLERVTGRRTTFRKVRDLDPRG--TIAFLTEC 398 Query: 221 ELG------------------------VFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWL 256 + DR D + I G + RR + Sbjct: 399 DDLWDPAHAGDVGASDGGDGTDTTDTAEVDRVDLVLTI--SAGPLSGSSERADDVARRQV 456 Query: 257 FFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF 316 LL +++ F ++P LG++ + + + + L++R G Sbjct: 457 LDCLLASPGYVAGLLDLFGRHPSLGVVMPAACHIGQPYV-GPQWDGLVGAADALSRRLGL 515 Query: 317 PTKRLH--LDFFNGTMFWVKPKCLEPLRN 343 G+MF +P+ L L Sbjct: 516 TAALDEIAPVAPVGSMFLARPEALRTLSE 544 >gi|313890159|ref|ZP_07823794.1| conserved hypothetical protein [Streptococcus pseudoporcinus SPIN 20026] gi|313121520|gb|EFR44624.1| conserved hypothetical protein [Streptococcus pseudoporcinus SPIN 20026] Length = 359 Score = 83.8 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 11/120 (9%), Positives = 30/120 (25%), Gaps = 8/120 (6%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 +K K+ + + + + + + W SP+ + + + F Sbjct: 242 IKRKVFRRPTVFKYKEAIKYMIDDSAKDENVIPVVAPNWDHSPRSGNNAMILDNAKPKYF 301 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + K + S + S E + L + + + Sbjct: 302 ADLLKE------TVKTVRSKPRSKQQVIIKSWNEWGEGNHLEPDLKYGLGYLEAVKKSIE 355 >gi|317476949|ref|ZP_07936191.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides eggerthii 1_2_48FAA] gi|316906742|gb|EFV28454.1| lipopolysaccharide biosynthesis protein-like protein [Bacteroides eggerthii 1_2_48FAA] Length = 360 Score = 83.8 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 13/114 (11%), Positives = 29/114 (25%), Gaps = 9/114 (7%) Query: 3 KVF-RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQ 61 K F +L S + I + V + W +P+ ++ Sbjct: 250 KCFDKLYSIVTGIPRIANYKSVSSHFIGKEEMEDNIYPTIIPNWDHTPRSGFNGYVLNNS 309 Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSN 113 +F + + + I F S E + ++ + Sbjct: 310 TPELFRFHVRKALATTLQKR------ADNMIVFLKSWNEWGEGNYMEPDLKYGK 357 >gi|326403402|ref|YP_004283483.1| putative glycosyltransferase [Acidiphilium multivorum AIU301] gi|325050263|dbj|BAJ80601.1| putative glycosyltransferase [Acidiphilium multivorum AIU301] Length = 1247 Score = 83.1 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 11/130 (8%), Positives = 41/130 (31%), Gaps = 13/130 (10%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 + + + ++++ + Y + W P++ +H + + Sbjct: 482 FSADVYRYDDIV----AASLADPDPAY--PLIRTAVPGWDNDPRREGAGVVLHEATPAAY 535 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + WL + + ++ + + I + E + A+L + + + + Sbjct: 536 Q---AWLAALIERARRAPV--HGEPIVCINAWNEWAEGAYLEPDLHFGAAFLNATARAIT 590 Query: 125 YVKELFEGWN 134 + + N Sbjct: 591 GRADAADAQN 600 >gi|148259629|ref|YP_001233756.1| glycosyl transferase, group 1 [Acidiphilium cryptum JF-5] gi|146401310|gb|ABQ29837.1| glycosyl transferase, group 1 [Acidiphilium cryptum JF-5] Length = 1247 Score = 83.1 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 11/130 (8%), Positives = 41/130 (31%), Gaps = 13/130 (10%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 + + + ++++ + Y + W P++ +H + + Sbjct: 482 FSADVYRYDDIV----AASLADPDPAY--PLIRTAVPGWDNDPRREGAGVVLHEATPAAY 535 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + WL + + ++ + + I + E + A+L + + + + Sbjct: 536 Q---AWLAALIERARRAPV--HGEPIVCINAWNEWAEGAYLEPDLHFGAAFLNATARAIT 590 Query: 125 YVKELFEGWN 134 + + N Sbjct: 591 GRADAADAQN 600 >gi|237727673|ref|ZP_04558154.1| polysaccharide biosynthesis protein [Bacteroides sp. D4] gi|229434529|gb|EEO44606.1| polysaccharide biosynthesis protein [Bacteroides dorei 5_1_36/D4] Length = 363 Score = 83.1 bits (204), Expect = 8e-14, Method: Composition-based stats. Identities = 11/95 (11%), Positives = 27/95 (28%), Gaps = 9/95 (9%) Query: 38 VSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 W SP+++ +++ WL+ L + + F Sbjct: 275 FPCVSPGWDNSPRRKKPPYTAFIGSTPCLYKK---WLKDTL---IRFQPFSEEENLVFIN 328 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKEL 129 + E + L ++ + E K++ Sbjct: 329 AWNEWAEGNHLEPDQKWGRKYLEVTKEAIDETKDI 363 >gi|306831232|ref|ZP_07464393.1| glycosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|304426798|gb|EFM29909.1| glycosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] Length = 381 Score = 83.1 bits (204), Expect = 8e-14, Method: Composition-based stats. Identities = 14/107 (13%), Positives = 29/107 (27%), Gaps = 8/107 (7%) Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79 D+ N + + SP+ + + F + S + Sbjct: 278 YKDIIRSFNTKEDFQENIYPQLIPGRDRSPRSGKKAVIYYENTPEEFRIAVKNAISCV-- 335 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + P RI F S E + A++ + + E+ Sbjct: 336 ----EKRNPEHRIIFLNSWNEWAEGAYMEPDTTYGKRYIQVLREELE 378 >gi|300728504|ref|ZP_07061863.1| conserved hypothetical protein [Prevotella bryantii B14] gi|299774222|gb|EFI70855.1| conserved hypothetical protein [Prevotella bryantii B14] Length = 369 Score = 82.7 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 8/94 (8%), Positives = 22/94 (23%), Gaps = 10/94 (10%) Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 + W +P+ + + F + + +I F Sbjct: 284 VIPQLLPQWDHTPRSGWNGTLLINCKPEYFYEHSKEALNIV--------KNKQNKIIFLK 335 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 S E + + + + + +E Sbjct: 336 SWNEWGEGNMMEPDLTYGRGFINALRKAVDEYEE 369 >gi|68643231|emb|CAI33513.1| conserved hypothetical protein [Streptococcus pneumoniae] Length = 381 Score = 82.7 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 31/112 (27%), Gaps = 13/112 (11%) Query: 17 LLLRLDVEEKGNMQAIY---IPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73 LL R D + ++G +V W + + + FE ++ L Sbjct: 274 LLDRRDYDATWTNIINRPIKDNKMIAGAFVDWDNTAR-NKNGRVFDGANPEKFEGYMRQL 332 Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + I F + E + A+L ++ + Sbjct: 333 IEKI-------QKEYQSEIVFINAWNEWAEGAYLEPDKKHGYGYLEALKTVI 377 >gi|320531345|ref|ZP_08032317.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] gi|320136436|gb|EFW28412.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] Length = 678 Score = 82.7 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 39/287 (13%), Positives = 70/287 (24%), Gaps = 52/287 (18%) Query: 102 KAFLRLNRFMSNSRMPFDSEKFLYVKE-------------LFEGWNDRPSSPKKSGLTIK 148 L S S+ + P+ Sbjct: 243 GELLEDAARAGYSEDLILSDVVHNAPARDLIVNAGLTEVVVEAAPAPDEPDPEAGSTAPT 302 Query: 149 SKIAIVVHCYYQD--------TWIEISHILLRLNFDFDLFVTVVE--ANKDFEQDVLKYF 198 +VVH ++ L L + + VT D E+ + Sbjct: 303 PSGCVVVHV--PAGGEGVERAEADGLAQRLASLPAHWRVVVTSPTHLDAADLERLTGRRP 360 Query: 199 PSA------------QLYVMENKGRDVRPFLYLLELGVFDRY--------DYLCKIHGKK 238 + ++ +G PFL D + +I Sbjct: 361 ADEAAAPGGAAVAFRAVRDLDPRG--TIPFLTECGDLWDPGRATGSDGGGDLVLRI-TVG 417 Query: 239 SQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFA 298 S + + RR + LL +I+ FE++P LG+ + Sbjct: 418 SPSGPESKAD-DVARRQVLDCLLASPGYTAGLIDLFERHPGLGVAMPAASHIGQAH-GGP 475 Query: 299 KRSEVYRRVIDLAKRAGF--PTKRLHLDFFNGTMFWVKPKCLEPLRN 343 + L++R G + G MF +P+ L L Sbjct: 476 TWDGLAGAAKTLSRRLGLTVELDPVAPVVPVGAMFMARPEALRTLSE 522 >gi|24637409|gb|AAN63687.1|AF454495_12 Eps4K [Streptococcus thermophilus] Length = 384 Score = 82.3 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 10/88 (11%), Positives = 22/88 (25%), Gaps = 9/88 (10%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 + G +V W + + + F+ ++ L S F + Sbjct: 295 IPGAFVEWDNTSRHGDRGRVYDGATPQKFQKYMSALI-------KKTKSEYHKDYIFINA 347 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123 E + A L + + Sbjct: 348 WNEWAEGAHLEPDEKNKYGYLEALKNAL 375 >gi|288803643|ref|ZP_06409073.1| glycosyl transferase, group 2 family [Prevotella melaninogenica D18] gi|288333883|gb|EFC72328.1| glycosyl transferase, group 2 family [Prevotella melaninogenica D18] Length = 369 Score = 82.3 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 10/95 (10%), Positives = 24/95 (25%), Gaps = 9/95 (9%) Query: 37 HVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95 + W SP+ + + F L + K +I Sbjct: 281 IIPQIVPQWDHSPRSEHAADLIYYNSTPESF------YLHCLDAFEVLKDKSEDEQILIL 334 Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 S E + ++ + + + + V + Sbjct: 335 KSWNEWGEGNYMEPDISNGDGYIKALRKALNKVSK 369 >gi|256392765|ref|YP_003114329.1| lipopolysaccharide biosynthesis protein-like protein [Catenulispora acidiphila DSM 44928] gi|256358991|gb|ACU72488.1| lipopolysaccharide biosynthesis protein-like protein [Catenulispora acidiphila DSM 44928] Length = 357 Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 12/94 (12%), Positives = 29/94 (30%), Gaps = 9/94 (9%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 + +P+ +H + IFE + + + + + P R+ F S Sbjct: 271 HPCVVPGFDNTPRSGRRGVLLHHPDPEIFE-------AAVTEAVRREQAMPDPRMLFIKS 323 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKEL 129 E + + + ++ S + L Sbjct: 324 WNEWAEGSVMEPDQHFGRSFLRALRRGLDVRPPL 357 >gi|330996598|ref|ZP_08320478.1| hypothetical protein HMPREF9442_01565 [Paraprevotella xylaniphila YIT 11841] gi|329572832|gb|EGG54459.1| hypothetical protein HMPREF9442_01565 [Paraprevotella xylaniphila YIT 11841] Length = 386 Score = 81.9 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 11/112 (9%), Positives = 31/112 (27%), Gaps = 12/112 (10%) Query: 16 NLLLRLDVEEKGNM---QAIYIPAHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIF 71 + +L +D + + + + SP+ + H +F + Sbjct: 278 DYVLHIDYAKIIRNYYVENDKMENIYPTIIPNFDRSPRSGKKTNNIWHGSTPKLFGKMVE 337 Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 + K +I F S E + ++ + + + + Sbjct: 338 QALDLI------KDKQDEHKILFLQSWNEWGEGNYMEPDLKFGHGYIDILGK 383 >gi|228937557|ref|ZP_04100197.1| Glycosyltransferase [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228970444|ref|ZP_04131097.1| Glycosyltransferase [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228789273|gb|EEM37199.1| Glycosyltransferase [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228822111|gb|EEM68099.1| Glycosyltransferase [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326938048|gb|AEA13944.1| glycosyltransferase [Bacillus thuringiensis serovar chinensis CT-43] Length = 120 Score = 81.5 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%) Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93 G +V W + +++ S F ++ SF + Sbjct: 23 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 75 Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + E + +L ++ S + Sbjct: 76 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 107 >gi|291520445|emb|CBK75666.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 109 Score = 81.1 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 22/94 (23%), Positives = 41/94 (43%), Gaps = 5/94 (5%) Query: 148 KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQD--VLKYF--PSAQL 203 +++ A+ + ++ D + E L D++V K + + K + ++ Sbjct: 13 QNRYAVFAYLFFDDLFEESLRYFSNLPNYVDIYVATNTEEKVDVINGYIPKMLFRHNVKV 72 Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGK 237 + NKGRDV L LL+ YD +C +H K Sbjct: 73 LLHNNKGRDVSALLVLLKRYY-SNYDVICFVHDK 105 >gi|319784640|ref|YP_004144116.1| hypothetical protein Mesci_4961 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170528|gb|ADV14066.1| hypothetical protein Mesci_4961 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 936 Score = 81.1 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 13/120 (10%), Positives = 42/120 (35%), Gaps = 10/120 (8%) Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 + W + + + S V ++ +E WLR + ++ ++ ++ F Sbjct: 262 IYRTVFPDWDNTARVKNRSLIVLGSTVANYE---RWLRGSSSLTRANRA--EGDQLVFIN 316 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIV 154 + E + +L +R + + + +P++ ++ ++A + Sbjct: 317 AWNEWAEGCYLEPDRRHGRGFLEAT---LRVKNGMSMVDDIYDVAPERVRFELRQQLAAI 373 >gi|42779379|ref|NP_976626.1| hypothetical protein BCE_0298 [Bacillus cereus ATCC 10987] gi|42735295|gb|AAS39234.1| conserved domain protein [Bacillus cereus ATCC 10987] Length = 358 Score = 81.1 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 33/115 (28%), Gaps = 10/115 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F + Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY- 296 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 L+ S + F + E + +L ++ + + + Sbjct: 297 ------LSKQIQRTYSLYNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345 >gi|229194650|ref|ZP_04321445.1| Glycosyltransferase [Bacillus cereus m1293] gi|228588820|gb|EEK46843.1| Glycosyltransferase [Bacillus cereus m1293] Length = 358 Score = 80.8 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 33/115 (28%), Gaps = 10/115 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F + Sbjct: 238 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY- 296 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 L+ S + F + E + +L ++ + + + Sbjct: 297 ------LSKQIQRTYSVYNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345 >gi|313199878|ref|YP_004038536.1| polysaccharide biosynthesis protein [Methylovorus sp. MP688] gi|312439194|gb|ADQ83300.1| polysaccharide biosynthesis protein [Methylovorus sp. MP688] Length = 379 Score = 80.8 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 27/88 (30%), Gaps = 8/88 (9%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 W S ++R + + + +FE WLR+ S RI F + Sbjct: 293 FPCVVPSWDKSARRRAGATVIQNHDPKLFE---LWLRNA---SSRVSKYPKDERIIFINA 346 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123 E + L + + + F Sbjct: 347 WNEWAEGCHLEPDLRHGHQFLEAVRNVF 374 >gi|114571025|ref|YP_757705.1| glycosyl transferase family protein [Maricaulis maris MCS10] gi|114341487|gb|ABI66767.1| glycosyl transferase, family 2 [Maricaulis maris MCS10] Length = 882 Score = 80.8 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 14/117 (11%), Positives = 32/117 (27%), Gaps = 10/117 (8%) Query: 8 KSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 K GK+ ++ E A H + W S ++ F+ Sbjct: 768 KDFYGKLYSV--DGAYEALVRRGAPAW-RHFHSAFTGWDNSARRGDRGDIFLGDCPGKFQ 824 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 + + + K L + F + E + +L + ++ + Sbjct: 825 ALLE-----VQMRKAKALGAAGEKAIFINAWNEWAEGTYLEPDLHHGHAWLEAVRNA 876 >gi|312131802|ref|YP_003999142.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM 17132] gi|311908348|gb|ADQ18789.1| polysaccharide biosynthesis protein [Leadbetterella byssophila DSM 17132] Length = 361 Score = 80.8 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 35/121 (28%), Gaps = 9/121 (7%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 L+ + + +E+ + I V + + ++ + + Q + F Sbjct: 245 LQGVINPTLKIYDYKQYKERAKIHKIKYKG-FPCPIVGFDNTARKGKNAVILKNQNVEDF 303 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 ++ S + + K +I F S E + L + E F Sbjct: 304 KA------SLIDAVEDVKEFPEEEQIVFINSWNEWAEGNHLEPCVKFGRQFLEAVKEVFS 357 Query: 125 Y 125 Sbjct: 358 K 358 >gi|30018522|ref|NP_830153.1| glycosyltransferase [Bacillus cereus ATCC 14579] gi|29894062|gb|AAP07354.1| Glycosyltransferase [Bacillus cereus ATCC 14579] Length = 358 Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%) Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93 G +V W + +++ S F ++ SF + Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313 Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + E + +L ++ S + Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345 >gi|325265289|ref|ZP_08132014.1| glycosyl transferase, group 2 family [Clostridium sp. D5] gi|324029468|gb|EGB90758.1| glycosyl transferase, group 2 family [Clostridium sp. D5] Length = 369 Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 38/126 (30%), Gaps = 14/126 (11%) Query: 4 VFRLKSKLGKIENLLLRLDVEEKGNMQAIYIP---AHVSGYYVLWSFSPKQRITSKDVHF 60 V +LK K K+ ++ D ++ P + G +V W +P+ + + Sbjct: 244 VNKLKIKQTKLSTIIF--DYDKAWKNILDMKPRDDKMIPGAFVDWDNTPRYKKLASVFRG 301 Query: 61 QELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPF 118 F+ + L+ + I F + E + +L + + Sbjct: 302 VTPEKFKYY-------LSRQIQNAKRVYRKDIIFMFAWNEWGEGGYLEPDEKNGYKMLDA 354 Query: 119 DSEKFL 124 Sbjct: 355 IKSALE 360 >gi|296501094|ref|YP_003662794.1| glycosyltransferase [Bacillus thuringiensis BMB171] gi|296322146|gb|ADH05074.1| glycosyltransferase [Bacillus thuringiensis BMB171] Length = 358 Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%) Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93 G +V W + +++ S F ++ SF + Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313 Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + E + +L ++ S + Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345 >gi|229042160|ref|ZP_04189916.1| Glycosyltransferase [Bacillus cereus AH676] gi|228727172|gb|EEL78373.1| Glycosyltransferase [Bacillus cereus AH676] Length = 358 Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%) Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93 G +V W + +++ S F ++ SF + Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313 Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + E + +L ++ S + Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345 >gi|47570410|ref|ZP_00241048.1| glycosyltransferase [Bacillus cereus G9241] gi|47552914|gb|EAL11327.1| glycosyltransferase [Bacillus cereus G9241] Length = 182 Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 10/124 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F + Sbjct: 62 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 120 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 L+ Y S + F + E + +L ++ + + + + Sbjct: 121 ------LSKQIYRTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAISRGMK 174 Query: 129 LFEG 132 ++ Sbjct: 175 AYKK 178 >gi|218231858|ref|YP_002365107.1| hypothetical protein BCB4264_A0319 [Bacillus cereus B4264] gi|229148661|ref|ZP_04276913.1| Glycosyltransferase [Bacillus cereus m1550] gi|218159815|gb|ACK59807.1| conserved hypothetical protein [Bacillus cereus B4264] gi|228634798|gb|EEK91375.1| Glycosyltransferase [Bacillus cereus m1550] Length = 358 Score = 80.4 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%) Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93 G +V W + +++ S F ++ SF + Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313 Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + E + +L ++ S + Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345 >gi|206972729|ref|ZP_03233664.1| conserved hypothetical protein [Bacillus cereus AH1134] gi|206732341|gb|EDZ49528.1| conserved hypothetical protein [Bacillus cereus AH1134] Length = 358 Score = 80.0 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 11/92 (11%), Positives = 25/92 (27%), Gaps = 10/92 (10%) Query: 35 PAHVSGYYVLWSFSPKQRI-TSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIF 93 G +V W + +++ S F ++ SF + Sbjct: 261 KKTFPGAFVDWDNTARRKNANSSIFVGSTPEKFTIYLSKQIH-------RTYSFYNSEFL 313 Query: 94 FYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + E + +L ++ S + Sbjct: 314 FINAWNEWAEGTYLEPDKKYGFSYLEGVKNAI 345 >gi|324324277|gb|ADY19537.1| hypothetical protein YBT020_01425 [Bacillus thuringiensis serovar finitimus YBT-020] Length = 358 Score = 80.0 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 10/98 (10%), Positives = 29/98 (29%), Gaps = 10/98 (10%) Query: 29 MQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSF 87 ++ G +V W + +++ + S F + L+ S Sbjct: 255 KRSPSEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY-------LSKQIQRTYSL 307 Query: 88 PSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + F + E + +L ++ + + + Sbjct: 308 YNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345 >gi|253581532|ref|ZP_04858757.1| methyltransferase type 11 [Fusobacterium varium ATCC 27725] gi|251836602|gb|EES65137.1| methyltransferase type 11 [Fusobacterium varium ATCC 27725] Length = 356 Score = 79.6 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 9/117 (7%), Positives = 27/117 (23%), Gaps = 15/117 (12%) Query: 17 LLLRLDVEEKGNM-----QAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIF 71 + + EE G + W + + + +F+ ++ Sbjct: 247 FVQKYKYEEFLKKSIDISNEFLNKKIYPGIFTGWDNTSRHGRRGYVIERNTPKLFKKYLL 306 Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 + + + F + E + +L + + E Sbjct: 307 EEKKIM--------KEKNIDYIFLNAWNEWAEGMYLEPDEKFKYGYLEAIKEVMETE 355 >gi|212694719|ref|ZP_03302847.1| hypothetical protein BACDOR_04251 [Bacteroides dorei DSM 17855] gi|237727302|ref|ZP_04557783.1| conserved hypothetical protein [Bacteroides sp. D4] gi|212662698|gb|EEB23272.1| hypothetical protein BACDOR_04251 [Bacteroides dorei DSM 17855] gi|229434158|gb|EEO44235.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] Length = 352 Score = 79.2 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 16/118 (13%), Positives = 31/118 (26%), Gaps = 11/118 (9%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIF 66 K KLG + D Q + W SP+ S + ++F Sbjct: 237 FKHKLGALHTY-KYEDALRYFVSQEDKAENIIPTIISGWDHSPRAGENSLILTNYTPALF 295 Query: 67 ESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 + + + L +I F + E + L + + + Sbjct: 296 QKHLENVFDIL--------VQKENKICFIKAWNEWGEGNHLEPDLKYGLDFLKTLKQV 345 >gi|148927813|ref|ZP_01811238.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] gi|147886839|gb|EDK72384.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] Length = 468 Score = 79.2 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 12/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 G W + +++ T + F S++ +LR++ ++ F Sbjct: 307 YTLYRGIIPSWDNTARRQDTGTIIVNATPEFFGSWLKFLRAYTRETRPGASDP----FIF 362 Query: 95 YGSRKE--QKAFLRLNRFMSNSRM-PFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSK 150 + E + L + + ++ ++L R ++ ++ Sbjct: 363 VNAWNEWGEGCHLEPDVQWGLGYLDEVARSSYISSEDLLPVDQARAAAFRRIEQIAARD 421 >gi|206978430|ref|ZP_03239298.1| conserved hypothetical protein [Bacillus cereus H3081.97] gi|217957833|ref|YP_002336377.1| hypothetical protein BCAH187_A0334 [Bacillus cereus AH187] gi|222094032|ref|YP_002528086.1| glycosyltransferase [Bacillus cereus Q1] gi|206743362|gb|EDZ54801.1| conserved hypothetical protein [Bacillus cereus H3081.97] gi|217068322|gb|ACJ82572.1| conserved hypothetical protein [Bacillus cereus AH187] gi|221238084|gb|ACM10794.1| glycosyltransferase [Bacillus cereus Q1] Length = 358 Score = 79.2 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 10/98 (10%), Positives = 29/98 (29%), Gaps = 10/98 (10%) Query: 29 MQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSF 87 ++ G +V W + +++ + S F + L+ S Sbjct: 255 KRSPSEKKTFPGAFVDWDNTARRKDLNSSIFVGSTPEKFTIY-------LSKQIQRTYSL 307 Query: 88 PSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + F + E + +L ++ + + + Sbjct: 308 YNSEFLFINAWNEWAEGTYLEPDKKHGFAYLEGVKQAI 345 >gi|254724735|ref|ZP_05186518.1| hypothetical protein BantA1_20079 [Bacillus anthracis str. A1055] Length = 358 Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 10/124 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F + Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 296 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 L+ Y S + F + E + +L ++ + + + + Sbjct: 297 ------LSKQIYRTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAISRGMK 350 Query: 129 LFEG 132 ++ Sbjct: 351 AYKK 354 >gi|218901466|ref|YP_002449300.1| hypothetical protein BCAH820_0304 [Bacillus cereus AH820] gi|228925519|ref|ZP_04088610.1| Glycosyltransferase [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|218535510|gb|ACK87908.1| conserved hypothetical protein [Bacillus cereus AH820] gi|228834134|gb|EEM79680.1| Glycosyltransferase [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 358 Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 13/124 (10%), Positives = 35/124 (28%), Gaps = 10/124 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F ++ Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIYL 297 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 S + F + E + +L ++ + + + + Sbjct: 298 SKQIH-------RTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAIKRGMK 350 Query: 129 LFEG 132 ++ Sbjct: 351 AYKK 354 >gi|196036928|ref|ZP_03104311.1| conserved hypothetical protein [Bacillus cereus W] gi|228944071|ref|ZP_04106452.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|195990465|gb|EDX54450.1| conserved hypothetical protein [Bacillus cereus W] gi|228815598|gb|EEM61838.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 358 Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 13/124 (10%), Positives = 35/124 (28%), Gaps = 10/124 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F ++ Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIYL 297 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 S + F + E + +L ++ + + + + Sbjct: 298 SKQIH-------RTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAIKRGMK 350 Query: 129 LFEG 132 ++ Sbjct: 351 AYKK 354 >gi|163938265|ref|YP_001643149.1| hypothetical protein BcerKBAB4_0253 [Bacillus weihenstephanensis KBAB4] gi|163860462|gb|ABY41521.1| conserved hypothetical protein [Bacillus weihenstephanensis KBAB4] Length = 358 Score = 78.8 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 33/115 (28%), Gaps = 10/115 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F + Sbjct: 238 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPRKFTIY- 296 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 L+ S + F + E + +L ++ + + + Sbjct: 297 ------LSKQIQRTYSLYNSEFLFINAWNEWAEGTYLEPDKRHGFAYLEGVKQAI 345 >gi|257468312|ref|ZP_05632408.1| hypothetical protein FulcA4_03172 [Fusobacterium ulcerans ATCC 49185] gi|317062590|ref|ZP_07927075.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313688266|gb|EFS25101.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 355 Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 7/91 (7%), Positives = 23/91 (25%), Gaps = 10/91 (10%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G + W + + + +F+ ++ + + + F + Sbjct: 272 FPGVFTGWDNTSRHGRRGYVIKGNTPKLFKEYLLEQKKIM--------KEKNIEYIFLNA 323 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 E + +L + + E Sbjct: 324 WNEWAEGMYLEPDEKFEYGYLEAVKEIMETE 354 >gi|229154032|ref|ZP_04282159.1| Glycosyltransferase [Bacillus cereus ATCC 4342] gi|228629429|gb|EEK86129.1| Glycosyltransferase [Bacillus cereus ATCC 4342] Length = 358 Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 10/124 (8%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F + Sbjct: 238 GKQINAWDYDKVWMYILKRSPPEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 296 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 L+ Y S + F + E + +L ++ + + + + Sbjct: 297 ------LSKQIYRTYSLYNSEFLFMNAWNEWAEGTYLEPDKKHGFAYLEGVKQAISRGMK 350 Query: 129 LFEG 132 ++ Sbjct: 351 AYKK 354 >gi|75758487|ref|ZP_00738608.1| Hypothetical protein RBTH_07389 [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74494014|gb|EAO57109.1| Hypothetical protein RBTH_07389 [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 353 Score = 78.5 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 12/101 (11%), Positives = 28/101 (27%), Gaps = 15/101 (14%) Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKY 82 + N Q G +V W SP+++ ++ + F+ ++ Sbjct: 260 WKRILNRQIKECENIYKGAFVDWDNSPRKKESALIMKGANPDKFKKYLL----------- 308 Query: 83 SKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 F + E + +L + + E Sbjct: 309 --QHSKDTDFLFINAWNEWAEGTYLEPDEKYGYKYLEALME 347 >gi|260172490|ref|ZP_05758902.1| polysaccharide biosynthesis protein [Bacteroides sp. D2] gi|315920784|ref|ZP_07917024.1| polysaccharide biosynthesis protein [Bacteroides sp. D2] gi|313694659|gb|EFS31494.1| polysaccharide biosynthesis protein [Bacteroides sp. D2] Length = 367 Score = 78.1 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 10/96 (10%), Positives = 25/96 (26%), Gaps = 8/96 (8%) Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 G +W + +++ + E + WL S + F Sbjct: 275 YKMYPGVTPMWDNTSRRKQKMFILDKSTP---EKYGEWLYSVMNKFV---PYSKDENFVF 328 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 + E + L + + + ++E Sbjct: 329 VNAWNEWAEGNHLEPDLKWGFRYLEETEKVVKSMQE 364 >gi|228904942|ref|ZP_04068994.1| Glycosyltransferase [Bacillus thuringiensis IBL 4222] gi|228854684|gb|EEM99290.1| Glycosyltransferase [Bacillus thuringiensis IBL 4222] Length = 340 Score = 78.1 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 12/101 (11%), Positives = 28/101 (27%), Gaps = 15/101 (14%) Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKY 82 + N Q G +V W SP+++ ++ + F+ ++ Sbjct: 247 WKRILNRQIKECENIYKGAFVDWDNSPRKKESALIMKGANPDKFKKYLL----------- 295 Query: 83 SKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 F + E + +L + + E Sbjct: 296 --QHSKDTDFLFINAWNEWAEGTYLEPDEKYGYKYLEALME 334 >gi|187732137|ref|YP_001879843.1| WbwX [Shigella boydii CDC 3083-94] gi|187429129|gb|ACD08403.1| WbwX [Shigella boydii CDC 3083-94] Length = 361 Score = 78.1 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 30/92 (32%), Gaps = 10/92 (10%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G +V W S +++ + +H F ++ L Y + +C F + Sbjct: 277 YPGAFVDWDNSARKKSRALVIHGGSPKKFGLYLDKL--------YKRSIENNCPFLFINA 328 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127 E + +L + S + + + Sbjct: 329 WNEWAEGTYLEPDEKNKYSYLEELKKVIEKYE 360 >gi|213962348|ref|ZP_03390611.1| conserved hypothetical protein [Capnocytophaga sputigena Capno] gi|213955014|gb|EEB66333.1| conserved hypothetical protein [Capnocytophaga sputigena Capno] Length = 368 Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 36/120 (30%), Gaps = 10/120 (8%) Query: 4 VFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQEL 63 + RLK K+ K + + E + V W +P+ + SK Sbjct: 254 IGRLKFKMEKSQKVDYVAFGEALLTLAQQTQDKTYQSIIVDWDNTPRYKNRSKFFVNATP 313 Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 + FE F+ L A F + E + A+L + + + Sbjct: 314 ANFEHFLKELSLIEAAK--------GNEFVFINAWNEWSEGAYLEPDTTYEYQYLDVVKK 365 >gi|62955962|gb|AAY23338.1| WbwX [Shigella boydii] Length = 327 Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 30/92 (32%), Gaps = 10/92 (10%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G +V W S +++ + +H F ++ L Y + +C F + Sbjct: 243 YPGAFVDWDNSARKKSRALVIHGGSPKKFGLYLDKL--------YKRSIENNCPFLFINA 294 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKFLYVK 127 E + +L + S + + + Sbjct: 295 WNEWAEGTYLEPDEKNKYSYLEELKKVIEKYE 326 >gi|298480506|ref|ZP_06998703.1| glycosyl transferase, group 2 family [Bacteroides sp. D22] gi|298273327|gb|EFI14891.1| glycosyl transferase, group 2 family [Bacteroides sp. D22] Length = 365 Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 9/91 (9%), Positives = 22/91 (24%), Gaps = 8/91 (8%) Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 G +W + +++ + E + WL S + F Sbjct: 275 YKMYPGVTPMWDNTSRRKQKMFILDKSTP---EKYGEWLYSVMNKFV---PYSKDENFVF 328 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + E + L + + + Sbjct: 329 VNAWNEWAEGNHLEPDLKWGLRYLEETKKVV 359 >gi|313203616|ref|YP_004042273.1| polysaccharide biosynthesis protein [Paludibacter propionicigenes WB4] gi|312442932|gb|ADQ79288.1| polysaccharide biosynthesis protein [Paludibacter propionicigenes WB4] Length = 383 Score = 77.7 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 7/94 (7%), Positives = 26/94 (27%), Gaps = 8/94 (8%) Query: 32 IYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCR 91 Y + W + ++ ++ +F+ ++ + F S + Sbjct: 252 NYNYPVFRCVFPSWDNTARKNSKGTIFINNDIDVFKYYLQRIVEFTQQSTNK------EK 305 Query: 92 IFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 F + E + + + + + + Sbjct: 306 YIFINAWNEWGEGCHIEPDCRTNFKYLEVIKQTL 339 >gi|39996608|ref|NP_952559.1| hypothetical protein GSU1508 [Geobacter sulfurreducens PCA] gi|39983489|gb|AAR34882.1| conserved hypothetical protein [Geobacter sulfurreducens PCA] Length = 381 Score = 76.9 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 12/112 (10%), Positives = 26/112 (23%), Gaps = 9/112 (8%) Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWL 73 + + V+ G W S ++R T+ IF+ ++ Sbjct: 273 DVYVYSHLVDNDLKYDFQQGWPIFPGVCPGWDNSARRRDTTAIIFDKSTPEIFKLWVREK 332 Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ R F + E + L + Sbjct: 333 IRITDWNLL------PERFLFVNAWNEWAEGNHLEPCEKWGTQYLAALQAGI 378 >gi|298505623|gb|ADI84346.1| conserved hypothetical protein [Geobacter sulfurreducens KN400] Length = 372 Score = 76.5 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 12/112 (10%), Positives = 26/112 (23%), Gaps = 9/112 (8%) Query: 15 ENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFIFWL 73 + + V+ G W S ++R T+ IF+ ++ Sbjct: 264 DVYVYSHLVDNDLKYDFQQGWPIFPGVCPGWDNSARRRDTTAIIFDKSTPEIFKLWVREK 323 Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ R F + E + L + Sbjct: 324 IRITDWNLL------PERFLFVNAWNEWAEGNHLEPCEKWGTQYLAALQAGI 369 >gi|228912320|ref|ZP_04076015.1| Glycosyltransferase [Bacillus thuringiensis IBL 200] gi|228847303|gb|EEM92262.1| Glycosyltransferase [Bacillus thuringiensis IBL 200] Length = 340 Score = 76.5 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 12/101 (11%), Positives = 28/101 (27%), Gaps = 15/101 (14%) Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKY 82 + N Q G +V W SP+++ ++ + F+ ++ Sbjct: 247 WKRILNRQIKERENIYKGAFVDWDNSPRKKESALIMEGASPDKFKKYLL----------- 295 Query: 83 SKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 F + E + +L + + E Sbjct: 296 --QHSKDTDFLFINAWNEWAEGTYLEPDEKYGYKYLEALME 334 >gi|212694325|ref|ZP_03302453.1| hypothetical protein BACDOR_03851 [Bacteroides dorei DSM 17855] gi|212662826|gb|EEB23400.1| hypothetical protein BACDOR_03851 [Bacteroides dorei DSM 17855] Length = 359 Score = 75.8 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 8/94 (8%), Positives = 21/94 (22%), Gaps = 9/94 (9%) Query: 38 VSGYYVLWSFSPKQRITS-KDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 + + ++ ++ WL S K F Sbjct: 271 FPCVTPNFDNASRRMHKGFTAFIGSTPQLYGK---WLSSVFEKF---KPYSQEENFIFIN 324 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 + E + L ++ + + K+ Sbjct: 325 AWNEWAEGNHLEPDQKWGRKYLEETKKNIDQYKK 358 >gi|227890975|ref|ZP_04008780.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741] gi|227867384|gb|EEJ74805.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741] Length = 370 Score = 75.0 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 12/113 (10%), Positives = 32/113 (28%), Gaps = 12/113 (10%) Query: 16 NLLLRLDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFW 72 N + ++ + P G +V W +P+++ FE ++ Sbjct: 255 NTIRHYKYDDIWKIILKQQPKGDDWYPGAFVDWDNTPRRKNKGSFCDGTSPEKFEYYLTQ 314 Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K ++ + + F + E + +L + + Sbjct: 315 ------QIKRARNVYHKDYL-FMFAWNEWGESGYLEPDTKNGYKMLEAVRNAL 360 >gi|114328198|ref|YP_745355.1| glycosyltransferase [Granulibacter bethesdensis CGDNIH1] gi|114316372|gb|ABI62432.1| glycosyltransferase [Granulibacter bethesdensis CGDNIH1] Length = 946 Score = 74.6 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 13/132 (9%), Positives = 36/132 (27%), Gaps = 7/132 (5%) Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRS 75 ++ + + + W +P+ + + + + WL Sbjct: 696 RIVDYHKFASYHMGRPMPEYRRHRTVMLPWDNTPRYGSRAMVHVNTSNNAYRT---WLTQ 752 Query: 76 FLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGW 133 + + P RI F S E + ++ + + V+++ Sbjct: 753 AMLDTHRR--HVPEERIVFLHSWNEWCEGTYVEPDGRYGRHYLNETRAAVQDVRDILSLA 810 Query: 134 NDRPSSPKKSGL 145 + S + L Sbjct: 811 SSGESVNALAKL 822 >gi|228946140|ref|ZP_04108475.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|228813553|gb|EEM59839.1| Glycosyltransferase [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 340 Score = 74.6 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 8/86 (9%), Positives = 24/86 (27%), Gaps = 15/86 (17%) Query: 36 AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95 G ++ W SP+++ ++ + F+ ++ F Sbjct: 260 NIYKGAFIDWDNSPRKKESALILKGANPDKFKKYLL-------------QHSKDTDFLFI 306 Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFD 119 + E + +L + + Sbjct: 307 NAWNEWAEGTYLEPDSKYGYKYLEAL 332 >gi|295085474|emb|CBK66997.1| hypothetical protein [Bacteroides xylanisolvens XB1A] Length = 389 Score = 73.4 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 7/91 (7%), Positives = 24/91 (26%), Gaps = 11/91 (12%) Query: 38 VSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 + W +P+ + + F +F+ + ++ ++ Sbjct: 297 FPNASIGWDDTPRFPNKTAKEVVHYNDSPESFAAFLQKTKEYVD------QRPDRPKLIT 350 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 S E + ++L + + Sbjct: 351 INSWNEWVEGSYLLPDMKHGYGYLNAVKRVI 381 >gi|237717351|ref|ZP_04547832.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262406116|ref|ZP_06082666.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|229443334|gb|EEO49125.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262356991|gb|EEZ06081.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] Length = 401 Score = 73.1 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 7/91 (7%), Positives = 24/91 (26%), Gaps = 11/91 (12%) Query: 38 VSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 + W +P+ + + F +F+ + ++ ++ Sbjct: 309 FPNASIGWDDTPRFPNKTAKEVVHYNDSPESFAAFLQKTKEYVD------QRPDRPKLIT 362 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 S E + ++L + + Sbjct: 363 INSWNEWVEGSYLLPDMKHGYGYLNAVKRVM 393 >gi|218257974|ref|ZP_03474434.1| hypothetical protein PRABACTJOHN_00087 [Parabacteroides johnsonii DSM 18315] gi|218225847|gb|EEC98497.1| hypothetical protein PRABACTJOHN_00087 [Parabacteroides johnsonii DSM 18315] Length = 404 Score = 73.1 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 10/108 (9%), Positives = 28/108 (25%), Gaps = 11/108 (10%) Query: 21 LDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFL 77 E + + + W +P+ ++ Q F SF+ + + Sbjct: 293 HTWEYVQKWDEAVMIPYFPNASIGWDDTPRFPHKTRKDVVHLNQSPQSFSSFLQKAKEYC 352 Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ + E + A+L + + + Sbjct: 353 DKH------PDQPKLITVYAWNEWVEGAYLLPDMKYGFDYLNAVKDVM 394 >gi|294647019|ref|ZP_06724633.1| putative lipoprotein [Bacteroides ovatus SD CC 2a] gi|294807810|ref|ZP_06766599.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b] gi|292637628|gb|EFF56032.1| putative lipoprotein [Bacteroides ovatus SD CC 2a] gi|294444986|gb|EFG13664.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b] Length = 407 Score = 73.1 bits (178), Expect = 8e-11, Method: Composition-based stats. Identities = 7/91 (7%), Positives = 24/91 (26%), Gaps = 11/91 (12%) Query: 38 VSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 + W +P+ + + F +F+ + ++ ++ Sbjct: 315 FPNASIGWDDTPRFPNKTAKEVVHYNDSPESFAAFLQKTKEYVD------QRPDRPKLIT 368 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 S E + ++L + + Sbjct: 369 INSWNEWVEGSYLLPDMKHGYGYLNAVKRVM 399 >gi|260172434|ref|ZP_05758846.1| hypothetical protein BacD2_11264 [Bacteroides sp. D2] gi|315920729|ref|ZP_07916969.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313694604|gb|EFS31439.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 403 Score = 72.7 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 11/110 (10%), Positives = 30/110 (27%), Gaps = 11/110 (10%) Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSF 76 R +E + + W +P+ +K + F +++ + + Sbjct: 292 RESMERMEKWVEALSVPYFPNASIGWDDTPRFPHKTKKDVVHYNNSPQSFATYLQKAKEY 351 Query: 77 LAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFL 124 + ++ S E + +L + + E L Sbjct: 352 VDAR------PDLPKLITVFSWNEWIEGGYLLPDMKYGFGYLEAVKEVML 395 >gi|295103156|emb|CBL00700.1| hypothetical protein [Faecalibacterium prausnitzii SL3/3] Length = 372 Score = 72.7 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 12/87 (13%), Positives = 26/87 (29%), Gaps = 10/87 (11%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G + W SP++ + F+++ L Y K + + Sbjct: 286 FLGCFCDWDNSPRKSYNCNVMMGVTAEKFKNYFRKL--------YIKAQTIGSPMIVINA 337 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEK 122 E + A+L + + + E Sbjct: 338 WNEWAEGAYLEPDEKNGYAFLEAIKEA 364 >gi|269839527|ref|YP_003324219.1| hypothetical protein Tter_2508 [Thermobaculum terrenum ATCC BAA-798] gi|269791257|gb|ACZ43397.1| hypothetical protein Tter_2508 [Thermobaculum terrenum ATCC BAA-798] Length = 381 Score = 72.3 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 37/122 (30%), Gaps = 23/122 (18%) Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK----------QRITSKDVHFQEL 63 + ++ R+ ++ G Y P G +P+ + ++ V + Sbjct: 269 VREVVERVWPKQAGLSALPYWPCVSPGC----DDTPRHLLPRDLEHPRSWRTRPVVGETP 324 Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 +FE F+ FL R+ GS E + +L + + + Sbjct: 325 EVFEGFVRAGVEFL-------QGRGGPRVLLIGSWNEWTEGHYLLPDTRLGFGMLRALQR 377 Query: 122 KF 123 Sbjct: 378 AL 379 >gi|302873795|ref|YP_003842428.1| hypothetical protein Clocel_0894 [Clostridium cellulovorans 743B] gi|307689965|ref|ZP_07632411.1| hypothetical protein Ccel74_17519 [Clostridium cellulovorans 743B] gi|302576652|gb|ADL50664.1| hypothetical protein Clocel_0894 [Clostridium cellulovorans 743B] Length = 367 Score = 72.3 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 10/114 (8%), Positives = 33/114 (28%), Gaps = 10/114 (8%) Query: 11 LGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFI 70 +G +E+ + E + + G + W +P++ + +F+ ++ Sbjct: 255 IGILESSFSYKNCWENIINRTPKQDNTILGGFTDWDNTPRRSYDGMIMKGTTPELFQYYM 314 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEK 122 + + + E + A+L + + + Sbjct: 315 E--------KQMERCKEYKSPFVVINAWNEWAEGAYLEPDEKYGYAFLNAIKNC 360 >gi|218257975|ref|ZP_03474435.1| hypothetical protein PRABACTJOHN_00088 [Parabacteroides johnsonii DSM 18315] gi|218225848|gb|EEC98498.1| hypothetical protein PRABACTJOHN_00088 [Parabacteroides johnsonii DSM 18315] Length = 414 Score = 72.3 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 9/106 (8%), Positives = 27/106 (25%), Gaps = 11/106 (10%) Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK---DVHFQELSIFESFIFWLRSFLAF 79 +E + W +P+ ++ Q F +F+ + + Sbjct: 305 LERLQKWDEAVSIPFFPNASIGWDDTPRFPHKTQKDVVHLNQSPQSFAAFLQKAKEYCDK 364 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ + E + A+L + + + Sbjct: 365 H------PDQPKLITVYAWNEWVEGAYLLPDMKYGFGYLDALKDVM 404 >gi|255015690|ref|ZP_05287816.1| hypothetical protein B2_17433 [Bacteroides sp. 2_1_7] Length = 400 Score = 71.9 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 13/113 (11%), Positives = 31/113 (27%), Gaps = 11/113 (9%) Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITS--KDVH-FQELSIFESFIFWLRSFLAF 79 E + + W +P+ + VH Q F +F+ + + Sbjct: 293 FERLEKWSEAVSIPYFPNASIGWDDTPRFPHKTQKDVVHFNQSPEAFAAFLQKAKEYCDR 352 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKELF 130 ++ + E + A+L + + + F+ K Sbjct: 353 H------PEQPKLITVYAWNEWVEGAYLLPDVKYGFGYLNAVKDVFVNGKYQA 399 >gi|90961958|ref|YP_535874.1| glycosyltransferase [Lactobacillus salivarius UCC118] gi|90821152|gb|ABD99791.1| Glycosyltransferase [Lactobacillus salivarius UCC118] gi|300214668|gb|ADJ79084.1| Glycosyltransferase [Lactobacillus salivarius CECT 5713] Length = 371 Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 12/106 (11%), Positives = 29/106 (27%), Gaps = 9/106 (8%) Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79 D+ + Q G +V W +P+++ FE ++ Sbjct: 262 YDDIWKIILKQQPKGKNWYPGAFVDWDNTPRRKHQGSFCDGTSPEKFEYYLT------KQ 315 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K + + + F + E + +L + + Sbjct: 316 IKRVRDVYHKDYL-FMFAWNEWGESGYLEPDVKNGYKMLEGVRNAL 360 >gi|301301020|ref|ZP_07207181.1| conserved hypothetical protein [Lactobacillus salivarius ACS-116-V-Col5a] gi|300851377|gb|EFK79100.1| conserved hypothetical protein [Lactobacillus salivarius ACS-116-V-Col5a] Length = 371 Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 9/88 (10%), Positives = 26/88 (29%), Gaps = 9/88 (10%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G + W +P+++ FE ++ K ++ + + F + Sbjct: 280 YPGAFADWDNTPRRKNKGVFCDGTSPEKFEYYLTQ------QIKRARDIYYKDYL-FMFA 332 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123 E + +L + + + Sbjct: 333 WNEWGESGYLEPDTKNGYKMLEAVRKAL 360 >gi|300214669|gb|ADJ79085.1| Glycosyltransferase [Lactobacillus salivarius CECT 5713] Length = 371 Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 11/108 (10%), Positives = 30/108 (27%), Gaps = 12/108 (11%) Query: 21 LDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77 ++ + P G +V W +P+++ FE ++ Sbjct: 260 YSYDDIWKIILKQKPKGKDWYPGSFVDWDNTPRRKNRGSFCDGTSPEKFEYYLTQ----- 314 Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K ++ + + F + E + +L + + Sbjct: 315 -QIKRARNVYHKDYL-FMFAWNEWGESGYLEPDTKNGYKMLEAVKNAL 360 >gi|90961959|ref|YP_535875.1| glycosyltransferase [Lactobacillus salivarius UCC118] gi|90821153|gb|ABD99792.1| Glycosyltransferase [Lactobacillus salivarius UCC118] Length = 371 Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 11/108 (10%), Positives = 30/108 (27%), Gaps = 12/108 (11%) Query: 21 LDVEEKGNMQAIYIPA---HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77 ++ + P G +V W +P+++ FE ++ Sbjct: 260 YSYDDIWKIILKQKPKGKDWYPGSFVDWDNTPRRKNRGSFCDGTSPEKFEYYLTQ----- 314 Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 K ++ + + F + E + +L + + Sbjct: 315 -QIKRARNVYHKDYL-FMFAWNEWGESGYLEPDTKNGYKMLEAVKNAL 360 >gi|324991549|gb|EGC23482.1| rhamnosyltransferase [Streptococcus sanguinis SK353] Length = 556 Score = 71.1 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 74/241 (30%), Gaps = 29/241 (12%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ + VT E K + + Q+ + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLVTTDQPEVLKQLQTALGHLGNKVQIVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266 K R L + + Y Y+ + S G + R L ++ D Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326 A I E+ +G++ R + + R+ + + AG + Sbjct: 396 ADASIEALEKESAVGLVIPDLPRLVRD--GLFESEPPRPRLAAVWQEAGLHKSFDFIITP 453 Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEE-ERNLKDGALEHAVERFFACSVRYTEFSI 380 G+ W K L L + + E+ L D +E + + Sbjct: 454 SLTRVYGSFVWFKYSALASLFQMKSLESLPSFEQELSD-----VLEHLLVYLAWDSHYDF 508 Query: 381 E 381 + Sbjct: 509 K 509 >gi|269839540|ref|YP_003324232.1| hypothetical protein Tter_2521 [Thermobaculum terrenum ATCC BAA-798] gi|269791270|gb|ACZ43410.1| hypothetical protein Tter_2521 [Thermobaculum terrenum ATCC BAA-798] Length = 381 Score = 71.1 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 37/122 (30%), Gaps = 23/122 (18%) Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK----------QRITSKDVHFQEL 63 + ++ R+ ++ G Y P G +P+ + ++ V + Sbjct: 269 VREVVERVWPKQAGLSALPYWPCVSPGC----DDTPRHLLPRDLEHPRSWRTRPVVGETP 324 Query: 64 SIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 +FE F+ FL ++ GS E + +L + + + Sbjct: 325 EVFEGFVRAGVEFL-------QGRGGPKVLLIGSWNEWTEGHYLLPDTRLGFGMLRALQR 377 Query: 122 KF 123 Sbjct: 378 AL 379 >gi|323694861|ref|ZP_08109014.1| hypothetical protein HMPREF9475_03878 [Clostridium symbiosum WAL-14673] gi|323501087|gb|EGB16996.1| hypothetical protein HMPREF9475_03878 [Clostridium symbiosum WAL-14673] Length = 374 Score = 71.1 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 28/86 (32%), Gaps = 10/86 (11%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G +V W SP++ + + F ++ L K + + Sbjct: 284 FLGAFVAWDNSPRKSYNATVITGATPEKFGEYMCKLM--------KKAQELHSPVIVINA 335 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSE 121 E + AFL ++ + + S+ Sbjct: 336 WNEWAEGAFLEPDKEYGTAYLEQISK 361 >gi|307340772|gb|ADN43835.1| WegG [Escherichia coli] Length = 357 Score = 70.7 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 10/108 (9%), Positives = 26/108 (24%), Gaps = 8/108 (7%) Query: 20 RLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAF 79 + + N + W + + + F ++ ++ L+ Sbjct: 254 YSKLSKGFNTFVENSNRVIPVIIPRWDSTVRHGKNGWVLTGSTPKEFAKHVYDVKKILSK 313 Query: 80 SKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 RI S E + F+ + + +F Sbjct: 314 RDIK------YRIAIVKSWNEWAEGNFIEPDNIYGKRYLEILKSEFTN 355 >gi|227891408|ref|ZP_04009213.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741] gi|227866797|gb|EEJ74218.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741] Length = 357 Score = 70.7 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 17/121 (14%), Positives = 33/121 (27%), Gaps = 10/121 (8%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPA-HVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 KL + N + Y +SG + W S ++ S V + + F+ Sbjct: 242 KKLKMTDYQSFDKIWSYILNRKRTYDSKTIISGAFSGWDNSARKGKESMIVKGKTVPKFK 301 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 + + S S + E + A+L + + E Sbjct: 302 KYFEKFYT-------SDRENISEEFCVINAWNEWSEGAYLEPDDKDGFGYLEAIKEVVDK 354 Query: 126 V 126 Sbjct: 355 Y 355 >gi|91201537|emb|CAJ74597.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 369 Score = 70.4 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 30/98 (30%), Gaps = 19/98 (19%) Query: 36 AHVSGYYVLWSFSPKQRITSK----------DVHFQELSIFESFIFWLRSFLAFSKYSKL 85 A+ W SP+ + V + +F F LR + ++ + Sbjct: 273 AYYPSVSPGWDASPRGELHGNQKPFCYPWWPIVVNEHPELFSGF---LRKAIHYTMRNNT 329 Query: 86 SFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 + + F S E + +L + + + + Sbjct: 330 TP----LCFIASWNEWSEGHYLEPDARFGTAWLEAVRQ 363 >gi|227890976|ref|ZP_04008781.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741] gi|227867385|gb|EEJ74806.1| glycosyltransferase [Lactobacillus salivarius ATCC 11741] Length = 371 Score = 70.0 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 10/88 (11%), Positives = 26/88 (29%), Gaps = 9/88 (10%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 G + W +P+++ FE ++ K ++ + + F + Sbjct: 280 YPGAFADWDNTPRRKNKGVFCDGTSPEKFEYYLTQ------QIKRARNVYHKNYL-FMFA 332 Query: 98 RKE--QKAFLRLNRFMSNSRMPFDSEKF 123 E + +L + S + Sbjct: 333 WNEWGESGYLEPDTKNSYKMLEAVRNAL 360 >gi|291520448|emb|CBK75669.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 625 Score = 70.0 bits (170), Expect = 7e-10, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 21/56 (37%), Gaps = 1/56 (1%) Query: 329 TMFWVKPKCLEPLRNLHL-IGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383 + FW + + L+ L F +E + HA+ER F + ++ Sbjct: 4 SCFWCRTEALKKLLEYDFSYNFFPKEPMDANLTTSHAIERIFPYVACDAGYYTSTI 59 Score = 43.0 bits (100), Expect = 0.086, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 55/146 (37%), Gaps = 15/146 (10%) Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVY-RRVIDLAK 312 ++ F +L+ + I F++N +G++G+ + + Y +++ K Sbjct: 421 QYTFDELIKNNGYISAICEVFKENQSVGVVGNIYGEIIFQINSNMNIYSKYEDEILEFEK 480 Query: 313 RAGFPTKRLH----LDFFNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERF 368 R F R L++ FW++ L+ + + I + + L D + Sbjct: 481 RFNFDFNRGGKHSLLNYNG---FWLRRDALQMIADCEDI--YISAKKLCD---AEWI--V 530 Query: 369 FACSVRYTEFSIESVDCVAEYERLLH 394 +R F + +V C E + + Sbjct: 531 LPELLRDKGFLLATVFCKREMNKAFY 556 >gi|326772087|ref|ZP_08231372.1| hypothetical protein HMPREF0059_00469 [Actinomyces viscosus C505] gi|326638220|gb|EGE39121.1| hypothetical protein HMPREF0059_00469 [Actinomyces viscosus C505] Length = 681 Score = 68.4 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 55/210 (26%), Gaps = 32/210 (15%) Query: 163 WIEISHILLRLNFDFDLFVTVVE--ANKDFEQDVLKYFPSAQLYVMENKG---------R 211 ++ L L + + VT E D E+ + G R Sbjct: 323 ADGLAQRLASLPAHWRVVVTSPERLDAADLERVTGRRPSQEDTQEDSAHGEGDVSFRLVR 382 Query: 212 DVRP-----FLYLL-----------ELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRW 255 D+ P FL D + +I + + R Sbjct: 383 DLDPRGTIAFLTQCDDLWDPGRAAGGDEGGDSGPLVLRI-TVGPPPVPGTRAD-DVAHRQ 440 Query: 256 LFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAG 315 LL +I+ F ++P LG+ + + L++R G Sbjct: 441 ALDCLLDSPGYTAGLIDLFARHPGLGVAMPAAGHIGQAH-GGPTWDGLAGAAKALSRRLG 499 Query: 316 F--PTKRLHLDFFNGTMFWVKPKCLEPLRN 343 L G MF +P+ L L Sbjct: 500 LSAELDPLAPVAPPGAMFMARPEALRTLSE 529 >gi|320198724|gb|EFW73324.1| Hypothetical protein ECoL_04149 [Escherichia coli EC4100B] Length = 355 Score = 66.9 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 32/120 (26%), Gaps = 11/120 (9%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYI-PAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 +K+K ++ N Y + W +P+ + + Sbjct: 240 IKNKRATYNQYKYSDYIQSMKNDVTEYKGKPVYPVVFPDWDNAPRYKENATFFCESSAYG 299 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 FE + ++ F + E + A+L + S + + F Sbjct: 300 FEKALNIACDI--------TRNHDDKLIFINAWNEWSEGAYLEPDEMHKYSSLEIIKKVF 351 >gi|168481345|gb|ACA24831.1| WbsX [Escherichia coli] Length = 378 Score = 66.9 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 32/120 (26%), Gaps = 11/120 (9%) Query: 7 LKSKLGKIENLLLRLDVEEKGNMQAIYI-PAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 +K+K ++ N Y + W +P+ + + Sbjct: 263 IKNKRATYNQYKYSDYIQSMKNDVTEYKGKPVYPVVFPDWDNAPRYKENATFFCESSAYG 322 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 FE + ++ F + E + A+L + S + + F Sbjct: 323 FEKALNIACDI--------TRNHDDKLIFINAWNEWSEGAYLEPDEMHKYSSLEIIKKVF 374 >gi|46451858|gb|AAS98033.1| WbsX [Shigella boydii] Length = 378 Score = 66.5 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 12/114 (10%), Positives = 27/114 (23%), Gaps = 12/114 (10%) Query: 14 IENLLLRLDVEEKGNMQAIYIPA--HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIF 71 N D + + W +P+ + + FE + Sbjct: 269 TYNHYKYSDYIQSMKNDVTEYKGKPIYPVVFPDWDNAPRYKENATFFCESSAFDFEKALN 328 Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ F + E + A+L + S + + F Sbjct: 329 IACDI--------TRNHDDKLIFINAWNEWSEGAYLEPDEMYKYSNLEIIKKVF 374 >gi|332180195|gb|AEE15883.1| hypothetical protein Trebr_0439 [Treponema brennaborense DSM 12168] Length = 376 Score = 65.4 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 14/112 (12%), Positives = 29/112 (25%), Gaps = 12/112 (10%) Query: 16 NLLLRLDVEEKGNMQAIYIP--AHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73 N + ++ P + W +P+ R + + L Sbjct: 270 NKIWDYLLKNACVNDYPMFPNLKIFESAFWGWDNTPRYRNRATIFSELTRFEKRKYFSDL 329 Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 Y K+S F+ + E + A+L + + E Sbjct: 330 --------YKKVSNSDSEFIFFNAWNEWSEGAYLEPDDKYGFENLEIIYEVL 373 >gi|325685344|gb|EGD27453.1| group 2 glycosyl transferase [Lactobacillus delbrueckii subsp. lactis DSM 20072] Length = 359 Score = 64.2 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 9/94 (9%), Positives = 26/94 (27%), Gaps = 9/94 (9%) Query: 37 HVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYG 96 G W + ++ V + F+ + + S + Sbjct: 272 IFKGCTSGWDNTARKGKQGMVVKGKTPKKFKKYFNQFLT-------KPRQDASDEFYVIN 324 Query: 97 SRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 + E + A+L + ++ + E ++ Sbjct: 325 AWNEWSEGAYLEPDEKDGDTYLEIIKEAVEKEEK 358 >gi|324993910|gb|EGC25829.1| rhamnosyltransferase [Streptococcus sanguinis SK405] gi|324994771|gb|EGC26684.1| rhamnosyltransferase [Streptococcus sanguinis SK678] Length = 556 Score = 63.8 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 72/244 (29%), Gaps = 35/244 (14%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ ++ +T E K + + QL + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266 K R L + + Y Y+ + S G + R L ++ D Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF- 325 A I EQ +G++ R + E + L DF Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRD-----GLFESEPPLPSLTAVWQEAVLHKSFDFM 450 Query: 326 -------FNGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTE 377 G W K L L + + E+ L D +E V + Sbjct: 451 TAPSLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIVWDSH 505 Query: 378 FSIE 381 + + Sbjct: 506 YDFK 509 >gi|319788852|ref|YP_004090167.1| glycosyltransferase [Ruminococcus albus 7] gi|315450719|gb|ADU24281.1| glycosyltransferase [Ruminococcus albus 7] Length = 360 Score = 63.4 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 32/117 (27%), Gaps = 10/117 (8%) Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73 + N V + + P H G + W SP+ + F+ Sbjct: 250 VTNYFNYDSVCDLIEKRIDNDPNHYLGLFAEWDNSPRHSHNCTIFKNFSIPRFKQ----- 304 Query: 74 RSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 L +S+ K + E + A+L + ++ + + Sbjct: 305 ---LVYSQIKKSVSVGKGFLIIDAWNEWGEGAYLEPDNISGFEKLNTIRDVLSGFMQ 358 >gi|327463172|gb|EGF09493.1| rhamnosyltransferase [Streptococcus sanguinis SK1] gi|327474781|gb|EGF20186.1| rhamnosyltransferase [Streptococcus sanguinis SK408] Length = 556 Score = 62.7 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 71/244 (29%), Gaps = 35/244 (14%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ ++ +T E K + + QL + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266 K R L + + Y Y+ + S G + R L ++ D Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF- 325 A I EQ +G++ R + E + L DF Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRD-----GLFESEPPLPSLTAVWQEAVLHKSFDFM 450 Query: 326 -------FNGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTE 377 G W K L L + + E+ L D +E + Sbjct: 451 TAPSLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIAWDSH 505 Query: 378 FSIE 381 + + Sbjct: 506 YDFK 509 >gi|327489888|gb|EGF21677.1| rhamnosyltransferase [Streptococcus sanguinis SK1058] Length = 556 Score = 62.7 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 71/244 (29%), Gaps = 35/244 (14%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ ++ +T E K + + QL + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266 K R L + + Y Y+ + S G + R L ++ D Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DY 395 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF- 325 A I EQ +G++ R + E + L DF Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRD-----GLFESEPPLPSLTAVWQEAVLHKSFDFM 450 Query: 326 -------FNGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTE 377 G W K L L + + E+ L D +E + Sbjct: 451 TAPSLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIAWDSH 505 Query: 378 FSIE 381 + + Sbjct: 506 YDFK 509 >gi|325694904|gb|EGD36809.1| rhamnosyltransferase [Streptococcus sanguinis SK150] Length = 556 Score = 62.3 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/289 (13%), Positives = 81/289 (28%), Gaps = 26/289 (8%) Query: 104 FLRLNRFMSNSRMPFD-SEKFLYV--KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQ 160 +L ++S E Y +L D+ S S + + +H Sbjct: 236 YLLEELETNSSYPTSLIREHLFYHFGPDLPCLLQDKYLSQSTSSYRTNQSVLLHIHVTNF 295 Query: 161 DTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLY 218 + + L L + VT E K + + Q+ + + K + L Sbjct: 296 PIFQQYQEKLFSLASQYQYLVTTNLPEMLKQLQTALAHLDDKVQIVLSQ-KSHALLAMLE 354 Query: 219 LLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNP 278 + + Y Y+ + + + R L ++ D A I EQ Sbjct: 355 --QKEILQNYVYIGHLSTHR--IMENQAVFDQAMRSDLINMMV---DYADASIEALEQES 407 Query: 279 CLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-----NGTMFWV 333 +G++ R + + + + + AG + G W Sbjct: 408 AVGLVIPDLPRLVRD--GLFESEPPLPSLTAVWQEAGLHKSFDFMTAPSLTRVYGGFLWF 465 Query: 334 KPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381 K L L + + E+ L D +E + + + Sbjct: 466 KYSALTSLFQMKSLESLPSSEQELSD-----VLEHLLVYIAWDSHYDFK 509 >gi|325690859|gb|EGD32860.1| rhamnosyltransferase [Streptococcus sanguinis SK115] Length = 556 Score = 61.1 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 74/239 (30%), Gaps = 25/239 (10%) Query: 153 IVVHCYYQDT--WIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + + L L+ + VTV E K + + QL + + Sbjct: 286 VLLHIHVTDLPIFQQYQNKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 K L + + Y Y+ + + + R L ++ D A Sbjct: 345 KSHAWLAMLE--QKEILQDYAYIGHLSTHR--IMENQAVFDQAMRSDLINLMV---DYAD 397 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-- 326 I EQ +G++ R + + + R+ + + AG + Sbjct: 398 ASIEALEQESAVGLVIPDLPRLVRD--GLFESEPLRPRLAAIWQEAGLHKSFDFMTPPSL 455 Query: 327 ---NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381 G W K L L + + E+ L D +E + + + Sbjct: 456 TRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHYDFK 509 >gi|29348315|ref|NP_811818.1| hypothetical protein BT_2906 [Bacteroides thetaiotaomicron VPI-5482] gi|29340219|gb|AAO78012.1| conserved hypothetical protein [Bacteroides thetaiotaomicron VPI-5482] Length = 436 Score = 61.1 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 35/127 (27%), Gaps = 26/127 (20%) Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK------QRITSK--------DVHFQ 61 ++ +L E G Y+PA G W +P+ + + + Sbjct: 312 DVAFKLWDEHHGQFDIPYVPAVAPG----WDSTPRYIAPANRPAKADRSQWPGCTIFKNE 367 Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119 + F++F+ + Y RI E + +L + + Sbjct: 368 NPASFKAFVQ------SSFVYLNKHPEVPRILTIACFNEWSEGHYLLPDNRFGYGMLDAL 421 Query: 120 SEKFLYV 126 E Sbjct: 422 GEALGKE 428 >gi|325067617|ref|ZP_08126290.1| hypothetical protein AoriK_07344 [Actinomyces oris K20] Length = 233 Score = 60.7 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 26/82 (31%), Gaps = 3/82 (3%) Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF--PTKRL 321 +I+ F ++P LG+ + A + L++R G L Sbjct: 1 PGYVAGLIDLFARHPGLGVAMPAAGHIGQAH-GGATWDGLAGAATALSRRLGLTVELDPL 59 Query: 322 HLDFFNGTMFWVKPKCLEPLRN 343 G MF +P L L Sbjct: 60 APVVPVGAMFLARPAALRTLSE 81 >gi|253569319|ref|ZP_04846729.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251841338|gb|EES69419.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 415 Score = 60.7 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 35/127 (27%), Gaps = 26/127 (20%) Query: 16 NLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK------QRITSK--------DVHFQ 61 ++ +L E G Y+PA G W +P+ + + + Sbjct: 291 DVAFKLWDEHHGQFDIPYVPAVAPG----WDSTPRYIAPANRPAKADRSQWPGCTIFKNE 346 Query: 62 ELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFD 119 + F++F+ + Y RI E + +L + + Sbjct: 347 NPASFKAFVQ------SSFVYLNKHPEVPRILTIACFNEWSEGHYLLPDNRFGYGMLDAL 400 Query: 120 SEKFLYV 126 E Sbjct: 401 GEALGKE 407 >gi|13474019|ref|NP_105587.1| hypothetical protein mll4797 [Mesorhizobium loti MAFF303099] gi|14024771|dbj|BAB51373.1| mll4797 [Mesorhizobium loti MAFF303099] Length = 467 Score = 60.3 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 14/96 (14%), Positives = 28/96 (29%), Gaps = 10/96 (10%) Query: 60 FQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMP 117 S + WL + +A + F S R+ F + E + A+L + + + Sbjct: 2 NASPSRY---AEWLANAVADTCDRFADFDS-RLIFVNAWNEWAEGAYLEPDARYGYAYLQ 57 Query: 118 FDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAI 153 P+ L + A+ Sbjct: 58 ETRNVL----SAPSAAGKFPTGASWRVLFVSHDAAL 89 >gi|323351266|ref|ZP_08086922.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sanguinis VMC66] gi|322122490|gb|EFX94201.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sanguinis VMC66] Length = 556 Score = 60.3 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 72/240 (30%), Gaps = 27/240 (11%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ ++ +T E K + + Q+ + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQIVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266 K R L + + Y Y+ + S + R L ++ D Sbjct: 345 KSRAWLAMLE--QKEILQDYAYIGHL----STHRLVENQAVFDQAMRSDLINLMV---DY 395 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326 A I EQ +G++ R + F + + + + + AG + Sbjct: 396 ADASIEALEQESAVGLVIPDLPRLVRDGLFETEP--LRPSLSAVWQEAGLHKSFDFMTAS 453 Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 G W K L L + + D L +E + + + Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509 >gi|327470704|gb|EGF16160.1| rhamnosyltransferase [Streptococcus sanguinis SK330] Length = 556 Score = 60.0 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 71/239 (29%), Gaps = 25/239 (10%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ + VTV E K + + QL + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 K L + + Y Y+ + + + R L ++ A Sbjct: 345 KSHAWLAMLE--QKEILQDYAYIGHLSTHR--IMENQAVFDQAMRSDLINMMVY---YAD 397 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-- 326 I EQ +G++ R + + R+ + + AG + Sbjct: 398 TSIEALEQESAVGLVIPDLPRLVRD--GLFESEPPRPRLAAIWQEAGLHKSFDFMTPPSL 455 Query: 327 ---NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381 G W K L L + + E+ L D +E + + + Sbjct: 456 TRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHYDFK 509 >gi|328946538|gb|EGG40677.1| rhamnosyltransferase [Streptococcus sanguinis SK1087] Length = 556 Score = 59.6 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 40/290 (13%), Positives = 84/290 (28%), Gaps = 28/290 (9%) Query: 104 FLRLNRFMSNSR-MPFDSEKFLYV--KELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQ 160 +L + ++S + E Y +L D+ S S + + + +H Sbjct: 236 YLLEDLETNSSYPILLIREHLFYHFGPDLPCLLEDKYLSQSTSNYCTEQPVLLHIHVTDF 295 Query: 161 DTWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLY 218 + + L L+ + VT E K + + Q+ + + K L Sbjct: 296 PIFQQYQDNLFSLSSQYQYLVTTGQPEVLKQLQTSLAHLGNKVQIVLSQ-KSHAWLAMLE 354 Query: 219 LLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276 + + Y Y+ + S + R L ++ +D + I E+ Sbjct: 355 --QKEILQNYAYIGHL----STHRLVENQAVFDQAMRSDLINMMVDSADAS---IEALEK 405 Query: 277 NPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-----NGTMF 331 N LG++ R + + R+ + + AG + G Sbjct: 406 NSDLGLVIPDLPRLVRD--GLFESEPPRPRLTSVWQDAGLHKSFNFMSTPSLTRVYGGFL 463 Query: 332 WVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 W K L + + D L +E + + + Sbjct: 464 WFKYSALASWFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509 >gi|283456866|ref|YP_003361430.1| putative glycosyltransferase [Bifidobacterium dentium Bd1] gi|283103500|gb|ADB10606.1| Putative glycosyltransferase [Bifidobacterium dentium Bd1] Length = 349 Score = 59.2 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 32/120 (26%), Gaps = 15/120 (12%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIP-AHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 K+ +++ L N Q Y V + + SP++ + + F Sbjct: 239 KKMKRLDCLDYDYLWNRILNKQRKYGTRQIVRSAFTNFDNSPRKGTRAFITQGSSYTKFA 298 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFF--YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ L S + F + E + A L + + Sbjct: 299 DYLNQLIH----------SNRQDYMDFTVINAWNEWGEGAILEPTESDQYGWLQAVKDAV 348 >gi|171741995|ref|ZP_02917802.1| hypothetical protein BIFDEN_01098 [Bifidobacterium dentium ATCC 27678] gi|171277609|gb|EDT45270.1| hypothetical protein BIFDEN_01098 [Bifidobacterium dentium ATCC 27678] Length = 356 Score = 59.2 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 32/120 (26%), Gaps = 15/120 (12%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIP-AHVSGYYVLWSFSPKQRITSKDVHFQELSIFE 67 K+ +++ L N Q Y V + + SP++ + + F Sbjct: 246 KKMKRLDCLDYDYLWNRILNKQRKYGTRQIVRSAFTNFDNSPRKGTRAFITQGSSYTKFA 305 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFF--YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++ L S + F + E + A L + + Sbjct: 306 DYLNQLIH----------SNRQDYMDFTVINAWNEWGEGAILEPTESDQYGWLQAVKDAV 355 >gi|281355222|ref|ZP_06241716.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548] gi|281318102|gb|EFB02122.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548] Length = 375 Score = 59.2 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 13/117 (11%), Positives = 25/117 (21%), Gaps = 23/117 (19%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSK----------DVHFQELSIFES 68 +E Y + W SP+ +T + + F Sbjct: 267 YWRKWDEIER---QYRIPYFPNVTAGWDPSPRTLMTDRWEPVGYPYTCTLSENTPENFRR 323 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + + +L R F E + + L + F Sbjct: 324 AL--------AATRDRLLKSEIRTFSINCWNEWTEGSMLEPEARYGYGYLDALKAVF 372 >gi|327461067|gb|EGF07400.1| rhamnosyltransferase [Streptococcus sanguinis SK1057] Length = 556 Score = 58.0 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 72/239 (30%), Gaps = 25/239 (10%) Query: 153 IVVHCYYQDT--WIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + + L L+ + VTV E K + + QL + + Sbjct: 286 VLLHIHVTDLPIFQQYQNKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAI 268 K L + + Y Y+ + + + R L ++ D A Sbjct: 345 KSHAWLAMLE--QKEILQDYAYIGHLSTHR--IMENQAVFDQAMRSDLINLMV---DYAD 397 Query: 269 RIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF-- 326 I EQ +G++ R + + + R+ + + AG + Sbjct: 398 ASIEALEQESAVGLVIPDLPRLVRD--GLFESEPLRPRLAAIWQEAGLHKSFDFMTPPSL 455 Query: 327 ---NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIE 381 G W K L + + + E+ D +E + + Sbjct: 456 TRVYGGFVWFKYSALASVFRMKSLESLPSSEQEFSD-----VLEHLLVYLAWDNHYDFK 509 >gi|125718317|ref|YP_001035450.1| lipopolysaccharide biosynthesis protein, putative [Streptococcus sanguinis SK36] gi|125498234|gb|ABN44900.1| Lipopolysaccharide biosynthesis protein, putative [Streptococcus sanguinis SK36] Length = 556 Score = 57.7 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 69/240 (28%), Gaps = 27/240 (11%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ + +T E K + + Q+ + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLLTTNQPEVLKQLQTALGHLGNKVQIILSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266 K L + + Y Y+ + S + R L ++ D Sbjct: 345 KSHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAVFDQAMRSDLINMMV---DY 395 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326 A I EQ+ G++ R + F + + + + AG + Sbjct: 396 ADASIEALEQDSAEGLVIPDLPRLVRDGLFEIEP--PRPSLSAVWQEAGLHKSFDFMTAS 453 Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 G W K L L + + D L +E + + + Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509 >gi|325687210|gb|EGD29232.1| rhamnosyltransferase [Streptococcus sanguinis SK72] Length = 556 Score = 57.3 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 69/240 (28%), Gaps = 27/240 (11%) Query: 153 IVVHCYYQD--TWIEISHILLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVMEN 208 +++H + D + + L L+ + +T E K + + Q+ + + Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLLTTNQPEVLKQLQTALGHLGNKVQIILSQ- 344 Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFSDI 266 K L + + Y Y+ + S + R L ++ D Sbjct: 345 KSHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAVFDQTMRSDLINMMV---DY 395 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326 A I EQ+ G++ R + F + + + + AG + Sbjct: 396 ADASIEALEQDSAEGLVIPDLPRLVRDGLFEIEP--PRPSLSAVWQEAGLHKSFDFMTAS 453 Query: 327 -----NGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIE 381 G W K L L + + D L +E + + + Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPS----SDQELSDVLEHLLVYLAWDSHYDFK 509 >gi|29348316|ref|NP_811819.1| hypothetical protein BT_2907 [Bacteroides thetaiotaomicron VPI-5482] gi|29340220|gb|AAO78013.1| glycosyltransferase-like protein [Bacteroides thetaiotaomicron VPI-5482] Length = 452 Score = 56.9 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 11/111 (9%), Positives = 29/111 (26%), Gaps = 18/111 (16%) Query: 27 GNMQAIYIPAHVSGYYVLWSFSPK---------QRITSK-----DVHFQELSIFESFIFW 72 + ++ W +P+ Q + + + F++ + Sbjct: 334 PKHHDDFAIPYLPSLSPGWDSTPRYIPPVSRPDQPNRDAWPNCVILDNENPASFKALVQ- 392 Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKF 123 S A+ K P I + + +L + + +E Sbjct: 393 --SAFAYLNKHKDVPPILTIACFNEW-TEGHYLLPDNRFGYGMLDALAEAV 440 >gi|253569318|ref|ZP_04846728.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251841337|gb|EES69418.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 441 Score = 56.9 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 11/111 (9%), Positives = 29/111 (26%), Gaps = 18/111 (16%) Query: 27 GNMQAIYIPAHVSGYYVLWSFSPK---------QRITSK-----DVHFQELSIFESFIFW 72 + ++ W +P+ Q + + + F++ + Sbjct: 323 PKHHDDFAIPYLPSLSPGWDSTPRYIPPVSRPDQPNRDAWPNCVILDNENPASFKALVQ- 381 Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKF 123 S A+ K P I + + +L + + +E Sbjct: 382 --SAFAYLNKHKDVPPILTIACFNEW-TEGHYLLPDNRFGYGMLDALAEAV 429 >gi|325696073|gb|EGD37964.1| rhamnosyltransferase [Streptococcus sanguinis SK160] Length = 556 Score = 56.5 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 36/243 (14%), Positives = 72/243 (29%), Gaps = 33/243 (13%) Query: 153 IVVHCYYQDTWIEISHI---LLRLNFDFDLFVTV--VEANKDFEQDVLKYFPSAQLYVME 207 +++H + D + H L L+ + VTV E K + + QL + + Sbjct: 286 VLLHIHVTD-FPIFQHYQDKLFSLSSQYQYLVTVAQPEMLKQLQTALAHLGDKVQLVLSQ 344 Query: 208 NKGRDVRPFLYLL-ELGVFDRYDYLCKIHGKKSQRE--GYHPIEGIIWRRWLFFDLLGFS 264 +L +L + + Y Y+ + S + R L ++ Sbjct: 345 ----ASHAWLAMLDQKEILQDYAYIGHL----STHRLVENQAVFDQAMRSDLINMMVY-- 394 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLD 324 A I EQ +G++ R + + R+ + + A + Sbjct: 395 -YADTSIEALEQESAVGLVIPDLPRLVRD--GLFESEPPRPRLAAIWQEADLHKSFDCMT 451 Query: 325 FF-----NGTMFWVKPKCLEPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEF 378 G W K L L + + E+ L D +E + + Sbjct: 452 PPSLTRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHY 506 Query: 379 SIE 381 + Sbjct: 507 DFK 509 >gi|218282206|ref|ZP_03488505.1| hypothetical protein EUBIFOR_01087 [Eubacterium biforme DSM 3989] gi|218216808|gb|EEC90346.1| hypothetical protein EUBIFOR_01087 [Eubacterium biforme DSM 3989] Length = 355 Score = 55.0 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 11/102 (10%), Positives = 26/102 (25%), Gaps = 16/102 (15%) Query: 22 DVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81 + + N + + + G W +P+ + F ++ Sbjct: 263 KMYKVANDTKLNVNNVIRGLCFEWDNTPRHGYRGYVITPPSKESFFKYM----------- 311 Query: 82 YSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 S S F + E + L + + + E Sbjct: 312 ---DSVQSDEYLFINAWNEWCEGMVLEPTQEKKYKYLEWIKE 350 >gi|296163856|ref|ZP_06846524.1| glycosyltransferase [Burkholderia sp. Ch1-1] gi|295885899|gb|EFG65849.1| glycosyltransferase [Burkholderia sp. Ch1-1] Length = 187 Score = 53.4 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 11/72 (15%), Positives = 23/72 (31%), Gaps = 4/72 (5%) Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 + WL + + P +I F S E + +L + + SE Sbjct: 11 YKQWLSQAILDTHDR--YSPDEQIVFLHSWNEWCEGTYLEPDGKSGRRFLEETSEAIKDA 68 Query: 127 KELFEGWNDRPS 138 + + +D + Sbjct: 69 ESVLALSDDSQA 80 >gi|315221431|ref|ZP_07863352.1| rhamnan synthesis protein F [Streptococcus anginosus F0211] gi|315189550|gb|EFU23244.1| rhamnan synthesis protein F [Streptococcus anginosus F0211] Length = 555 Score = 52.6 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 47/178 (26%), Gaps = 23/178 (12%) Query: 215 PFLYLL-ELGVFDRYDYLCKI--HGKKSQREGYHPIEGIIW-RRWLFFDLLGFSDIAIRI 270 P L + + Y Y+ + H W R LF ++ + Sbjct: 348 PLLAMFAQAERLKTYKYIGHLSTHT-----LIPEVAGLDQWMRDDLFNMMI---ENMNYS 399 Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDF----- 325 IN E LG+I + F+ K + + L K D Sbjct: 400 INALEHCSNLGLIIPDLPSVVRNGLFYQKP--LKEEMEKLWKLLSCRKSFKFTDAVTLTR 457 Query: 326 FNGTMFWVKPKCLEPLRNLHLIGEFEEERNLKDGALEHAVERFFACSVRYTEFSIESV 383 G W K + +E L F + + +E + + + Sbjct: 458 VYGGWMWFKYEAVESLFKASFKT-FSSYSLQEQSTI---LENLLVYVAWDKNYDFQII 511 >gi|283785857|ref|YP_003365722.1| hypothetical protein ROD_21731 [Citrobacter rodentium ICC168] gi|282949311|emb|CBG88922.1| conserved hypothetical protein [Citrobacter rodentium ICC168] Length = 346 Score = 52.6 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 30/115 (26%), Gaps = 14/115 (12%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 LG I + + + + I + + W + + FE Sbjct: 239 KFLGPI-RYNYKKMISSLWHNETKDI-KEIPIIFSGWDTTIRHGKQGVFYSNFSEHSFE- 295 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 + + P I F S E + + + S+ + S+ Sbjct: 296 ---------VNVQNAINYNPQQDIVFLKSWNEWAEGNTVEPDTIFSDKLLRIISK 341 >gi|160894490|ref|ZP_02075266.1| hypothetical protein CLOL250_02042 [Clostridium sp. L2-50] gi|156863801|gb|EDO57232.1| hypothetical protein CLOL250_02042 [Clostridium sp. L2-50] Length = 783 Score = 51.1 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 69/218 (31%), Gaps = 33/218 (15%) Query: 139 SPKKSGLTIKSKIAIVV-HCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY 197 P I KIA+V+ +Y +I L DL+ E + +++ + Sbjct: 291 IPDSECERIAEKIAVVIDEDFYLQHQPDI----DDLETHADLYYWGSEESFHQKKNWEEM 346 Query: 198 F-------PSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEG- 249 A++Y V F Y+Y+C + + + G Sbjct: 347 HLLECTTGNFAEVYY------AVGAF--------AKEYEYICFLVNEDRSYIAENLDNGH 392 Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSF-FAKRSEVYRRVI 308 W +LG I++ N +G++ + +S + +R + + Sbjct: 393 TGWIIE--NSILGKGVSLGNIVSCLNDNSGIGLVYPPNSSQSLYYSRQYKERELISCEIQ 450 Query: 309 DLAKRAGFPTKRLHLDFFNG---TMFWVKPKCLEPLRN 343 + + + + G FW + + L+ L Sbjct: 451 QILEDSDIHLNIAKVRGSIGQYTGCFWCRSQVLQNLTE 488 >gi|168481320|gb|ACA24808.1| WfgB [Shigella dysenteriae] gi|168481331|gb|ACA24818.1| WfgB [Escherichia coli] Length = 345 Score = 50.7 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 29/115 (25%), Gaps = 14/115 (12%) Query: 9 SKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFES 68 LG I + + Q I + + W + + FE Sbjct: 239 KFLGPI-RYNYEKMISSLWHNQTKDI-KEIPIIFSGWDTTIRHGKQGVFYSDFSEHSFE- 295 Query: 69 FIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSE 121 K + P I F S E + + + S+ + S+ Sbjct: 296 ---------VNVKNAINYNPQQDIVFLKSWNEWAEGNTVEPDTIFSDKLLRIISK 341 >gi|127512343|ref|YP_001093540.1| tetratricopeptide TPR_2 [Shewanella loihica PV-4] gi|126637638|gb|ABO23281.1| tetratricopeptide TPR_2 [Shewanella loihica PV-4] Length = 372 Score = 50.3 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 10/92 (10%), Positives = 23/92 (25%), Gaps = 10/92 (10%) Query: 18 LLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFL 77 V+ S W SP+ + + + F+ + Sbjct: 268 NYASTVKTLEYAHQNISGTVHSTIVTGWDNSPRSNRRALVLTNFNENSFK-------YAI 320 Query: 78 AFSKYSKLSFPSCRIFFYGSRKE--QKAFLRL 107 + ++ ++ F S E + L Sbjct: 321 DIAISNE-KNNENKLLFIKSWNEWAEGNTLEP 351 >gi|254431846|ref|ZP_05045549.1| hypothetical protein CPCC7001_1737 [Cyanobium sp. PCC 7001] gi|197626299|gb|EDY38858.1| hypothetical protein CPCC7001_1737 [Cyanobium sp. PCC 7001] Length = 205 Score = 50.3 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 45/178 (25%), Gaps = 15/178 (8%) Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGF 263 + N G D F +L G F K+ KKS G G+ W + Sbjct: 28 ERVTNYGEDWSSFHHLFYSGAFSSRGATFKLQTKKSSNLG--ADGGMAWVDEALQPIASS 85 Query: 264 SDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-PTKRLH 322 +I + + + V + R G Sbjct: 86 YRATATVIKNLKAG--------TIKLAASKLCKRTGFGANPQLVAEYIHRLGLNEQSAKR 137 Query: 323 LDFFNGTMFWVKPKCLEPLR-NLHLIGEFEEERN--LKDGAL-EHAVERFFACSVRYT 376 F G+MF ++ +L + G HA+ER F Sbjct: 138 QSFCMGSMFAADNDLIQLFYSSLGDVDYRITSDGGSQFCGRYPGHAIERAFFYYSYQA 195 >gi|323483798|ref|ZP_08089177.1| hypothetical protein HMPREF9474_00926 [Clostridium symbiosum WAL-14163] gi|323402883|gb|EGA95202.1| hypothetical protein HMPREF9474_00926 [Clostridium symbiosum WAL-14163] Length = 358 Score = 49.2 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 8/80 (10%), Positives = 20/80 (25%), Gaps = 16/80 (20%) Query: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99 G + W +P+ I + + ++ ++ S F + Sbjct: 278 GVFFEWDNTPRHSIRGTII---TPPDKKRYLQYMDSIKDT-----------EYLFINAWN 323 Query: 100 E--QKAFLRLNRFMSNSRMP 117 E + L + Sbjct: 324 EWAEGMMLEPTVENKYKYLE 343 >gi|322510485|gb|ADX05799.1| putative N-acetyl glucosaminyl transferase [Organic Lake phycodnavirus 1] Length = 690 Score = 48.0 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 49/268 (18%), Positives = 81/268 (30%), Gaps = 63/268 (23%) Query: 141 KKSGLTIKSKIAIVVHCYYQDTWIEIS-HILLRLNFDFDLFVTVVEANKDFEQDVLKYFP 199 +KS L KS A +HCY + I + L+ F + VT D + + + Sbjct: 302 EKSELYSKSLFA-HLHCYDISQFTTIYKDYIYDLSKYFHIIVTYTIGYLDKKNEYITLLK 360 Query: 200 SAQLYVMENKGRDVRP--FLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLF 257 + N G D+ + Y Y+ +H K + R ++ Sbjct: 361 ------IPNNGYDIGAKMMMVKYLKDKNIDYKYIYFMHSK-----------SDVNLRHIY 403 Query: 258 FDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKR-----SEVYRRVIDLAK 312 FD L D I+ E G + Y+ Y +++ Y +L Sbjct: 404 FDTLY--DHVDDIVKYIEDYD--GYFPNLLYKLYNQYNIKQSNKIKQPDYNYVYTNEL-- 457 Query: 313 RAGFPTKRLHLD-FFNGTMFWVKPKC---------LEPLRNLHLIGEF---------EEE 353 + K + F G ++ ++ L L N ++ E Sbjct: 458 KHYLNVKDTQFNTFVEGNVYILRRNICETIFGDERLYRLLNESDENDYVHLQNIYRKPLE 517 Query: 354 RNL------------KDGALEHAVERFF 369 DG LEHA ER Sbjct: 518 EIYHKLKYNYQTKMIHDGQLEHAFERVV 545 >gi|53803315|ref|YP_114969.1| glycosyl transferase group 2 family protein [Methylococcus capsulatus str. Bath] gi|53757076|gb|AAU91367.1| glycosyl transferase, group 2 family protein [Methylococcus capsulatus str. Bath] Length = 957 Score = 48.0 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 8/62 (12%), Positives = 19/62 (30%), Gaps = 3/62 (4%) Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 + W S ++ + + L+ S+ WL + + + R+ F Sbjct: 853 YKLFRSVTLAWDNSARRGKRATILRNFSLT---SYAQWLLTACKATLADHNLTENERLVF 909 Query: 95 YG 96 Sbjct: 910 IN 911 >gi|293611242|ref|ZP_06693540.1| predicted protein [Acinetobacter sp. SH024] gi|292826493|gb|EFF84860.1| predicted protein [Acinetobacter sp. SH024] Length = 347 Score = 48.0 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 10/126 (7%), Positives = 24/126 (19%), Gaps = 14/126 (11%) Query: 3 KVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQE 62 K+ K+ + + W S + ++ + Sbjct: 233 KINNFTKGRFKLGPFFYSYKRMMMLEKNLKNSSGEIPVIFSGWDTSIRHATNGIVLNEFD 292 Query: 63 LSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDS 120 +F + S S E + L + + + Sbjct: 293 SQVFNEHV------------SHNLNFESDFLIVKSWNEWAEGNLLEPDSIYGFTMLKVMK 340 Query: 121 EKFLYV 126 E Sbjct: 341 EALRKY 346 >gi|302024024|ref|ZP_07249235.1| polysaccharide biosynthesis protein [Streptococcus suis 05HAS68] Length = 587 Score = 47.3 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 81/232 (34%), Gaps = 25/232 (10%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDT--WIEISHILLRLNFDFDLFVTVVEAN-KDFEQD 193 P ++ T++S ++++H + + + E L +++ L +T+ E + + Sbjct: 282 PIRVSQTTETVRSSTSVLLHVHIESVSIFEEYIEELCKISDRCQLLITLPETDFSNKCSI 341 Query: 194 VLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253 V +Y + +L K D F ++ + D YL + K+++ Y + I R Sbjct: 342 VERYLSTYKLRAQIAKLTDELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-R 399 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 L +I+ FE L ++ + + +L ++ Sbjct: 400 HQLRKMFFTS---FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQ 451 Query: 314 AGF---------PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353 + + G + +W+K + + + +F +E Sbjct: 452 LNILYESLVRTKKVDFYKVPYIIGEEVSWYWIKTEDYKKIEEKFRNIDFSKE 503 >gi|223933250|ref|ZP_03625240.1| Rhamnan synthesis F [Streptococcus suis 89/1591] gi|330832517|ref|YP_004401342.1| Rhamnan synthesis F [Streptococcus suis ST3] gi|223898064|gb|EEF64435.1| Rhamnan synthesis F [Streptococcus suis 89/1591] gi|329306740|gb|AEB81156.1| Rhamnan synthesis F [Streptococcus suis ST3] Length = 574 Score = 46.9 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 81/232 (34%), Gaps = 25/232 (10%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDT--WIEISHILLRLNFDFDLFVTVVEAN-KDFEQD 193 P ++ T++S ++++H + + + E L +++ L +T+ E + + Sbjct: 269 PIRVSQTTETVRSSTSVLLHVHIESVSIFEEYIEELCKISDRCQLLITLPETDFSNKCSI 328 Query: 194 VLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWR 253 V +Y + +L K D F ++ + D YL + K+++ Y + I R Sbjct: 329 VERYLSTYKLRAQIAKLTDELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-R 386 Query: 254 RWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKR 313 L +I+ FE L ++ + + +L ++ Sbjct: 387 HQLRKMFFTS---FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQ 438 Query: 314 AGF---------PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353 + + G + +W+K + + + +F +E Sbjct: 439 LNILYESLVRTKKVDFYKVPYIIGEEVSWYWIKTEDYKKIEEKFRNIDFSKE 490 >gi|146318939|ref|YP_001198651.1| polysaccharide biosynthesis protein [Streptococcus suis 05ZYH33] gi|145689745|gb|ABP90251.1| polysaccharide biosynthesis protein [Streptococcus suis 05ZYH33] gi|319758378|gb|ADV70320.1| polysaccharide biosynthesis protein [Streptococcus suis JS14] Length = 587 Score = 46.9 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 31/221 (14%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEAN-----KDFEQDVLKYFPSAQLY 204 + + VH + E L ++ L +T+ EA+ E+ + Y AQ+ Sbjct: 297 SVLLHVHIESVSIFEEYIEELCKIADRCQLLITLPEADFSNKCSIVERCLFTYQLRAQIA 356 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 + D F ++ + D YL + K+++ Y + I R L Sbjct: 357 KLT----DELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-RHQLRKMFFTS- 409 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-------- 316 +I+ FE L ++ + + +L ++ Sbjct: 410 --FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQLNILYESLVRT 462 Query: 317 -PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353 + + G + +W+K + + + +F +E Sbjct: 463 KKVDFYKVPYIIGEEVSWYWIKTEHYKKIEEKFRNIDFSKE 503 >gi|253752012|ref|YP_003025153.1| rhamnan synthesis protein F family protein [Streptococcus suis SC84] gi|253753837|ref|YP_003026978.1| rhamnan synthesis protein F family protein [Streptococcus suis P1/7] gi|251816301|emb|CAZ51929.1| rhamnan synthesis protein F family protein [Streptococcus suis SC84] gi|251820083|emb|CAR46353.1| rhamnan synthesis protein F family protein [Streptococcus suis P1/7] Length = 574 Score = 46.5 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 31/221 (14%) Query: 150 KIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEAN-----KDFEQDVLKYFPSAQLY 204 + + VH + E L ++ L +T+ EA+ E+ + Y AQ+ Sbjct: 284 SVLLHVHIESVSIFEEYIEELCKIADRCQLLITLPEADFSNKCSIVERCLFTYQLRAQIA 343 Query: 205 VMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFS 264 + D F ++ + D YL + K+++ Y + I R L Sbjct: 344 KLT----DELHFFEIVNNYMGDA-KYLAHVTVKQTKEIKYSVEDIID-RHQLRKMFFTS- 396 Query: 265 DIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGF-------- 316 +I+ FE L ++ + + +L ++ Sbjct: 397 --FDAVISNFESQSNLAVVIPDLTTNQRYDRQSLREGNP-----ELIRQLNILYESLVRT 449 Query: 317 -PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353 + + G + +W+K + + + +F +E Sbjct: 450 KKVDFYKVPYIIGEEVSWYWIKTEHYKKIEEKFRNIDFSKE 490 >gi|322418496|ref|YP_004197719.1| hypothetical protein GM18_0965 [Geobacter sp. M18] gi|320124883|gb|ADW12443.1| hypothetical protein GM18_0965 [Geobacter sp. M18] Length = 393 Score = 46.1 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 13/107 (12%), Positives = 26/107 (24%), Gaps = 14/107 (13%) Query: 26 KGNMQAIYIPAHVSGYYVLWSFSPKQ-------RITSKDVHFQELSIFESFIFWLRSFLA 78 V V W P++ ++ S LR + Sbjct: 271 FWEECKALAQQTVPVVNVGWDNRPRRTSPEQALKLRGPWYVPPTPDELASH---LRMAIQ 327 Query: 79 FSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + + + + I + E + L R N+R+ Sbjct: 328 WERENPAYTEANAIL-IYAWNELDEGG-LVPTRSEGNARLQAVKTAL 372 >gi|253755287|ref|YP_003028427.1| rhamnan synthesis protein F family protein [Streptococcus suis BM407] gi|251817751|emb|CAZ55503.1| rhamnan synthesis protein F family protein [Streptococcus suis BM407] Length = 574 Score = 45.7 bits (107), Expect = 0.014, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 81/236 (34%), Gaps = 33/236 (13%) Query: 137 PSSPKKSGLTIKSKIAIVVHCYYQDT--WIEISHILLRLNFDFDLFVTVVEAN-----KD 189 P ++ T++S ++++H + + + E L +++ L +T+ E + Sbjct: 269 PIRVSQTTETVRSSTSVLLHIHIESVSIFEEYIEELCKISDRCQLLITLPETDFSNNFSI 328 Query: 190 FEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEG 249 E+ + Y AQ+ + D F ++ + D YL I K++ + Y + Sbjct: 329 VERYLSTYKLRAQIVKLT----DELHFFEIVNNYMGDA-KYLAHITVKQTNKTKYSVEDI 383 Query: 250 IIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVID 309 I R L +I+ FE L ++ + + + Sbjct: 384 ID-RYQLRKMFFTS---FDAVISNFESQSNLAVVIPDLTTNQRYDRKSLREGNP-----E 434 Query: 310 LAKRAGF---------PTKRLHLDFFNG---TMFWVKPKCLEPLRNLHLIGEFEEE 353 L ++ + + G + +W+K + + + +F +E Sbjct: 435 LIRQLNILYESLVRTKKVDFYKVPYIIGEEVSWYWIKTEHYKKIEEKFRNIDFSKE 490 >gi|30260448|ref|NP_842825.1| hypothetical protein BA_0273 [Bacillus anthracis str. Ames] gi|47525533|ref|YP_016882.1| hypothetical protein GBAA_0273 [Bacillus anthracis str. 'Ames Ancestor'] gi|49183290|ref|YP_026542.1| hypothetical protein BAS0259 [Bacillus anthracis str. Sterne] gi|65317702|ref|ZP_00390661.1| COG1882: Pyruvate-formate lyase [Bacillus anthracis str. A2012] gi|227812939|ref|YP_002812948.1| hypothetical protein BAMEG_0323 [Bacillus anthracis str. CDC 684] gi|254736984|ref|ZP_05194689.1| hypothetical protein BantWNA_17640 [Bacillus anthracis str. Western North America USA6153] gi|254756036|ref|ZP_05208066.1| hypothetical protein BantV_26524 [Bacillus anthracis str. Vollum] gi|254761686|ref|ZP_05213703.1| hypothetical protein BantA9_25534 [Bacillus anthracis str. Australia 94] gi|30253769|gb|AAP24311.1| conserved domain protein [Bacillus anthracis str. Ames] gi|47500681|gb|AAT29357.1| conserved hypothetical protein [Bacillus anthracis str. 'Ames Ancestor'] gi|49177217|gb|AAT52593.1| conserved domain protein [Bacillus anthracis str. Sterne] gi|227005477|gb|ACP15220.1| conserved hypothetical protein [Bacillus anthracis str. CDC 684] Length = 317 Score = 45.3 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 13/87 (14%), Positives = 25/87 (28%), Gaps = 8/87 (9%) Query: 12 GKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQR-ITSKDVHFQELSIFESFI 70 GK N V ++ G +V W + +++ + S F + Sbjct: 238 GKQINAWDYDKVWMYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIY- 296 Query: 71 FWLRSFLAFSKYSKLSFPSCRIFFYGS 97 L+ Y S + F + Sbjct: 297 ------LSKQIYRTYSLYNSEFLFMNA 317 >gi|86144017|ref|ZP_01062355.1| hypothetical protein MED217_13656 [Leeuwenhoekiella blandensis MED217] gi|85829477|gb|EAQ47941.1| hypothetical protein MED217_13656 [Leeuwenhoekiella blandensis MED217] Length = 361 Score = 44.2 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 10/91 (10%), Positives = 27/91 (29%), Gaps = 6/91 (6%) Query: 36 AHVSGYYVLWSFSPKQRITSK-DVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 + + W P Q+ + + + R ++ K S ++ Sbjct: 269 KQIPTVTLNWDPRPMQKHSGAKIFSGFSAKSVKKAVLATRVWVDTH---KESVSKKKLIM 325 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + E + A+L + + + + E Sbjct: 326 LYAWNEYAEGAWLTPSEVLGTTLLDGLKEGL 356 >gi|325106250|ref|YP_004275904.1| hypothetical protein Pedsa_3552 [Pedobacter saltans DSM 12145] gi|324975098|gb|ADY54082.1| hypothetical protein Pedsa_3552 [Pedobacter saltans DSM 12145] Length = 355 Score = 43.8 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 10/120 (8%), Positives = 35/120 (29%), Gaps = 20/120 (16%) Query: 6 RLKSKLGKIENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSI 65 R + ++ L + + ++P GY P + Sbjct: 251 RWWAFYSFVD-LNWQNWKASLDKLNVEFVPCIFPGY-----NEP------------SAAT 292 Query: 66 FESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 ++++ ++ +K + ++ S + + L ++ + + +F Sbjct: 293 QRIIDRTEKNYVDYANVAKRNMGVNQMVIINSWNDFSKGTALEPSKKYNKQFLELTKREF 352 >gi|254788145|ref|YP_003075574.1| glycosyltransferase family 2 domain-containing protein [Teredinibacter turnerae T7901] gi|237685039|gb|ACR12303.1| glycosyltransferase family 2 domain protein [Teredinibacter turnerae T7901] Length = 307 Score = 43.4 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 42/126 (33%), Gaps = 32/126 (25%) Query: 166 ISHILLRLNFDFDLFVTVVEANKD----FEQDVLKYFPSAQLYVMENKG-RDVRP----- 215 + DL+V V + + D D + +P Q+ EN+G R V P Sbjct: 19 TLDSVCNQTVPPDLWVVVDDGSTDETPAILADYSERYPFIQVITRENRGHRSVGPGVIEA 78 Query: 216 FLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFE 275 F Y + ++DY+CK L +I+ E Sbjct: 79 FYYGYDKIDVSQFDYVCKFD---------------------LDLDLP-PRYFEILIDRME 116 Query: 276 QNPCLG 281 +NP LG Sbjct: 117 KNPRLG 122 >gi|261338088|ref|ZP_05965972.1| 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium gallicum DSM 20093] gi|270276707|gb|EFA22561.1| 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium gallicum DSM 20093] Length = 639 Score = 43.4 bits (101), Expect = 0.070, Method: Composition-based stats. Identities = 19/101 (18%), Positives = 28/101 (27%), Gaps = 17/101 (16%) Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREG---------YHPIEGIIWRR 254 +E+ G DV + L + + IH K E W+ Sbjct: 214 RYVEH-GNDVHAMIQALREVKDTDHPVVLHIHTCKGLGLDQQDAHYGVLEGRCEANHWQN 272 Query: 255 WLFFDL--LGFSDIAIRII-----NTFEQNPCLGMIGSRRY 288 L LG R I F++ P L +I Sbjct: 273 PLAQANAPLGSRKTYGRAIMAMLEQRFDEEPGLMVISPATP 313 >gi|257125628|ref|YP_003163742.1| 1-deoxy-D-xylulose-5-phosphate synthase [Leptotrichia buccalis C-1013-b] gi|257049567|gb|ACV38751.1| 1-deoxy-D-xylulose-5-phosphate synthase [Leptotrichia buccalis C-1013-b] Length = 582 Score = 43.0 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 44/117 (37%), Gaps = 13/117 (11%) Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244 E+N + + K +YV +KG D+ + + E + + +H +K + Y Sbjct: 193 ESNGQAQNNYFKSLGLDYVYV--DKGNDLDALIEVFEKVKDINHPIVVHVHTQKGKGLPY 250 Query: 245 HPIEGIIWRRWL-FFDLLG----------FSDIAIRIINTFEQNPCLGMIGSRRYRR 290 + W + F G +D A +++ E++P + ++ S Sbjct: 251 AEKDKETWHYGMPFDPKTGESKVNYSGGLSNDTAEFLMDKMEKDPTIAVVTSGTPTV 307 >gi|255533245|ref|YP_003093617.1| hypothetical protein Phep_3361 [Pedobacter heparinus DSM 2366] gi|255346229|gb|ACU05555.1| hypothetical protein Phep_3361 [Pedobacter heparinus DSM 2366] Length = 355 Score = 43.0 bits (100), Expect = 0.089, Method: Composition-based stats. Identities = 9/101 (8%), Positives = 30/101 (29%), Gaps = 19/101 (18%) Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84 + Y+P GY P + ++++ ++ +K Sbjct: 269 SLDKLNVEYVPCIFPGY-----NEP------------SAATQRIIERTEKNYVDYTNVAK 311 Query: 85 LSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 + + ++ S + + L ++ + + +F Sbjct: 312 RNMGTNQMVIINSWNDFSKGTALEPSKKFNKQFLGITRREF 352 >gi|149279792|ref|ZP_01885919.1| hypothetical protein PBAL39_02720 [Pedobacter sp. BAL39] gi|149229382|gb|EDM34774.1| hypothetical protein PBAL39_02720 [Pedobacter sp. BAL39] Length = 357 Score = 42.2 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 40/132 (30%), Gaps = 15/132 (11%) Query: 2 YKVFRLKSKLGKIENLLLRLDVEEKGNMQAIYIP-AHVSGYYVLWSFSPK---QRITSKD 57 Y K+ +I ++ + N A P + + W P+ D Sbjct: 228 YHSSGFKAGSTEIPISNMQAAENQMWNNIAYVSPLKFIPVATLNWD--PRPWANAGNGYD 285 Query: 58 ----VHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFM 111 +S + S + + +K P RI + E + A+L ++ Sbjct: 286 KAPYFVGYS---EKSVYKSVSSLIDWINKNKWETPKERIGLLYAWNENGEGAYLTPSQEG 342 Query: 112 SNSRMPFDSEKF 123 ++ + + Sbjct: 343 DDNLLRGVQKAL 354 >gi|238916219|ref|YP_002929736.1| polysaccharide biosynthesis protein [Eubacterium eligens ATCC 27750] gi|238871579|gb|ACR71289.1| polysaccharide biosynthesis protein [Eubacterium eligens ATCC 27750] Length = 621 Score = 42.2 bits (98), Expect = 0.16, Method: Composition-based stats. Identities = 29/230 (12%), Positives = 67/230 (29%), Gaps = 35/230 (15%) Query: 122 KFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTW-IEISHILLRLNFDFD-L 179 + +LFE N + + I K AIVV C EI + R+ + + Sbjct: 267 RLYNHADLFEKLNLQYVLQTRGEKEISLKNAIVVICGNVKLISNEIDEYIQRIKDEIKVI 326 Query: 180 FVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKS 239 F+T +K+ +++ E D + Sbjct: 327 FIT---ESKEGCEELKNQIR-------------------EYEYVCLINCDIIL------- 357 Query: 240 QREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLG-MIGSRRYRRYKRWSFFA 298 + +L+ + ++ F++N +G + + Sbjct: 358 -ENNTFSCVNKSALYGVLENLIKSNSYISNVMGIFKRNKKIGALTIPELIHADFLGKAWK 416 Query: 299 KRSEVYRRVIDLA--KRAGFPTKRLHLDFFNGTMFWVKPKCLEPLRNLHL 346 + ++ +++ + K+ + N WV+ + LE + Sbjct: 417 RWVQIRQKISYILDSKQIHCIFSMDKMPIVNSDNLWVRRELLEQAIEYND 466 >gi|149198517|ref|ZP_01875562.1| hypothetical protein LNTAR_06784 [Lentisphaera araneosa HTCC2155] gi|149138523|gb|EDM26931.1| hypothetical protein LNTAR_06784 [Lentisphaera araneosa HTCC2155] Length = 441 Score = 41.9 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 7/68 (10%), Positives = 17/68 (25%), Gaps = 9/68 (13%) Query: 58 VHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSR 115 H ++ + R + + G E + A+L + Sbjct: 375 YHGGTPKLYGESLKLARESVE-------KNNGRKFITVGIWNEFYEDAYLEPDVKYGYEY 427 Query: 116 MPFDSEKF 123 + + F Sbjct: 428 LKQIEKNF 435 >gi|87312199|ref|ZP_01094301.1| hypothetical protein DSM3645_13248 [Blastopirellula marina DSM 3645] gi|87285075|gb|EAQ77007.1| hypothetical protein DSM3645_13248 [Blastopirellula marina DSM 3645] Length = 349 Score = 41.9 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 12/104 (11%), Positives = 30/104 (28%), Gaps = 21/104 (20%) Query: 25 EKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSK 84 + Q ++ P + Y F K+ +T ++ + +A + Sbjct: 260 QTWAEQTVFCPTLMPKY---HDFRGKRTLTG------TPEQYQ-------TMIAMMQALP 303 Query: 85 LSFPSC---RIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKF 123 I+ S E + + + + + + E F Sbjct: 304 KQPVGHGIGSIYLITSWNEWWEGTTIEPDTTDGEAFLKANREAF 347 >gi|75674738|ref|YP_317159.1| hypothetical protein Nwi_0540 [Nitrobacter winogradskyi Nb-255] gi|74419608|gb|ABA03807.1| hypothetical protein Nwi_0540 [Nitrobacter winogradskyi Nb-255] Length = 381 Score = 41.5 bits (96), Expect = 0.26, Method: Composition-based stats. Identities = 12/117 (10%), Positives = 25/117 (21%), Gaps = 19/117 (16%) Query: 23 VEEKGNMQAIYIPAHVSGYYVLWSFSPK-----------QRITSKDVHFQELSIFESFIF 71 E GN V W P+ + + F + + Sbjct: 261 WNELGNGHLP----VVPTVMTGWDRRPRIENPVPWEKKQRPGEGIENFFAAPTK-KELAD 315 Query: 72 WLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYV 126 L L + + + + E + +L R+ + Sbjct: 316 HLARALDWVGARPQGEQAP-VVLIYAWNENDEGGWLMPTLPCQTDRLDALRQVLKKT 371 >gi|295425765|ref|ZP_06818450.1| conserved hypothetical protein [Lactobacillus amylolyticus DSM 11664] gi|295064573|gb|EFG55496.1| conserved hypothetical protein [Lactobacillus amylolyticus DSM 11664] Length = 433 Score = 41.1 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 41/113 (36%), Gaps = 16/113 (14%) Query: 97 SRKEQKAFLRLNRFMSNSR----MPFDSEKFLYVKELF-EGWNDRPSSPKKSGLTIK--- 148 + E+ FL + + +K + V E+ E W + +GL +K Sbjct: 122 ALNEKGDFLLPAAGHEKGYTRPIIAAEYKKPIKVGEITMEIWPSDHDAYGATGLIVKTPD 181 Query: 149 SKIA----IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY 197 KI+ I +H Y+ D E + DLF+T + E+ K Sbjct: 182 KKISFTGDIRLHGYHPDWVHEFLAA----SKGADLFITEATGSSWPERKNEKQ 230 >gi|328948788|ref|YP_004366125.1| 1-deoxy-D-xylulose-5-phosphate synthase [Treponema succinifaciens DSM 2489] gi|328449112|gb|AEB14828.1| 1-deoxy-D-xylulose-5-phosphate synthase [Treponema succinifaciens DSM 2489] Length = 589 Score = 41.1 bits (95), Expect = 0.35, Method: Composition-based stats. Identities = 10/46 (21%), Positives = 15/46 (32%), Gaps = 1/46 (2%) Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 EN G D+ + L E + + IH K + W Sbjct: 215 EN-GNDIGAMIALFEKVKDIDHPVVLHIHTLKGKGYAPAEKNKEAW 259 >gi|313123698|ref|YP_004033957.1| metallo-beta-lactamase superfamily hydrolase [Lactobacillus delbrueckii subsp. bulgaricus ND02] gi|312280261|gb|ADQ60980.1| Metallo-beta-lactamase superfamily hydrolase [Lactobacillus delbrueckii subsp. bulgaricus ND02] Length = 412 Score = 40.7 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 41/113 (36%), Gaps = 16/113 (14%) Query: 97 SRKEQKAFLRLNRFMSNSR----MPFDSEKFLYVKELF-EGWNDRPSSPKKSGLTIK--- 148 + E+ FL + + +K + V E+ E W + +GL +K Sbjct: 101 ALNEKGDFLLPAAGHEKGYTRPIIAAEYKKPIKVGEITMEIWPSDHDAYGATGLIVKTPD 160 Query: 149 SKIA----IVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKY 197 KI+ I +H Y+ D E + DLF+T + E+ K Sbjct: 161 KKISFTGDIRLHGYHPDWVHEFLAA----SKGADLFITEATGSSWPERKNEKQ 209 >gi|300727407|ref|ZP_07060816.1| 1-deoxy-d-xylulose-5-phosphate synthase 2 [Prevotella bryantii B14] gi|299775287|gb|EFI71886.1| 1-deoxy-d-xylulose-5-phosphate synthase 2 [Prevotella bryantii B14] Length = 584 Score = 40.3 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 18/148 (12%), Positives = 40/148 (27%), Gaps = 14/148 (9%) Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244 E N ++ K F +YV E G +V + + + + IH +K Sbjct: 194 ETNGQAANNLFKAFGLDYIYVEE--GNNVGKLVEAFQKVKDIDHPIVVHIHTEKGHGYAP 251 Query: 245 HPIEGIIWRRWL----FFDLLGFSDIAIR--------IINTFEQNPCLGMIGSRRYRRYK 292 W + L + +++P + I + + Sbjct: 252 AVENKEGWHYHMPFNREDGSLKNPGNGENMTALLGQWMAEQLKKDPKMVCIAAGTAPAFY 311 Query: 293 RWSFFAKRSEVYRRVIDLAKRAGFPTKR 320 + + + +A+ G Sbjct: 312 FDKERREEAGKQFIDVGIAEEEGVAIAS 339 >gi|312373266|gb|EFR21041.1| hypothetical protein AND_17673 [Anopheles darlingi] Length = 1344 Score = 40.3 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 9/65 (13%), Positives = 21/65 (32%), Gaps = 4/65 (6%) Query: 98 RKEQKAFLRLNRFMSNSRM----PFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAI 153 E+ +L + + + + + E P++ ++ IA+ Sbjct: 67 WIEEGGYLEKELAYTRKALGETDSSTHQLLIQTPKDMEASILHPTALLTHLDVVRKAIAV 126 Query: 154 VVHCY 158 VH Y Sbjct: 127 TVHMY 131 >gi|297569722|ref|YP_003691066.1| glycosyl transferase family 2 [Desulfurivibrio alkaliphilus AHT2] gi|296925637|gb|ADH86447.1| glycosyl transferase family 2 [Desulfurivibrio alkaliphilus AHT2] Length = 318 Score = 40.3 bits (93), Expect = 0.57, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 46/166 (27%), Gaps = 35/166 (21%) Query: 161 DTWIEISHILLRLNFDFDLFVTVVEANKD----FEQDVLKYFPSAQLYVMENKG-RDVRP 215 D ++ DL+V V + + D + + ++ N+G R V P Sbjct: 17 DYMRHTLDSMVAQTVRPDLWVIVDDGSTDQTPQILAEYAAKYDFIKIVPKANRGHRSVGP 76 Query: 216 -----FLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRI 270 F D ++Y+CK+ L + Sbjct: 77 GVIEAFYAGYRAVRPDDFEYICKLD---------------------LDLELP-PRYFEIL 114 Query: 271 INTFEQNPCLGMIGSRRYRRYKRWSFFAKR---SEVYRRVIDLAKR 313 + E+NP +G + Y E + L ++ Sbjct: 115 LKRLEENPRIGTCSGKPYFLDNESGKLISEKCGDENSVGMTKLFRK 160 >gi|153811516|ref|ZP_01964184.1| hypothetical protein RUMOBE_01908 [Ruminococcus obeum ATCC 29174] gi|149832257|gb|EDM87342.1| hypothetical protein RUMOBE_01908 [Ruminococcus obeum ATCC 29174] Length = 589 Score = 39.9 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 9/46 (19%), Positives = 16/46 (34%), Gaps = 1/46 (2%) Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 EN G D+ + L + + IH +K + + W Sbjct: 219 EN-GNDIASLISLFRKVKDTDHPIVVHIHTQKGKGYEIAEKDKEGW 263 >gi|153814764|ref|ZP_01967432.1| hypothetical protein RUMTOR_00979 [Ruminococcus torques ATCC 27756] gi|145847795|gb|EDK24713.1| hypothetical protein RUMTOR_00979 [Ruminococcus torques ATCC 27756] Length = 589 Score = 39.9 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 9/46 (19%), Positives = 16/46 (34%), Gaps = 1/46 (2%) Query: 207 ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 EN G D+ + L + + IH +K + + W Sbjct: 219 EN-GNDIASLISLFRKVKDTDHPIVVHIHTQKGKGYEIAEKDKEGW 263 >gi|255533249|ref|YP_003093621.1| hypothetical protein Phep_3365 [Pedobacter heparinus DSM 2366] gi|255346233|gb|ACU05559.1| hypothetical protein Phep_3365 [Pedobacter heparinus DSM 2366] Length = 348 Score = 39.9 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 9/88 (10%), Positives = 21/88 (23%), Gaps = 12/88 (13%) Query: 38 VSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGS 97 V ++ + + F + +K + S RI S Sbjct: 268 VPCIAPGYNDKAMTPASKMYDIGYTPEFYTDF----------TNVAKRNMSSKRIVLINS 317 Query: 98 RK--EQKAFLRLNRFMSNSRMPFDSEKF 123 + + N + ++F Sbjct: 318 WNNFQLGTAIEPTETYGNIFLQMTRKQF 345 >gi|229112680|ref|ZP_04242216.1| Glycosytransferase [Bacillus cereus Rock1-15] gi|228670812|gb|EEL26120.1| Glycosytransferase [Bacillus cereus Rock1-15] Length = 355 Score = 39.5 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 14/89 (15%), Positives = 37/89 (41%), Gaps = 6/89 (6%) Query: 122 KFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVH-----CYYQDTWIEISHILLRLNFD 176 K +++ G R K G K K+ + +H +YQ++ +I + + + Sbjct: 73 KIMHIHTASRGSFFRKRIFVKLGKLFKKKVVLHIHGAEFMVFYQESSEDIRNQIREILNQ 132 Query: 177 FDLFVTVVEANKDFEQDVLKYFPSAQLYV 205 D+ +T+ + K+ + + + ++ Sbjct: 133 VDVIITLSQKWKEDIESITNN-RNVKVIY 160 >gi|145591836|ref|YP_001153838.1| hypothetical protein Pars_1633 [Pyrobaculum arsenaticum DSM 13514] gi|145283604|gb|ABP51186.1| hypothetical protein Pars_1633 [Pyrobaculum arsenaticum DSM 13514] Length = 609 Score = 39.5 bits (91), Expect = 0.97, Method: Composition-based stats. Identities = 8/103 (7%), Positives = 20/103 (19%), Gaps = 11/103 (10%) Query: 22 DVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSK 81 E + + + T + F + R + + Sbjct: 515 KYGEWSEATNALGVGFIPSAMPGFDD--RAIRTGHIPLPKSTERFRKQLIIARQYTNINT 572 Query: 82 YSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFL 124 I + E + + S + + L Sbjct: 573 IL--------ITTFNEWHENTN-IEPSVKDGFSYLQVLKQVLL 606 >gi|260889525|ref|ZP_05900788.1| 1-deoxy-D-xylulose 5-phosphate synthase [Leptotrichia hofstadii F0254] gi|260860936|gb|EEX75436.1| 1-deoxy-D-xylulose 5-phosphate synthase [Leptotrichia hofstadii F0254] Length = 592 Score = 39.5 bits (91), Expect = 0.99, Method: Composition-based stats. Identities = 13/72 (18%), Positives = 28/72 (38%), Gaps = 2/72 (2%) Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244 E+N + + K +YV +KG D+ + + E + + +H +K + Y Sbjct: 203 ESNGQAQNNYFKSLGLDYIYV--DKGNDLEALIEVFEKVKDINHPIVVHVHTQKGKGLPY 260 Query: 245 HPIEGIIWRRWL 256 + W + Sbjct: 261 AEKDKETWHYGM 272 >gi|193216921|ref|YP_002000163.1| ribosome biogenesis GTP-binding protein YsxC [Mycoplasma arthritidis 158L3-1] gi|238692481|sp|B3PN57|ENGB_MYCA5 RecName: Full=Probable GTP-binding protein EngB gi|193002244|gb|ACF07459.1| GTPase protein YihA (EngB) [Mycoplasma arthritidis 158L3-1] Length = 183 Score = 39.2 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 46/126 (36%), Gaps = 8/126 (6%) Query: 73 LRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEG 132 L + LA K +K S R + Q+ + ++ + + + + Sbjct: 35 LINALASQKIAKTSSTPGRTRLINYFETQRKKIIVDLP-GYGFASMSKKAQSKISGIIDF 93 Query: 133 WNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVT-VVEANKDFE 191 + + K + I +KI Y D E+ L L FD+ +T + +AN+ + Sbjct: 94 YFRNSKNSKNICILIDAKIGFS----YIDL--EMIDYLKSLGLLFDIIITKIDKANQSQK 147 Query: 192 QDVLKY 197 V + Sbjct: 148 HRVKQQ 153 >gi|94310676|ref|YP_583886.1| glycosyl transferase family protein [Cupriavidus metallidurans CH34] gi|93354528|gb|ABF08617.1| Cellulose synthase (UDP-forming), putative glycosyl transferase [Cupriavidus metallidurans CH34] Length = 658 Score = 38.8 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 176 DFDLFVTVVEA-----NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY-- 228 D+F+ K + +P+ +++V+++ RD +L V RY Sbjct: 117 PVDIFIATYNEGLDVLEKTIVAALDIDYPNFRVWVLDDTRRD---WLREFCDQVGARYVT 173 Query: 229 --DYLCKIHGK-----KSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLG 281 D H K R G + L D +I +RI+ F+ +P +G Sbjct: 174 RPDNA---HAKAGNLNNGLRHSAELDGGAPFIMVLDADFAPNRNILLRIVGLFD-DPQVG 229 Query: 282 MI-GSRRYRRYKRWSFFAKRSEVY-RRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLE 339 ++ + Y + + +E + F GT F V+ + L+ Sbjct: 230 VVQTPQFYYNADPIQYNLRSTECWVDEQRAFFDVMQPSKDAWGTAFCIGTSFVVRREALD 289 Query: 340 PL 341 + Sbjct: 290 RI 291 >gi|118400046|ref|XP_001032346.1| hypothetical protein TTHERM_00636850 [Tetrahymena thermophila] gi|89286687|gb|EAR84683.1| hypothetical protein TTHERM_00636850 [Tetrahymena thermophila SB210] Length = 420 Score = 38.8 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 10/78 (12%), Positives = 21/78 (26%), Gaps = 8/78 (10%) Query: 161 DTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLL 220 D +I L L + + + + ++ + Q+ P Sbjct: 17 DLHKKIIKYLAPLPDQRAFIIYTSQEQNEDDTYIICNLLNGQIKY--------GPLFSKA 68 Query: 221 ELGVFDRYDYLCKIHGKK 238 E D + + KK Sbjct: 69 EKIENINDDLIVFVDSKK 86 >gi|253682781|ref|ZP_04863576.1| putative formyl-CoA transferase [Clostridium botulinum D str. 1873] gi|253560980|gb|EES90434.1| putative formyl-CoA transferase [Clostridium botulinum D str. 1873] Length = 391 Score = 38.8 bits (89), Expect = 1.7, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 34/108 (31%), Gaps = 21/108 (19%) Query: 236 GKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA-----IRIINTF--------EQNPCL-- 280 KKS YH EG + L+ +D+ + +F E NP L Sbjct: 63 SKKSITINYHKSEGA----EIIKRLVKNTDMIIFNEPEEKLKSFGLGFPELKEVNPKLVY 118 Query: 281 GMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFFNG 328 G++ + W + L ++ G P K F G Sbjct: 119 GILTP--FGEEGPWKDMPDYDLIIMARTGLLEKTGMPEKPTKFGFPLG 164 >gi|321451847|gb|EFX63374.1| hypothetical protein DAPPUDRAFT_335541 [Daphnia pulex] Length = 337 Score = 38.4 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 65/221 (29%), Gaps = 20/221 (9%) Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVK 127 F +R+ L L + + R F G+ E+ +L R ++ F Sbjct: 31 QFYSGVRTALGLQSNQLLIYYTER-VFAGTANEEPNYLERGRCRKRRKLRFVVVNRRLPS 89 Query: 128 ELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-- 185 + P + S + + Y++ RL FD+ + Sbjct: 90 TKLWCDFVNTNYPHRLMSLNSSDSLVQMQIYFR----TCMQPFGRLLASFDIHLNGPLKL 145 Query: 186 --ANKDFEQDVLKYFPSAQLYVMENKGRDVRP--FLYLLELGVFDRYDYLCKIHGKKSQR 241 + + A V + P F L E + C+ H KK + Sbjct: 146 LLDRVAKREYASQEIKEAYFLVFTS-----YPSTFKPLFEKTALVEDELYCEFHDKK-RD 199 Query: 242 EGYHPIEGIIWRRWLF---FDLLGFSDIAIRIINTFEQNPC 279 + ++ + +R LL + II F +N Sbjct: 200 KVFNSRQAKSYRLKNLAKTERLLSTKNGVNSIIYHFAENEK 240 >gi|239616887|ref|YP_002940209.1| hypothetical protein Kole_0482 [Kosmotoga olearia TBF 19.5.1] gi|239505718|gb|ACR79205.1| hypothetical protein Kole_0482 [Kosmotoga olearia TBF 19.5.1] Length = 715 Score = 38.4 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 7/88 (7%), Positives = 18/88 (20%), Gaps = 17/88 (19%) Query: 37 HVSGYYVLWSFS-PKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFY 95 HV + + + S D L + + Sbjct: 384 HVMNIMPGYDDTHVRVPGFSVDREN------GKLYEELWKLV--------LNLDPDMVII 429 Query: 96 GSRKE--QKAFLRLNRFMSNSRMPFDSE 121 S E + + + + + + Sbjct: 430 TSWNEWHEGSEIEPSLEYGRKYLEITKK 457 >gi|310831260|ref|YP_003969903.1| hypothetical protein crov271 [Cafeteria roenbergensis virus BV-PW1] gi|309386444|gb|ADO67304.1| hypothetical protein crov271 [Cafeteria roenbergensis virus BV-PW1] Length = 821 Score = 38.4 bits (88), Expect = 2.2, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 57/203 (28%), Gaps = 47/203 (23%) Query: 197 YFPSAQLYVMEN-KGRDVRPFLYLLE-LGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRR 254 F + + N G D+ P + F ++Y+ K+ K I WR Sbjct: 210 NFNNKYFVIETNEYGNDIIPTIIGFNFANTFLNFNYILKLQTK----------SDIKWRN 259 Query: 255 WLFFDLL-GFSDIAIRIIN--TFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLA 311 L L I +++ F +P + + L Sbjct: 260 PLINFFLNKSKTDLINLLDNQEFICHPKF----------------------ISKITSTLI 297 Query: 312 KRAGFPTKR-LHLDFFNGTMFWVKPKCLEPL---------RNLHLIGEFEEERNLKDGAL 361 + F G++++ K + + + ++ L+ + Sbjct: 298 NKLFLQNLNWNDKSFPAGSIYFCKKHKFDNMIKFINYSSPHKYFIQTMYDTHYVLRGNSS 357 Query: 362 EHAVERFFACSVRYTEFSIESVD 384 H +ER ++ F+ S Sbjct: 358 VHFLERLVGINLDKHIFTTSSNY 380 >gi|281492254|ref|YP_003354234.1| 1-deoxy-D-xylulose 5-phosphate synthase [Lactococcus lactis subsp. lactis KF147] gi|281375925|gb|ADA65419.1| 1-deoxy-D-xylulose 5-phosphate synthase [Lactococcus lactis subsp. lactis KF147] Length = 580 Score = 38.0 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 22/57 (38%), Gaps = 1/57 (1%) Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260 +EN G D+ ++L E + + IH +K + + + FDL Sbjct: 211 KYLEN-GNDIESLIHLFEEVKDINHPIVLHIHTEKGRGYQPALENKEAFHWHMPFDL 266 >gi|310831259|ref|YP_003969902.1| hypothetical protein crov270 [Cafeteria roenbergensis virus BV-PW1] gi|309386443|gb|ADO67303.1| hypothetical protein crov270 [Cafeteria roenbergensis virus BV-PW1] Length = 781 Score = 38.0 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 48/176 (27%), Gaps = 43/176 (24%) Query: 208 NKGRDVRPFLYLLELGVFD-RYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDI 266 N G D+ P L + + Y+ KIH K ++ I + +L Sbjct: 327 NIGNDLIPSLKIFNDNYSKFNFKYVLKIHTK------HNQIFNELTDFFLINY------- 373 Query: 267 AIRIINTFEQNPCLGMIGSRRYRRYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326 +IN E N + I +Y + K+ + +L F Sbjct: 374 -DNLINVMEDNHQIDFITKHKYCYNIEKDCYNKKITNKIIINK------------NLFFC 420 Query: 327 NGTMFWVKPKCLEP------------LRNLHLIGEFEEERNLKDGALEHAVERFFA 370 + F K L N + + H +ER + Sbjct: 421 AISFFIGKKDIFIKNLNKVAFLFKPSLLNCFYYDNI----MFINNSPVHTIERVIS 472 >gi|85713618|ref|ZP_01044608.1| hypothetical protein NB311A_03739 [Nitrobacter sp. Nb-311A] gi|85699522|gb|EAQ37389.1| hypothetical protein NB311A_03739 [Nitrobacter sp. Nb-311A] Length = 387 Score = 38.0 bits (87), Expect = 2.9, Method: Composition-based stats. Identities = 12/120 (10%), Positives = 24/120 (20%), Gaps = 15/120 (12%) Query: 19 LRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPK-----------QRITSKDVHFQELSIFE 67 L E N + V W P+ + + F + E Sbjct: 259 LARFAERGWNALSHGRLPVVPTVMTGWDRRPRIEHPVPWETKQRPGEGMENFFTAPTKKE 318 Query: 68 SFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLY 125 R+ + + E + +L R+ + Sbjct: 319 LADHLARALGWVAARPPD--EQAPAVLIYAWNENDEGGWLMPTLPCQTDRLDALRQVLKK 376 >gi|85105803|ref|XP_962039.1| peroxisomal hydratase-dehydrogenase-epimerase [Neurospora crassa OR74A] gi|3929350|sp|Q01373|FOX2_NEUCR RecName: Full=Peroxisomal hydratase-dehydrogenase-epimerase; Short=HDE; AltName: Full=Multifunctional beta-oxidation protein; Short=MFP; Includes: RecName: Full=2-enoyl-CoA hydratase; Includes: RecName: Full=(3R)-3-hydroxyacyl-CoA dehydrogenase gi|510867|emb|CAA56355.1| multifunctional beta-oxidation protein [Neurospora crassa] gi|28923632|gb|EAA32803.1| peroxisomal hydratase-dehydrogenase-epimerase [Neurospora crassa OR74A] Length = 894 Score = 37.6 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 40/106 (37%), Gaps = 15/106 (14%) Query: 185 EANKDFEQDVLKYFPSAQLYVMENKG--RDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 E + +K F + ++ N G RD+ + + +D + K+H K S + Sbjct: 77 ENGDKIIETAIKEFGRIDI-LINNAGILRDIS-----FKNMKDEDWDLIFKVHVKGSYKT 130 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ----NPCLGMIG 284 +R+ F ++ + A + F Q LGM+G Sbjct: 131 ARAAWP--YFRKQKFGRVI-NTASAAGLFGNFGQANYSAAKLGMVG 173 >gi|312863645|ref|ZP_07723883.1| conserved hypothetical protein [Streptococcus vestibularis F0396] gi|311101181|gb|EFQ59386.1| conserved hypothetical protein [Streptococcus vestibularis F0396] Length = 262 Score = 37.6 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 60/195 (30%), Gaps = 23/195 (11%) Query: 152 AIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVM----E 207 AI VH + +L+ + ++ E ++L F + ++ + Sbjct: 2 AIHVHISDLERLKVFFD--SKLSAFYYFTLSGHLDKNQVENNLLNSFDKDRFQIVSQKFD 59 Query: 208 NKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIA 267 N + YD++ H + + R L LL Sbjct: 60 NHYHALVSL-----ASQLSEYDFIGHFHT--ADFGNEGKLVDEATRLALIDMLL-DEKKV 111 Query: 268 IRIINTFEQNPCLGMIGSRRYR-RYKRWSFFAKRSEVYRRVIDLAKRAGFPTKRLHLDFF 326 I F P +G++ + + Y + ++ + ++ + L F Sbjct: 112 SSIFADF---PEVGLVFADLSKELYWTDAIGTLNQNQAAKLDNECQKT----IKNSLHVF 164 Query: 327 NGTMFWVKPKCLEPL 341 G+M W+ LE + Sbjct: 165 QGSM-WLSKDFLEKI 178 >gi|213406643|ref|XP_002174093.1| DNA polymerase epsilon catalytic subunit A [Schizosaccharomyces japonicus yFS275] gi|212002140|gb|EEB07800.1| DNA polymerase epsilon catalytic subunit A [Schizosaccharomyces japonicus yFS275] Length = 2185 Score = 37.6 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 37/117 (31%), Gaps = 14/117 (11%) Query: 86 SFPSCRIFFYGSRKEQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRP-SSPKKSG 144 + + + E ++ ++ M + + N P S P Sbjct: 28 AKANEEVVLENIWNEIQSKNEIDTKMGFDNIEA------GPPRIGWLLNVHPTSVPSDDN 81 Query: 145 LTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE-ANKDFEQDVLKYFPS 200 KS IA Y+ E + + + L+V+ E + E + K FP+ Sbjct: 82 ANGKSAIA----LYFIQEDGETFR--VTVPYRPYLYVSTKEGKEAEVEDYLKKAFPN 132 >gi|73539059|ref|YP_299426.1| cellulose synthase (UDP-forming) [Ralstonia eutropha JMP134] gi|72122396|gb|AAZ64582.1| Cellulose synthase (UDP-forming) [Ralstonia eutropha JMP134] Length = 659 Score = 37.6 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 59/181 (32%), Gaps = 24/181 (13%) Query: 178 DLFVTVVEA-----NKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRY---- 228 D+F+ K + +P+ +++V+++ RD +L V Y Sbjct: 119 DIFIATYNEGLDVLEKTIVSALAIDYPNFRVWVLDDTRRD---WLKAYCARVGACYVTRP 175 Query: 229 DYLCKIHGKKS------QREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGM 282 D H K G + L D +I +RI+ F+ +P +G+ Sbjct: 176 DNA---HAKAGNLNNGLMHSAAQRGGGAPFIMVLDADFAPNRNILLRIVGLFD-DPAVGV 231 Query: 283 I-GSRRYRRYKRWSFFAKRSEVY-RRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEP 340 + + Y + + +E + F GT F V+ + L Sbjct: 232 VQTPQFYYNADPIQYNLRSTECWVDEQRAFFDIMQPAKDAWGTAFCIGTSFVVRREALAR 291 Query: 341 L 341 + Sbjct: 292 I 292 >gi|228477260|ref|ZP_04061898.1| rhamnosyltransferase [Streptococcus salivarius SK126] gi|228251279|gb|EEK10450.1| rhamnosyltransferase [Streptococcus salivarius SK126] Length = 547 Score = 37.6 bits (86), Expect = 3.7, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 73/222 (32%), Gaps = 26/222 (11%) Query: 78 AFSKYSKLSFPSCRIFFYG----SRKEQKAFLRLN-RFMSNSRMPFDSEKFL---YVKEL 129 FS Y +S ++ F + E+K L L+ ++ + L + + Sbjct: 204 DFSYYRPISTLEHKVPFIKLKAFTDNEKKGRLLLDYLANLSTYPVALIKSHLNRYHSPDS 263 Query: 130 FEGWNDRPSSPKKSGLTI-KSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANK 188 +++ P L+ + ++ I VH + +L+ + ++ Sbjct: 264 LVISDEKIIGPSFITLSKHEYRMVIHVHISDLERLKVFFD--SKLSAFYYFTLSSHLDKN 321 Query: 189 DFEQDVLKYFPSAQLYVM----ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244 E +L F + ++ EN + + YD++ H + G Sbjct: 322 KVENTLLNSFDKDRFQLVSKTFENHYHAL-----VFLASHLSEYDFVGHFHT---EAFGN 373 Query: 245 HPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSR 286 R ++L + + I + F P +G++ + Sbjct: 374 EGKLVDEDTRHALVNMLSDEEKVVSIFDHF---PEVGLVFAD 412 >gi|294499860|ref|YP_003563560.1| hypothetical protein BMQ_3104 [Bacillus megaterium QM B1551] gi|294349797|gb|ADE70126.1| hypothetical protein BMQ_3104 [Bacillus megaterium QM B1551] Length = 123 Score = 37.6 bits (86), Expect = 3.8, Method: Composition-based stats. Identities = 10/31 (32%), Positives = 15/31 (48%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDL 179 KIA ++H YY E + +L + DL Sbjct: 43 KKIAQLIHLYYPGMLFEFATLLTNPTYRIDL 73 >gi|116512536|ref|YP_811443.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp. cremoris SK11] gi|116108190|gb|ABJ73330.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp. cremoris SK11] Length = 580 Score = 37.2 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 27/76 (35%), Gaps = 2/76 (2%) Query: 185 EANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGY 244 E N E + K F +EN G D+ + L E + + IH +K + Sbjct: 193 ETNGQVENNFFKTF-GLDYKYLEN-GNDIESLVNLFEEVKDIDHPIVLHIHTEKGRGYQP 250 Query: 245 HPIEGIIWRRWLFFDL 260 + + FDL Sbjct: 251 ALENKEAFHWHMPFDL 266 >gi|306823338|ref|ZP_07456713.1| 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium dentium ATCC 27679] gi|309802562|ref|ZP_07696666.1| putative 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium dentium JCVIHMP022] gi|304553045|gb|EFM40957.1| 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium dentium ATCC 27679] gi|308220626|gb|EFO76934.1| putative 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium dentium JCVIHMP022] Length = 648 Score = 37.2 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 7/42 (16%), Positives = 13/42 (30%) Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250 +G DV + L + + +H K +G Sbjct: 224 RGNDVHALVEALRAVKDIDHPIVVHVHTTKGLGFDEAAGDGN 265 >gi|283455635|ref|YP_003360199.1| 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium dentium Bd1] gi|283102269|gb|ADB09375.1| dxs 1-deoxy-D-xylulose 5-phosphate synthase [Bifidobacterium dentium Bd1] Length = 651 Score = 37.2 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 7/42 (16%), Positives = 13/42 (30%) Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250 +G DV + L + + +H K +G Sbjct: 227 RGNDVHALVEALRAVKDIDHPIVVHVHTTKGLGFDEAAGDGN 268 >gi|171740975|ref|ZP_02916782.1| hypothetical protein BIFDEN_00037 [Bifidobacterium dentium ATCC 27678] gi|171276589|gb|EDT44250.1| hypothetical protein BIFDEN_00037 [Bifidobacterium dentium ATCC 27678] Length = 648 Score = 37.2 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 7/42 (16%), Positives = 13/42 (30%) Query: 209 KGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGI 250 +G DV + L + + +H K +G Sbjct: 224 RGNDVHALVEALRAVKDIDHPIVVHVHTTKGLGFDEAAGDGN 265 >gi|295705244|ref|YP_003598319.1| hypothetical protein BMD_3129 [Bacillus megaterium DSM 319] gi|294802903|gb|ADF39969.1| hypothetical protein BMD_3129 [Bacillus megaterium DSM 319] Length = 123 Score = 37.2 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 11/31 (35%), Positives = 15/31 (48%) Query: 149 SKIAIVVHCYYQDTWIEISHILLRLNFDFDL 179 KIA ++H YY E S +L + DL Sbjct: 43 KKIAQLIHLYYPGMLFEFSTLLTNPTYRIDL 73 >gi|325062498|gb|ADY66188.1| two component sensor kinase [Agrobacterium sp. H13-3] Length = 345 Score = 37.2 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 19/116 (16%), Positives = 36/116 (31%), Gaps = 4/116 (3%) Query: 116 MPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNF 175 + S + L + +S K + I ++ H + T +E + L Sbjct: 15 LARLSSRILGRHGMEVVHAASVASGLKMFQDEQFDIVVLDHYFQTSTGMEFLAAIQSLPG 74 Query: 176 DFD-LFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDY 230 L+VT + + YV++N G D P L + + Sbjct: 75 RVPVLYVTGSNEAQIAIDALKAGAAD---YVIKNVGDDFFPLLLTAIDQSLENHRL 127 >gi|15673655|ref|NP_267829.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp. lactis Il1403] gi|12724687|gb|AAK05771.1|AE006398_2 1-deoxyxylulose-5-phosphate synthase [Lactococcus lactis subsp. lactis Il1403] Length = 580 Score = 37.2 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 22/57 (38%), Gaps = 1/57 (1%) Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260 +EN G D+ ++L E + + IH +K + + + FDL Sbjct: 211 KYLEN-GNDIESLIHLFEEVKDIDHPIVLHIHTEKGRGYQPALENKEAFHWHMPFDL 266 >gi|312278781|gb|ADQ63438.1| Rhamnosyltransferase [Streptococcus thermophilus ND03] Length = 547 Score = 37.2 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 43/280 (15%), Positives = 83/280 (29%), Gaps = 36/280 (12%) Query: 78 AFSKYSKLSFPSCRIFFYG----SRKEQKAFLRLN--RFMSNSRMPFDSEKFLYVKELFE 131 FS Y +S ++ F + E+K L L+ +S + Sbjct: 204 DFSYYRPISTLEHKVPFIKLKAFTDNEKKGRLLLDYITKLSAYPLALIKSHLNSYHSPDS 263 Query: 132 GWNDRPSSPKKSGLTIKSK---IAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVE--A 186 + S ++ K AI VH D E + + T+ Sbjct: 264 LVILDEKIIEPSFHSVSGKGYHSAIHVHI--SDL--ERLKVFSDKKLSAFYYFTLSSHLD 319 Query: 187 NKDFEQDVLKYFPSAQLYVM----ENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQRE 242 E +L F + ++ +N + L YD++ H + Sbjct: 320 KNIVENTLLNSFDKDRFQLVSQKFDNH---YYALVSLASQF--SEYDFVGHFHTE--DFG 372 Query: 243 GYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPCLGMIGSRRYR-RYKRWSFFAKRS 301 R L LL + I + F P +G++ + + Y + Sbjct: 373 NEGKFVDEATRLALVNMLL-DEERVASIFDHF---PEVGLVFADLSKELYWTDAIGTLNQ 428 Query: 302 EVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCLEPL 341 ++ + ++ + L F G+M W+ LE + Sbjct: 429 NQAAKLDNECQKT----IKNSLHVFQGSM-WLSKDFLEKI 463 >gi|77920184|ref|YP_357999.1| hypothetical protein Pcar_2591 [Pelobacter carbinolicus DSM 2380] gi|77546267|gb|ABA89829.1| hypothetical protein Pcar_2591 [Pelobacter carbinolicus DSM 2380] Length = 262 Score = 37.2 bits (85), Expect = 4.9, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 35/105 (33%), Gaps = 13/105 (12%) Query: 139 SPKKSGLTIKSKIAIVV--HCYYQDTWIEISHILLRLNFD-----FDLFVTVVEANKDFE 191 SP + L +IA+V+ D + ++ +N +DLF+ Sbjct: 5 SPSTNCLPANGRIAVVISTWIGNPD--DYLLRLMDSMNTHSAGMDYDLFLCANGETYKLP 62 Query: 192 QDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYLCKIHG 236 ++ F + EN G ++ + Y Y Y + Sbjct: 63 ANLQASFKKIFIR--ENSGFNLGAWDYAWRR--LSNYRYFLFLQD 103 >gi|146099084|ref|XP_001468551.1| ATP-dependent RNA helicase [Leishmania infantum] gi|134072919|emb|CAM71636.1| putative ATP-dependent RNA helicase [Leishmania infantum JPCM5] Length = 803 Score = 36.9 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 20/65 (30%), Gaps = 14/65 (21%) Query: 158 YYQDTWIEISHILLRLNFDFDLFVT--------VVEANKDFEQDVLKYFPSAQL------ 203 YY D I L DL T + E + E D LK + Sbjct: 381 YYIDLLQFIGRPLQSAPVPGDLLFTPDNGCYGRLPEEDIQLELDFLKRLHENDVEVRSMA 440 Query: 204 YVMEN 208 V+EN Sbjct: 441 RVVEN 445 >gi|322502575|emb|CBZ37658.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 803 Score = 36.9 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 20/65 (30%), Gaps = 14/65 (21%) Query: 158 YYQDTWIEISHILLRLNFDFDLFVT--------VVEANKDFEQDVLKYFPSAQL------ 203 YY D I L DL T + E + E D LK + Sbjct: 381 YYIDLLQFIGRPLQSAPVPGDLLFTPDNGCYGRLPEEDIQLELDFLKRLHENDVEVRSMA 440 Query: 204 YVMEN 208 V+EN Sbjct: 441 RVVEN 445 >gi|1764094|gb|AAB39865.1| ATP-dependent RNA helicase [Leishmania amazonensis] Length = 855 Score = 36.9 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 20/65 (30%), Gaps = 14/65 (21%) Query: 158 YYQDTWIEISHILLRLNFDFDLFVT--------VVEANKDFEQDVLKYFPSAQL------ 203 YY D I L DL T + E + E D LK + Sbjct: 384 YYVDLMQFIGRPLQSSPVPGDLLFTADDGCYGRLPEEDIQLELDFLKRLHENDVEVRNMA 443 Query: 204 YVMEN 208 V+EN Sbjct: 444 RVVEN 448 >gi|313905641|ref|ZP_07839002.1| 1-deoxy-D-xylulose-5-phosphate synthase [Eubacterium cellulosolvens 6] gi|313469465|gb|EFR64806.1| 1-deoxy-D-xylulose-5-phosphate synthase [Eubacterium cellulosolvens 6] Length = 588 Score = 36.5 bits (83), Expect = 8.1, Method: Composition-based stats. Identities = 6/43 (13%), Positives = 15/43 (34%) Query: 210 GRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIW 252 G D+ ++ L + + +H +K + + W Sbjct: 219 GNDIPMLIHALREVKDVDHPIVLHVHTQKGKGYKPAEEDRESW 261 >gi|18976927|ref|NP_578284.1| hypothetical protein PF0555 [Pyrococcus furiosus DSM 3638] gi|18892545|gb|AAL80679.1| hypothetical protein PF0555 [Pyrococcus furiosus DSM 3638] Length = 257 Score = 36.5 bits (83), Expect = 8.4, Method: Composition-based stats. Identities = 7/96 (7%), Positives = 25/96 (26%), Gaps = 12/96 (12%) Query: 35 PAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFF 94 + + + + + ++ F + L +C+ Sbjct: 171 KCFIPTVSPGFDRTFDKSFNQQFPIPRDPKRFAEMLKIALDSLG----------NCKEIR 220 Query: 95 YGSRKE--QKAFLRLNRFMSNSRMPFDSEKFLYVKE 128 + + + F+ + + + E +KE Sbjct: 221 IDTWNDFYEGTFIEPSVSDGFTYLEVLEEFIKELKE 256 >gi|16519726|ref|NP_443846.1| probable methyl-accepting membrane chemoreceptor [Sinorhizobium fredii NGR234] gi|2497833|sp|P55439|Y4FA_RHISN RecName: Full=Probable chemoreceptor y4fA; AltName: Full=Methyl-accepting chemotaxis protein gi|2182384|gb|AAB91658.1| probable methyl-accepting membrane chemoreceptor [Sinorhizobium fredii NGR234] Length = 845 Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 62/235 (26%), Gaps = 39/235 (16%) Query: 61 QELSIFESFIFWLRSFLAFSKYSKL-----SFPSCRIFFYGSRKEQKAFLRLNRFMSNSR 115 L F+ + FL + R + A L + Sbjct: 54 ASLRGFKDVYAAMIGFLDQTTEENRALVFSKLDEQRAALDAA----GARLTPKAE-GWAE 108 Query: 116 MPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNF 175 + S + + + + IK + ++ ++ Sbjct: 109 LESASAALSAINGRMDDLWALHADEARLEAGIKEALGVIS----TSQADLLTAATA---- 160 Query: 176 DFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYLLELGVFDRYDYL---- 231 FD ++ E + + + SA +V+ RD F + Y + Sbjct: 161 -FDKSISTQEDDAKEKLRDAQRILSATSFVV--ALRD--AF--AARKDDGEGYRAIAAAM 213 Query: 232 --CKIHGK--------KSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQ 276 KIH K KS+ G + L D + +I++ F + Sbjct: 214 GDLKIHQKLLPIALPKKSKPLGKAFSANVRALSALVDDKARPPENIEKILDVFAE 268 >gi|308479828|ref|XP_003102122.1| hypothetical protein CRE_06803 [Caenorhabditis remanei] gi|308262277|gb|EFP06230.1| hypothetical protein CRE_06803 [Caenorhabditis remanei] Length = 1266 Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 33/93 (35%), Gaps = 4/93 (4%) Query: 140 PKKSGLTIKSKIAIVVHCYYQDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFP 199 K A V H +Y +T EI +L ++ DL+ ++ ++ V P Sbjct: 908 GMAQEFMAKETKAYVCHVHYAETLDEIYSMLK-MSCPEDLYNCTLDQMENVLITVTALRP 966 Query: 200 SAQLYVMENKGRDVRPFLYLLELGVFDRYDYLC 232 L +++ + F + +YD Sbjct: 967 DITLDQLKSI---LFKFAQRYKHLKETKYDLFG 996 >gi|301067040|ref|YP_003789063.1| glycosyl transferase, family 2 [Lactobacillus casei str. Zhang] gi|300439447|gb|ADK19213.1| Glycosyl transferase, family 2 [Lactobacillus casei str. Zhang] Length = 313 Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 37/93 (39%), Gaps = 14/93 (15%) Query: 150 KIAIVVHCY----YQDTWIEISHILLRLNFDFDLFV---TVVEANKDFEQDVLKYFPSAQ 202 KIAI++ + Y ++ IL + + DLF+ + +++ + + P Sbjct: 2 KIAILLSVFNGELY--LGKQVKSILEQKDVKLDLFIRDDGSTDGSRELVESIAATDPRVH 59 Query: 203 LYVMENKG--RDVRPFLYLLELGVFDRYDYLCK 233 L + N G R FL L+ YDY Sbjct: 60 LIIGHNVGYKR---SFLELVNEPSMSDYDYFAF 89 >gi|291518896|emb|CBK74117.1| Domain of unknown function (DUF1975) [Butyrivibrio fibrisolvens 16/4] Length = 320 Score = 36.1 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 7/68 (10%), Positives = 25/68 (36%), Gaps = 13/68 (19%) Query: 150 KIAIVVHC-YYQD--------TWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPS 200 ++ +V+H ++ + W ++ D ++T + ++ + + + Sbjct: 234 RVGVVIHADHFSEGATDDDNILWNNFYEYTFSMHRHIDFYITATDDQRNLLIEQFEKY-- 291 Query: 201 AQLYVMEN 208 + V N Sbjct: 292 --VGVTPN 297 >gi|125623603|ref|YP_001032086.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp. cremoris MG1363] gi|124492411|emb|CAL97353.1| 1-deoxyxylulose-5-phosphate synthase [Lactococcus lactis subsp. cremoris MG1363] gi|300070369|gb|ADJ59769.1| 1-deoxy-D-xylulose-5-phosphate synthase [Lactococcus lactis subsp. cremoris NZ9000] Length = 580 Score = 36.1 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 21/57 (36%), Gaps = 1/57 (1%) Query: 204 YVMENKGRDVRPFLYLLELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDL 260 +EN G D+ + L E + + IH +K + + + FDL Sbjct: 211 KYLEN-GNDIESLVNLFEEVKDIDHPIVLHIHTEKGRGYQPALENKEAFHWHMPFDL 266 >gi|89073969|ref|ZP_01160475.1| hypothetical protein SKA34_15405 [Photobacterium sp. SKA34] gi|89050297|gb|EAR55801.1| hypothetical protein SKA34_15405 [Photobacterium sp. SKA34] Length = 579 Score = 36.1 bits (82), Expect = 10.0, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 27/82 (32%) Query: 14 IENLLLRLDVEEKGNMQAIYIPAHVSGYYVLWSFSPKQRITSKDVHFQELSIFESFIFWL 73 + N +++ D+ K Y+ H W + K I S H + F + Sbjct: 392 VPNSIMKKDIMPKVRRNRNYLANHNYEVIKGWDNAQKIAINSIPYHLLSPNRFPYRLRQK 451 Query: 74 RSFLAFSKYSKLSFPSCRIFFY 95 K +FP+ + + Sbjct: 452 PGKRNALGLYKFNFPNNQAIYL 473 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.312 0.145 0.443 Lambda K H 0.267 0.0443 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,672,879,989 Number of Sequences: 13984884 Number of extensions: 148137242 Number of successful extensions: 589719 Number of sequences better than 10.0: 468 Number of HSP's better than 10.0 without gapping: 340 Number of HSP's successfully gapped in prelim test: 128 Number of HSP's that attempted gapping in prelim test: 587837 Number of HSP's gapped (non-prelim): 604 length of query: 394 length of database: 4,792,584,752 effective HSP length: 141 effective length of query: 253 effective length of database: 2,820,716,108 effective search space: 713641175324 effective search space used: 713641175324 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 82 (36.1 bits)