BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780923|ref|YP_003065336.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter asiaticus str. psy62] (365 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780923|ref|YP_003065336.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter asiaticus str. psy62] gi|254040600|gb|ACT57396.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter asiaticus str. psy62] Length = 365 Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust. Identities = 365/365 (100%), Positives = 365/365 (100%) Query: 1 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLFYKRSKKLCYDENYV 60 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLFYKRSKKLCYDENYV Sbjct: 1 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLFYKRSKKLCYDENYV 60 Query: 61 VAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAI 120 VAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAI Sbjct: 61 VAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAI 120 Query: 121 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDV 180 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDV Sbjct: 121 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDV 180 Query: 181 LPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRT 240 LPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRT Sbjct: 181 LPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRT 240 Query: 241 FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFW 300 FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFW Sbjct: 241 FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFW 300 Query: 301 VRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRK 360 VRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRK Sbjct: 301 VRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRK 360 Query: 361 SLSQN 365 SLSQN Sbjct: 361 SLSQN 365 >gi|315122651|ref|YP_004063140.1| hypothetical protein CKC_04515 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496053|gb|ADR52652.1| hypothetical protein CKC_04515 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 405 Score = 465 bits (1196), Expect = e-129, Method: Compositional matrix adjust. Identities = 221/357 (61%), Positives = 280/357 (78%), Gaps = 7/357 (1%) Query: 1 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLF-YKRSKKLCYDENY 59 +PVSGC KGY LF S ++WNF S+H I KV+ + GFFFW +LF +KR + L YDEN Sbjct: 38 LPVSGCRKGYLLFVSRRENWNFDSNHLVIRKVSFFLGFFFWIRSLFLFKRYQTLRYDENR 97 Query: 60 VVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIA 119 ++AYGSR GKKFFA SN M+ R + FDG++IH FP+LLHGW+SP+ K+ + I++++A Sbjct: 98 IIAYGSRIGKKFFACSNKDMLARGVPFDGEKIHRFPRLLHGWDSPSSEKIASVKIQSRVA 157 Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRD 179 IVVH+YY DLW EIANLLS L+ SFDLH+TLVTE ASIKSEILK FP A I++MEN+GRD Sbjct: 158 IVVHIYYADLWAEIANLLSGLNFSFDLHITLVTEIASIKSEILKRFPNAHIYVMENYGRD 217 Query: 180 VLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIR 239 + FL LLE +L +YDYVCKIHGKKSKR G+ WW+GDLWRRWLF+DLLGAPG+ +II+ Sbjct: 218 IRSFLKLLEGGKLDSYDYVCKIHGKKSKRNGHVWWDGDLWRRWLFFDLLGAPGIALEIIK 277 Query: 240 TFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMF 299 TF+ + IGMIGSR YRY D SLG NRE +C +A +MG++F+D K+DFF GTMF Sbjct: 278 TFEKYPKIGMIGSRTYRY-----DQKISLGNNREFVCAIANKMGVSFEDTKIDFFGGTMF 332 Query: 300 WVRTEALDPIKNLRLSRYFEPKVHK-ALDGEIEHAVERCFSLSVKKANFRISDVDCI 355 WVR +ALDPIKNL L++YF+ KV LDG +EHA+ERCFS+SV+KANF ++ VDC+ Sbjct: 333 WVRPQALDPIKNLALTQYFKSKVDMVGLDGCLEHAIERCFSISVEKANFDLAYVDCL 389 >gi|315122628|ref|YP_004063117.1| hypothetical protein CKC_04400 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496030|gb|ADR52629.1| hypothetical protein CKC_04400 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 399 Score = 272 bits (695), Expect = 7e-71, Method: Compositional matrix adjust. Identities = 148/358 (41%), Positives = 216/358 (60%), Gaps = 5/358 (1%) Query: 9 GYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWT-LFYKRSKKLCYDENYVVAYGSRS 67 GY++ S K S+ E+V + FW + L + + +L + + YGSR Sbjct: 45 GYYMLWSLSKEQKITSEDVFFEEVTTFKACLFWLRSFLTFSKYSQLSFPSCRIFFYGSRK 104 Query: 68 GKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWES-PAMGKVMQIAIKAKIAIVVHLYY 126 KK F + N +M + FDG++ + +L GW++ ++ +I I +KIAIVVH YY Sbjct: 105 DKKAFFRLNRFMSNSRMPFDGKKFLYIKELFEGWKNLSSLDNKGKIKINSKIAIVVHCYY 164 Query: 127 IDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 D W EI++LL L+ FDL +T V ++ + ++LK FP+AR+++MEN GRDVLPFL L Sbjct: 165 QDTWDEISHLLLRLNFDFDLFITTVKKNKDFEQDVLKNFPSARLYVMENKGRDVLPFLCL 224 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 LE +YDY+CKIHGKKS R+ Y +EG LWRRW+F+DLLG + +II F+ + Sbjct: 225 LELGVFYDYDYLCKIHGKKSARRNYHPFEGILWRRWIFFDLLGFSDIALRIINKFEQNPS 284 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 IGMIGS +R KY + K + + LA R+ ++ LDFF GTMFWVR + L Sbjct: 285 IGMIGSGRFRRYKKYSFFK-KRSKVYKRVVDLARRIDFPVEELDLDFFNGTMFWVRPKCL 343 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRKSLSQ 364 +P++N+ L+ FE + + DG +EHAVER F LSV++A F + VDC+ Y + LSQ Sbjct: 344 EPLRNIHLTGEFEEECNLE-DGALEHAVERFFPLSVQRAGFSLESVDCVAEYDQ-LSQ 399 >gi|254780201|ref|YP_003064614.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] gi|254039878|gb|ACT56674.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] Length = 394 Score = 268 bits (686), Expect = 7e-70, Method: Compositional matrix adjust. Identities = 146/356 (41%), Positives = 212/356 (59%), Gaps = 4/356 (1%) Query: 9 GYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWT-LFYKRSKKLCYDENYVVAYGSRS 67 GY++ S S + ++++++ F FW + L + + KL + + YGSR Sbjct: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99 Query: 68 GKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGW-ESPAMGKVMQIAIKAKIAIVVHLYY 126 +K F + N +M + FD ++ + +L GW + P+ K + IK+KIAIVVH YY Sbjct: 100 EQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYY 159 Query: 127 IDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 D WIEI+++L L+ FDL VT+V + + ++LK FP+A++++MEN GRDV PFL L Sbjct: 160 QDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYL 219 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 LE YDY+CKIHGKKS+R+GY EG +WRRWLF+DLLG + +II TF+ + Sbjct: 220 LELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPC 279 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 +GMIGSR YR ++ + R +I LA R G + LDFF GTMFWV+ + L Sbjct: 280 LGMIGSRRYRRYKRWSFFAKRSEVYRRVI-DLAKRAGFPTKRLHLDFFNGTMFWVKPKCL 338 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRKSL 362 +P++NL L FE + DG +EHAVER F+ SV+ F I VDC+ Y + L Sbjct: 339 EPLRNLHLIGEFEEE-RNLKDGALEHAVERFFACSVRYTEFSIESVDCVAEYERLL 393 >gi|13474020|ref|NP_105588.1| hypothetical protein mll4799 [Mesorhizobium loti MAFF303099] gi|14024772|dbj|BAB51374.1| mll4799 [Mesorhizobium loti MAFF303099] Length = 386 Score = 185 bits (470), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 108/300 (36%), Positives = 154/300 (51%), Gaps = 16/300 (5%) Query: 62 AYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIV 121 A GS +G F S+ HF+G + P+ L ++ KIA+ Sbjct: 99 ALGSLAGWGFGTASDNLQKTLSRHFNGPQAEA-PERLP-------------TVEPKIAVA 144 Query: 122 VHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVL 181 +HL+Y DLW E LL F L +TL A++ + FP A I + EN GRDV Sbjct: 145 LHLHYPDLWPEFEALLEATGRQFQLFLTLTRPDAALAQRVQARFPGAEITVYENRGRDVG 204 Query: 182 PFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTF 241 PF+ LL + +D +CK+HGKKS + G G++WR+ +DL+G+ GVV +II F Sbjct: 205 PFIQLLREGKFDPFDLICKLHGKKSGQSGPRMVLGEIWRQVSAFDLIGSRGVVDRIIANF 264 Query: 242 DTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWV 301 + D MIGSR +R PN++ + G+NR M L MG+ +LDFFAGTMFWV Sbjct: 265 ERSPDTQMIGSRRFRLPNEWKGEKSAWGENRAMALNLLETMGMP-SSSRLDFFAGTMFWV 323 Query: 302 RTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRKS 361 R AL+P++ L L P+ DG ++HA+ER + K SD D R++ Sbjct: 324 RRGALEPLRRLDLPLAAFPEETGQQDGTLQHALERVLGMICTKIGV-ASDGDMTTDSREA 382 >gi|299133415|ref|ZP_07026610.1| Rhamnan synthesis F [Afipia sp. 1NLS2] gi|298593552|gb|EFI53752.1| Rhamnan synthesis F [Afipia sp. 1NLS2] Length = 408 Score = 151 bits (382), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 87/236 (36%), Positives = 132/236 (55%), Gaps = 5/236 (2%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK 163 P K +Q+ + I+VHL+Y D+W + L NL+ F L VTL + + + Sbjct: 153 PGAPKPLQLNGRIATGIIVHLHYCDVWPDFEKRLRNLTCPFSLIVTLNESNPDFAARVAG 212 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 FP A++ + N GRDV PF+ LL L +++ +CK+HGKK+ G G++WRR L Sbjct: 213 QFPNAKVLVYPNRGRDVGPFIQLLREGHLDDFELICKLHGKKTVSLGPRMIFGEIWRRLL 272 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 DL+G+ +V I++ F + +G++GS +R +Y + +N + LA R+G Sbjct: 273 LNDLVGSDELVRAILQRFISQPGLGLVGSSHFR-----GNYLGTWPRNAALTLELAKRLG 327 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 + KLDFFAGTMFWVR E LD +K+L LS+ P DG ++HA+ER F Sbjct: 328 CPEERFKLDFFAGTMFWVRRELLDLLKSLNLSQDDFPVEAGQTDGTLQHALERIFG 383 >gi|312962408|ref|ZP_07776899.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas fluorescens WH6] gi|311283335|gb|EFQ61925.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas fluorescens WH6] Length = 1308 Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 85/255 (33%), Positives = 139/255 (54%), Gaps = 24/255 (9%) Query: 106 MGKVMQIAIK-----AKIAIVVHLYYIDLWIEIANLLSNL-SISFDLHVTLVTESASIKS 159 + KV ++A K A A+V+HL+Y DLW +I + L + + +DL+VT+ + SA ++ Sbjct: 1070 ISKVSRMAFKRSVRQADYAVVLHLHYDDLWDDIKSYLDSFGQLEYDLYVTVTSSSAGVR- 1128 Query: 160 EILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLW 219 + + +P A I ++EN GRDVLPFL +L+ + Y VCKIH K+S + +GD Sbjct: 1129 -VAQEYPKAHIQLVENRGRDVLPFLKILQVIKDMGYVAVCKIHSKRSLYRD----DGDKI 1183 Query: 220 RRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYT-CSLGKNREMICTL 278 R L LLG+ + ++ F+ +DIG+I Y P+ + T C ++ L Sbjct: 1184 RGELIGSLLGSKETILSVVDRFERQKDIGVIVPVKYLIPHTDHNMTYCG-----AIVTEL 1238 Query: 279 AGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKAL-DGEIEHAVERC 337 + ++G F +F AG+MFW R +AL+ + ++ S + +V L DG I H +ER Sbjct: 1239 SSKLGFNF--SYCEFIAGSMFWFRPKALEALLSIDESSF---EVEDGLADGTIAHGIERV 1293 Query: 338 FSLSVKKANFRISDV 352 VKKAN+ + + Sbjct: 1294 LCNVVKKANYTVETI 1308 >gi|218455303|gb|AAX19606.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 96/284 (33%), Positives = 146/284 (51%), Gaps = 29/284 (10%) Query: 79 MMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYIDLW 130 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++IDL Sbjct: 272 MIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHIDLI 331 Query: 131 IEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHIMENHGRDVLPFLIL 186 I + N+ + +D+ V++ +S S + ++ F A + I N GRDV PF+ L Sbjct: 332 DAICAYMRNVIVDYDVFVSV--KSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSK-RKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHR 245 L T L YD VCKIH KKS R G W DL + LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYRDGGGQWRDDLMKA-----LLGSSFDVLRVLRAFDDHP 444 Query: 246 DIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEA 305 G++G + Y G N E + LA GI + +L FFAGTMFW R A Sbjct: 445 ACGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAA 499 Query: 306 LDPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 L + +++ LS F+P+ + D + H +ER F L V++A F Sbjct: 500 LSALRARSIGLSE-FDPEAGQR-DATLAHVIERLFVLWVEQAGF 541 >gi|218455307|gb|AAX19610.2| WxocB [Xanthomonas oryzae pv. oryzicola] gi|218455309|gb|AAX19612.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 129 bits (323), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 95/283 (33%), Positives = 144/283 (50%), Gaps = 27/283 (9%) Query: 79 MMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYIDLW 130 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++IDL Sbjct: 272 MIEREVARMRQTRKNIMPIVTGGDSPASDELERYGVGAIDAESLSSRFAIVLHLFHIDLI 331 Query: 131 IEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHIMENHGRDVLPFLIL 186 I + N+ + +D+ V++ +S S + ++ F A + I N GRDV PF+ L Sbjct: 332 DAICAYMRNVIVDYDVFVSV--KSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCKIH KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYHD----GGGQWRDDLMKALLGSSFDVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + +++ LS F+P+ + D + H +ER F L V++A F Sbjct: 501 SALRARSIGLSE-FDPEAGQR-DATLAHVIERLFVLWVEQAGF 541 >gi|218455296|gb|AAV67426.2| glycosyltransferase [Xanthomonas oryzae pv. oryzicola] gi|218455299|gb|AAX19602.2| WxocB [Xanthomonas oryzae pv. oryzicola] gi|218455301|gb|AAX19604.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 128 bits (321), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 95/283 (33%), Positives = 144/283 (50%), Gaps = 27/283 (9%) Query: 79 MMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYIDLW 130 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++IDL Sbjct: 272 MIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHIDLI 331 Query: 131 IEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHIMENHGRDVLPFLIL 186 I + N+ + +D+ V++ +S S + ++ F A + I N GRDV PF+ L Sbjct: 332 DAICAYMRNVIVDYDVFVSV--KSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCKIH KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYHD----GGGQWRDDLMKALLGSSFDVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + +++ LS F+P+ + D + H +ER F L V++A F Sbjct: 501 SALRARSIGLSE-FDPEAGQR-DATLAHVIERLFVLWVEQAGF 541 >gi|166713474|ref|ZP_02244681.1| hypothetical protein Xoryp_19045 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 568 Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 95/283 (33%), Positives = 144/283 (50%), Gaps = 27/283 (9%) Query: 79 MMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYIDLW 130 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++IDL Sbjct: 272 MIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHIDLI 331 Query: 131 IEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHIMENHGRDVLPFLIL 186 I + N+ + +D+ V++ +S S + ++ F A + I N GRDV PF+ L Sbjct: 332 DAICAYMRNVIVDYDVFVSV--KSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCKIH KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYHD----GGGQWRDDLMKALLGSSFDVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + +++ LS F+P+ + D + H +ER F L V++A F Sbjct: 501 SALRARSIGLSE-FDPEAGQR-DATLAHVIERLFVLWVEQAGF 541 >gi|218455305|gb|AAX19608.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 94/283 (33%), Positives = 144/283 (50%), Gaps = 27/283 (9%) Query: 79 MMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYIDLW 130 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++IDL Sbjct: 272 MIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHIDLI 331 Query: 131 IEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHIMENHGRDVLPFLIL 186 I + N+ + +D+ V++ +S S + ++ F A + I N GRDV PF+ L Sbjct: 332 DAICAYMRNVIVDYDVFVSV--KSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCK+H KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKVHSKKSVYHD----GGGQWRDDLMKALLGSSFNVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + +++ LS F+P+ + D + H +ER F L V++A F Sbjct: 501 SALRARSIGLSE-FDPEAGQR-DATLAHVIERLFVLWVEQAGF 541 >gi|84501312|ref|ZP_00999517.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597] gi|84390603|gb|EAQ03091.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597] Length = 741 Score = 122 bits (306), Expect = 8e-26, Method: Compositional matrix adjust. Identities = 82/240 (34%), Positives = 119/240 (49%), Gaps = 17/240 (7%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TESASIKSEILKIFPAARIH 171 +A+ AI +HLYY DLW E + L L +SFDL+VTL E+ + I + P A++ Sbjct: 127 RARFAIHLHLYYPDLWPEFSERLDRLDLSFDLYVTLTWRGPETEWLADIIREAHPRAQVF 186 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + N GRD+LPFL LL Y+ +CK+HGKKS + +GD WRR L +L P Sbjct: 187 PVANRGRDILPFLRLLNAGAFDGYEAICKLHGKKSPHRD----DGDAWRRHLVDGVL--P 240 Query: 232 G-VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK 290 G ++ + F D + + RY + G NR L R+ + D Sbjct: 241 GKALWTSLSAFLADEDAALWVADGQRYSVRKW-----WGSNRARTDALLRRVELDRSDTD 295 Query: 291 LDFFAGTMFWVRTEALDPIKNLRLSR-YFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 DF AG+M+W++ L I+ L L+ FEP+ + DG + HA ER K A + Sbjct: 296 FDFPAGSMYWMKPLLLGMIRALDLTEDLFEPESGQT-DGTLAHAFERAIGALAKAAGQEV 354 >gi|190572676|ref|YP_001970521.1| putative glycosyltransferase, fusion protein [Stenotrophomonas maltophilia K279a] gi|190010598|emb|CAQ44207.1| putative glycosyltransferase, fusion protein [Stenotrophomonas maltophilia K279a] Length = 566 Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 84/248 (33%), Positives = 124/248 (50%), Gaps = 17/248 (6%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKI 164 +G + +K++ AIV+HLY++DL I + N+ + DL V++ +S + + ++ Sbjct: 304 GVGAIPADKLKSRFAIVLHLYHLDLIESIQGYMKNMIVDHDLFVSV--KSVADRRVAVRF 361 Query: 165 FPAARIH----IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 F ++ + N GRDV PF+ LL T L YD VCKIH KKS G WR Sbjct: 362 FEERKVRAFVFVHPNIGRDVGPFVSLLNTGLLDRYDAVCKIHSKKSVYHD----GGGQWR 417 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 L LLG+ V KI+R F G++G + Y G N E + LA Sbjct: 418 DELMKSLLGSSHTVLKILRAFRHDSSCGIVGPE-----HAYVSNARFWGGNEERLRRLAA 472 Query: 281 RMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLS-RYFEPKVHKALDGEIEHAVERCFS 339 GI +L FFAGTMFW R AL ++ L+ F+P+ + LD + H +ER F Sbjct: 473 ETGIDDARIRLGFFAGTMFWFRPAALYALRERALALSEFDPEAGQ-LDATLAHVIERLFV 531 Query: 340 LSVKKANF 347 L V++A + Sbjct: 532 LWVEQAGY 539 >gi|145588508|ref|YP_001155105.1| methyltransferase type 11 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145046914|gb|ABP33541.1| Methyltransferase type 11 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 1082 Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 83/258 (32%), Positives = 143/258 (55%), Gaps = 25/258 (9%) Query: 97 LLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSN-LSI-SFDLHVTLVT-- 152 +L + +P ++++ + + IAIVVH++Y+D W +I ++ LS+ D+++T+ Sbjct: 838 ILKNYINPEHAEIIRESQENSIAIVVHIHYMDTWEDIKKIIKKILSVHDSDIYITITNLE 897 Query: 153 ESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYS 212 + SIK++ FP+A I ++EN GRD+LPF+ +L+ NY +CKIH KKS+ + Sbjct: 898 QYQSIKND----FPSANIELVENRGRDILPFINVLKKIIHKNYVAICKIHSKKSEYRS-- 951 Query: 213 WWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK-- 270 +G++ R+ L++ L+ + KI + F+ ++ +GM+ P KY + Sbjct: 952 --DGEVIRKELYFSLINNEITLEKIPKFFEVNKKLGML------VPGKYFLQHNDINMYF 1003 Query: 271 NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKAL-DGE 329 NRE I + +G+ F++ K F AG+MFW R AL + L F+ V + L DG Sbjct: 1004 NRENISKVCSVIGVNFKESK--FPAGSMFWARPAALQKLLKLESGELFD--VEEGLADGT 1059 Query: 330 IEHAVERCFSLSVKKANF 347 + HAVER F L + + F Sbjct: 1060 VAHAVERLFGLVSESSGF 1077 >gi|332561391|ref|ZP_08415706.1| hypothetical protein RSWS8N_20164 [Rhodobacter sphaeroides WS8N] gi|332274190|gb|EGJ19507.1| hypothetical protein RSWS8N_20164 [Rhodobacter sphaeroides WS8N] Length = 751 Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 97/323 (30%), Positives = 142/323 (43%), Gaps = 40/323 (12%) Query: 53 LCYDENYVVAYGSRSGKK----FFAQSNLYMME--------------RELHFDGQRIHHF 94 L Y E + +AYG R G + F Q+ L R H + + Sbjct: 50 LKYPEKHYIAYGERLGYRPNPDFSPQAYLRYHPDVAEAGVPPFLHYVRVGHAEQRLTKEL 109 Query: 95 PQLLHGWESPAMGKVMQIAIK-----AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVT 149 P+++ PA G + Q+ + A A+ VH+YY DLW E A L L I FDL+VT Sbjct: 110 PEVV---ALPARG-MPQVRFEHGRQTAPYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVT 165 Query: 150 LV---TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKS 206 L E+ ++ EI FP A + M N GRD+LPF+ LL Y VCK H KKS Sbjct: 166 LTYRGEETDALAEEIRADFPGAFVTPMPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKS 225 Query: 207 KRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTC 266 + +GDLWR+ L +L G+ K + F + G + Y T Sbjct: 226 PHRQ----DGDLWRKHLIEGILPETGLEEK-LEAFVEAPEAGFWVADGQHYTG-----TQ 275 Query: 267 SLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKAL 326 G N E L R+ I + L F AG+++WV+ L +++L+L + Sbjct: 276 WWGSNVEATRHLLQRIEIPLDREALSFPAGSIYWVKPLVLGLLRSLQLRLEDFDLEEGQV 335 Query: 327 DGEIEHAVERCFSLSVKKANFRI 349 DG + HA+ER +A ++ Sbjct: 336 DGTLAHAIERVLGYLTARAGQKV 358 >gi|126464825|ref|YP_001041801.1| lipopolysaccharide biosynthesis protein-like [Rhodobacter sphaeroides ATCC 17029] gi|126106640|gb|ABN79165.1| Lipopolysaccharide biosynthesis protein-like [Rhodobacter sphaeroides ATCC 17029] Length = 751 Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 79/237 (33%), Positives = 112/237 (47%), Gaps = 13/237 (5%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TESASIKSEILKIFPAARIHI 172 A A+ VH+YY DLW E A L L I FDL+VTL E+ ++ EI FP A + Sbjct: 132 APYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAEEIRADFPGAFVTP 191 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 M N GRD+LPF+ LL Y VCK H KKS + +GDLWR+ L +L G Sbjct: 192 MPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQ----DGDLWRKHLIEGILPETG 247 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLD 292 + K + F + G + Y T G N E L R+ I + L Sbjct: 248 LEEK-LEAFVEAPEAGFWVADGQHYTG-----TQWWGSNVEATRHLLQRIEIPLDREALS 301 Query: 293 FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 F AG+++WV+ L +++L+L +DG + HA+ER +A ++ Sbjct: 302 FPAGSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTLAHAIERVLGYLTARAGQKV 358 >gi|125654691|ref|YP_001033885.1| hypothetical protein RSP_3918 [Rhodobacter sphaeroides 2.4.1] gi|77386351|gb|ABA81780.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 751 Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 79/237 (33%), Positives = 112/237 (47%), Gaps = 13/237 (5%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TESASIKSEILKIFPAARIHI 172 A A+ VH+YY DLW E A L L I FDL+VTL E+ ++ EI FP A + Sbjct: 132 APYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAEEIRADFPGAFVTP 191 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 M N GRD+LPF+ LL Y VCK H KKS + +GDLWR+ L +L G Sbjct: 192 MPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQ----DGDLWRKHLIEGILPETG 247 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLD 292 + K + F + G + Y T G N E L R+ I + L Sbjct: 248 LEEK-LEAFVEAPEAGFWVADGQHYTG-----TQWWGSNVEATRHLLQRIEIPLDREALS 301 Query: 293 FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 F AG+++WV+ L +++L+L +DG + HA+ER +A ++ Sbjct: 302 FPAGSIYWVKPLVLGLLRSLQLRLEDFDLEEGQVDGTLAHAIERVLGYLTARAGQKV 358 >gi|221634566|ref|YP_002523254.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter sphaeroides KD131] gi|221163439|gb|ACM04401.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter sphaeroides KD131] Length = 755 Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 79/237 (33%), Positives = 112/237 (47%), Gaps = 13/237 (5%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TESASIKSEILKIFPAARIHI 172 A A+ VH+YY DLW E A L L I FDL+VTL E+ ++ EI FP A + Sbjct: 136 APYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYRGEETDALAQEIRADFPGAFVTP 195 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 M N GRD+LPF+ LL Y VCK H KKS + +GDLWR+ L +L G Sbjct: 196 MPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQ----DGDLWRKHLIEGILPETG 251 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLD 292 + K + F + G + Y T G N E L R+ I + L Sbjct: 252 LEEK-LEAFVEAPEAGFWVADGQHYTG-----TQWWGSNVEATRHLLQRIEIPLDREALS 305 Query: 293 FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 F AG+++WV+ L +++L+L +DG + HA+ER +A ++ Sbjct: 306 FPAGSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTLAHAIERVLGYLTARAGQKV 362 >gi|259414984|ref|ZP_05738907.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B] gi|259349435|gb|EEW61182.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B] Length = 680 Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 73/234 (31%), Positives = 112/234 (47%), Gaps = 13/234 (5%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TESASIKSEILKIFPAARIHIMEN 175 A+V+HLYY DLW E + L + +FDL+VTL E+ ++ I + +P AR+ ++ N Sbjct: 75 AVVIHLYYTDLWDEFRDRLRSARFTFDLYVTLTEQGPETEETRARIAEDWPEARVLVLPN 134 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVF 235 GRD+ PFL LL L +Y VCK+H KKS + +GD+WR L +L G Sbjct: 135 RGRDIYPFLHLLNAGWLDHYRAVCKLHSKKSPHRQ----DGDVWRTHLTEGIL-PEGETA 189 Query: 236 KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFA 295 +++ F D G+ + Y G N E L R+ + LDF A Sbjct: 190 ELLERFLAAEDCGLWVADGQHYEGARW-----WGSNLERCRNLLARLELAASADTLDFPA 244 Query: 296 GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 G+++W++ LD ++ L L DG + HA+ER + +I Sbjct: 245 GSIYWLKPAILDMLRGLALGFDDFDIEQGQTDGTLAHALERALGMICAAGGLQI 298 >gi|289668432|ref|ZP_06489507.1| hypothetical protein XcampmN_08015 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 945 Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 77/256 (30%), Positives = 125/256 (48%), Gaps = 18/256 (7%) Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS-- 159 ++P V ++ K+ ++VH++Y DL E A L + + +DL V+++ +A ++ Sbjct: 69 QAPFGRPVPSAQLQVKVGVMVHVFYPDLIDEFAQSLQQMPVGYDLLVSVMDNAAEAQARD 128 Query: 160 --EILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGD 217 L+ I I+ N GRD+ P L+ EQ+ D V +H KKS Y+ E Sbjct: 129 RFSKLQQIEKLDIRIVPNRGRDIAPLLVTFR-EQILALDVVGHLHTKKSL---YTGSEQG 184 Query: 218 LWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTC---SLGKNREM 274 WRR+L L+G+ + + F +GM+ YP Y + N E+ Sbjct: 185 QWRRYLVSSLMGSAERIAWQLGMFQAEPRLGML------YPESYERVPLWAHTWLSNFEV 238 Query: 275 ICTLAGRMGITFQDQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 TLA R+G + +DF AG+MFW + +AL P+ L L P+ H +DG + HA Sbjct: 239 CRTLAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLELKDFPEEHGQIDGTLHHA 298 Query: 334 VERCFSLSVKKANFRI 349 +ER F V+ ++RI Sbjct: 299 MERMFVAVVRHQHYRI 314 >gi|289662624|ref|ZP_06484205.1| hypothetical protein XcampvN_05932 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 945 Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 77/256 (30%), Positives = 125/256 (48%), Gaps = 18/256 (7%) Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS-- 159 ++P V ++ K+ ++VH++Y DL E A L + + +DL V+++ +A ++ Sbjct: 69 QAPFGRPVPSAQLQLKVGVMVHVFYPDLIDEFAQSLQQMPVGYDLLVSVMDNAAEAQARD 128 Query: 160 --EILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGD 217 L+ I I+ N GRD+ P L+ EQ+ D V +H KKS Y+ E Sbjct: 129 RFSKLQQIEKLDIRIVPNRGRDIAPLLVTFR-EQILALDVVGHLHTKKSL---YTGSEQG 184 Query: 218 LWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTC---SLGKNREM 274 WRR+L L+G+ + + F +GM+ YP Y + N E+ Sbjct: 185 QWRRYLVSSLMGSAERIAWQLGMFQAEPRLGML------YPESYERVPLWAHTWLSNFEV 238 Query: 275 ICTLAGRMGITFQDQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 TLA R+G + +DF AG+MFW + +AL P+ L L P+ H +DG + HA Sbjct: 239 CRTLAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLELKDFPEEHGQIDGTLHHA 298 Query: 334 VERCFSLSVKKANFRI 349 +ER F V+ ++RI Sbjct: 299 MERMFVAVVRHQHYRI 314 >gi|83950907|ref|ZP_00959640.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM] gi|83838806|gb|EAP78102.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM] Length = 752 Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 78/252 (30%), Positives = 121/252 (48%), Gaps = 17/252 (6%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TESASIKSEILKIFPAARIH 171 KA+ A+ H+YY DLW E A + DL++TL E+ + EI + FP A + Sbjct: 130 KARFALHAHIYYPDLWPEFATRFDEIGDGIDLYITLTWRGEETRWLADEITERFPRAFVT 189 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + N GRD+LPFL+L YD +CKIH KKS + +GD WRR L +L A Sbjct: 190 PVPNRGRDILPFLLLANAGAFDGYDALCKIHTKKSPHRD----DGDQWRRHLIDGVLPAT 245 Query: 232 GVVFKIIRTF-DTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK 290 G+ ++ D + +AY + + G NR+ + R+ + Sbjct: 246 GLQERLQHFLADDAAAFWVADGQAYAARDWW-------GINRDKTAAVLRRVELDPLLDA 298 Query: 291 LDFFAGTMFWVRTEALDPIKNLRL-SRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 L F AG+++W++ L IK L L + FEP+ + +DG + HAVER A I Sbjct: 299 LRFPAGSIYWMKPLMLGMIKALDLDAPMFEPEKGQ-VDGTLAHAVERAIGGLALAAGQEI 357 Query: 350 SDVDCILGYRKS 361 + ++ R++ Sbjct: 358 RETAALMRPRRA 369 >gi|258591058|emb|CBE67353.1| protein of unknown function [NC10 bacterium 'Dutch sediment'] Length = 1460 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 72/242 (29%), Positives = 125/242 (51%), Gaps = 16/242 (6%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA-SIKSEILKIFPAAR--- 169 I ++IA+ H YY DL E+A+ L N+ +FDL V++ + A + + P AR Sbjct: 589 ISSRIAVHAHAYYPDLTKELASYLKNMPFAFDLFVSVSNDEARDVCRQAFAGLPQARRVI 648 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 + ++ N GRD+ P ++ +L+ YDY+C +H KKS Y+ + D W +L L+G Sbjct: 649 VDVVANRGRDIAP-MVCHFGGRLATYDYICHLHTKKSM---YAQGKMDGWLEYLLRQLMG 704 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT-FQD 288 + V +I F + G+I + Y Y + + S N+ + + +MGIT + Sbjct: 705 SEDQVRRIFSMFQSDPRAGIIYPQNYEYLPYWGNTWLS---NKALGAQMCRQMGITDVPE 761 Query: 289 QKLDFFAGTMFWVRTEALDPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 D+ AG+MFW R+EA+ + ++RL+ + P+ DG + H +ER L + A Sbjct: 762 GYFDYPAGSMFWARSEAIRNLFSADIRLTDF--PEEAGQTDGSLAHCIERLLVLVARHAG 819 Query: 347 FR 348 ++ Sbjct: 820 YK 821 >gi|310816773|ref|YP_003964737.1| lipopolysaccharide biosynthesis protein-like protein [Ketogulonicigenium vulgare Y25] gi|308755508|gb|ADO43437.1| lipopolysaccharide biosynthesis protein-like protein [Ketogulonicigenium vulgare Y25] Length = 726 Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 75/250 (30%), Positives = 125/250 (50%), Gaps = 24/250 (9%) Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI 161 E+PA + I + +HLYY +L A L+ + + L+V+ T++A ++I Sbjct: 459 EAPAPAR--------PIGVFLHLYYQELAPVFAKRLAQIPLPLSLYVS--TDTAEKAAQI 508 Query: 162 LKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRR 221 + P A++ ++ N GRD+ P L + +++D V +HGKKS D W Sbjct: 509 ERALPQAQVRVLPNRGRDIFPKLYGF-GDAYADHDIVLHLHGKKSLHSSML----DEWLS 563 Query: 222 WLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR 281 + LLG P V +I+ FD+ +G++ +R + G NR++ LA R Sbjct: 564 HILDCLLGDPADVNRILSLFDSVPRLGIVMPVVHRSVLNAAHW----GFNRDIGAELAYR 619 Query: 282 MGITF---QDQKLDFFAGTMFWVRTEALDPIKNLRL-SRYFEPKVHKALDGEIEHAVERC 337 MG+ ++ L F AG+MFW RT AL PI +L L + +F P+ + +DG + HAVER Sbjct: 620 MGMATPLPENDALQFPAGSMFWARTAALQPILDLALEASHFPPEAGQ-VDGTLAHAVERM 678 Query: 338 FSLSVKKANF 347 + + + Sbjct: 679 LGVVCRAGGY 688 >gi|134297301|ref|YP_001121036.1| lipopolysaccharide biosynthesis protein-like protein [Burkholderia vietnamiensis G4] gi|134140458|gb|ABO56201.1| Lipopolysaccharide biosynthesis protein-like protein [Burkholderia vietnamiensis G4] Length = 1231 Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 76/243 (31%), Positives = 120/243 (49%), Gaps = 12/243 (4%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAARIHIM 173 K + A+V HLYY DL E+ +L+ +++ D +T+ S EIL + + Sbjct: 994 KNENALVAHLYYFDLLPELLSLIER-NVNLDAFITIPVHFSREQVGEILASLDNVYVLRV 1052 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 +N GRD+LPFL + + +Y + K+H KKS ++ +G L R+ +LL P + Sbjct: 1053 QNRGRDILPFLNIYPIIKSYSYANLVKVHSKKSPQRA----DGALLRKRALLELLD-PSI 1107 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 V ++R +T IG+I DY L NR+ + R+G+ +F Sbjct: 1108 VPGVLRALNTDPKIGLIAPSNSLCSLSNSDY---LINNRKQLNYCLSRLGLVDSSLNFEF 1164 Query: 294 FAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 AG+MFW R +AL + +L L FE ++ + LDG + HA+ER F K +R V Sbjct: 1165 IAGSMFWARVDALRMLSDLSLREEDFEEELGQ-LDGTLAHAIERLFCFLGKHVGYRTLPV 1223 Query: 353 DCI 355 D I Sbjct: 1224 DQI 1226 >gi|325928537|ref|ZP_08189725.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans 91-118] gi|325541076|gb|EGD12630.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans 91-118] Length = 1415 Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 71/249 (28%), Positives = 127/249 (51%), Gaps = 15/249 (6%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA-SIKSEILK 163 A+ + + ++A+V+H +Y ++ E+ L + + + L ++ V + A ++ + Sbjct: 1156 ALRRPVMPRTPERVAVVIHAFYPEILPEMLKELQSWDVPYFLIISTVADKADEVRGYLAD 1215 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 + A + + EN GRD+LPFL +++ + V K+H K+S + +G+ WRR + Sbjct: 1216 LSVVADVRVFENRGRDILPFLEIMK-DLRGRESLVLKLHTKRSLHR----QDGESWRRDM 1270 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 LL AP V +I F +G+ + T G N + + L+ +M Sbjct: 1271 LEKLL-APKVASEIFAAFREQERLGLAAPEGH-----ILSMTTYWGANADTVHRLSKQMH 1324 Query: 284 ITFQDQKLDFFA-GTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLS 341 + + FA G+MF+VR EA+D I +L L R FEP+ + +DG + HA+ERCFSL+ Sbjct: 1325 VDPVNPVTAMFAAGSMFYVRPEAIDSIMDLDLRREDFEPEAGQ-VDGTLAHAIERCFSLA 1383 Query: 342 VKKANFRIS 350 V + I+ Sbjct: 1384 VCSTGYYIA 1392 >gi|21109952|gb|AAM38419.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 296 Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 73/244 (29%), Positives = 120/244 (49%), Gaps = 21/244 (8%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E + +++ +S L VT +T ++ + + A++ EN GR Sbjct: 51 VVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQQRGVQAQVDGFENRGR 110 Query: 179 DVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 D+LPFL +L E EQ+ V K+H KKS + +GD WRR +F LL P Sbjct: 111 DILPFLRVANRLLDEGEQV-----VLKLHTKKSTHRE----DGDAWRREMFSALL-TPQH 160 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 I+R F +G+ + P T +G N + + LA R G D+ F Sbjct: 161 ADAIMRGFTDDPLLGLAAPAQHLLP-----VTDFIGGNADALDYLAVRTGTDAIDEHSVF 215 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 +G+MFWV+ EAL P+ + L +DG + HA+ER +++V ++ +D Sbjct: 216 ASGSMFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVSHCGHHVATID 275 Query: 354 CILG 357 +LG Sbjct: 276 QLLG 279 >gi|77748730|ref|NP_643883.2| hypothetical protein XAC3576 [Xanthomonas axonopodis pv. citri str. 306] Length = 546 Score = 102 bits (254), Expect = 9e-20, Method: Compositional matrix adjust. Identities = 73/250 (29%), Positives = 122/250 (48%), Gaps = 21/250 (8%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E + +++ +S L VT +T ++ + + A++ EN GR Sbjct: 301 VVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQQRGVQAQVDGFENRGR 360 Query: 179 DVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 D+LPFL +L E EQ+ V K+H KKS + +GD WRR +F LL P Sbjct: 361 DILPFLRVANRLLDEGEQV-----VLKLHTKKSTHRE----DGDAWRREMFSALL-TPQH 410 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 I+R F +G+ + P T +G N + + LA R G D+ F Sbjct: 411 ADAIMRGFTDDPLLGLAAPAQHLLP-----VTDFIGGNADALDYLAVRTGTDAIDEHSVF 465 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 +G+MFWV+ EAL P+ + L +DG + HA+ER +++V ++ +D Sbjct: 466 ASGSMFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVSHCGHHVATID 525 Query: 354 CILGYRKSLS 363 +LG + + Sbjct: 526 QLLGIPQPTA 535 >gi|166713445|ref|ZP_02244652.1| hypothetical protein Xoryp_18900 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 695 Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 74/250 (29%), Positives = 124/250 (49%), Gaps = 21/250 (8%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E + L++ +S L VT +T ++ + + A++ EN GR Sbjct: 450 VVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQRGLQAQVEGFENRGR 509 Query: 179 DVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 D+LPFL +L E EQ+ V K+H KKS + +GD WRR + LL AP Sbjct: 510 DILPFLRVANRLLDEGEQV-----VLKLHTKKSTHRE----DGDTWRRDMLSGLL-APQH 559 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 V I+R F +G++ + P T +G N + + L R G + F Sbjct: 560 VAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFIGGNADALDYLTVRTGTDAINAHSLF 614 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 +G+MFWV+ EAL P+ + L +DG + HA+ER +++V + R++ ++ Sbjct: 615 ASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVAHSGQRVATIE 674 Query: 354 CILGYRKSLS 363 +LG K + Sbjct: 675 QLLGIPKPAT 684 >gi|58425017|gb|AAW74054.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 727 Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 73/244 (29%), Positives = 122/244 (50%), Gaps = 21/244 (8%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E + L++ +S L VT +T ++ + + A++ EN GR Sbjct: 482 VVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQRGLQAQVEGFENRGR 541 Query: 179 DVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 D+LPFL +L E EQ+ V K+H KKS + +GD WRR + LL AP Sbjct: 542 DILPFLRVANRLLDEGEQV-----VLKLHTKKSTHRE----DGDTWRRDMLSGLL-APQH 591 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 V I+R F +G++ + P T +G N + + L R G + F Sbjct: 592 VAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFMGGNADALDYLTVRTGTDAINAHSLF 646 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 +G+MFWV+ EAL P+ + L +DG + HA+ER +++V + R++ ++ Sbjct: 647 ASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVAHSGQRVATIE 706 Query: 354 CILG 357 +LG Sbjct: 707 QLLG 710 >gi|122879048|ref|YP_199439.6| hypothetical protein XOO0800 [Xanthomonas oryzae pv. oryzae KACC10331] Length = 546 Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 73/250 (29%), Positives = 124/250 (49%), Gaps = 21/250 (8%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E + L++ +S L VT +T ++ + + A++ EN GR Sbjct: 301 VVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQRGLQAQVEGFENRGR 360 Query: 179 DVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 D+LPFL +L E EQ+ V K+H KKS + +GD WRR + LL AP Sbjct: 361 DILPFLRVANRLLDEGEQV-----VLKLHTKKSTHRE----DGDTWRRDMLSGLL-APQH 410 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 V I+R F +G++ + P T +G N + + L R G + F Sbjct: 411 VAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFMGGNADALDYLTVRTGTDAINAHSLF 465 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 +G+MFWV+ EAL P+ + L +DG + HA+ER +++V + R++ ++ Sbjct: 466 ASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVAHSGQRVATIE 525 Query: 354 CILGYRKSLS 363 +LG + + Sbjct: 526 QLLGIPQPAT 535 >gi|325928558|ref|ZP_08189746.1| Lipopolysaccharide biosynthesis protein/putative glycosyl transferase [Xanthomonas perforans 91-118] gi|325541097|gb|EGD12651.1| Lipopolysaccharide biosynthesis protein/putative glycosyl transferase [Xanthomonas perforans 91-118] Length = 695 Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 73/246 (29%), Positives = 122/246 (49%), Gaps = 21/246 (8%) Query: 118 IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENH 176 + +V+H +Y+D+ E +++ +S L +T +T ++ + + A++ EN Sbjct: 448 VCVVLHAWYLDVLDEALEAIAHCGLSLRLVITTDITMVEQVRQRLQQRGVQAQVEGFENR 507 Query: 177 GRDVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 GRD+LPFL +L E EQ+ V K+H KKS + +GD WRR +F LL AP Sbjct: 508 GRDILPFLRVANRLLDEGEQV-----VLKLHTKKSTHRE----DGDTWRREMFSALL-AP 557 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKL 291 V I+R F +G+ + P T +G N + + LA R G ++ Sbjct: 558 QHVDAIMRGFADDPLLGLAAPAQHLLP-----VTDFIGGNADALDYLAVRTGTDAINEHS 612 Query: 292 DFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISD 351 F +G+MFWV+ EAL P+ + L +DG + HA+ER +++V ++ Sbjct: 613 MFASGSMFWVKLEALRPLLDAHLHPSEFEDEQGQIDGTLAHAIERFLAVAVGHCGHHVAT 672 Query: 352 VDCILG 357 V+ +LG Sbjct: 673 VEQLLG 678 >gi|84622385|ref|YP_449757.1| hypothetical protein XOO_0728 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188578640|ref|YP_001915569.1| hypothetical protein PXO_03177 [Xanthomonas oryzae pv. oryzae PXO99A] gi|84366325|dbj|BAE67483.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188523092|gb|ACD61037.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 695 Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 73/244 (29%), Positives = 122/244 (50%), Gaps = 21/244 (8%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E + L++ +S L VT +T ++ + + A++ EN GR Sbjct: 450 VVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQRGLQAQVEGFENRGR 509 Query: 179 DVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 D+LPFL +L E EQ+ V K+H KKS + +GD WRR + LL AP Sbjct: 510 DILPFLRVANRLLDEGEQV-----VLKLHTKKSTHRE----DGDTWRRDMLSGLL-APQH 559 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 V I+R F +G++ + P T +G N + + L R G + F Sbjct: 560 VAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFMGGNADALDYLTVRTGTDAINAHSLF 614 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 +G+MFWV+ EAL P+ + L +DG + HA+ER +++V + R++ ++ Sbjct: 615 ASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVAHSGQRVATIE 674 Query: 354 CILG 357 +LG Sbjct: 675 QLLG 678 >gi|148556902|ref|YP_001264484.1| glycosyl transferase family protein [Sphingomonas wittichii RW1] gi|148502092|gb|ABQ70346.1| glycosyl transferase, family 2 [Sphingomonas wittichii RW1] Length = 1301 Score = 99.8 bits (247), Expect = 5e-19, Method: Composition-based stats. Identities = 67/227 (29%), Positives = 112/227 (49%), Gaps = 14/227 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVT-LVTESASIKSEILKIFPAARIHIMEN 175 K A+V+HL+Y ++ +E+ + ++ + S D+ VT V ++ + + ++ A + + N Sbjct: 2 KAALVLHLFYPEVAVELIDRVAAIGASVDIFVTHSVALDETVLAALDRLPRKAEVVTVAN 61 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVF 235 G D+ P LL YD + K+H KK GY+ WRR + ++G+P +V Sbjct: 62 RGWDIGPLFELLPLLAERGYDLIGKLHSKKGG-SGYA----PEWRRLAYDGMIGSPALVA 116 Query: 236 KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRM-GITFQDQKLDFF 294 I+ FD H D+ ++G++ Y L +N E++ LA R+ + FF Sbjct: 117 DIVAAFDAHPDLSLLGAKPL-----YKSVASHLFRNAELLSDLAPRLTAPAYPPADWGFF 171 Query: 295 AGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLS 341 AGT FW R L+ + L R P + DG + HAVER F L+ Sbjct: 172 AGTFFWARRTLLEKVAALADFRDAAPNQDR--DGALGHAVERLFGLA 216 >gi|296135664|ref|YP_003642906.1| glycosyl transferase family 2 [Thiomonas intermedia K12] gi|295795786|gb|ADG30576.1| glycosyl transferase family 2 [Thiomonas intermedia K12] Length = 1414 Score = 99.8 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 85/247 (34%), Positives = 118/247 (47%), Gaps = 29/247 (11%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGR 178 A+++HLYY DLW E L L D++V+L + ++I++ P A + N GR Sbjct: 281 AVLLHLYYPDLWPEFLAHLKTLPAPCDVYVSLSEGREELLTDIVRDLPDAVVMRHPNKGR 340 Query: 179 DVLPFLILLETEQLSNYDYVCKIHGKKSKRKG---------YSWWEGDLWRRWLFYDLLG 229 D+ P L LL + NY + +HGKKS +GD WRR L LL Sbjct: 341 DIAPRLALLRLARAHNYKQLLFLHGKKSPHLKEVENIHIPFLQHKDGDRWRRELLAALLD 400 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMICTLAGRMGIT 285 A K I F +G+IG + LG N + A RMGIT Sbjct: 401 ASE---KTIAAFAQQPKLGLIGPHGFW-----------LGLRGDANFPRLSAQAQRMGIT 446 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLSVKK 344 + +FAG+MFW R +ALDP+ L L FE + + DG + H VER F+LS +K Sbjct: 447 PDPARHGYFAGSMFWCRPQALDPLLALDLKDADFEDETGQT-DGTLAHVVERLFALSAEK 505 Query: 345 ANFRISD 351 A F+I+D Sbjct: 506 AGFQIAD 512 >gi|192359986|ref|YP_001983898.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio japonicus Ueda107] gi|190686151|gb|ACE83829.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio japonicus Ueda107] Length = 872 Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust. Identities = 82/236 (34%), Positives = 111/236 (47%), Gaps = 28/236 (11%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFPAARIHIM 173 + +IA+V HLYY DL EI + L + +FDL VTL + I+ + + +P A + Sbjct: 600 QKRIAVVAHLYYRDLVPEILSALETIPEAFDLIVTLPDWGTRHIEQMVREAYPEAVFYRA 659 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKS-KRKGYSWWE-GDLWRRWLFYDLLGAP 231 N GRD+ PF+ LL NYD + KI K+ R G + G LWR F LLG Sbjct: 660 VNRGRDIGPFVDLLPLITEKNYDALLKIQTKRGYYRSGRLLPQFGQLWRSETFRALLGNK 719 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAY-----RYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 V I+ T + M+G Y +YP Y D G + I G Sbjct: 720 SRVTDILEALRTDPSLNMVGPSPYFLSLTKYP--YHDQ----GDLAQTILNNPTGNG--- 770 Query: 287 QDQKLDFFAGTMFWVRTEALDPI---KNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 FFAGTMFWVR L P+ ++L ++ FEP+ A DG H +ER FS Sbjct: 771 ------FFAGTMFWVRPSCLRPLTEPEHLSIT-AFEPE-SGANDGATAHLIERLFS 818 >gi|295687882|ref|YP_003591575.1| rhamnan synthesis protein F [Caulobacter segnis ATCC 21756] gi|295429785|gb|ADG08957.1| Rhamnan synthesis F [Caulobacter segnis ATCC 21756] Length = 818 Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 72/238 (30%), Positives = 113/238 (47%), Gaps = 12/238 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIM 173 +A ++HL+Y +L A L+ + DL +T+ T S + + FP A + I Sbjct: 588 RADAVTLLHLFYPELIDWFAERLAATADVLDLMITVPETWSEADLARARATFPMAHLAIA 647 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 EN GRD+ PF+ L + Y CK+H K+S + +GD WR L LLG Sbjct: 648 ENRGRDIRPFVETLRRARTLGYSVFCKLHSKRSPHRA----KGDEWRAELVDGLLGGEAA 703 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 + + +G++ + R D + NR+ LA RMG+ + F Sbjct: 704 ALALRAFAQDAK-LGLLAAAGSRLRIGDPDV---MNNNRQDADRLARRMGLKLAPET-PF 758 Query: 294 FAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLSVKKANFRIS 350 AG+MFW RTEA P+ +L + F P++ + +DG HA+ER + V +A +R S Sbjct: 759 SAGSMFWGRTEAFAPLSDLTDAEIDFGPELGR-VDGTTAHAIERLTAAIVARAGYRAS 815 >gi|148927812|ref|ZP_01811237.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] gi|147886838|gb|EDK72383.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] Length = 498 Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 79/246 (32%), Positives = 121/246 (49%), Gaps = 25/246 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIF-PAAR---IHI 172 ++A+VVH++Y +L EI +++ N+ FDL +T E A SE++ F P A I + Sbjct: 240 RLAVVVHIFYPELANEIYDVIKNIVEPFDLIITTPHEGAV--SELIDTFAPLASSVAIAL 297 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 EN GRDV PFL + + L YD V K+H KKS G W++ LF L G Sbjct: 298 SENRGRDVGPFLAVHRSGLLERYDAVLKLHSKKSTYSD----SGQQWQQSLFRQLCGNSQ 353 Query: 233 VVFKIIRTFDTHRD--IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF---Q 287 +V + + RD GM+G Y Y + G NR + L + T + Sbjct: 354 IVRRSVALL---RDGKTGMVGPHDY-----YLTHPHYWGANRPAVHKLLQSLTATPLKEE 405 Query: 288 DQKLDFFAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLSVKKAN 346 D L FFAGTMFW +A+ + ++ + FE + K DG + HA+ER F + + Sbjct: 406 DVPLRFFAGTMFWFAPKAIVALHDIPEALLNFESENGKQ-DGTLAHALERLFGIVPQLGG 464 Query: 347 FRISDV 352 + ++ + Sbjct: 465 YNVTSL 470 >gi|325915787|ref|ZP_08178089.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937] gi|325538051|gb|EGD09745.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937] Length = 695 Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 74/255 (29%), Positives = 118/255 (46%), Gaps = 21/255 (8%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMEN 175 + +VVH +Y+D+ EI + L+ L VT +T ++ + + A + EN Sbjct: 447 RACVVVHAWYLDVLDEILDALAPSVAMLRLIVTTDLTLVGQVRGRLQQHGIEAEVEGFEN 506 Query: 176 HGRDVLPFL-----ILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGA 230 GRD+LPFL +L E EQL V K+H KKS + +GD WRR + LLG Sbjct: 507 RGRDILPFLHIANRLLDEGEQL-----VVKLHTKKSTHRH----DGDAWRREMLAALLGG 557 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK 290 G V I+ F +G+ + T +G N + + LA R G + Sbjct: 558 -GRVDAIVNAFVADPQLGLAAPAQH-----LLAVTDFIGGNADALDYLAVRTGTGTVTEH 611 Query: 291 LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRIS 350 F +G+MFW + +AL P+ + L +DG + HA+ER +V + RI+ Sbjct: 612 DRFASGSMFWAKLDALRPLLDAHLQPGDFEGEQGQIDGTLAHAIERFLGHAVLHSGHRIA 671 Query: 351 DVDCILGYRKSLSQN 365 +D ++G R+ + Sbjct: 672 TIDGLMGQREPAAST 686 >gi|16124886|ref|NP_419450.1| hypothetical protein CC_0633 [Caulobacter crescentus CB15] gi|221233606|ref|YP_002516042.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000] gi|13421844|gb|AAK22618.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|51039815|tpg|DAA00361.1| TPA_exp: conserved hypothetical protein [Caulobacter vibrioides] gi|220962778|gb|ACL94134.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000] Length = 818 Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 72/242 (29%), Positives = 114/242 (47%), Gaps = 12/242 (4%) Query: 111 QIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAAR 169 Q A +A ++HL+Y +L A L+ + DL +T+ T S + + FP A Sbjct: 584 QFAKRADAVTLLHLFYPELIDWFAERLAATADVLDLMITVPETWSEADLARARAAFPTAH 643 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 + I EN GRD+ PF+ L + Y CK+H K+S + +GD WR L LLG Sbjct: 644 LAIAENRGRDIRPFVETLRRARALGYSVFCKLHSKRSPHQA----KGDQWRTTLVEGLLG 699 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 + + +G++ + R D + NR L+ MG+ + + Sbjct: 700 GEAAALALRAFAQDPK-LGLLAAAGARMRIGDPDV---MDNNRAEADRLSAHMGLKPRPE 755 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLSR-YFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 F AG+MFW RTEA P+ +L F P++ + +DG HA+ER + V++A +R Sbjct: 756 T-PFAAGSMFWGRTEAFAPLTDLSDDEIAFGPELGR-VDGTTAHAIERLTAAIVERAGYR 813 Query: 349 IS 350 S Sbjct: 814 AS 815 >gi|315654770|ref|ZP_07907675.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333] gi|315490731|gb|EFU80351.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333] Length = 680 Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 67/248 (27%), Positives = 124/248 (50%), Gaps = 16/248 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSEILKIFPAAR---IH 171 ++A+V+H+YY DL EI LSN+ + FD+ +T + + + +I + P + + Sbjct: 51 RLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK-RKGYSWWEGD--LWRRWLFYDLL 228 +ENHGRD+ P + L+ L Y + K+H KKS R+ + EG W+ LL Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDALL 170 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 G+ V KI+ F + +G++ + ++ G ++ + L R+ + + Sbjct: 171 GSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQFW------GGDQALTAELLRRLEMQLKP 224 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLSVKKANF 347 KL F AG+M+WVR + +++L LS FE + + +D HA+ER + +A Sbjct: 225 SKLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQ-IDATTAHALERAIGILTTEAGL 283 Query: 348 RISDVDCI 355 ++ + + + Sbjct: 284 KLRETNQL 291 >gi|315657309|ref|ZP_07910191.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315491781|gb|EFU81390.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 680 Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 67/248 (27%), Positives = 124/248 (50%), Gaps = 16/248 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSEILKIFPAAR---IH 171 ++A+V+H+YY DL EI LSN+ + FD+ +T + + + +I + P + + Sbjct: 51 RLAVVMHVYYSDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK-RKGYSWWEGD--LWRRWLFYDLL 228 +ENHGRD+ P + L+ L Y + K+H KKS R+ + EG W+ LL Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDALL 170 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 G+ V KI+ F + +G++ + ++ G ++ + L R+ + + Sbjct: 171 GSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQFW------GGDQALTAELLRRLEMQLKP 224 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLSVKKANF 347 KL F AG+M+WVR + +++L LS FE + + +D HA+ER + +A Sbjct: 225 SKLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQ-IDATTAHALERAIGILTTEAGL 283 Query: 348 RISDVDCI 355 ++ + + + Sbjct: 284 KLRETNQL 291 >gi|188993121|ref|YP_001905131.1| conserved protein involved in carbohydrate biosynthesis [Xanthomonas campestris pv. campestris str. B100] gi|189030067|sp|B0RVK2|WXCX_XANCB RecName: Full=Uncharacterized protein wxcX gi|167734881|emb|CAP53093.1| conserved protein involved in carbohydrate biosynthesis [Xanthomonas campestris pv. campestris] Length = 695 Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 72/246 (29%), Positives = 116/246 (47%), Gaps = 13/246 (5%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E+ + + + +T +T+ + I + A + EN GR Sbjct: 450 VVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFENRGR 509 Query: 179 DVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKII 238 D+LPFL + N V K+H KKS + +G+ WR + LLG P V I+ Sbjct: 510 DILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEMLTALLG-PQRVDAIV 564 Query: 239 RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTM 298 F T +G+ + P T +G N + + L R G D F +G+M Sbjct: 565 NAFSTDPLVGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGSDAPDTNSLFASGSM 619 Query: 299 FWVRTEALDPIKNLRL-SRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILG 357 FW R EAL P+ + L + FE + +DG + HA+ER L+V + R++ V+ LG Sbjct: 620 FWARLEALRPLLDAHLHASEFESE-QGQIDGTLAHAIERFVGLAVTHSGHRVTTVEQTLG 678 Query: 358 YRKSLS 363 K+ S Sbjct: 679 ITKTPS 684 >gi|298346187|ref|YP_003718874.1| hypothetical protein HMPREF0573_11061 [Mobiluncus curtisii ATCC 43063] gi|304390053|ref|ZP_07372007.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|298236248|gb|ADI67380.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 43063] gi|304326535|gb|EFL93779.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii ATCC 35241] Length = 680 Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 67/248 (27%), Positives = 123/248 (49%), Gaps = 16/248 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSEILKIFPAAR---IH 171 ++A+V+H+YY DL EI LSN+ + FD+ +T + + + +I + P + + Sbjct: 51 RLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLVVV 110 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK-RKGYSWWEGD--LWRRWLFYDLL 228 +ENHGRD+ P + L+ L Y + K+H KKS R+ + EG W+ LL Sbjct: 111 PVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRENHPDLEGSGAQWKDEFLDALL 170 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 G+ V KI+ F +G++ + ++ G ++ + L R+ + + Sbjct: 171 GSKDSVEKIMSAFGADPWLGLVTAPGNIVGPQFW------GGDQALTAELLRRLEMQLKP 224 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAVERCFSLSVKKANF 347 KL F AG+M+WVR + +++L LS FE + + +D HA+ER + +A Sbjct: 225 SKLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQ-IDATTAHALERAIGILTTEAGL 283 Query: 348 RISDVDCI 355 ++ + + + Sbjct: 284 KLRETNQL 291 >gi|269219069|ref|ZP_06162923.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon 848 str. F0332] gi|269211216|gb|EEZ77556.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon 848 str. F0332] Length = 687 Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 63/240 (26%), Positives = 118/240 (49%), Gaps = 9/240 (3%) Query: 112 IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIH 171 IA ++IA+V+H +Y DL E+ + L NL FDL VT + + + L+ + + Sbjct: 71 IADPSRIAVVIHCFYADLMPELFDRLRNLPTDFDLFVTNASGADVAVPKDLERMRHSVVV 130 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK-RKGYSWWEGD--LWRRWLFYDLL 228 +ENHGRD+ P + L+ + L YD + K+H KKS R+ ++ +G W+ DL+ Sbjct: 131 EVENHGRDIFPTVQLVNSGILDPYDLILKLHTKKSPWREEHADLDGSGAAWKDQFLSDLV 190 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 G+ V +I+ F +G++ + ++ G ++ ++ L R+ ++ Sbjct: 191 GSREKVEEILNAFAADPTLGLVTAADSIVGKEFW------GGDQRIVEQLMLRIEMSIDP 244 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 +L+F +G+M+W R L ++ L+ + +D HA+ER + +A R Sbjct: 245 DELEFASGSMYWTRAFVLQGLRAFNLTSADFDEEKGQVDATTAHAIERIVGIVTDEAGLR 304 >gi|77747764|ref|NP_636021.2| hypothetical protein XCC0629 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761299|ref|YP_244667.2| hypothetical protein XC_3605 [Xanthomonas campestris pv. campestris str. 8004] Length = 546 Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 72/246 (29%), Positives = 115/246 (46%), Gaps = 13/246 (5%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E+ + + + +T +T+ + I + A + EN GR Sbjct: 301 VVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFENRGR 360 Query: 179 DVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKII 238 D+LPFL + N V K+H KKS + +G+ WR + LLG P V I+ Sbjct: 361 DILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEMLTALLG-PQRVDAIV 415 Query: 239 RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTM 298 F T G+ + P T +G N + + L R G D F +G+M Sbjct: 416 NAFSTDPLAGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGSDAPDTNSLFASGSM 470 Query: 299 FWVRTEALDPIKNLRL-SRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILG 357 FW R EAL P+ + L + FE + +DG + HA+ER L+V + R++ V+ LG Sbjct: 471 FWARLEALRPLLDAHLHASEFESE-QGQIDGTLAHAIERFVGLAVTHSGHRVTTVEQTLG 529 Query: 358 YRKSLS 363 K+ S Sbjct: 530 ITKTPS 535 >gi|21111631|gb|AAM39945.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66575237|gb|AAY50647.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 296 Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 72/246 (29%), Positives = 115/246 (46%), Gaps = 13/246 (5%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E+ + + + +T +T+ + I + A + EN GR Sbjct: 51 VVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFENRGR 110 Query: 179 DVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKII 238 D+LPFL + N V K+H KKS + +G+ WR + LLG P V I+ Sbjct: 111 DILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEMLTALLG-PQRVDAIV 165 Query: 239 RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTM 298 F T G+ + P T +G N + + L R G D F +G+M Sbjct: 166 NAFSTDPLAGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGSDAPDTNSLFASGSM 220 Query: 299 FWVRTEALDPIKNLRL-SRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILG 357 FW R EAL P+ + L + FE + +DG + HA+ER L+V + R++ V+ LG Sbjct: 221 FWARLEALRPLLDAHLHASEFESE-QGQIDGTLAHAIERFVGLAVTHSGHRVTTVEQTLG 279 Query: 358 YRKSLS 363 K+ S Sbjct: 280 ITKTPS 285 >gi|297538440|ref|YP_003674209.1| Rhamnan synthesis F [Methylotenera sp. 301] gi|297257787|gb|ADI29632.1| Rhamnan synthesis F [Methylotenera sp. 301] Length = 782 Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 73/238 (30%), Positives = 114/238 (47%), Gaps = 21/238 (8%) Query: 118 IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA-----ARIHI 172 +A++ HL+Y + + LSN+ +FD+++T TE K+ I K F + + Sbjct: 512 LAVICHLFYHQMCEDYKVYLSNIPFNFDIYITTDTEDK--KAYIEKSFSGWQRGKVEVRL 569 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P LI + S Y+Y+ IH K S YS WR ++ LLG+ Sbjct: 570 AVNQGRDIAPKLIACR-DIYSAYEYILHIHSKNSP---YSSIHTG-WRDYILDTLLGSQK 624 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG--KNREMICTLAGRMGITF-QDQ 289 V I F + ++G+I P + +G +N ++ LAGRMG + Sbjct: 625 TVSSIFEAFQLNSNLGIIA------PQHFKALKLDIGWDRNFKIAKKLAGRMGFDISRKA 678 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 +DF +G+MFW R+ AL P+ N LS P+ DG H++ER + +KA F Sbjct: 679 PIDFPSGSMFWARSAALLPLLNCSLSLQDFPREDGQKDGTTAHSIERLYFFICEKAGF 736 >gi|189030068|sp|P0C7J1|WXCX_XANCP RecName: Full=Uncharacterized protein wxcX Length = 695 Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 72/246 (29%), Positives = 115/246 (46%), Gaps = 13/246 (5%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E+ + + + +T +T+ + I + A + EN GR Sbjct: 450 VVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFENRGR 509 Query: 179 DVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKII 238 D+LPFL + N V K+H KKS + +G+ WR + LLG P V I+ Sbjct: 510 DILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEMLTALLG-PQRVDAIV 564 Query: 239 RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTM 298 F T G+ + P T +G N + + L R G D F +G+M Sbjct: 565 NAFSTDPLAGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGSDAPDTNSLFASGSM 619 Query: 299 FWVRTEALDPIKNLRL-SRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILG 357 FW R EAL P+ + L + FE + +DG + HA+ER L+V + R++ V+ LG Sbjct: 620 FWARLEALRPLLDAHLHASEFESE-QGQIDGTLAHAIERFVGLAVTHSGHRVTTVEQTLG 678 Query: 358 YRKSLS 363 K+ S Sbjct: 679 ITKTPS 684 >gi|325921211|ref|ZP_08183074.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC 19865] gi|325548310|gb|EGD19301.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC 19865] Length = 706 Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 67/250 (26%), Positives = 118/250 (47%), Gaps = 11/250 (4%) Query: 108 KVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFP 166 KV+ ++ +V+H +Y+D+ E+ + +++ +IS L +T +T ++ + + Sbjct: 431 KVVSELAPHRVCVVLHAWYLDVLDEMLDAVAHCAISPRLVITTDLTMVVEVRHRVQQRGM 490 Query: 167 AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 A + EN GRD+LPFL + V K+H KKS + +GD WR + Sbjct: 491 QAEVEGFENRGRDILPFLHVANRLLDEGVCLVVKLHTKKSTHRS----DGDTWRHEMLSA 546 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 LL AP I+ F + +G+ + P D+ +G N + + L R G Sbjct: 547 LL-APERADAIVNAFSSDPLLGLAAPDGHLLP--VADF---IGGNTDALDYLGARTGTET 600 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 ++ F +G+MFW R EAL P+ + L +DG + HA+ER +S ++ Sbjct: 601 AIEQGMFASGSMFWARLEALRPLLDAHLHPSEFETEQGQIDGTLAHAIERFMGISAIQSG 660 Query: 347 FRISDVDCIL 356 +RI+ + L Sbjct: 661 YRIATIGQAL 670 >gi|190572709|ref|YP_001970554.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia K279a] gi|190010631|emb|CAQ44240.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia K279a] Length = 707 Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 71/254 (27%), Positives = 111/254 (43%), Gaps = 11/254 (4%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAARIH 171 A ++ IV+H +Y+D E+ + + + L +T E A ++S I A I Sbjct: 456 AAPSRPCIVIHAWYLDALPELLQAVKDSGLQARLVITTTGERQAQVQSIIDAEGLTAEIW 515 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + +NHGRDVLPFL + N V K+H K+S + GD WRR + LLG Sbjct: 516 VYDNHGRDVLPFLHAADRLLQQNESLVLKLHTKRSTHRD----NGDQWRREMVDALLGT- 570 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKL 291 + + IG++ + K DY +G N + + L +G+ Sbjct: 571 AQAAANLAHLLANPSIGLMAPAGHLL--KVADY---IGGNAQRMERLWALLGLDSAPGDG 625 Query: 292 DFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISD 351 F +G+MFWVR AL P+ + L +DG + HA+ER V A F ++D Sbjct: 626 QFASGSMFWVRLPALRPLLDAHLLPSMFDTEAGQIDGTLAHAIERATGAVVSAAGFTVAD 685 Query: 352 VDCILGYRKSLSQN 365 + G S + Sbjct: 686 TSEVEGAPPRASSS 699 >gi|32455988|ref|NP_861990.1| rb115 [Ruegeria sp. PR1b] gi|22726340|gb|AAN05136.1| RB115 [Ruegeria sp. PR1b] Length = 963 Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 68/257 (26%), Positives = 117/257 (45%), Gaps = 18/257 (7%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAA------ 168 K ++ + +HLYY+D+ E+ LL+ L ++F+L ++L + E++ +F A Sbjct: 154 KGRLVVQLHLYYVDMAAEMIALLARLPVTFELLLSLPETAVVADEEMISLFRAGLERLGA 213 Query: 169 -RIHIMENHGRDVLPFLILLETE--QLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFY 225 + + N GRDV P+++ +E L++ D V +H KKS Y W R+L + Sbjct: 214 ITLRRVPNRGRDVAPWMVSFRSELRALADRDLVLHLHSKKSPHGNYHVG----WGRYLGH 269 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT 285 LLG+ V +++ F ++G++ + + +Y GK ++ L RMG+ Sbjct: 270 SLLGSTAVAAQMLGLFAEDPELGLVAPAYWPALRRAPNY----GKVGDLCAHLFRRMGLG 325 Query: 286 FQDQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 D DF AG+ F R L P L L P + G + HAVER + Sbjct: 326 EVDPICADFPAGSFFCARAAVLRPFLTLGLEARDFPAEAGQICGTLAHAVERLLGQVPAR 385 Query: 345 ANFRISDVDCILGYRKS 361 R V L + ++ Sbjct: 386 LGLRFDMVAVDLPFEEA 402 >gi|269978088|ref|ZP_06185038.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1] gi|306818459|ref|ZP_07452182.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239] gi|307700705|ref|ZP_07637730.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16] gi|269933597|gb|EEZ90181.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1] gi|304648632|gb|EFM45934.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239] gi|307613700|gb|EFN92944.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16] Length = 613 Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 74/247 (29%), Positives = 116/247 (46%), Gaps = 18/247 (7%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEIL 162 A V+ K +IA V H++Y D+ EI S L + +T T + I+ ++ Sbjct: 285 AEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKKTQIEQQLQ 344 Query: 163 KIFPAARIHIME-NHGRDVLPFLIL-LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + A + I+E N GRDV FL+ + + +D V KIH KKS + Y+ +L++ Sbjct: 345 TMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGRFDVVAKIHSKKSAQDAYN--AAELFK 402 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMIC 276 R LF +LL +PG ++ F T +GM+ +P +LG N++ Sbjct: 403 RHLFENLLPSPGYTANLLHLFATEPYLGMV------FPPAVSLGYPTLGHAWFANKKPAL 456 Query: 277 TLAGRMGI--TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 L R+GI F D G+MF+ R EAL P+ + P+ + DG + H + Sbjct: 457 ALCERLGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSLAHVI 516 Query: 335 ERCFSLS 341 ER FS S Sbjct: 517 ERIFSYS 523 >gi|225350704|ref|YP_002720664.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae WA1] gi|225216388|gb|ACN85121.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae WA1] Length = 342 Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 72/240 (30%), Positives = 109/240 (45%), Gaps = 14/240 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAAR---I 170 K KI I +HLYYID+ L + I FDL +T E+ I P + I Sbjct: 24 KLKIGIHIHLYYIDMMDMFIKYLKDSPIEFDLFITTSKEENKDICLNAFNKLPKLKNITI 83 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGA 230 I+EN GRD+ P+LI Q +NYD C +H KKS W + W +L +L+ Sbjct: 84 FIVENIGRDIAPWLIECNNIQ-NNYDLFCHLHTKKS----LHWESINEWGEYLIENLISE 138 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK 290 + I+ F +IG+I Y Y Y Y + I L ++ I F+ + Sbjct: 139 EAIN-NILSNFILDNNIGIISPHIYYYLFPYILYIDK--DDMHHIKLLLNKLNINFEPKP 195 Query: 291 LDFF--AGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 +F G+M W R + L P+ +L L P+ G I HA+ER + +++N++ Sbjct: 196 ENFVFPVGSMLWYRPKVLKPLFDLNLKYSDFPQEPIPKTGTIAHAIERIIGIICEQSNYK 255 >gi|298290915|ref|YP_003692854.1| Rhamnan synthesis F [Starkeya novella DSM 506] gi|296927426|gb|ADH88235.1| Rhamnan synthesis F [Starkeya novella DSM 506] Length = 633 Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 78/261 (29%), Positives = 116/261 (44%), Gaps = 22/261 (8%) Query: 101 WESPAMGK-VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS 159 W P G +A K+ + H +Y DL E+ L+ + DL +T T + K Sbjct: 378 WAVPVFGPPAAPVASPLKVGLHGHFFYPDLLPELLERLAANASRPDLFLTTDTPA---KV 434 Query: 160 EILKIFPAA-----RIHIMENHGRDVLPFLILL-ETEQLSNYDYVCKIHGKKSKRKGYSW 213 E L+ AA RI ++ N GRD+ PFL L + YD + +HGKK+K G Sbjct: 435 EQLRALTAAWPAKVRIDVVPNSGRDIGPFLTALRDVLTGGEYDVLLHLHGKKTK--GRRR 492 Query: 214 WEGDLWRRWLFYDLLGAPGVVFKIIRTF-DTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR 272 GD WR +L+ +L+G + + + H +G++ YP + +N Sbjct: 493 AIGDPWRNFLWENLIGGDHPMLDAVLAYMAAHPQVGLV------YPEDT--HLLDWARNG 544 Query: 273 EMICTLAGRMGIT-FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIE 331 ++ L MG+T +DF G MF VR AL P+ L L P LDG + Sbjct: 545 RVVEELRRDMGLTEPMGTYVDFPVGNMFAVRPAALAPVLALDLKWSDYPVEPIPLDGTVL 604 Query: 332 HAVERCFSLSVKKANFRISDV 352 H +ER V+KA F + V Sbjct: 605 HGIERLLPTVVRKAGFTTAAV 625 >gi|285019449|ref|YP_003377160.1| hypothetical protein XALc_2689 [Xanthomonas albilineans GPE PC73] gi|283474667|emb|CBA17166.1| conserved hypothetical protein [Xanthomonas albilineans] Length = 686 Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 76/243 (31%), Positives = 114/243 (46%), Gaps = 20/243 (8%) Query: 121 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK--IFPAARIHIMENHGR 178 +VH +Y ++ E+ N L+ ++ + L VT + AS L+ FP + ++EN GR Sbjct: 442 IVHAWYPNVLPELLNPLAASALPWRLLVTTSPDQASAVQAQLRDCSFPY-EVMVLENRGR 500 Query: 179 DVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKII 238 D+LPFL E D V K+H K+S GD WR L L GA +I+ Sbjct: 501 DILPFLHAGERLLQDGVDVVLKLHTKRSTHLH----NGDAWRSELLQRLAGA-DRAARIL 555 Query: 239 RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLD---FFA 295 F +G++ + P D+ G NR L R G T D LD F + Sbjct: 556 EAFAQDPMLGLVAPEGHLLP--LADF---WGGNRMAADYLLRRTGYT--DVCLDEAHFIS 608 Query: 296 GTMFWVRTEALDPIKNLRL-SRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDC 354 G+MFWVR AL P+ + L FEP+ +DG + HA ER +L + +R++ Sbjct: 609 GSMFWVRLHALRPLLDSHLCPSEFEPE-QGQIDGTLAHAAERVTALLAQHRGYRVATAAE 667 Query: 355 ILG 357 ++G Sbjct: 668 LIG 670 >gi|227875198|ref|ZP_03993340.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris ATCC 35243] gi|227844103|gb|EEJ54270.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris ATCC 35243] Length = 613 Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 74/247 (29%), Positives = 116/247 (46%), Gaps = 18/247 (7%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEIL 162 A V+ K +IA V H++Y D+ EI S L + +T T + I+ ++ Sbjct: 285 AEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKKTQIEQQLQ 344 Query: 163 KIFPAARIHIME-NHGRDVLPFLIL-LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + A + I+E N GRDV FL+ + + +D V KIH KKS + Y+ +L++ Sbjct: 345 TMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGCFDVVAKIHSKKSAQDAYN--AAELFK 402 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMIC 276 R LF +LL +PG ++ F T +GM+ +P +LG N++ Sbjct: 403 RHLFENLLPSPGYTANLLHLFATEPYLGMV------FPPAVSLGYPTLGHAWFANKKPAL 456 Query: 277 TLAGRMGI--TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 L R+GI F D G+MF+ R EAL P+ + P+ + DG + H + Sbjct: 457 ALCERLGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSLAHVI 516 Query: 335 ERCFSLS 341 ER FS S Sbjct: 517 ERIFSYS 523 >gi|90425670|ref|YP_534040.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18] gi|90107684|gb|ABD89721.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18] Length = 846 Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 75/240 (31%), Positives = 112/240 (46%), Gaps = 16/240 (6%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIH 171 A + +IAI H YY DL L++ + S DL +T E A+ + L+ F Sbjct: 592 AARPRIAIHGHFYYPDLLESFLKLIAANASSVDLFLTTSGPEQAAQIRKSLRAFGIQNAD 651 Query: 172 I--MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 + + N GRD+ PFL + ++L +YD V HGK+SK + GD WR + + L+G Sbjct: 652 VWSVPNRGRDIGPFLKEM-PDKLGSYDIVGHFHGKRSKHVDSTV--GDQWRDFAWQHLIG 708 Query: 230 APGVVFKIIR-TFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQ- 287 + +I F +G++ + Y +NR++ LA RM I Sbjct: 709 DAFPMIDVIADAFAEDAKLGLVFAED--------PYLNGWDENRDLAERLAQRMKIEAPL 760 Query: 288 DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + DF GTMFW R AL P+ L L P +DG I HA+ER +V+K+ F Sbjct: 761 PEHFDFPIGTMFWARVAALQPLFQLNLDWNDYPHEPLPIDGTILHALERIVPFAVQKSGF 820 >gi|261868364|ref|YP_003256286.1| lipopolysaccharide biosynthesis protein [Aggregatibacter actinomycetemcomitans D11S-1] gi|3132260|dbj|BAA28137.1| unnamed protein product [Actinobacillus actinomycetemcomitans] gi|261413696|gb|ACX83067.1| lipopolysaccharide biosynthesis protein [Aggregatibacter actinomycetemcomitans D11S-1] Length = 632 Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 71/256 (27%), Positives = 124/256 (48%), Gaps = 25/256 (9%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE---ILK 163 KV ++ KI +V H+YY D+ EI + N+ S+DL +T E + ++ E ILK Sbjct: 285 SKVEKVRSDIKILVVAHIYYSDMLDEIISYTQNIPCSYDLLITTANEKSKLEIESNPILK 344 Query: 164 IFPAARIHIM---ENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLW 219 + A I++ +N GRD+ I + E +S YD+VC++H KKS + ++ + Sbjct: 345 MSGAKGINVKVVEQNRGRDMSSLFITCKQEIISERYDWVCRLHSKKSPQNSHNM--SIHF 402 Query: 220 RRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMI 275 + ++ ++L + K+I D ++ IG + P+ +LG NR++ Sbjct: 403 KEMMYLNILKDKAYISKVINYLDKNKSIG------FAMPSMVHIGHPTLGHAWFTNRDLA 456 Query: 276 CTLAGRMGITFQDQKLDFFA--GTMFWVRTEALDPI--KNLRLSRYFEPKVHKALDGEIE 331 +A R+GI + FA GTMFW R EAL + N + + + +H+ D + Sbjct: 457 IKIAERVGIKLPFDDISPFAAYGTMFWFRPEALKKLFEYNWKFEDFNKEPMHQ--DSSLA 514 Query: 332 HAVERCFSLSVKKANF 347 H +ER + A + Sbjct: 515 HILERLLVYAAHDAGY 530 >gi|222148479|ref|YP_002549436.1| hypothetical protein Avi_2007 [Agrobacterium vitis S4] gi|221735467|gb|ACM36430.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 513 Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 78/249 (31%), Positives = 123/249 (49%), Gaps = 30/249 (12%) Query: 112 IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES-ASIKSEILKIFPAAR- 169 +A++ + I VH +Y++L+ EIA+ L L++ F L VT+ ES A + +L F + Sbjct: 251 VALQLSLCIHVHCFYVELFNEIADRLQCLTLPFYLVVTVCNESDAKVVENLLVDFNQRQN 310 Query: 170 --IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 I ++EN GRD+ PFLI + D V +H KKS +W GD WRR+LF Sbjct: 311 THILVVENRGRDIAPFLI-DASPIWRKSDLVLHLHTKKSPH--ITW--GDNWRRYLFDQT 365 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYC--DYTCSLGKNREMICTLAGRMGIT 285 +G ++ II F D+GM+ YP +C + KN++ I +A ++ + Sbjct: 366 IGYEPLLKGIIDQFQDRDDMGMM------YPENFCMIKHFTEEEKNKDAIRYIAQKLRLE 419 Query: 286 FQDQKLD-FFAGTMFWVRTEAL------DPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 + L + AG+M + R +AL D ++NL F P+ LDG H +ER Sbjct: 420 CSFEALGAYAAGSMAFYRVKALASVLEYDALENL-----FGPE-QGQLDGTAAHVLERLL 473 Query: 339 SLSVKKANF 347 V+ F Sbjct: 474 PEMVRLNGF 482 >gi|302337197|ref|YP_003802403.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293] gi|301634382|gb|ADK79809.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293] Length = 1100 Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 89/309 (28%), Positives = 138/309 (44%), Gaps = 44/309 (14%) Query: 42 FWTLFYKRS-----KKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQ 96 F TLFY RS +K + + G R G + N +M+ER+ F + FP Sbjct: 128 FDTLFYLRSYPDVAEKGVNPLGHYIKIGWRKGNR----PNPFMVERDCTF----LPAFPL 179 Query: 97 LLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESA 155 H S I +V H+Y+ DL +S++ FDL VT + E+ Sbjct: 180 QDHSALS--------------IVVVFHIYHEDLVGSCLQYISHIPYPFDLIVTTPLEENN 225 Query: 156 SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWE 215 ++ ++P A I +N GRD+ PFL + + ++ YD CK+H KK YS Sbjct: 226 DAILQVKSLYPDAEIVRSKNAGRDIGPFLQVWD--RVLQYDLCCKVHTKKGN-SAYS--- 279 Query: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMI 275 ++WR +L V I+R F+ + + G+ Y Y LGKN+++ Sbjct: 280 -EIWRDLSLRGILETVDTVHGILRMFEQEDSLALAGAELL-----YGSYQFLLGKNKDLS 333 Query: 276 CTLAGRMGITFQDQKLD-FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 +L I + FF GTMFW+R + + NL+ ++ P DG+ EHA+ Sbjct: 334 NSLIKDYNIPVNSYSNNGFFMGTMFWMRVKKFIFLSNLKQLQF--PIEDGKNDGKYEHAL 391 Query: 335 ERCF-SLSV 342 ER SLS+ Sbjct: 392 ERLLGSLSL 400 >gi|323138318|ref|ZP_08073389.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] gi|322396401|gb|EFX98931.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] Length = 754 Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 63/243 (25%), Positives = 118/243 (48%), Gaps = 17/243 (6%) Query: 111 QIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHV-TLVTESASIKSEILKIFP--A 167 I + +A +VH +Y DL I L N+ + DL++ T E A I ++++ + + Sbjct: 364 DINMDKPVAAIVHAFYPDLLEHILGYLENIPCAVDLYISTDSAEKAEIIGKVVRNWSKGS 423 Query: 168 ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 + IMEN GRD+ P ++ + + +D +H K+S G + WR +L L Sbjct: 424 TDVRIMENRGRDIAPMIVGFR-DVFAKHDIFLHVHTKRSPHAGDLLYH---WRDYLLNTL 479 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL--GKNREMICTLAGRMGIT 285 G + ++ F+ + IG++ +P + + L G + ++ L R+G+ Sbjct: 480 FGTGDIARSVLSLFNDPK-IGVV------FPQHFFEVRRMLNWGFDYDLARNLLARVGVQ 532 Query: 286 F-QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 +D L+F +G+MFW RT+A+ P+ +L L P+ +DG + HA+ER + + Sbjct: 533 LNKDLVLEFPSGSMFWGRTDAIRPLLDLDLQFSDFPEEAGQIDGTLAHAIERTLLMVAES 592 Query: 345 ANF 347 + Sbjct: 593 KGY 595 >gi|317047360|ref|YP_004115008.1| family 2 glycosyl transferase [Pantoea sp. At-9b] gi|316948977|gb|ADU68452.1| glycosyl transferase family 2 [Pantoea sp. At-9b] Length = 1419 Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 71/235 (30%), Positives = 110/235 (46%), Gaps = 31/235 (13%) Query: 118 IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL------VTESASIKSEILKIFPAARIH 171 I + +HLYY+DL E L+ + FDL ++L V E +K + Sbjct: 599 IGVHLHLYYVDLADEFIKHLNTIPTGFDLFISLPRGKHNVEECERKFRSGIKTLKKLVVR 658 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR---KGYSWWEGDLWRRWLFYDLL 228 EN GRD+ PF++ E LS Y+ + IH KKS + KG WRR+L + L Sbjct: 659 ETENKGRDIYPFIVEFGAELLS-YELILHIHSKKSPQALSKG--------WRRFLLHYTL 709 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYT--CSLGKNREMICTLAGRMGITF 286 G + +I+ +FD +G++ +P + T + G NRE++ R+G ++ Sbjct: 710 GTESITTQILNSFDNDPKLGVL------FPAYFYGVTRQPNWGGNREIVKQQLARLGFSY 763 Query: 287 QDQKL-DFFAGTMFWVRTEALDPIKN--LRLSRYFEPKVHKALDGEIEHAVERCF 338 D+ AG+ FW R++AL P+ N RL + E DG + H ER F Sbjct: 764 DMTYCPDYPAGSFFWSRSDALRPLLNGEYRLEDFDEEAGQ--YDGTLAHGFERLF 816 >gi|220924211|ref|YP_002499513.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium nodulans ORS 2060] gi|219948818|gb|ACL59210.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium nodulans ORS 2060] Length = 1366 Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 68/256 (26%), Positives = 124/256 (48%), Gaps = 18/256 (7%) Query: 112 IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIK-SEILKIFPAAR- 169 + + ++A++ H++Y D E++ L+ + DL ++ TE + + L+ + + Sbjct: 697 LELPERVAVIAHVFYTDFCSELSAYLARIPTQADLFISTDTEDKRQQIAFALQSYNMGKL 756 Query: 170 -IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + +M N GRD+ P L+ + + ++Y+Y IH KKS WR +L +LL Sbjct: 757 TVRVMPNIGRDIAPMLVGFD-DVFNSYEYFLHIHSKKSPHDP----AFGSWREFLLENLL 811 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF-Q 287 G+ ++ I+ H+ G++ S+ + + + + G N E + L GR GI Sbjct: 812 GSEDIIRSILYLLHAHK-TGIVFSQHF----EPVRHLLNFGYNFETMKGLLGRCGIKISN 866 Query: 288 DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 D L+F + + FW R+ AL P+ +L L +DG + HA+ER V+K+ F Sbjct: 867 DLVLEFPSSSFFWGRSSALKPLLDLNLDWSDFAAEAGQIDGTLAHAIERSVLYIVEKSGF 926 Query: 348 RISDVDCILGYRKSLS 363 R + V G+R L Sbjct: 927 RWAKV----GHRSKLP 938 >gi|160894491|ref|ZP_02075267.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50] gi|156863802|gb|EDO57233.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50] Length = 646 Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 70/262 (26%), Positives = 125/262 (47%), Gaps = 38/262 (14%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP 166 K+ +I K K+A+V+HLY+ DL + SN+ D+++T T++ K ILK+F Sbjct: 304 NKMDEILKKRKLALVMHLYFPDLVEDSFQWASNVPKETDVYIT--TDTVEKKEAILKVFK 361 Query: 167 A-----ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK---------GYS 212 + ++ N GRDV L+ ++ + + NYDY C +H KK+ + GY Sbjct: 362 NLPCNHLEVRVIVNRGRDVSSILVGVK-DVIQNYDYACFVHDKKTAQAKPGSVGDSFGYK 420 Query: 213 WWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY----CDYTCSL 268 W L+ + V +++TF+ + +G++ + Y ++ C+ Sbjct: 421 CWNNTLYNKEF----------VCNVLQTFEDNERLGILSPPEPNHGPFYQTLGNEWGCNF 470 Query: 269 GKNREMICTLAGRMGITF---QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKA 325 K+RE +A ++GIT +D++ GT FW R AL + + P+ Sbjct: 471 EKSRE----VADKLGITIPMSEDKEALAPYGTFFWFRPTALKVLFDHDWQYEEFPEEPNN 526 Query: 326 LDGEIEHAVERCFSLSVKKANF 347 DG I HA+ER + + V++A + Sbjct: 527 FDGTILHAIERLYPICVQQAGY 548 >gi|194364297|ref|YP_002026907.1| hypothetical protein Smal_0519 [Stenotrophomonas maltophilia R551-3] gi|194347101|gb|ACF50224.1| conserved hypothetical protein [Stenotrophomonas maltophilia R551-3] Length = 686 Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 64/245 (26%), Positives = 111/245 (45%), Gaps = 13/245 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAARIHIM 173 ++ +V+H +++D E+ + + + + L +T ++ A ++S A I Sbjct: 437 PSRPCVVIHAWHLDALPELLSAVKDSGLPARLVITTTSDRQAQVQSITESHGLPAEIWAY 496 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 +NHGRD+LPFL + N V K+H K+S + GD WRR + LLG P Sbjct: 497 DNHGRDILPFLHAADRLLQQNESLVLKLHTKRSTHRD----NGDQWRREMVDALLG-PAQ 551 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 + +G++ + DY +G N + + L ++G+ F Sbjct: 552 AAANLAHLQADPRLGLMAPAGHLL--NVADY---IGGNAQRMERLWAQLGLDGAPGDGQF 606 Query: 294 FAGTMFWVRTEALDPIKNLR-LSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 +G+MFWVR +AL P+ + L FE + + +DG + HA+ER A F + D Sbjct: 607 ASGSMFWVRLQALRPLLDAHLLPSMFEVEAGQ-IDGTLAHAIERATGAVATCAGFSVGDT 665 Query: 353 DCILG 357 + G Sbjct: 666 SQVHG 670 >gi|332995244|gb|AEF05299.1| hypothetical protein ambt_19030 [Alteromonas sp. SN2] Length = 638 Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 67/241 (27%), Positives = 117/241 (48%), Gaps = 17/241 (7%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA------A 168 + K+AIV+H++Y D + A + + D+ +T T+ +IK++ LK F Sbjct: 131 QQKVAIVLHIFYPDFVDKFAASVQRFPTNVDVFITAGTD--AIKNKALKTFNGLKNVQKV 188 Query: 169 RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + + EN GR+ PFL+ +++L +YD +C +H KKS YS E W + LL Sbjct: 189 QAVLCENRGRNFGPFLVNF-SDELLDYDLMCHLHSKKSL---YSGREQTQWFDYQNNFLL 244 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAY-RYPNKYCDYTCSLGKNREMICTLAGRMGITFQ 287 VV I+R FD ++G+ ++ P +TC+ G +++ + + G+ Sbjct: 245 KDKHVVKSILRLFDEREELGIYYPTSFWMMPAWVNHWTCNKGISQDFV----DKWGLDIS 300 Query: 288 DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 D +++ G MFW R +A+ P+ + P+ DG HA+ER L +K F Sbjct: 301 DDFVNYPVGGMFWARPKAIAPLLKEKFEYSDFPEEPLPNDGSWLHALERSIGLLAEKQGF 360 Query: 348 R 348 + Sbjct: 361 K 361 >gi|30024633|dbj|BAC75688.1| rhamnosyltransferase [Streptococcus mutans] Length = 583 Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 79/241 (32%), Positives = 114/241 (47%), Gaps = 38/241 (15%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K+A+ +H++Y+DL E S+DL +T T+S K+EI ++ A A+I + Sbjct: 285 KVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFIT--TDSDDKKAEIEEVLSANSQEAQIFV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L LS YD+V H KKSK + W G WR L D+L P Sbjct: 343 TGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSWREELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMIC----TLAGRMGIT 285 I+ + IG++ + +RY NK D N +I TL +MG+T Sbjct: 398 D--NILAQLQQNPKIGLVIADMPTFFRY-NKIVD-----AWNEHLIAPEMNTLWQKMGMT 449 Query: 286 FQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 +K+DF A GT W + +AL P+ +L L+ P+ I HA+ER Sbjct: 450 ---KKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPE-EPLPQNSILHAIERL 505 Query: 338 F 338 Sbjct: 506 L 506 >gi|30024644|dbj|BAC75698.1| rhamnosyltransferase [Streptococcus mutans] Length = 583 Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 79/241 (32%), Positives = 114/241 (47%), Gaps = 38/241 (15%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K+A+ +H++Y+DL E S+DL +T T+S K+EI +I A A++ + Sbjct: 285 KVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFIT--TDSDDKKAEIEEILSANSQEAQVFV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L LS YD+V H KKSK + W G WR L D+L P Sbjct: 343 TGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSWREELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMIC----TLAGRMGIT 285 I+ + IG++ + +RY NK D N +I TL +MG+T Sbjct: 398 D--NILAQLQQNPKIGLVIADMPTFFRY-NKIVD-----AWNEHLIAPEMNTLWQKMGMT 449 Query: 286 FQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 +K+DF A GT W + +AL P+ +L L+ P+ I HA+ER Sbjct: 450 ---KKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPE-EPLPQNSILHAIERL 505 Query: 338 F 338 Sbjct: 506 L 506 >gi|82703518|ref|YP_413084.1| glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196] gi|82411583|gb|ABB75692.1| Glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196] Length = 828 Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 70/236 (29%), Positives = 110/236 (46%), Gaps = 13/236 (5%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIK-SEILKIFPA--ARIHIM 173 ++A+ +H+YY +L+ EI L ++ DL +++ TE + + +L +P I I+ Sbjct: 591 RVALHLHVYYSELFPEIMARLKVNNVRPDLFISVPTECTRNEVTGLLNDYPGKVVDIQIV 650 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L + L +YD + +H KK+ G W +L +LLG Sbjct: 651 PNRGRDIGPLLTAFGSVFLDDYDAIGHLHTKKTADLSDEMI-GKRWYTFLLENLLGGKRN 709 Query: 234 VFKII-RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLD 292 + II IG++ +P+ + G N+ +LA ++G+ + Sbjct: 710 MADIILGRMTADPAIGIV------FPDD--PHVFDWGNNKAHADSLASKLGLGKLQENFV 761 Query: 293 FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 F GTMFW RTEAL P+ L LS P DG I HA+ER L K + R Sbjct: 762 FPMGTMFWARTEALRPLFTLDLSWQDYPAEPLPYDGTILHALERLLPLIAAKQSTR 817 >gi|290580710|ref|YP_003485102.1| rhamnan synthesis protein F [Streptococcus mutans NN2025] gi|254997609|dbj|BAH88210.1| RgpFc protein [Streptococcus mutans NN2025] Length = 557 Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 79/237 (33%), Positives = 113/237 (47%), Gaps = 30/237 (12%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K+A+ +H++Y+DL E S+DL +T T+S K+EI +I A A++ + Sbjct: 259 KVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFIT--TDSDDKKAEIEEILSANSQEAQVFV 316 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L LS YD+V H KKSK + W G WR L D+L P Sbjct: 317 TGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEADF--WAGQSWREELI-DMLVKPA 371 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 I+ + IG++ + +RY NK D EM TL +MG+T + Sbjct: 372 D--NILAQLQQNPKIGLVIADMPTFFRY-NKIVDAWNEHLIAPEM-NTLWQKMGMT---K 424 Query: 290 KLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 K+DF A GT W + +AL P+ +L L+ P+ I HA+ER Sbjct: 425 KIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPE-EPLPQNSILHAIERLL 480 >gi|3399709|dbj|BAA32094.1| rgpFc [Streptococcus mutans] Length = 583 Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 79/241 (32%), Positives = 114/241 (47%), Gaps = 38/241 (15%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K+A+ +H++Y+DL E S+DL +T T+S K+EI +I A A++ + Sbjct: 285 KVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFIT--TDSDDKKAEIEEILSANGQEAQVFV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L LS YD+V H KKSK + W G WR L D+L P Sbjct: 343 TGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEADF--WAGQSWREELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMIC----TLAGRMGIT 285 I+ + IG++ + +RY NK D N +I TL +MG+T Sbjct: 398 D--NILAQLQQNPKIGLVIADMPTFFRY-NKIVD-----AWNEHLIAPEMNTLWQKMGMT 449 Query: 286 FQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 +K+DF A GT W + +AL P+ +L L+ P+ I HA+ER Sbjct: 450 ---KKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPE-EPLPQNSILHAIERL 505 Query: 338 F 338 Sbjct: 506 L 506 >gi|221634514|ref|YP_002523202.1| hypothetical protein RSKD131_4489 [Rhodobacter sphaeroides KD131] gi|221163387|gb|ACM04349.1| Hypothetical Protein RSKD131_4489 [Rhodobacter sphaeroides KD131] Length = 1042 Score = 86.3 bits (212), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 70/232 (30%), Positives = 105/232 (45%), Gaps = 27/232 (11%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAARIHIMENHG 177 A ++H++++D+ ++ L +L S D VTL + + + FP A I +EN G Sbjct: 8 AAIIHVWHLDVLDDLTEALEHLHGSADQFVTLPSSFRQEQRDRVTAAFPKATIVEVENRG 67 Query: 178 RDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKI 237 +D+ L++ L YD++CKIH K KG + E WRR L +LG+ V I Sbjct: 68 QDIGALFQLMQKVNLGRYDFICKIHTK----KGPNMPEE--WRRALLDGVLGSKRQVTHI 121 Query: 238 IRTFDTHRDIGMIGSR-------AYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK 290 + +F + + G+R AY PN K E +L G + +D Sbjct: 122 VESFRADPKVMLAGARQLFVYGPAYLEPNA--------DKVAEDYASLIGDFDVRSED-- 171 Query: 291 LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV 342 F AGT FW+RT L + + F P + DG HA ER F L V Sbjct: 172 WGFIAGTCFWIRTSILQEMAACAVD--FLPADY-VTDGAPAHAAERMFGLCV 220 >gi|24379285|ref|NP_721240.1| RgpFc protein [Streptococcus mutans UA159] gi|24377204|gb|AAN58546.1|AE014924_6 RgpFc protein [Streptococcus mutans UA159] Length = 583 Score = 85.9 bits (211), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 79/241 (32%), Positives = 113/241 (46%), Gaps = 38/241 (15%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K A+ +H++Y+DL E S+DL +T T+S K+EI +I A A++ + Sbjct: 285 KAAVHLHVFYVDLLEEFLTAFKQFHFSYDLFIT--TDSDDKKAEIEEILSANSQEAQVFV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L LS YD+V H KKSK + W G WR L D+L P Sbjct: 343 TGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEADF--WAGQSWREELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMIC----TLAGRMGIT 285 I+ + IG++ + +RY NK D N +I TL +MG+T Sbjct: 398 D--NILAQLQQNPKIGLVIADMPTFFRY-NKIVD-----AWNEHLIAPEMNTLWQKMGMT 449 Query: 286 FQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 +K+DF A GT W + +AL P+ +L L+ P+ I HA+ER Sbjct: 450 ---KKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPE-EPLPQNSILHAIERL 505 Query: 338 F 338 Sbjct: 506 L 506 >gi|302337198|ref|YP_003802404.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293] gi|301634383|gb|ADK79810.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293] Length = 1808 Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 65/240 (27%), Positives = 115/240 (47%), Gaps = 19/240 (7%) Query: 118 IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL---VTESASIKSEILKIFP---AARIH 171 I + +HL+YIDL E+ + L N+ + F L ++ V + IK + K P + Sbjct: 1019 IGVHLHLFYIDLAEELLSSLINIPVCFSLFISTSAGVKDQEYIKKIVNKKLPLCNECTVI 1078 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 EN GRD+ PF++ LS +D + H KKS +S D RR+L + +LG Sbjct: 1079 QTENRGRDIAPFIVEF-GNSLSQFDLILHFHSKKSL---HSDSLSDA-RRFLLHYILGNK 1133 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKL 291 + + + F + IGM+ + ++ + ++ + +MGI + + Sbjct: 1134 AITIQNLNMFFENGSIGMVAPPYHPSLRNMPNFGLQEYETKQFL----KKMGINYSGKCT 1189 Query: 292 DFFAGTMFWVRTEALDPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 DF AG+ FW R +A+ + N+R + + P+ +DG + H +ER + K+ NF+I Sbjct: 1190 DFPAGSFFWCRKDAIRQLLTSNIRWNSF--PEEKGQIDGTLAHVIERSLGIICKQNNFKI 1247 >gi|329944276|ref|ZP_08292535.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] gi|328531006|gb|EGF57862.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] Length = 636 Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 67/234 (28%), Positives = 106/234 (45%), Gaps = 7/234 (2%) Query: 109 VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHV-TLVTESASIKSEILKIFP- 166 V++ K KIA++ HLYY+DL + N+ D+ + T E K P Sbjct: 301 VVREPKKQKIALIAHLYYMDLVEPTLKYIRNMPEGIDIFLSTSSPEKVEQVEAACKGLPY 360 Query: 167 AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 + ++EN GRDV PFL+ + + + +YD VC H KK + Y + GD + F + Sbjct: 361 NIEVRLVENRGRDVGPFLVAWK-DVVHDYDVVCYTHDKKVTQL-YPYSVGDGFAYKCFEN 418 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 LL V +I TFD +G + + + + +T G N + L +G+ Sbjct: 419 LLPTRDFVKNVIATFDAEPRLGFLAPTPPNHADYFPVFTYGWGPNFDRTKALLRELGLDV 478 Query: 287 Q-DQKLDFFA--GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 D + A G+MFW R +AL P+ + P +DG + HA+ER Sbjct: 479 PLDPTKEPIAPLGSMFWFRPQALKPLFDHDWQWEEFPPEPCPIDGTLMHAIERS 532 >gi|325067622|ref|ZP_08126295.1| hypothetical protein AoriK_07369 [Actinomyces oris K20] Length = 626 Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 67/238 (28%), Positives = 111/238 (46%), Gaps = 10/238 (4%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFP-AARIHIME 174 KIA++ HLYY+DL + ++ DL +T+ + E A + E K P + ++E Sbjct: 309 KIALIAHLYYMDLLEPTLAYVKSMPEGTDLILTVGSQEKAELVEEACKDLPYNVTVRLIE 368 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKK-SKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRDV L+ + + + +YD VC H KK ++ K YS GD + F +LL Sbjct: 369 NRGRDVSALLVGCK-DIIHDYDLVCFTHDKKVTQVKPYSV--GDGFAIKCFENLLATRDF 425 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR---MGITFQDQK 290 V +I TFD +G++ + + + ++ G N E TL + + + + + Sbjct: 426 VKNVIATFDAEPRLGLLAPTPPNHGDYFPVFSMGWGPNFERTKTLLEKELNLSVPIDESR 485 Query: 291 LDFFA-GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 GTMFW R AL P+ + P +DG I HA+ER + + + + Sbjct: 486 APIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNNIDGTILHAIERAYGYVAQASGY 543 >gi|258654317|ref|YP_003203473.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233] gi|258557542|gb|ACV80484.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233] Length = 631 Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 78/252 (30%), Positives = 115/252 (45%), Gaps = 34/252 (13%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIME-- 174 +I ++ H++Y D+ E+ + N+ + FDL VT T SA+ K I A + +E Sbjct: 297 RICVLAHIFYEDMTDEMMGWIGNIPVPFDLVVT--TTSAAKKEAIESALEAYALKSVEVR 354 Query: 175 ----NHGRDVLPFLILLETEQLS-NYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 N GR FLI S YD V KIH KKS + G + G L++ +LL Sbjct: 355 LVESNRGRAESAFLIACRDVLTSGEYDLVLKIHSKKSPQNGANL--GQLFKHHSVDNLLS 412 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMICTLAGRMGI- 284 +PG V I+ F + +GM+ +P +LG NRE LA ++GI Sbjct: 413 SPGYVASILGMFQSQPSLGMV------FPPVVNIGFPTLGHSWFTNREAAHELADQLGIH 466 Query: 285 TFQDQKLDFFA-GTMFWVRTEALDPIKNLRLSRY------FEPKVHKALDGEIEHAVERC 337 T D+ GTMFW R E+L +L+R+ F + DG + H +ER Sbjct: 467 TIFDRTTPLAPNGTMFWARPESL-----AKLARHDFDYSQFAAEHEGWSDGMLGHVIERL 521 Query: 338 FSLSVKKANFRI 349 + +V A RI Sbjct: 522 YGYAVLDAGLRI 533 >gi|304309760|ref|YP_003809358.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1] gi|301795493|emb|CBL43691.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1] Length = 1315 Score = 84.7 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 68/244 (27%), Positives = 109/244 (44%), Gaps = 24/244 (9%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE-ILKIFPAAR---IH 171 A IAI +HLYY DL LS + FDL++++ + ++ E +++ P R I Sbjct: 443 ATIAIHIHLYYADLAPTFVQALSRMERPFDLYISIQVRANPVEIEAVVRKIPCLRGLDIR 502 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 N GRD+ PF+ + E L YD + +H KKS Y+ W ++ L +P Sbjct: 503 ATPNLGRDLYPFVCIF-GEALRKYDIIAHLHSKKSL---YNQGATAGWLEYILDSLFRSP 558 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNK-----YCDYTCSLGKNREMICTLAGRMGIT- 285 + +I+ G++ YP Y YT ++R + R G+T Sbjct: 559 EDIARILERLSDASQTGIV------YPQNFSGLPYMAYTWLANRSRAQ--QVQARFGLTS 610 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSR-YFEPKVHKALDGEIEHAVERCFSLSVKK 344 D+ AG+MFW R +A+ P +L+ FE + + DG + H +ER L + Sbjct: 611 LPSGYFDYPAGSMFWARADAIAPFFEAQLNEDDFENESGQT-DGTLAHTLERFLVLVPES 669 Query: 345 ANFR 348 FR Sbjct: 670 LGFR 673 >gi|310829395|ref|YP_003961752.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612] gi|308741129|gb|ADO38789.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612] Length = 627 Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 71/247 (28%), Positives = 118/247 (47%), Gaps = 16/247 (6%) Query: 111 QIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS-IKSEILKIFPAA- 168 +I + +IA + HLY+ DL E LS++ D+++T TE + E K F Sbjct: 306 KILCEKRIAAIFHLYFEDLIDETYRYLSSMPEEADIYITTDTEPKKKLIQEKFKDFSCRN 365 Query: 169 -RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKK-SKRKGYSWWEGDLWRRWLFYD 226 ++ +++N GRDV L+ + + NYDYVC H KK ++ K YS G + F + Sbjct: 366 FKVILIQNRGRDVSALLVATKA-FIMNYDYVCFAHDKKVTQTKPYSI--GGAFAYKCFEN 422 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMI----GSRAYRYPNKYCDYTCSLGKNREMICTLAGRM 282 L V II F+ + +GM+ + YP ++ C+ + +I L ++ Sbjct: 423 TLQNKNFVLNIINAFEKNPRLGMLMPAPPNNGPYYPTLGNEWMCNYEVTKNLIDELGIKV 482 Query: 283 GITFQDQKLDFFAGTMFWVRTEALDPI--KNLRLSRYFEPKVHKALDGEIEHAVERCFSL 340 + + + GTMFW R +AL + KN S + P+ +DG + HA+ER + L Sbjct: 483 PMDPGKEPISPL-GTMFWFRPKALKVLFDKNWEYSDF--PEEPNKVDGTLLHAIERAYGL 539 Query: 341 SVKKANF 347 V+ F Sbjct: 540 IVQSEGF 546 >gi|209524107|ref|ZP_03272658.1| glycosyl transferase family 2 [Arthrospira maxima CS-328] gi|209495482|gb|EDZ95786.1| glycosyl transferase family 2 [Arthrospira maxima CS-328] Length = 2819 Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 74/250 (29%), Positives = 115/250 (46%), Gaps = 20/250 (8%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA-SIKSEILKIFPAARIHIMEN 175 KIA+V+H YY +L E+ + L NLS +DL VT+ S+ S + K ++ I++N Sbjct: 1738 KIAVVLHAYYPELLPELFSKLDNLS-DYDLFVTIPENVVDSVTSALDKYTKNYQVSIVKN 1796 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVF 235 G D+LPFL ++ Y YVCKIH K+ G LWR L +LG + Sbjct: 1797 IGYDILPFLEVISELDTLGYKYVCKIHTKRDHPDF-----GSLWRECLLDAVLGDKNITE 1851 Query: 236 KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFA 295 +II FD + + ++G A Y + K ++MI + + + FF Sbjct: 1852 QIITAFDNNPSLQIVGP-ALLYMSMLGTIYDGHEKMKKMIHDFMEPLNLI---EDWGFFG 1907 Query: 296 GTMFWVRTEALDPIKNLRLSRYFEPKVHKA--LDGEIEHAVERCFSL-------SVKKAN 346 G+MFW R L I + L + + + K+ G H VER L V + Sbjct: 1908 GSMFWSRITPLKYIADQILLKPIDWQASKSWLTTGFYYHIVERLLGLVSYINEGQVGLVD 1967 Query: 347 FRISDVDCIL 356 ++I D + I+ Sbjct: 1968 YKIIDTENIV 1977 >gi|71735705|ref|YP_273244.1| hypothetical protein PSPPH_0972 [Pseudomonas syringae pv. phaseolicola 1448A] gi|71556258|gb|AAZ35469.1| conserved hypothetical protein [Pseudomonas syringae pv. phaseolicola 1448A] Length = 1262 Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 74/244 (30%), Positives = 108/244 (44%), Gaps = 25/244 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA-----SIKSEILKIFPAARIH 171 +I + +HLYY DL I+ L+N+ ++FDL ++ E I S+ + I Sbjct: 211 RIGVYLHLYYTDLLGAISKHLNNIPLAFDLFISTPHELDHKKLRKIVSDSVTNVKEISIK 270 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR-KGYSWWEGDLWRRWLFYDLLGA 230 + N GRD+ PF+I E L YD +C IH KKS+ KG S W D+ LLG+ Sbjct: 271 HVPNRGRDIAPFIIEFGNE-LQAYDAICHIHTKKSEHTKGLSDWGDDI-----LSSLLGS 324 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYP---NKYCDYTCSLGKNREMIC-TLAGRMGITF 286 V KI+ + G YP N Y +N E+ L+ + Sbjct: 325 REDVKKILTL--------LKGDAKIIYPEGQNYYMKDPTGWSENHEIAKHILSDHLETDI 376 Query: 287 QD-QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + K +F G+MFW R E + N+ L P+ DG + HA+ER +S A Sbjct: 377 SNFPKAEFPEGSMFWARQEGIQSFLNIPLDWEDFPEEPIPTDGTLAHALERIILISAYAA 436 Query: 346 NFRI 349 RI Sbjct: 437 PGRI 440 >gi|296876714|ref|ZP_06900762.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus parasanguinis ATCC 15912] gi|296432216|gb|EFH18015.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus parasanguinis ATCC 15912] Length = 582 Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 77/254 (30%), Positives = 120/254 (47%), Gaps = 26/254 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+ K I +I AR+ + Sbjct: 285 KVAVHLHVFYVDLLEEFLDAFRDFHFDYDLWIT--TDVEEKKQAIEQILSNRAQDARVVV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L+L EQLS YDYV H KKSK + W G+ WR+ L ++L P Sbjct: 343 TGNIGRDVLPMLLL--KEQLSKYDYVGHFHTKKSKEADF--WAGESWRKELI-EMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGI--TFQ 287 +I+ + + +G+ + +RY + +L + L RMG T Sbjct: 398 D--QILANMEANPKVGITIADIPTFFRYNRIVVAWNEALISPE--MNKLWQRMGATKTID 453 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 +K++ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 454 FEKINTFVMSYGTFVWFKYDALKPLFDLNLTAADVP-AEPLPQNSILHAIERLLIYIAWD 512 Query: 343 KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 513 QKYDFRISQNPHVL 526 >gi|116628171|ref|YP_820790.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMD-9] gi|116101448|gb|ABJ66594.1| Lipopolysaccharide biosynthesis protein [Streptococcus thermophilus LMD-9] Length = 581 Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 80/254 (31%), Positives = 121/254 (47%), Gaps = 26/254 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + ++DL +T T+ K EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWIT--TDVEEKKQEIEQILSRRSQDATIVV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L+L E+LS YDYV H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGM-IGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQ 287 +I+ + + +G+ IG +RY + +L + L RMG T Sbjct: 398 D--QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALISPE--MNKLWQRMGATKNID 453 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 + L+ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 454 FKNLNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVP-AEPLPQNSILHAIERLLVYIAWD 512 Query: 343 KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 513 QKYDFRISQNPHVL 526 >gi|55823377|ref|YP_141818.1| polysaccharide biosynthesis protein [Streptococcus thermophilus CNRZ1066] gi|55739362|gb|AAV63003.1| polysaccharide biosynthesis protein [Streptococcus thermophilus CNRZ1066] Length = 581 Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 80/254 (31%), Positives = 121/254 (47%), Gaps = 26/254 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + ++DL +T T+ K EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWIT--TDIEEKKQEIEQILSRRSQDATIVV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L+L E+LS YDYV H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGM-IGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQ 287 +I+ + + +G+ IG +RY + +L + L RMG T Sbjct: 398 D--QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALISPE--MNKLWQRMGATKNID 453 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 + L+ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 454 FKNLNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVP-AEPLPQNSILHAIERLLVYIAWD 512 Query: 343 KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 513 QKYDFRISQNPHVL 526 >gi|291520004|emb|CBK75225.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 984 Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 70/245 (28%), Positives = 111/245 (45%), Gaps = 34/245 (13%) Query: 118 IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV--TESASIKSEI--LKIFPAARIHIM 173 +A+ +HL+Y+DL E + +N+ FDL+++ + + IKS + LK+ I + Sbjct: 329 VAVHLHLFYVDLLPEFVSYFANIPFRFDLYISCQEGADVSVIKSGVKELKMANKVVIRPL 388 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P + +++ +DY +H KKS YS E WR++ LLG+P Sbjct: 389 PNRGRDLAPLYVGF-ADEIRQHDYFLHVHSKKSL---YSGAEKGGWRQFSLELLLGSPEK 444 Query: 234 VFKIIRTF----------DTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLA-GRM 282 V I F D H ++ MI AY + + G R++ G M Sbjct: 445 VNSIFNLFKNKNAGLVYPDIHEEVPMI---AYS-------WLANAGLGRKLFDEFELGEM 494 Query: 283 GITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV 342 F ++ AG+ FW RT+AL PI N P+ DG + HA+ER Sbjct: 495 PTVF-----NYPAGSFFWARTDALMPIFNRNYIYEDFPEEAGQTDGTLAHALERIIPFVS 549 Query: 343 KKANF 347 +K + Sbjct: 550 RKLGY 554 >gi|306831662|ref|ZP_07464819.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325978600|ref|YP_004288316.1| rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] gi|304426087|gb|EFM29202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325178528|emb|CBZ48572.1| rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] Length = 586 Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 69/248 (27%), Positives = 119/248 (47%), Gaps = 29/248 (11%) Query: 108 KVMQIAIKA-----KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSE 160 KV+++ K K+A+ +H +Y+DL E N N +DL +T TE+ A I+S Sbjct: 274 KVLKVVEKEYSNTKKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESI 333 Query: 161 ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + K A++ + N GRD++P L L E+LS+YDY+ H KKS Y +W GD WR Sbjct: 334 LEKNGKIAQVFLTGNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPE--YPYWVGDSWR 389 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR--EMICTL 278 L+ L+ + I+ + + ++G++ + + +Y +NR + + L Sbjct: 390 NELYQMLIQSAD---NILANLENNDNLGLVIADIPSF-FRYTKIVDPWNENRFADGMNEL 445 Query: 279 AGRMGITFQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 RM + ++++DF GT W + + L P+ +L L+ P I Sbjct: 446 WERMNL---ERQIDFNNLSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPS-EPIPQHTI 501 Query: 331 EHAVERCF 338 H++ER Sbjct: 502 LHSIERIL 509 >gi|322385732|ref|ZP_08059376.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus cristatus ATCC 51100] gi|321270470|gb|EFX53386.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus cristatus ATCC 51100] Length = 598 Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 72/232 (31%), Positives = 112/232 (48%), Gaps = 20/232 (8%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE--ILKIFPAARIHIME 174 KIA+ +H YY+DL + N ++DL +T +E ++ E +LK +I+I Sbjct: 288 KIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDSEKKKLEIEAVLLKRNQLGKIYITG 347 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVV 234 N GRD++P L L E+L YDY+ H KKS Y +W GD WR LF D+L P + Sbjct: 348 NKGRDIIPMLKL--REELCTYDYIGHFHTKKSPE--YPYWVGDSWRNELF-DMLLKPADL 402 Query: 235 FKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGI--TFQDQ 289 I+ + + + +G++ + +RY K D + K + + L RM I + Sbjct: 403 --IMASLENDKRLGLVIADIPTFFRY-TKIVD-PWNENKFADDMNILWERMDINRSIDFN 458 Query: 290 KLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 KL+ F GT W + +AL P+ +L L P I H++ER Sbjct: 459 KLNTFIMSYGTFIWFKYDALKPLFDLNLQDEDIPS-EPLPQHTILHSIERIL 509 >gi|227497960|ref|ZP_03928140.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] gi|226832618|gb|EEH65001.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] Length = 626 Score = 82.8 bits (203), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 68/260 (26%), Positives = 127/260 (48%), Gaps = 16/260 (6%) Query: 90 RIHHFPQLLHGWESPAMGKVMQIAIK---AKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R ++ L+ E + +A++ +KIA+V+H+Y++DL ++ + +++ DL Sbjct: 278 RTNNLADLVKNLELTYILPTQAVAVQPEESKIALVMHVYHMDLLPQLLHYAASMPAGCDL 337 Query: 147 HVTLVTESAS--IKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 T+ TE+ + ++ + ++EN GRDV L+ +L +YD VC IH K Sbjct: 338 IATVDTEAKAQQVREATAGLSLNVETILIENRGRDVAALLVGARP-RLLDYDLVCFIHDK 396 Query: 205 K-SKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCD 263 K ++ + S EG + + F ++L P V +I TF +G++ A + + + Sbjct: 397 KVTQIRPGSVGEG--FAKRCFENVLATPEFVCNVIATFQAEPRLGVLTPSAPHHGDYFPI 454 Query: 264 YTCSLGKN----REMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFE 319 S G N +E++ + I + + F G++FW R +A+ P+ R RY + Sbjct: 455 SAFSWGPNDKNTKELLASFGLHAPIDPDKEAIAPF-GSVFWFRPQAIRPLLE-RKWRYDD 512 Query: 320 -PKVHKALDGEIEHAVERCF 338 P +DG I HA+ER + Sbjct: 513 FPAEPLPIDGTISHAIERVY 532 >gi|319946716|ref|ZP_08020950.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus australis ATCC 700641] gi|319746764|gb|EFV99023.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus australis ATCC 700641] Length = 581 Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 80/246 (32%), Positives = 117/246 (47%), Gaps = 22/246 (8%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEIL-KIFPAARIHIME 174 K+A+ +H++Y+DL E + ++DL +T V E EIL A + + Sbjct: 285 KVAVHLHVFYVDLLEEFLDAFQAFHFAYDLWITTDVEEKKQAIEEILSNRAQVATVVVTG 344 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVV 234 N GRDVLP L+L EQLS+YDYV H KKSK + W G+ WR+ L D+L P Sbjct: 345 NIGRDVLPMLLL--KEQLSHYDYVGHFHTKKSKEADF--WAGESWRKELI-DMLVKPAD- 398 Query: 235 FKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGI--TFQDQ 289 KI+ + + +G+ + +RY N+ + + EM L RMG T + Sbjct: 399 -KILANMEANPKVGITIADIPTFFRY-NRIVVAWNEVLISPEM-NKLWQRMGATKTIDFK 455 Query: 290 KLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KK 344 L+ F GT W + +AL P+ +L L P I HA+ER +K Sbjct: 456 NLNTFVMSYGTFVWFKYDALKPLFDLNLKAADVP-AEPLPQNSILHAIERLLVYIAWDQK 514 Query: 345 ANFRIS 350 +FRIS Sbjct: 515 YDFRIS 520 >gi|326772082|ref|ZP_08231367.1| rhamnan synthesis protein F [Actinomyces viscosus C505] gi|326638215|gb|EGE39116.1| rhamnan synthesis protein F [Actinomyces viscosus C505] Length = 652 Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 67/239 (28%), Positives = 112/239 (46%), Gaps = 12/239 (5%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFP-AARIHIME 174 K+A++ HLYY+DL ++ D +T+ + E + E K P + ++E Sbjct: 335 KVALIAHLYYMDLLEPTLAYARSMPEGTDFILTVGSQEKVELVEEACKDLPYNVTVRLIE 394 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKK-SKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRDV L+ + + +S+YD VC IH KK ++ Y+ EG + R F +LL Sbjct: 395 NRGRDVSALLVGCK-DIVSDYDLVCFIHDKKVTQLSPYTVGEG--FARKCFDNLLPTREF 451 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN---REMICTLAGRMGITFQDQK 290 V +I TFD+ +G++ + + + Y+ S G N +M+ + + K Sbjct: 452 VENVISTFDSEPRLGLLSPTPPNHADYFPIYSYSWGPNFDRTKMLLEKELNLSVPLDAHK 511 Query: 291 LDFFA--GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + A GTMFW R AL P+ + P +DG I HA+ER + + + + Sbjct: 512 -EVIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNDIDGTILHAIERAYGYVAQASGY 569 >gi|55821450|ref|YP_139892.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG 18311] gi|55737435|gb|AAV61077.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG 18311] Length = 594 Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 73/231 (31%), Positives = 106/231 (45%), Gaps = 18/231 (7%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSEILKIFPAARIHIME 174 KIA+ +H YY+DL + N ++DL +T +E A I+S + K ARI I Sbjct: 288 KIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDSEDKKAEIQSILDKNGKVARIFITG 347 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVV 234 N GRDV+P L L ++LS YDY+ H KKS Y +W GD WR LF L+ Sbjct: 348 NRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPE--YPYWVGDSWRNELFSMLIQPAD-- 401 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR--EMICTLAGRM--GITFQDQK 290 II + +G++ + + +Y +NR E + L RM G K Sbjct: 402 -NIIANLERDDRLGLVIADIPSF-FRYTKIVDPWNENRFAEGMNDLWERMDLGRDIDFDK 459 Query: 291 LDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 ++ F GT W + +AL P+ +L L E I H++ER Sbjct: 460 MNTFIMSYGTFIWFKYDALKPLFDLDLQDE-EIPAEPIPQHTILHSIERIL 509 >gi|225868697|ref|YP_002744645.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. zooepidemicus] gi|225701973|emb|CAW99527.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. zooepidemicus] Length = 581 Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 81/252 (32%), Positives = 116/252 (46%), Gaps = 35/252 (13%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIF----PAARIHI 172 KIA+ +H++Y+DL E S+ +DL +T T+S + K+EI +I +A I + Sbjct: 285 KIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLIT--TDSKAKKAEIKEILRESGASADILV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L E+LS YDY+ H KKSK + W G WR L D++ P Sbjct: 343 TGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEADF--WAGQSWRTELI-DMMVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGR----MGIT 285 D IG++ + +R+ NK D N +I + MG+T Sbjct: 398 DQILTALAADA---IGIVIADIPSFFRF-NKIVD-----AWNEHLIAPEMNQLWQAMGLT 448 Query: 286 --FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSL 340 Q +D F GT W + +AL P+ +L LS P I HA+ER Sbjct: 449 KRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIP-AEPLPQNSILHAIERLLIY 507 Query: 341 SV--KKANFRIS 350 + +FRIS Sbjct: 508 IAWDRHYDFRIS 519 >gi|225870347|ref|YP_002746294.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. equi 4047] gi|225699751|emb|CAW93520.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. equi 4047] Length = 581 Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 81/252 (32%), Positives = 116/252 (46%), Gaps = 35/252 (13%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIF----PAARIHI 172 KIA+ +H++Y+DL E S+ +DL +T T+S + K+EI +I +A I + Sbjct: 285 KIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLIT--TDSKAKKAEIKEILRESGASADILV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L E+LS YDY+ H KKSK + W G WR L D++ P Sbjct: 343 TGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEADF--WAGQSWRTELI-DMMVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGR----MGIT 285 D IG++ + +R+ NK D N +I + MG+T Sbjct: 398 DQILTALAADA---IGIVIADIPSFFRF-NKIVD-----AWNEHLIAPEMNQLWQAMGLT 448 Query: 286 --FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSL 340 Q +D F GT W + +AL P+ +L LS P I HA+ER Sbjct: 449 KRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIP-AEPLSQNSILHAIERLLIY 507 Query: 341 SV--KKANFRIS 350 + +FRIS Sbjct: 508 IAWDRHYDFRIS 519 >gi|306833804|ref|ZP_07466929.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis ATCC 700338] gi|304423998|gb|EFM27139.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis ATCC 700338] Length = 586 Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 69/248 (27%), Positives = 117/248 (47%), Gaps = 29/248 (11%) Query: 108 KVMQIAIKA-----KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSE 160 KV+++ K K+A+ +H +Y+DL E N N +DL +T TE+ A I+S Sbjct: 274 KVLKVVEKEYSNTKKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESI 333 Query: 161 ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + K A++ + N GRD++P L L E+LS+YDY+ H KKS Y +W GD WR Sbjct: 334 LEKNGKTAQVFLTGNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPE--YPYWVGDSWR 389 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR--EMICTL 278 L+ L+ + ++ + + ++G++ + + +Y +NR + + L Sbjct: 390 NELYQMLIQSAD---NVLANLENNDNLGLVIADIPSF-FRYTKIVDPWNENRFADGMNEL 445 Query: 279 AGRMGITFQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 RM + Q +DF GT W + + L P+ +L L+ P I Sbjct: 446 WERMNLGRQ---IDFNNLSTFIMSYGTFIWFKHDTLKPLFDLELTDDEIPS-EPIPQHTI 501 Query: 331 EHAVERCF 338 H++ER Sbjct: 502 LHSIERIL 509 >gi|288905572|ref|YP_003430794.1| polysaccharide biosynthesis protein (RgpF) [Streptococcus gallolyticus UCN34] gi|288732298|emb|CBI13867.1| Putative polysaccharide biosynthesis protein (RgpF) [Streptococcus gallolyticus UCN34] Length = 586 Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 71/250 (28%), Positives = 116/250 (46%), Gaps = 33/250 (13%) Query: 108 KVMQIAIKA-----KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSE 160 KV+++ K K+A+ +H +Y+DL E N N +DL +T TE+ A I+S Sbjct: 274 KVLKVVEKEYSNTKKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDTEAKKAEIESI 333 Query: 161 ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + K A++ + N GRD++P L L E+LS+YDY+ H KKS Y +W GD WR Sbjct: 334 LEKNGKIAQVFLTGNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPE--YPYWVGDSWR 389 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRA---YRYP--------NKYCDYTCSLG 269 L+ L+ + I+ + + ++G++ + +RY N++ D G Sbjct: 390 NELYQMLIQSAD---NILANLENNDNLGLVIADIPSFFRYTKIVDPWNENRFAD-----G 441 Query: 270 KNREMICTLAGRMGITFQDQKLDFFA-GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDG 328 N C R I F + + GT W + + L P+ +L L+ P Sbjct: 442 MNELWECMNLERQ-IDFNNLSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPS-EPIPQH 499 Query: 329 EIEHAVERCF 338 I H++ER Sbjct: 500 TILHSIERIL 509 >gi|23009067|ref|ZP_00050256.1| COG3754: Lipopolysaccharide biosynthesis protein [Magnetospirillum magnetotacticum MS-1] Length = 486 Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 66/227 (29%), Positives = 104/227 (45%), Gaps = 19/227 (8%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIH--- 171 A+I + H+++ DL + +N I FD V + T SAS I K F H Sbjct: 272 NARIGVFAHIFHTDLCEYVLKYTNN--IPFDTTVYVTTSSASKADFIRKTFGRLSKHRYE 329 Query: 172 --IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 I N GRD+ P L+ N DY +H KKS + D WR +LF LG Sbjct: 330 IVIAPNRGRDIAPMLVGYRN-AFQNCDYAVHVHTKKSLHYSSGF---DAWRDYLFEMNLG 385 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 + ++ I+ + +IG + Y K + G N + I L G++ + Sbjct: 386 SAELITGIVNVL-SRSNIGAVAPDHYAPIAKLIQW----GGNIDAINGLLSFTGLSVASE 440 Query: 290 K-LDFFAGTMFWVRTEALDPIKNLRLSRY-FEPKVHKALDGEIEHAV 334 LDF +G+MFW + +AL + + L Y F+P++ + +DG + HA+ Sbjct: 441 NVLDFPSGSMFWFKPDALSKLMEIHLQSYHFDPELGQ-VDGTLAHAI 486 >gi|160936495|ref|ZP_02083863.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC BAA-613] gi|158440580|gb|EDP18318.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC BAA-613] Length = 674 Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 65/230 (28%), Positives = 104/230 (45%), Gaps = 15/230 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAAR-----IH 171 KIA+V HL+Y DL E L N+ + DL++T+ + K ++ F + R + Sbjct: 292 KIAVVAHLFYPDLMDETLRYLQNIQENIDLYITVA--NIETKYKVYNYFESIRRSNVKVL 349 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + N GRD L+ E L Y+Y+C +H KK+ R G G + + + L + Sbjct: 350 LSGNRGRDAGSLLVACR-EYLMQYEYLCFVHDKKTTRGGGPVTVGKAFMYHAWENTLRSG 408 Query: 232 GVVFKIIRTFDTHRDIGMIGSR-----AYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 G V II+ F+ + +G++ Y ++TC K +E+ L+ + + Sbjct: 409 GFVSSIIKLFEKNDRLGILTPPVPALGGYLTELVGNEWTCCYQKTKELAEILS--LKVPM 466 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 QK F T FW R AL P+ P+ A DG + HA+ER Sbjct: 467 SPQKQPFALATAFWCRPAALKPLFEYPWRYEDFPEEPLASDGTLNHAIER 516 >gi|195977971|ref|YP_002123215.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195974676|gb|ACG62202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 581 Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 82/262 (31%), Positives = 119/262 (45%), Gaps = 35/262 (13%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIF- 165 + + IA KIA+ +H++Y DL E S+ +DL +T T+S + K+EI +I Sbjct: 275 NQPISIAPSKKIAVHLHVFYADLLSEFLEAFSHFHFDYDLLIT--TDSKAKKAEIKEILR 332 Query: 166 ---PAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 +A I + N GRDVLP L L E+LS YDY+ H KKSK + W G WR Sbjct: 333 ESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEADF--WAGQSWRTE 388 Query: 223 LFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLA 279 L D++ P D IG++ + +R+ NK D N +I Sbjct: 389 LI-DMMVKPADQILTALAADA---IGIVIADIPSFFRF-NKIVD-----AWNEHLIAPEM 438 Query: 280 GR----MGIT--FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 + MG+T Q +D F GT W + +AL P+ +L L+ P I Sbjct: 439 NQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLGLNEADIP-AEPLPQNSI 497 Query: 331 EHAVERCFSLSV--KKANFRIS 350 HA+ER + +FRIS Sbjct: 498 LHAIERLLIYIAWDRHYDFRIS 519 >gi|281490695|ref|YP_003352675.1| bifunctional alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis subsp. lactis KF147] gi|281374464|gb|ADA63985.1| Alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis subsp. lactis KF147] Length = 589 Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 74/269 (27%), Positives = 123/269 (45%), Gaps = 21/269 (7%) Query: 94 FPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT- 152 +P L S + + + + +A+ +H+YY +L E + N S +DL++T T Sbjct: 265 YPDLNFLLPSKMITSLSKTVLHQTVAVHLHVYYPELLEEFLDAFKNFSFDYDLYLTTNTD 324 Query: 153 ESASIKSEILKIFPA-ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY 211 E I E+LK A A++ NHGRD++PFL L E+L YD V H K+S + Sbjct: 325 EKEEIIKEMLKCKDARAKLVRTPNHGRDIVPFLAL--KEELKKYDIVGHFHTKRSLEAAF 382 Query: 212 SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSL 268 + G+ WR L L+ I+ F+ + +G++ + +R+ NK + Sbjct: 383 --FAGESWRTELISMLIEPAD---NIMAHFEQKQKLGIVIADIPSFFRF-NKIVNADNEN 436 Query: 269 GKNREMICTLAGRMGITFQDQKLDFFA-----GTMFWVRTEALDPIKNLRLSRYFEPKVH 323 + ++ + RM + + DF GT FW +TE L+P+ NL + P Sbjct: 437 KQIAPIMNDIWKRMKMNKKVNFHDFNTFTMSYGTFFWAKTEVLEPLFNLEIMDREIPN-E 495 Query: 324 KALDGEIEHAVERCFSLSV--KKANFRIS 350 I HA+ER K+ +F+IS Sbjct: 496 PLPQNTILHAIERVLIYLAWDKEMDFKIS 524 >gi|308235695|ref|ZP_07666432.1| hypothetical protein GvagA14_05663 [Gardnerella vaginalis ATCC 14018] gi|311114292|ref|YP_003985513.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019] gi|310945786|gb|ADP38490.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019] Length = 637 Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 66/230 (28%), Positives = 114/230 (49%), Gaps = 8/230 (3%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFP-AARIHI 172 K K+A+ +HLYY+DL + + + ++ D+ +T+ + E+ I + ++ P + + Sbjct: 313 KPKVALCMHLYYMDLLDKSLHYIQSMPQGCDVILTVGSKENQQIVKQRVEHLPYDVDVRL 372 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 +EN GRDV FL+ + L YDYVC H KK + GD + F ++L + Sbjct: 373 IENRGRDVSAFLVGGGAD-LMKYDYVCFAHDKKVTQLSPRSI-GDGFAYKCFENILASKE 430 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT-LAGRMGITFQ-DQK 290 V +I F+TH +GM + + + +T + G N E L +GI+ D+ Sbjct: 431 YVQNVINLFETHPRLGMAMPTPPNHADYFPGFTYTWGPNFEGTKKFLEKTLGISVPLDEN 490 Query: 291 LDFFA--GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 D A GTMFW RT+A+ + + + + P +DG + H +ER + Sbjct: 491 KDAIAPLGTMFWFRTKAMRGLLDRKWTYEDFPAEPLKIDGTLLHFIERAY 540 >gi|83582737|ref|YP_425043.1| glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170] gi|83578053|gb|ABC24603.1| Glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170] Length = 1236 Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 76/243 (31%), Positives = 114/243 (46%), Gaps = 27/243 (11%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSI---SFDLHVTLVTESASI----KSEILKIFPAA 168 K+ + H YY+DL I + L + I S DL +T E ++ K E K + Sbjct: 986 GKVLLHGHFYYVDL---IDDFLKKIIINDFSCDLIITTTDEDRAVFLRKKLEEYKNG-SV 1041 Query: 169 RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + ++ N GRDV F L + S+YD V IHGKKS S G+ WR +L+ L+ Sbjct: 1042 EVRVVPNIGRDVGAFFTGLSDLKNSDYDVVGHIHGKKSIH--LSDGTGNKWRNFLWEHLI 1099 Query: 229 GAPGVVFKI-IRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG--KNREMICTLAGRMGIT 285 G I + + DIG++ + + G KN+E+ LA +MGI Sbjct: 1100 GGEKKAAAIAVSALIRNPDIGLV----------FAEEPFLFGWDKNKELANDLAKKMGIE 1149 Query: 286 FQDQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 + D+ GTMFW + +AL+PI +L L P + G + HA+ER +V+K Sbjct: 1150 KSLPRFFDWPIGTMFWAKRKALEPIFDLNLRWEDYPPEPIPVYGTMLHALERLLPFAVEK 1209 Query: 345 ANF 347 A F Sbjct: 1210 AGF 1212 >gi|222152862|ref|YP_002562039.1| rhamnan synthesis protein F family protein [Streptococcus uberis 0140J] gi|222113675|emb|CAR41606.1| rhamnan synthesis protein F family protein [Streptococcus uberis 0140J] Length = 585 Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 73/234 (31%), Positives = 109/234 (46%), Gaps = 27/234 (11%) Query: 118 IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHIM 173 IA+ +H++Y+DL E + ++ FDL++T T+ + +SEI I +A+I + Sbjct: 288 IAVHLHVFYVDLLEEFLHAFTSFKFPFDLYIT--TDKSEKESEIKAILDSFRVSAKIVVT 345 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRDVLP L L ++LS YDY+ H KKSK + W G+ WR L D+L P Sbjct: 346 GNIGRDVLPMLKL--KDELSQYDYIGHFHTKKSKEADF--WAGESWRNELI-DMLIKPAN 400 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR------MGITFQ 287 II F+ IG+I + P+ + N +I + + T Sbjct: 401 T--IINQFEDPA-IGIIIADI---PSFFRFNKIVTPLNEHLIAPEMNKLWEKMNLSKTID 454 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 ++ D F GT W + +AL P+ +L L PK I HAVER Sbjct: 455 FEQFDTFVMSYGTFVWFKYDALKPLFDLNLKDGDVPK-EPLPQNSILHAVERLL 507 >gi|289678438|ref|ZP_06499328.1| glycosyl transferase, group 1 [Pseudomonas syringae pv. syringae FF5] Length = 774 Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 61/245 (24%), Positives = 107/245 (43%), Gaps = 21/245 (8%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----- 167 A + +A+ +H++Y D + ++ L+N D+ +TL A + + + +F Sbjct: 262 AARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLA--DAKHQKKTIAVFSKHPRVK 319 Query: 168 -ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 ++ + N GR+ P L+ ++ L YD C +H KKS YS E W +L Sbjct: 320 NLKVRCVPNRGRNFGPLLVEF-SKDLMAYDLFCHLHSKKSL---YSGREQTQWADYLTEY 375 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY---CDYTCSLGKNREMICTLAGRMG 283 LL ++ +++ F H+D+G+ YP + + + N+ + Sbjct: 376 LLRDANIITRLLNAFADHKDLGLY------YPTTFWMMPSWVNHVTMNKSFMAAWHNEWQ 429 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 I D L + AG MFW R EAL + F P+ DG + HA+ER L + Sbjct: 430 IDPCDGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDGSMLHALERVIGLLAE 489 Query: 344 KANFR 348 K ++ Sbjct: 490 KNGYK 494 >gi|330899783|gb|EGH31202.1| hypothetical protein PSYJA_20361 [Pseudomonas syringae pv. japonica str. M301072PT] Length = 626 Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 61/245 (24%), Positives = 107/245 (43%), Gaps = 21/245 (8%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----- 167 A + +A+ +H++Y D + ++ L+N D+ +TL A + + + +F Sbjct: 114 AARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLA--DAKHQKKTIAVFSKHPRVK 171 Query: 168 -ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 ++ + N GR+ P L+ ++ L YD C +H KKS YS E W +L Sbjct: 172 NLKVRCVPNRGRNFGPLLVEF-SKDLMAYDLFCHLHSKKSL---YSGREQTQWADYLTEY 227 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY---CDYTCSLGKNREMICTLAGRMG 283 LL ++ +++ F H+D+G+ YP + + + N+ + Sbjct: 228 LLRDANIITRLLNAFADHKDLGLY------YPTTFWMMPSWVNHVTMNKSFMAAWHNEWQ 281 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 I D L + AG MFW R EAL + F P+ DG + HA+ER L + Sbjct: 282 IAPCDGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDGSMLHALERVIGLLAE 341 Query: 344 KANFR 348 K ++ Sbjct: 342 KNGYK 346 >gi|171779906|ref|ZP_02920810.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171281254|gb|EDT46689.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 592 Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 76/240 (31%), Positives = 116/240 (48%), Gaps = 39/240 (16%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 KIA+ +H++Y+DL + + N +DL +T T++A+ K EI I + A+I + Sbjct: 291 KIAVHLHVFYVDLLEDFLDAFENFHFVYDLFIT--TDNATKKQEIESILRSNGKDAQIFV 348 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L + LS+YDY+ H KKSK + W G+ WR L D+L P Sbjct: 349 TGNVGRDVLPMLKL--KDYLSDYDYIGHFHTKKSKEADF--WAGESWRNELI-DMLIKPA 403 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICT----LAGRMGIT 285 I+ FD + +G++ + +R+ NK D N +I L +MG+T Sbjct: 404 D--NILANFDNDK-LGIVIADIPTFFRF-NKIVD-----AWNEHLIAPAMNDLWQQMGMT 454 Query: 286 FQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 + +DF GT W + +AL P+ +L L+ P I HA+ER Sbjct: 455 ---KAIDFNNFHNFVMSYGTYVWFKYDALKPLFDLGLTDEDVP-AEPLPQNSILHAIERL 510 >gi|312133752|ref|YP_004001091.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773032|gb|ADQ02520.1| Hypothetical protein BBMN68_1493 [Bifidobacterium longum subsp. longum BBMN68] Length = 651 Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 74/236 (31%), Positives = 114/236 (48%), Gaps = 11/236 (4%) Query: 118 IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES-ASIKSEILKIFP-AARIHIMEN 175 +A+V HLYYIDL +S++ D+ +T+ +E A I E + P + ++EN Sbjct: 311 VALVFHLYYIDLLDSSLQYISSMPEGCDVIITVGSEEKACIVKERCEGMPYNIDVRVIEN 370 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKK-SKRKGYSWWEGDLWRRWLFYDLLGAPGVV 234 GRDV L+ + L NYD VC H KK ++ K S +G ++ F ++L + V Sbjct: 371 RGRDVSALLVGAGKDVL-NYDLVCFAHDKKVTQIKPLSVGDGFAYK--CFENILASKAYV 427 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQ-DQKLDF 293 II F+ +G++ + N + +T S G N + L + T D+K + Sbjct: 428 ANIIDQFEREPHLGVLMPNPPEHGNYFPVFTLSWGDNFDGTVQLLRDIHKTVPLDKKKEV 487 Query: 294 FA--GTMFWVRTEAL-DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 A GTMFW R +AL D + N PK +DG I H +ER + V +AN Sbjct: 488 IAPLGTMFWFRPKALSDGLLNHNWQYSDFPKEPNKIDGTILHYIERAYCY-VAQAN 542 >gi|50913971|ref|YP_059943.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10394] gi|50903045|gb|AAT86760.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10394] Length = 581 Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 73/234 (31%), Positives = 117/234 (50%), Gaps = 25/234 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD- 288 I+ F+T+ DIG+I + +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETN-DIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQID 452 Query: 289 -QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|21910063|ref|NP_664331.1| hypothetical protein SpyM3_0527 [Streptococcus pyogenes MGAS315] gi|28896239|ref|NP_802589.1| hypothetical protein SPs1327 [Streptococcus pyogenes SSI-1] gi|21904254|gb|AAM79134.1| putative protein [Streptococcus pyogenes MGAS315] gi|28811490|dbj|BAC64422.1| conserved hypothetical protein [Streptococcus pyogenes SSI-1] Length = 581 Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 73/234 (31%), Positives = 117/234 (50%), Gaps = 25/234 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD- 288 I+ F+T+ DIG+I + +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETN-DIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQID 452 Query: 289 -QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|262282406|ref|ZP_06060174.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA] gi|262261697|gb|EEY80395.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA] Length = 582 Score = 79.7 bits (195), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 74/249 (29%), Positives = 114/249 (45%), Gaps = 28/249 (11%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 KIA+ +H++Y+DL E + + S+DL +T T+S K+EIL I A + + Sbjct: 284 KIAVHLHVFYVDLLAEFLHAFESFHFSYDLFIT--TDSEKKKNEILDILEGKQAKAEVLV 341 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L LS YDY+ H KKSK Y W G+ WR+ L ++L P Sbjct: 342 TGNVGRDVLPMLKL--KRHLSQYDYIGHFHTKKSKEADY--WAGESWRKELI-NMLVHPA 396 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK-L 291 +I+ +G++ + P+ + + N +I ++ QK + Sbjct: 397 D--QIVSQLGQDDRLGLVIADI---PSFFRFNRIVVAWNEALISPEMNKLWERMNCQKEV 451 Query: 292 DFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV- 342 DF GT W + +AL P+ +L L+ P I HA+ER Sbjct: 452 DFKQMNTFVMSYGTFVWFKYDALSPLFDLNLTEEDVPS-EPLPQNSILHAIERLLVYIAW 510 Query: 343 -KKANFRIS 350 K+ +F+IS Sbjct: 511 DKQYDFKIS 519 >gi|94988294|ref|YP_596395.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS9429] gi|94992170|ref|YP_600269.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS2096] gi|94541802|gb|ABF31851.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS9429] gi|94545678|gb|ABF35725.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS2096] Length = 581 Score = 79.7 bits (195), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 73/234 (31%), Positives = 117/234 (50%), Gaps = 25/234 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD- 288 I+ F+T+ DIG+I + +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETN-DIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQID 452 Query: 289 -QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|94990172|ref|YP_598272.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10270] gi|94543680|gb|ABF33728.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10270] Length = 581 Score = 79.7 bits (195), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 73/234 (31%), Positives = 117/234 (50%), Gaps = 25/234 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD- 288 I+ F+T+ DIG+I + +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETN-DIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQID 452 Query: 289 -QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|322373386|ref|ZP_08047922.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sp. C150] gi|321278428|gb|EFX55497.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sp. C150] Length = 594 Score = 79.7 bits (195), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 73/233 (31%), Positives = 104/233 (44%), Gaps = 22/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 KIA+ +H YY+DL + N ++DL +T T+S K EI I ARI I Sbjct: 288 KIAVHLHTYYVDLLDDFLRQFENFHFTYDLFLT--TDSEEKKKEIQSILDKHGKEARIFI 345 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDV+P L L ++LS YDY+ H KKS Y +W GD WR LF L+ Sbjct: 346 TGNRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPE--YPYWVGDSWRNELFSMLIQPAD 401 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR--EMICTLAGRM--GITFQD 288 II + +G++ + + +Y +NR E + L RM G Sbjct: 402 ---NIIANLEHDDRLGLVIADIPTF-FRYTKIVDPWNENRFAEGMNDLWERMDLGRDIDF 457 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 K++ F GT W + + L P+ +L L E I H++ER Sbjct: 458 DKMNTFIMSYGTFIWFKYDTLKPLFDLDLQDE-EIPAEPIPQHTILHSIERIL 509 >gi|312867647|ref|ZP_07727853.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405] gi|311096710|gb|EFQ54948.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405] Length = 582 Score = 79.7 bits (195), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 77/254 (30%), Positives = 117/254 (46%), Gaps = 26/254 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K+A+ +H++Y+DL E + +DL +T T+ K I KI A + + Sbjct: 285 KVAVHLHVFYVDLLEEFLDAFQAFHFVYDLWIT--TDVEEKKQTIEKILSNRAQDATVVV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L+L EQLS YDYV H KKSK + W G+ WR+ L ++L P Sbjct: 343 TGNIGRDVLPMLLL--KEQLSQYDYVGHFHTKKSKEADF--WAGESWRKELI-EMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGI--TFQ 287 +I+ + + +G+ + +RY + +L + L RMG T Sbjct: 398 D--QILANMEANPKVGITIADIPTFFRYNRIVVAWNEALISPE--MNKLWERMGAAKTID 453 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 + L+ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 454 FKNLNTFVMSYGTFVWFKYDALKPLFDLNLTAANVP-AEPLPQNSILHAIERLLIYIAWD 512 Query: 343 KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 513 QKYDFRISQNPHVL 526 >gi|330882679|gb|EGH16828.1| hypothetical protein Pgy4_27710 [Pseudomonas syringae pv. glycinea str. race 4] Length = 608 Score = 79.3 bits (194), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 62/243 (25%), Positives = 108/243 (44%), Gaps = 17/243 (6%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS-EILKIFPAAR-- 169 A + +AI +H++Y D + ++ L+N I+ D+ +TL + K+ P + Sbjct: 96 AARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVKNL 155 Query: 170 -IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + + N GR+ P L+ ++ L YD C +H KKS YS E W +L LL Sbjct: 156 KVSCVPNRGRNFGPLLVEF-SKDLMAYDLFCHLHSKKSL---YSGREQTQWADYLTEYLL 211 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY---CDYTCSLGKNREMICTLAGRMGIT 285 ++ +++ F H+D+G+ YP + + + N+ + I Sbjct: 212 RDANIITRLLNAFVDHKDLGLY------YPTTFWMMPSWVNHVTMNKAFMNAWHNEWQID 265 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + L + AG MFW R EAL + + F P+ DG + HA+ER L +K Sbjct: 266 PCEGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKN 325 Query: 346 NFR 348 ++ Sbjct: 326 GYK 328 >gi|320330331|gb|EFW86314.1| hypothetical protein PsgRace4_09215 [Pseudomonas syringae pv. glycinea str. race 4] Length = 774 Score = 79.3 bits (194), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 62/243 (25%), Positives = 108/243 (44%), Gaps = 17/243 (6%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS-EILKIFPAAR-- 169 A + +AI +H++Y D + ++ L+N I+ D+ +TL + K+ P + Sbjct: 262 AARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVKNL 321 Query: 170 -IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + + N GR+ P L+ ++ L YD C +H KKS YS E W +L LL Sbjct: 322 KVSCVPNRGRNFGPLLVEF-SKDLMAYDLFCHLHSKKSL---YSGREQTQWADYLTEYLL 377 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY---CDYTCSLGKNREMICTLAGRMGIT 285 ++ +++ F H+D+G+ YP + + + N+ + I Sbjct: 378 RDANIITRLLNAFVDHKDLGLY------YPTTFWMMPSWVNHVTMNKAFMNAWHNEWQID 431 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + L + AG MFW R EAL + + F P+ DG + HA+ER L +K Sbjct: 432 PCEGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKN 491 Query: 346 NFR 348 ++ Sbjct: 492 GYK 494 >gi|322516362|ref|ZP_08069287.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124] gi|322125095|gb|EFX96488.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124] Length = 581 Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 77/257 (29%), Positives = 118/257 (45%), Gaps = 32/257 (12%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 K+A+ +H++Y+DL E + +DL +T T+ K I KI A + + Sbjct: 285 KVAVHLHVFYVDLLEEFLDAFQAFHFIYDLWIT--TDVEEKKQAIEKILSNRVQDATVVV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L+L EQLS YDYV H KKSK + W G+ WR+ L ++L P Sbjct: 343 TGNIGRDVLPMLLL--KEQLSRYDYVGHFHTKKSKEADF--WAGESWRKELI-EMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 + I+ + + +G+ + +RY + +L + L RMG T + Sbjct: 398 DL--ILANMEANPKVGITIADIPTFFRYNRIVVAWNEALISPE--MNKLWQRMGAT---K 450 Query: 290 KLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLS 341 +DF + GT W + +AL P+ +L L+ P I HA+ER Sbjct: 451 TIDFKSLNTFVMSYGTFVWFKYDALKPLFDLNLTAADVP-AEPLPQNSILHAIERLLIYI 509 Query: 342 V--KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 510 AWDQKYDFRISQNPHVL 526 >gi|13476280|ref|NP_107850.1| hypothetical protein mlr7559 [Mesorhizobium loti MAFF303099] gi|14027041|dbj|BAB53995.1| mlr7559 [Mesorhizobium loti MAFF303099] Length = 644 Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 70/257 (27%), Positives = 112/257 (43%), Gaps = 32/257 (12%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI---LKIFPAARIHI 172 KIA+ H+YY D+ EI L N+ + +D T T + K+EI L P + I Sbjct: 297 GKIAVCAHIYYTDMLDEILGLTGNIPVPYDFIAT--TNTPEKKAEIETALANRPGVKNVI 354 Query: 173 M----ENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 + +N GRD+ I L + + YD VC++H KKS + S G+L++R + +L Sbjct: 355 VRVVEQNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQVQSSM--GNLFKRHMVDNL 412 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYR--YPNKYCDYTCSLGKNREMICTLAGRMGIT 285 L + G V ++ F + +G+ + YP + + K E L + + Sbjct: 413 LNSRGYVHNVLDMFHDNPSVGLAIPPIFHISYPTMGFSWFANKPKVEETARLL--NINVK 470 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPK--------VHKALDGEIEHAVERC 337 F + GTMFW R A L + FE K +DG HA+ER Sbjct: 471 FDENTPVAAYGTMFWFRPRA--------LRKMFEHKWKWEEFNAEPDHVDGGFAHALERL 522 Query: 338 FSLSVKKANFRISDVDC 354 + +V+ A + + C Sbjct: 523 IAYAVQNAGYTTQHIMC 539 >gi|320531350|ref|ZP_08032322.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] gi|320136441|gb|EFW28417.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] Length = 626 Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 64/247 (25%), Positives = 116/247 (46%), Gaps = 12/247 (4%) Query: 109 VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFP- 166 V++ + K+A++ HLY++DL ++ DL +T+ + E A + + P Sbjct: 301 VVREPKQQKVALIAHLYFMDLLDSTLAYARSMPEGTDLILTVGSQEKAELVERACQDLPY 360 Query: 167 AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKK-SKRKGYSWWEGDLWRRWLFY 225 + ++EN GRDV L+ + + + +YD VC +H KK ++ Y+ EG + R F Sbjct: 361 NVDVRVIENRGRDVSALLVGCK-DIVDDYDLVCFMHDKKVTQLSPYTVGEG--FARKCFD 417 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN---REMICTLAGRM 282 +LL V ++ TFD+ +G++ + + + Y+ S G N +M+ + Sbjct: 418 NLLPTREFVENVVATFDSEPRLGLLSPTPPNHADYFPIYSYSWGPNFDRTKMLLEKELNL 477 Query: 283 GITFQDQKLDFFA--GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSL 340 + K + A GTMFW R AL P+ + P +DG I HA+ER + Sbjct: 478 NVPLDAHK-EVIAPLGTMFWFRPAALKPLFDHDWQWEDFPPEPNDIDGTILHAIERAYGY 536 Query: 341 SVKKANF 347 + + + Sbjct: 537 VAQASGY 543 >gi|157151529|ref|YP_001450315.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr. CH1] gi|157076323|gb|ABV11006.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr. CH1] Length = 582 Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 73/249 (29%), Positives = 114/249 (45%), Gaps = 28/249 (11%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 KIA+ +H++Y+DL E + + S+DL +T T+S K+EIL I A + + Sbjct: 284 KIAVHLHVFYVDLLAEFLHAFESFHFSYDLFIT--TDSEKKKNEILGILEGKQAKAEVFV 341 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L LS YDY+ H KKSK Y W G+ WR+ L ++L P Sbjct: 342 TGNVGRDVLPMLKL--KRHLSQYDYIGHFHTKKSKEADY--WAGESWRKELI-NMLVHPA 396 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK-L 291 +I+ +G++ + P+ + + N +I ++ QK + Sbjct: 397 D--QIVSQLGQDDCLGLVIADI---PSFFRFNRIVVAWNEALISPEMNKLWERMNCQKEV 451 Query: 292 DFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV- 342 DF GT W + +AL P+ +L ++ P I HA+ER Sbjct: 452 DFKQMNTFVMSYGTFVWFKYDALSPLFDLNMTEEDVPS-EPLPQNSILHAIERLLVYIAW 510 Query: 343 -KKANFRIS 350 K+ +F+IS Sbjct: 511 DKQYDFKIS 519 >gi|15672189|ref|NP_266363.1| polysaccharide biosynthesis protein [Lactococcus lactis subsp. lactis Il1403] gi|12723062|gb|AAK04305.1|AE006258_8 polysaccharide biosynthesis protein [Lactococcus lactis subsp. lactis Il1403] Length = 589 Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 73/269 (27%), Positives = 121/269 (44%), Gaps = 21/269 (7%) Query: 94 FPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT- 152 +P L S + + + + +A+ +H+YY +L E + N S +DL++T T Sbjct: 265 YPDLNFLLPSKMITPLSKTVLHQTVAVHLHVYYPELLEEFLDAFKNFSFDYDLYLTTNTD 324 Query: 153 ESASIKSEILKIFPA-ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY 211 E I E+LK A A++ NHGRD++PFL L E+L YD V H K+S + Sbjct: 325 EKEEIIKEMLKCKDAKAKLVRTPNHGRDIVPFLAL--KEELKKYDIVGHFHTKRSLEAAF 382 Query: 212 SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSL 268 + G+ WR L L+ I+ F+ + +G++ + +R+ NK + Sbjct: 383 --FAGESWRTELISMLIEPAD---NIMAHFEQKQKLGIVIADIPSFFRF-NKIVNADNEN 436 Query: 269 GKNREMICTLAGRMGITFQDQKLDFFA-----GTMFWVRTEALDPIKNLRLSRYFEPKVH 323 + ++ + RM + + DF GT FW + E L+P+ NL + P Sbjct: 437 KQIAPIMNDIWKRMKMNKKVNFHDFNTFTMSYGTFFWAKIEVLEPLFNLEIMDREIPN-E 495 Query: 324 KALDGEIEHAVERCFSLSV--KKANFRIS 350 I HA+ER K+ +F IS Sbjct: 496 PLPQNTILHAIERVLIYLAWDKEMDFNIS 524 >gi|163853098|ref|YP_001641141.1| lipopolysaccharide biosynthesis protein-like protein [Methylobacterium extorquens PA1] gi|163664703|gb|ABY32070.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium extorquens PA1] Length = 916 Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 65/230 (28%), Positives = 107/230 (46%), Gaps = 21/230 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP-----AARIH 171 K A +VH +Y +L EI L + D++V+ T+ + +I+ + + Sbjct: 250 KAAAIVHGFYPELMEEILIYLGKSNFPIDIYVS--TDDSKKAEQIISMGKKYHNGQLDVR 307 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 I+ N GRD+ P L ++ NY+ IH KKS G WR +LF +L+G+ Sbjct: 308 IISNRGRDIGPMLTGF-SDVFDNYEAFLHIHTKKSPHGGDGLSS---WRDYLFKNLIGSA 363 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL--GKNREMICTLAGRMGITF-QD 288 ++ + T R++G + YP L G N + + +L R+G+ +D Sbjct: 364 EIIDSNLHILGT-RNVGFV------YPQHLYALRGILNWGYNFDTVSSLLRRVGVRLSKD 416 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 L+F +G+MFW RT AL + +L L +DG + HA+ER F Sbjct: 417 MVLEFPSGSMFWARTAALHGLLSLDLKLEDFDNEAGQVDGTLGHAIERSF 466 >gi|325276923|ref|ZP_08142610.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51] gi|324097938|gb|EGB96097.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51] Length = 758 Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 71/261 (27%), Positives = 109/261 (41%), Gaps = 17/261 (6%) Query: 95 PQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES 154 P+ +E ++ A KIAI +H+YY D A L DL +T+ ES Sbjct: 228 PKWTPKFEINPASELTPKASNQKIAICLHIYYDDYIERFAEALYTFPTEVDLLITIANES 287 Query: 155 ASIKS--EILKIFPAARIHI--MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 ++ KI ++ I + N GR+ P L+ E L+ YD +C +H KKS Sbjct: 288 FRDRAYQTFSKIQAVKKVTIKSVPNRGRNFGPLLVEFAQELLT-YDLLCHLHSKKSL--- 343 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY---CDYTCS 267 YS E W +L LL VV +++ F + G+ YP + + Sbjct: 344 YSGREQTQWADYLSEYLLNDCSVVKRVLNAFSDNPQFGVY------YPTTFWMMPSWVNH 397 Query: 268 LGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALD 327 + N+ + L +G D L + AG MFW R +AL I N + P D Sbjct: 398 VTMNKPHMRNLQTALGFGHFDDFLSYPAGGMFWARPKALVDILNKTYTYDDFPNEPLPND 457 Query: 328 GEIEHAVERCFSLSVKKANFR 348 G + HA+ER +K ++ Sbjct: 458 GSMLHALERVIGPVCEKNGYQ 478 >gi|331086190|ref|ZP_08335272.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406349|gb|EGG85863.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 592 Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 64/239 (26%), Positives = 112/239 (46%), Gaps = 12/239 (5%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIF---PAAR--IH 171 KIA+V+HLY+ DL E + +S + D+++T TE K+ I K+F P + + Sbjct: 257 KIALVMHLYFEDLLEESYHYVSAMPEKADIYLTTDTEKK--KAAIEKVFAKLPCNKLEVR 314 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 +++N GRDV L+ ++ + + +YD VC H KK+ + G + F + L Sbjct: 315 VIKNRGRDVSSLLVGVK-DVIMDYDLVCFAHDKKTAQVKPGTI-GASFAYKCFENTLSNK 372 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKL 291 V +I TF + +G++ + + G N + LA ++G+T Sbjct: 373 AYVGNVINTFVNNPRMGLLCPPEPNHSTFFTTIGFEWGPNFNITRDLAKKLGLTVPISVA 432 Query: 292 DFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 GTMFW R +A+ P+ N P +DG + HA+ER + V+++ + Sbjct: 433 SPPVAPLGTMFWFRPKAMKPLYNKDWKYEDFPAEPNKIDGTLLHAIERIYPFIVQESGY 491 >gi|329116186|ref|ZP_08244903.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020] gi|326906591|gb|EGE53505.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020] Length = 589 Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 70/240 (29%), Positives = 112/240 (46%), Gaps = 29/240 (12%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIF----PAA 168 I K+AI +H +Y+DL E + N +DL +T T+ K++I + A Sbjct: 287 VINKKVAIHLHTFYVDLLQEFLSAFENFHFDYDLFIT--TDIEEKKTQIENVLNENNQKA 344 Query: 169 RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + + N GRDVLP L+L E+LS YDY+ H KKSK + W G+ WR+ L +L Sbjct: 345 EVFVTGNIGRDVLPMLLL--KEKLSVYDYIGHFHTKKSKEADF--WAGESWRKELI-KML 399 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN--REMICTLAGRMGITF 286 P I+ T + ++ +G++ + Y +Y + +N + L +MG+T Sbjct: 400 VLPAD--SILATLEKNK-VGIVIADMPTY-FRYNKIVTAWNENLIAPEMNELWKKMGLT- 454 Query: 287 QDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 + +DF GT W + +AL P+ +L L+ P I HA+ER Sbjct: 455 --KSIDFNHLHTFVMSYGTFVWFKYDALKPLFDLNLTVEDVP-AEPLPQNSILHAIERLL 511 >gi|225352528|ref|ZP_03743551.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156722|gb|EEG70116.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 648 Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 67/246 (27%), Positives = 114/246 (46%), Gaps = 13/246 (5%) Query: 109 VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFP- 166 + I +IA+++HLYY+DL + ++ D T+ + E+A+I E K P Sbjct: 302 IAPIPTNKRIALIMHLYYMDLLDKTLEYAKSMPEGCDFIFTVGSEENATIVRERCKDLPY 361 Query: 167 AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKK-SKRKGYSWWEGDLWRRWLFY 225 + +++N GRDV L+ + L YDYVC H KK ++ YS GD + F Sbjct: 362 NVDVRVIQNRGRDVSALLVGAGKDCLQ-YDYVCFAHDKKVTQLSPYSI--GDGFSYKCFE 418 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNRE-----MICTLAG 280 ++LG+ +V II F+ G++ + + + ++ G N E + TL Sbjct: 419 NVLGSKALVSNIINHFENDPHAGVLAPAPPNHADYFGNFASLWGPNYEGTKKMLEETLQV 478 Query: 281 RMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSL 340 ++ + + + GTMFW R +AL ++ P +DG + H VER + Sbjct: 479 KVPLDKSKEPIAPM-GTMFWFRPKALQQFFDIDWKYEDFPPEPNKIDGSMLHFVERAYGY 537 Query: 341 SVKKAN 346 V +AN Sbjct: 538 -VPQAN 542 >gi|312866008|ref|ZP_07726229.1| rhamnan synthesis protein F [Streptococcus downei F0415] gi|311098412|gb|EFQ56635.1| rhamnan synthesis protein F [Streptococcus downei F0415] Length = 584 Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 65/244 (26%), Positives = 114/244 (46%), Gaps = 24/244 (9%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP 166 + ++ ++K+A+ +H++Y+DL E + ++DL +T T+ ++EI I Sbjct: 276 AEAEELPAESKVAVHLHVFYVDLLQEFLDAFKTFHFAYDLFIT--TDKEEKRAEIQAILE 333 Query: 167 A----ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 A+I + N GRDVLP L L +QL YDY+ H KKSK Y W G WR+ Sbjct: 334 QNQVLAQIFVTGNIGRDVLPMLKL--KDQLKGYDYIGHFHTKKSKEADY--WAGQSWRQE 389 Query: 223 LFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSL--GKNREMICT 277 L L+ +I+ + +G++ + +R+ + +L + E+ Sbjct: 390 LIAMLVKPAN---QILAQMAKNDRLGIVIADMPSFFRFNKIVVAWNENLIAPEMEELWEK 446 Query: 278 LAGRMGITFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 ++ + I F + +D F GT W + +AL P+ +L L+ + P I HA+ Sbjct: 447 MSLKKSIDF--KAMDTFVMSYGTYAWFKYDALSPLFDLDLTDEYVP-AEPLPQNSILHAI 503 Query: 335 ERCF 338 ER Sbjct: 504 ERLL 507 >gi|116511036|ref|YP_808252.1| lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp. cremoris SK11] gi|116106690|gb|ABJ71830.1| Lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp. cremoris SK11] Length = 588 Score = 77.0 bits (188), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 81/290 (27%), Positives = 133/290 (45%), Gaps = 34/290 (11%) Query: 94 FPQLLHGWESPAMGKVMQIAIKA--KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 +P + E + +++ K KI I +H +Y+DL E N ++DL++T Sbjct: 274 YPDREYMLEEKTLKFTKEVSAKTNLKIGIHLHAFYLDLIPEYLNYFDKYVQNYDLYITTD 333 Query: 152 TESASIKSEILKIFPAARIH---IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 TE EILK +P +I + N GRDVLP++ + +E +++YD H KKSK Sbjct: 334 TEEKY--EEILKNYPLPQIKKVIVTGNKGRDVLPWMQV--SELMTDYDLCGHFHTKKSKD 389 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 W G+ WRR + Y LL +F + F+ + +G+I + + + Y + Sbjct: 390 N--DWIVGESWRRDIEYSLLEPAQAIF---QEFEKNPKLGLIIADVPSFFEHF--YGPTY 442 Query: 269 GKNREMICTLAGR-MGITFQDQK----LDFFA---GTMFWVRTEALDPIKNLRLSRYF-- 318 R++ + I F++ K D + GTM W R +AL+ + N+ + Sbjct: 443 ITERDIWPDMQEIWQKIDFENSKELKQKDSYVMSYGTMIWYRPQALNNLLNVNIQADVPE 502 Query: 319 EPKVHKALDGEIEHAVERCFSLSVKKAN---FRISDVDCILGYRKSLSQN 365 EP + + I HA ER + V AN FRIS + G+ + S N Sbjct: 503 EPLPYNS----ILHAFERLL-VYVSWANGYDFRISQIQTNNGFVANFSAN 547 >gi|154509526|ref|ZP_02045168.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC 17982] gi|153799160|gb|EDN81580.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC 17982] Length = 620 Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 73/254 (28%), Positives = 113/254 (44%), Gaps = 18/254 (7%) Query: 109 VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSEILKIFP 166 + A KI V H++Y D+ EI + LS L + L T E A I++ + Sbjct: 290 TLDAAASLKILAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEARAQERGV 349 Query: 167 AARIHIM-ENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 A + ++ N GRD+ FL+ S YD V KIH KKS + Y+ + L++ L+ Sbjct: 350 DADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYNAAQ--LFKEHLY 407 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMICTLAG 280 +LL + V I+ F H +GM+ + P + Y ++G NR A Sbjct: 408 DNLLASSDHVASILAEFAAHPGLGMVIA-----PMPHMGYP-TMGHAWFANRAPARDFAK 461 Query: 281 RMGIT--FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 ++GIT F D + G+MF R EAL + L P+ DG + H +ER Sbjct: 462 KVGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLL 521 Query: 339 SLSVKKANFRISDV 352 S +V + + V Sbjct: 522 SYAVLSRGYYVRPV 535 >gi|323135560|ref|ZP_08070643.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] gi|322398651|gb|EFY01170.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] Length = 812 Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 65/241 (26%), Positives = 113/241 (46%), Gaps = 17/241 (7%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFPAAR-- 169 A + IA +VH +Y ++ + L N++ DL + T E ++ + +P R Sbjct: 142 ARERPIAAIVHGFYPEIAPLVLEKLKNVTGPVDLFFSTDTQEKKHALEDVCRDWPKGRVE 201 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 I I N GRD+ + ++YD +H K+S G + WR +LF +LLG Sbjct: 202 IRICPNRGRDIAAKFFGFR-DVYADYDLFIHLHTKRSPHGGAALAR---WRDYLFDNLLG 257 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL--GKNREMICTLAGRMGITF- 286 +P +V I+ FD + IG++ +P + L G + + L RMG+ Sbjct: 258 SPEIVNSILSLFDDPK-IGVV------FPQHLFELRGILNWGYDYDHARALMKRMGVEID 310 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 ++ L+F +G+MFW R+ A P+ +L + P+ +DG + HA+ER + + Sbjct: 311 KNLVLEFPSGSMFWGRSAAFRPLLDLDIDFDDFPQEGGQVDGTLAHAIERSLLMIAESRG 370 Query: 347 F 347 F Sbjct: 371 F 371 >gi|320325880|gb|EFW81940.1| hypothetical protein PsgB076_04646 [Pseudomonas syringae pv. glycinea str. B076] Length = 774 Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 61/243 (25%), Positives = 107/243 (44%), Gaps = 17/243 (6%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS-EILKIFPAAR-- 169 A + +AI +H++Y D + ++ L+N I+ D+ +TL + K+ P + Sbjct: 262 AARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNHKKKTIATFGQHPRVKNL 321 Query: 170 -IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + + N R+ P L+ ++ L YD C +H KKS YS E W +L LL Sbjct: 322 KVSCVPNRERNFGPLLVEF-SKDLMAYDLFCHLHSKKSL---YSGREQTQWADYLTEYLL 377 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY---CDYTCSLGKNREMICTLAGRMGIT 285 ++ +++ F H+D+G+ YP + + + N+ + I Sbjct: 378 RDANIITRLLNAFVDHKDLGLY------YPTTFWMMPSWVNHVTMNKAFMNAWHNEWQID 431 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + L + AG MFW R EAL + + F P+ DG + HA+ER L +K Sbjct: 432 PCEGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSMLHALERVIGLLAEKN 491 Query: 346 NFR 348 ++ Sbjct: 492 GYK 494 >gi|293189412|ref|ZP_06608132.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309] gi|292821502|gb|EFF80441.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309] Length = 620 Score = 75.9 bits (185), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 72/254 (28%), Positives = 113/254 (44%), Gaps = 18/254 (7%) Query: 109 VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSEILKIFP 166 + A K+ V H++Y D+ EI + LS L + L T E A I++ + Sbjct: 290 TLDAAASLKVLAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEAHAQERGV 349 Query: 167 AARIHIM-ENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 A + ++ N GRD+ FL+ S YD V KIH KKS + Y+ + L++ L+ Sbjct: 350 DADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYNAAQ--LFKEHLY 407 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMICTLAG 280 +LL + V I+ F H +GM+ + P + Y ++G NR A Sbjct: 408 DNLLASSDHVASILAKFAAHPGLGMVIA-----PMPHMGYP-TMGHAWFANRAPARDFAK 461 Query: 281 RMGIT--FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 ++GIT F D + G+MF R EAL + L P+ DG + H +ER Sbjct: 462 KVGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLL 521 Query: 339 SLSVKKANFRISDV 352 S +V + + V Sbjct: 522 SYAVLSRGYYVRPV 535 >gi|94994091|ref|YP_602189.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10750] gi|94547599|gb|ABF37645.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10750] Length = 581 Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 71/233 (30%), Positives = 113/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 I+ F+T +I +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSVFETDDIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER F Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLF 505 >gi|71903253|ref|YP_280056.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS6180] gi|71802348|gb|AAX71701.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS6180] Length = 581 Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 69/233 (29%), Positives = 112/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD 398 Query: 233 VVFKIIRTFDTHRDIGMIGS--RAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 + + T D I I S R + N + ++ + +EM+ +L +M + Q Sbjct: 399 SILSVFETDDIGIIIADIPSFFRFNKIVNAWNEHLIA----QEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER F Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLF 505 >gi|312133751|ref|YP_004001090.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773029|gb|ADQ02517.1| Hypothetical protein BBMN68_1492 [Bifidobacterium longum subsp. longum BBMN68] Length = 641 Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 72/241 (29%), Positives = 110/241 (45%), Gaps = 11/241 (4%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES-ASIKSEILK 163 + I K ++A+V+HLYY+D+ +I ++ D+ +T+ +E A I E + Sbjct: 298 SQDNAQPIPQKFRVALVLHLYYMDILDQILRYARSMPEGCDVIITVGSEEKACIVKERCE 357 Query: 164 IFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 P + ++EN GRDV L+ + L NYD VC H KK R+ GD + + Sbjct: 358 GMPYNIDVRVIENRGRDVSALLVGAGKDVL-NYDLVCFAHDKKV-RQLRPETIGDGFAKK 415 Query: 223 LFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRM 282 F + L + V II F + +G+ A + + + Y S G N L + Sbjct: 416 CFENTLASKAYVANIINLFADNPRLGVAMPSAPNHADYFYSYAFSWGPNYRGTKDLLDGL 475 Query: 283 GITFQ-DQKLDFFA--GTMFWVRTEALDPI--KNLRLSRYFEPKVHKALDGEIEHAVERC 337 GI D A GTMFW R +AL + K+ S F P+ + A DG H VER Sbjct: 476 GIKVPLSPHADVIAPLGTMFWFRPKALHGLIDKSWEYSD-FPPEPNPA-DGSFLHFVERA 533 Query: 338 F 338 + Sbjct: 534 Y 534 >gi|15674835|ref|NP_269009.1| hypothetical protein SPy_0792 [Streptococcus pyogenes M1 GAS] gi|71910421|ref|YP_281971.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS5005] gi|13621968|gb|AAK33730.1| conserved hypothetical protein - possibly involved in cell wall localization and side chain formation of rhamnose-glucose polysaccharide [Streptococcus pyogenes M1 GAS] gi|71853203|gb|AAZ51226.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS5005] Length = 581 Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 71/233 (30%), Positives = 112/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E N + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFENWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 I+ F+T +I +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETDDIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|320095829|ref|ZP_08027469.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str. F0338] gi|319977239|gb|EFW08942.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str. F0338] Length = 619 Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 68/248 (27%), Positives = 112/248 (45%), Gaps = 18/248 (7%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES--ASIKSEIL 162 A + + A ++ + H++Y D+ EI + LS L + L T E A+I+ + Sbjct: 286 AAPEAREKAASLRVVAIAHIFYADMADEIIDRLSVLPDGWRLVATTADEERKAAIEETMA 345 Query: 163 KIFPAARIHIM-ENHGRDVLPFLI-LLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + ++ ++ N GRD+ FL+ + +YD V KIH KKS + + L++ Sbjct: 346 RRGAVGQVRVVASNRGRDISAFLVDCSDVLAGDDYDVVVKIHSKKSVQDEAN--AAQLFK 403 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNREMIC 276 L+ +LL + V I+ F H +GM + P + Y ++G NR Sbjct: 404 DHLYENLLDSKDHVANILAEFADHPGLGMALA-----PMPHMGYP-TMGHAWFANRPPAR 457 Query: 277 TLAGRMGIT--FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 LA R+GIT F D + G+MF R AL P+ L+ P DG + H + Sbjct: 458 ELAKRIGITVPFDDHQPLAPYGSMFIARPRALRPLVEAGLTHDDFPPEGGYQDGSLAHVI 517 Query: 335 ERCFSLSV 342 ER + +V Sbjct: 518 ERLLAYAV 525 >gi|332535404|ref|ZP_08411194.1| glycosyl transferase, group 1 [Pseudoalteromonas haloplanktis ANT/505] gi|332035169|gb|EGI71680.1| glycosyl transferase, group 1 [Pseudoalteromonas haloplanktis ANT/505] Length = 672 Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 61/239 (25%), Positives = 104/239 (43%), Gaps = 17/239 (7%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS----EILKIFPAARIHI 172 K+A+ H++Y + L+ + D+ V++ +E + K+ + + + Sbjct: 164 KLAMCFHVFYGEFIDYYCGALAKFTQQVDVFVSVASEELAKKAIHDFKACSKVNKVVVKV 223 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 + NHGR+ P L+ ++ L NYD C +H KKS G + + W +L LL P Sbjct: 224 VPNHGRNFGPMLVEFASD-LQNYDLFCHMHSKKSLYSGRAQTQ---WADYLGEYLLNDPH 279 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKY---CDYTCSLGKNREMICTLAGRMGITFQDQ 289 V+ +++ F+ + G+ YP + D+ KN+ + I +D Sbjct: 280 VIKQVLNHFNDNPKSGLY------YPTSFWMMPDWVNHWLKNKPAAQKFTKKWNIELKDD 333 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 L + AG MFW R EAL + N P DG HA+ER L V+K ++ Sbjct: 334 FLAYPAGGMFWARPEALKQLLNKEYKYDDFPGEPLPNDGSQLHALERMLGLLVEKNGYK 392 >gi|78184210|ref|YP_376645.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] gi|78168504|gb|ABB25601.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] Length = 1161 Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 64/252 (25%), Positives = 115/252 (45%), Gaps = 34/252 (13%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP 166 GK++ + K + +H++Y +L IA+ + + + D+H++ ++ S +EI K Sbjct: 192 GKLISSIGQKKFGVFLHIFYPELAPIIADYIRKIPVKIDIHISTTHDAISGLTEIFKGLE 251 Query: 167 AA---RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY--SWWEGDLWRR 221 + ++ N GRDV PF++ E++ YDY+ K+H KKS W+E L Sbjct: 252 NSLNVQVKSFPNIGRDVAPFIVGFR-EEIPKYDYILKLHSKKSPHSNALSGWFEHCL--- 307 Query: 222 WLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG-----------K 270 +L+G+ V + I+ + DI ++ YP + +Y SLG Sbjct: 308 ---DNLIGSIDVFYTNIQELNK-EDISIV------YPVE--NYALSLGIKHDSCWGHEDG 355 Query: 271 NREMICTLAGRMGITFQDQKLDFF--AGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDG 328 N TL ++G+ ++ +F G MFW + + L PI + L +DG Sbjct: 356 NYNKAKTLLKKLGLEQINRNSEFLFPTGNMFWCKPDILKPILDWDLKFEDFDNEGGQIDG 415 Query: 329 EIEHAVERCFSL 340 + H++ER L Sbjct: 416 TLAHSIERLIGL 427 >gi|319939379|ref|ZP_08013739.1| RgpFc protein [Streptococcus anginosus 1_2_62CV] gi|319811365|gb|EFW07660.1| RgpFc protein [Streptococcus anginosus 1_2_62CV] Length = 587 Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 78/255 (30%), Positives = 117/255 (45%), Gaps = 41/255 (16%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHI 172 KI + +H++Y+DL + N ++DL +T T++ + K EI I A I + Sbjct: 291 KIGVHLHVFYVDLLEDFLKAFENFHFAYDLFIT--TDNDTKKLEIEAILNQNHKNAHIFV 348 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L L + LS YDY+ H KKSK + W G+ WR L D+L P Sbjct: 349 TGNIGRDVLPMLKL--KKYLSTYDYIGHFHTKKSKEADF--WAGESWRNELI-DMLIKPA 403 Query: 233 VVFKIIRTFDTHRDIGMIGSRA---YRYPNKYCDYTCSLGKNREMICT----LAGRMGIT 285 I+ F+ + +G++ S +RY NK D N +I L +M +T Sbjct: 404 D--NILANFENDK-LGLVISDIPTFFRY-NKIVD-----AWNEHLIAPEMNDLWYKMKMT 454 Query: 286 FQDQKLDFFA--------GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 + +DF GT W + +AL P+ +L L+ P + I HA+ER Sbjct: 455 ---KPIDFNTFHTFVMSYGTFIWFKYDALKPLFDLDLTDKDVP-IEPLPQNSILHAIERL 510 Query: 338 FSLSV--KKANFRIS 350 + +FRIS Sbjct: 511 IVYVAWNEHYDFRIS 525 >gi|19745874|ref|NP_607010.1| hypothetical protein spyM18_0853 [Streptococcus pyogenes MGAS8232] gi|19748025|gb|AAL97509.1| conserved hypothetical protein [Streptococcus pyogenes MGAS8232] Length = 581 Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 70/233 (30%), Positives = 112/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 I+ F+T +I +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETDDIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|139474025|ref|YP_001128741.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes str. Manfredo] gi|134272272|emb|CAM30524.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes str. Manfredo] Length = 581 Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 70/233 (30%), Positives = 112/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 I+ F+T +I +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETDDIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|56808559|ref|ZP_00366292.1| COG3754: Lipopolysaccharide biosynthesis protein [Streptococcus pyogenes M49 591] Length = 581 Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 70/233 (30%), Positives = 112/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 I+ F+T +I +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETDDIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|306827605|ref|ZP_07460885.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes ATCC 10782] gi|304430168|gb|EFM33197.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes ATCC 10782] Length = 581 Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 70/233 (30%), Positives = 112/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 I+ F+T +I +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETDDIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|119026520|ref|YP_910365.1| hypothetical protein BAD_1502 [Bifidobacterium adolescentis ATCC 15703] gi|118766104|dbj|BAF40283.1| hypothetical protein [Bifidobacterium adolescentis ATCC 15703] Length = 647 Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 61/231 (26%), Positives = 110/231 (47%), Gaps = 12/231 (5%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFP-AARIHIME 174 +IA+++HLYY+DL + ++ D T+ +E +A + E K P + +++ Sbjct: 309 RIALIMHLYYMDLLDKTLEYAKSMPEGCDFIFTVGSEENAKLVRERCKGLPYNVDVRVIQ 368 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKK-SKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRDV LI + L YDYVC H KK ++ YS +G ++ F ++LG+ + Sbjct: 369 NRGRDVSALLIGAGKDCLK-YDYVCFAHDKKVTQLSPYSIGDGFAYK--CFENILGSKAL 425 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNRE-----MICTLAGRMGITFQD 288 V II F+ G++ + + + + ++ G N E + TL ++ + Sbjct: 426 VSNIINHFEQDPHAGLLAPTSPNHADYFGNFASLWGPNFEGTKKMLEETLGVKVPLNPYK 485 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 + + GTMFW R +AL + ++ P +DG + H +ER + Sbjct: 486 EPIAPL-GTMFWFRPKALHQLFDIDWKYEDFPPEPNKIDGSMLHFIERAYG 535 >gi|209559162|ref|YP_002285634.1| RgpFc protein [Streptococcus pyogenes NZ131] gi|209540363|gb|ACI60939.1| RgpFc protein [Streptococcus pyogenes NZ131] Length = 581 Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 70/233 (30%), Positives = 112/233 (48%), Gaps = 23/233 (9%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP----AARIHI 172 K+A+ +H++Y+DL E + + +DL +T T+S + EI +I A I + Sbjct: 285 KVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFIT--TDSDIKRKEIKEILQRKGKTADIRV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L D+L P Sbjct: 343 TGNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELI-DMLVKPA 397 Query: 233 VVFKIIRTFDTHRDIGMIGSRA--YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD-- 288 I+ F+T +I +R+ NK + +EM+ +L +M + Q Sbjct: 398 D--SILSAFETDDIGIIIADIPSFFRF-NKIVNAWNEHLIAQEMM-SLWRKMDVKKQIDF 453 Query: 289 QKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 454 QAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPS-EPLPQNSILHAIERLL 505 >gi|158422520|ref|YP_001523812.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium caulinodans ORS 571] gi|158329409|dbj|BAF86894.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium caulinodans ORS 571] Length = 661 Score = 73.2 bits (178), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 77/262 (29%), Positives = 112/262 (42%), Gaps = 28/262 (10%) Query: 100 GWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIK 158 G PA V ++ +A VVH YY DL + L++ L + L VT E A Sbjct: 381 GARRPAAAPVTPRPVRTGSLASVVHGYYEDL---LPGLIAGLDPAH-LFVTTPPEKAEAV 436 Query: 159 SEILKIFPAARIHI-MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGD 217 +L A +EN GRDV PFL LL + YD V K+H K+S +G EG Sbjct: 437 RAVLARAAPAARLRVVENRGRDVRPFLSLLPELEAEGYDLVLKVHTKRSPHQGK---EGS 493 Query: 218 LWRRWLFYDLLGAPGVVFKIIRT------FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN 271 W + L G + K+ R+ F+ H +G++G+ + Y S G N Sbjct: 494 DWLQRLS-------GPLLKLARSERLAPVFEAHPQMGLLGAAGHVLDGAL--YAGSAG-N 543 Query: 272 REMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNL-RLSRYFEPKVHKALDGEI 330 + LA +G T + AGTMF R P++ L F+ + DG + Sbjct: 544 AAWMRRLAAELG-TGAPLTSPYVAGTMFVARLGIFAPLRGASELLDLFDTDMGLK-DGTL 601 Query: 331 EHAVERCFSLSVKKANFRISDV 352 HA ER F + +A + +V Sbjct: 602 AHAFERFFGVLAAEAGLSVGEV 623 >gi|116071634|ref|ZP_01468902.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107] gi|116065257|gb|EAU71015.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107] Length = 934 Score = 72.8 bits (177), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 60/230 (26%), Positives = 108/230 (46%), Gaps = 17/230 (7%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARI---- 170 + ++AI +H+YY + E L+ L L +T T ++ K I++I A + Sbjct: 43 ECQVAIYLHIYYPESLHEFLEYLTVLPSQIRLVIT--TTTSEKKELIIEILERALLINRL 100 Query: 171 ---HIM-ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 H+ EN GRD+ F+ + + +L YD VCK+H KKS G G W R+L Sbjct: 101 DLCHVYHENKGRDIGAFINIYD--ELIKYDVVCKLHAKKSPHLGEF---GKSWFRYLIRS 155 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 +G + I+ +DIG++ +++ N + D+ + ++ I + Sbjct: 156 TIGNQSAIENIVNILYHSKDIGILAPTSFQGTNNH-DWASNFDISQS-ISDHIFNSELDI 213 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 +KL + + T+FW + EAL+ + + F P+ +DG H++ER Sbjct: 214 NKEKLRYPSATVFWFKPEALNQQQFRSIQPDFFPEEPIPIDGTTAHSLER 263 >gi|13476282|ref|NP_107852.1| hypothetical protein mlr7561 [Mesorhizobium loti MAFF303099] gi|14027043|dbj|BAB53997.1| mlr7561 [Mesorhizobium loti MAFF303099] Length = 609 Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 63/239 (26%), Positives = 108/239 (45%), Gaps = 15/239 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL--VTESASIKSEILKIFPA--ARIHI 172 +IA++ H+Y++D+ EI N+ +DL VT + A I+ I K A A + + Sbjct: 298 RIAVLAHVYHLDMIDEILGYAENVPKGYDLIVTTDNADKQALIQQAIAKATNASNAVVLV 357 Query: 173 MENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + N GRD L+ L + YD +C++H K+S + G G+L++ F +LL P Sbjct: 358 VRNDGRDTSALLVGCRDYVLEDRYDLICRVHSKRSPQDGP---RGELFKLHTFENLLHTP 414 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQDQ 289 G V ++ F + +G++ Y S N+ + LA ++G+ D Sbjct: 415 GYVSNLLELFANNPALGLVMPPLVHI--GYPTIGNSWAGNKANVAKLARQLGLIVHLDDS 472 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLS-RYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 G M+W R AL + R + F ++ DG + HA+ER + A + Sbjct: 473 TPVAPYGGMYWFRPAALRKLFEERWNWNDFANMDYR--DGSLVHAIERIIAYVAIDAGY 529 >gi|260890973|ref|ZP_05902236.1| conserved hypothetical protein [Leptotrichia hofstadii F0254] gi|260859000|gb|EEX73500.1| conserved hypothetical protein [Leptotrichia hofstadii F0254] Length = 319 Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 65/244 (26%), Positives = 115/244 (47%), Gaps = 12/244 (4%) Query: 112 IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKI---FPAA 168 I +K K+ ++ H+Y+ DL E + + ++ + DL +T + K E F Sbjct: 2 IYLKYKVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNI 61 Query: 169 RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR-KGYSWWEGDLWRRWLFYDL 227 + ++EN GRDV L+ + + + NYDYVC +H KK+ + K YS +G +R + + Sbjct: 62 EVRVIENRGRDVSSLLVGAK-DAVMNYDYVCFMHDKKTAQLKPYSSGQG--FRYKCYENN 118 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY----CDYTCSLGKNREMICTLAGRMG 283 L V +I TF + +GM+ + N + +++ + K ++I L + Sbjct: 119 LATKKYVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVD 178 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 + + + GTMFW R AL + + P+ DG I HAVER + +V+ Sbjct: 179 FHWNLEPISPL-GTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFAVQ 237 Query: 344 KANF 347 A + Sbjct: 238 DAGY 241 >gi|13476281|ref|NP_107851.1| hypothetical protein mlr7560 [Mesorhizobium loti MAFF303099] gi|14027042|dbj|BAB53996.1| mlr7560 [Mesorhizobium loti MAFF303099] Length = 637 Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 70/261 (26%), Positives = 114/261 (43%), Gaps = 40/261 (15%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI---LKIFPAAR--- 169 KIA+ H+YY D+ EI L N+ + +D T T++ K+EI L P + Sbjct: 297 GKIAVCAHIYYTDMLEEILALTGNIPVPYDFIAT--TDTPDKKAEIEATLAKRPGVKNVI 354 Query: 170 IHIME-NHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 + ++E N GRD+ I L + + YD VC++H KKS + S +L++R + +L Sbjct: 355 VRVVEKNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQVQAS--RSNLFKRHMLENL 412 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK----NREMICTLAGRMG 283 L G V ++ F + +G+ P + Y ++G NR + A + Sbjct: 413 LNTRGYVHNVLDMFHDNPSVGLAVP-----PVVHISYP-TMGHAWFFNRPKVEETARLLN 466 Query: 284 ITFQ-DQKLDFFA-GTMFWVRTEALDPIKNLRLSRYFEPK--------VHKALDGEIEHA 333 I + D A GTMFW R A L + FE K +DG + H Sbjct: 467 IKVKFDHDTPVAAYGTMFWFRPRA--------LRKMFEHKWKWEDFNAEPNHVDGGLAHV 518 Query: 334 VERCFSLSVKKANFRISDVDC 354 +ER + + + A + + C Sbjct: 519 LERLIAYAAQDAGYTTRHIMC 539 >gi|33862360|ref|NP_893920.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313] gi|33640473|emb|CAE20262.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313] Length = 738 Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 68/262 (25%), Positives = 106/262 (40%), Gaps = 32/262 (12%) Query: 101 WESPAM---GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL---------HV 148 W P + + Q + IA+ VH++Y +L I N L+ I DL H Sbjct: 482 WNVPMITPASSLQQQDSETTIALHVHVHYPELLDTILNALNYNKIRPDLFLSCTNHENHS 541 Query: 149 TLVTESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 + +SA + I N GRD+ P L + E + Y+ +H KKS Sbjct: 542 EIQCKSAGANCTLKSII------TTPNRGRDIGPLLTEIGKELDTKYEIYGHLHTKKSAL 595 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVF--KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTC 266 +G WR +L +L+G + +I+ + +G++ + D TC Sbjct: 596 --LPGKQGCSWRDFLISNLVGMQDIAMADRIVTALKKNPKLGLVFAD---------DPTC 644 Query: 267 -SLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKA 325 NR+ LA ++ + + DF GTMFW + AL + NL L P+ Sbjct: 645 VGWSGNRKHADILANKLNLGPLPRCFDFPVGTMFWAKKGALTELYNLNLGWEDYPQEPLG 704 Query: 326 LDGEIEHAVERCFSLSVKKANF 347 DG I HA+ER + K F Sbjct: 705 YDGTILHAIERLLPIIAAKQGF 726 >gi|227546966|ref|ZP_03977015.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212567|gb|EEI80455.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 631 Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 63/236 (26%), Positives = 105/236 (44%), Gaps = 9/236 (3%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES-ASIKSEILKIFPAA-RIHIME 174 KIA+ +H+YY+DL + + ++ D+ +T+ +E+ A E K FP + ++E Sbjct: 310 KIALAIHVYYMDLLESTFHYIQSMPEGCDIIITVGSEANAETVREYCKQFPYNFDVRVIE 369 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVV 234 N GRDV L+ E L YDYVC H KK + GD + F ++L + V Sbjct: 370 NRGRDVSALLVGC-GEDLFQYDYVCFAHDKKVTQLSPQSI-GDGFAYKCFENILASKEYV 427 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN----REMICTLAGRMGITFQDQK 290 +I F+ + +G+ + + + YT G N ++ + D++ Sbjct: 428 SNVIDLFERNPRLGIAMPTPPNHASYFPGYTFPWGPNFPGTKDFLEQTLNMHVPLNADKE 487 Query: 291 LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 GTMFW R EA + + P +DG + H +ER + V +AN Sbjct: 488 PVAPMGTMFWFRPEAFRGLLDHGWEYTDFPPEPNKVDGTLLHFIERAYGY-VPQAN 542 >gi|262038042|ref|ZP_06011449.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii F0264] gi|261747934|gb|EEY35366.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii F0264] Length = 629 Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 72/259 (27%), Positives = 125/259 (48%), Gaps = 27/259 (10%) Query: 111 QIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAAR- 169 +I K+ + H+Y+ DL E N+ D+ +T T+ K +I KIF + Sbjct: 298 KIENSPKVGLFFHIYFEDLIEECYRYALNMPEYADIFIT--TDKEEKKEKIEKIFSKMKN 355 Query: 170 ---IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 I +++N GRDV FLI E++ YDY C H KK+K+ +G+ ++ F + Sbjct: 356 KIDIKVIQNRGRDVSAFLIP-NKEEILKYDYACFAHDKKTKQLQ-PEIKGEDFKFRCFEN 413 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK--------NREMICTL 278 +LG+ +V II F + +G++ + PN + ++ +LG+ N E C L Sbjct: 414 ILGSKELVENIIGLFIENPRLGLLSPPS---PN-HAEFYGNLGREWGHSGNDNYEETCNL 469 Query: 279 AGRMGITFQ-DQKLDFFA--GTMFWVRTEALDPIKNLRLSRYFE--PKVHKALDGEIEHA 333 + I D A GT+FW R ++L+ + L+ +E PK +DG + HA Sbjct: 470 LKELVIEVNVDISKAPVAPYGTIFWFRPKSLEKL--LKKGWKYEDFPKEPNKVDGTLLHA 527 Query: 334 VERCFSLSVKKANFRISDV 352 +ER + V+ A + +++ Sbjct: 528 IERVYPFVVQGAGYYSANI 546 >gi|261367011|ref|ZP_05979894.1| putative polysaccharide biosynthesis protein [Subdoligranulum variabile DSM 15176] gi|282571129|gb|EFB76664.1| putative polysaccharide biosynthesis protein [Subdoligranulum variabile DSM 15176] Length = 646 Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 60/254 (23%), Positives = 113/254 (44%), Gaps = 23/254 (9%) Query: 108 KVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA 167 + ++ + +IA+ +HLY++D+ + + D+ V+ T S K +I + F Sbjct: 300 QAEELCAQRRIALAMHLYFMDMLEQSVAFAAKFPPQTDVFVS--TNSEEKKEQIEQAFSG 357 Query: 168 ARIH-----IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 ++H ++EN GRDV FL L L NYDY C +H KK+ + G + + Sbjct: 358 QKLHSVTVMVVENRGRDVGAFLCDL-APHLRNYDYACFMHDKKAIQTK----PGSVGASF 412 Query: 223 LFY---DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG------KNRE 273 + ++ V ++ F+ +G++ Y Y CS G ++ Sbjct: 413 GYVCNENVCKNAAHVLNVLCEFENDPYLGILCP-PYPTHGLYFMNMCSGGWGPNFENTKK 471 Query: 274 MICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 ++ L + I+ ++ + F G++FW R +AL+P+ P+ DG I HA Sbjct: 472 LLKELGLDVPISGEESPIAPF-GSVFWFRPKALEPLFAHGWQHTDFPQEPLPQDGTISHA 530 Query: 334 VERCFSLSVKKANF 347 +ER + + A + Sbjct: 531 IERVYPFVAQAAGY 544 >gi|241668058|ref|ZP_04755636.1| glycosyl transferase, group 1 [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 756 Score = 69.7 bits (169), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 63/239 (26%), Positives = 106/239 (44%), Gaps = 15/239 (6%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHI 172 I K AI +HL+YIDL E L +DL++T++ + ++ E AA + I Sbjct: 518 ISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSSGAANVEI 577 Query: 173 --MENHGRDVLPFLILLETEQLS-NYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 ++N GRD+ P + L+ + L+ Y+ V H KK+ + GD WR +L +L+G Sbjct: 578 VAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKTVSAHDNL--GDKWRAYLLNNLIG 635 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 + I IG++ Y +G+N+ + L +G+ + Sbjct: 636 DNEQISNSILNLFNDEKIGLVFPEDRTY--------IDIGENKFYVDELCTAIGLEKICE 687 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 F G MFW R +A+ I +L + + DG HA+ER V+K ++ Sbjct: 688 TPLFPLGNMFWARVDAIRDIFSLNEDMILQEE-PLPRDGSYMHALERIIPNIVEKNGYK 745 >gi|167627488|ref|YP_001677988.1| group 1 glycosyl transferase [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167597489|gb|ABZ87487.1| glycosyl transferase, group 1 [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 763 Score = 69.7 bits (169), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 63/239 (26%), Positives = 106/239 (44%), Gaps = 15/239 (6%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHI 172 I K AI +HL+YIDL E L +DL++T++ + ++ E AA + I Sbjct: 525 ISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSSGAANVEI 584 Query: 173 --MENHGRDVLPFLILLETEQLS-NYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 ++N GRD+ P + L+ + L+ Y+ V H KK+ + GD WR +L +L+G Sbjct: 585 VAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKTVSAHDNL--GDKWRAYLLNNLIG 642 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 + I IG++ Y +G+N+ + L +G+ + Sbjct: 643 DNEQISNSILNLFNDEKIGLVFPEDRTY--------IDIGENKFYVDELCTAIGLEKICE 694 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 F G MFW R +A+ I +L + + DG HA+ER V+K ++ Sbjct: 695 TPLFPLGNMFWARVDAIRDIFSLNEDMILQEEPLPR-DGSYMHALERIIPNIVEKNGYK 752 >gi|254876593|ref|ZP_05249303.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254842614|gb|EET21028.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 765 Score = 69.7 bits (169), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 63/239 (26%), Positives = 106/239 (44%), Gaps = 15/239 (6%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHI 172 I K AI +HL+YIDL E L +DL++T++ + ++ E AA + I Sbjct: 527 ISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSSGAANVEI 586 Query: 173 --MENHGRDVLPFLILLETEQLS-NYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 ++N GRD+ P + L+ + L+ Y+ V H KK+ + GD WR +L +L+G Sbjct: 587 VAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKTVSAHDNL--GDKWRAYLLNNLIG 644 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 + I IG++ Y +G+N+ + L +G+ + Sbjct: 645 DNEQISNSILNLFNDEKIGLVFPEDRTY--------IDIGENKFYVDELCTAIGLEKICE 696 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 F G MFW R +A+ I +L + + DG HA+ER V+K ++ Sbjct: 697 TPLFPLGNMFWARVDAIRDIFSLNEDMILQEE-PLPRDGSYMHALERIIPNIVEKNGYK 754 >gi|116071143|ref|ZP_01468412.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. BL107] gi|116066548|gb|EAU72305.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. BL107] Length = 1161 Score = 69.3 bits (168), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 65/259 (25%), Positives = 111/259 (42%), Gaps = 40/259 (15%) Query: 103 SPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEIL 162 SP K+ Q + K + +H++Y +L IA+ L+ + + D++++ + E+ Sbjct: 188 SPHPQKLTQAIEQKKFGVFLHIFYPELAKTIADYLAKIPVKIDIYISTTEKEV---DELA 244 Query: 163 KIF------PAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY--SWW 214 K F ++ N GRDV PF++ E++ YD++ K+H KKS W+ Sbjct: 245 KTFRRLDNSEHVQVKSFSNTGRDVAPFVVGFR-EEILKYDFILKLHSKKSPHSDALSGWF 303 Query: 215 EGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----- 269 E L +L+G+ V + I M A YP + +Y SLG Sbjct: 304 EHCL------DNLIGSKDVFYTNIFEL-------MNNETAIIYPVE--NYALSLGIKHDS 348 Query: 270 ------KNREMICTLAGRMGITF--QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPK 321 N + L ++ + +D K F GTMFW ++ L PI + L + Sbjct: 349 CWGHEDGNYDKAKPLLDKLNLKHIDRDSKFLFPTGTMFWCKSYILQPILDWNLGFHDFDN 408 Query: 322 VHKALDGEIEHAVERCFSL 340 +DG + H++ER L Sbjct: 409 EGGQIDGTLAHSIERLIGL 427 >gi|311063512|ref|YP_003970237.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum PRL2010] gi|310865831|gb|ADP35200.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum PRL2010] Length = 631 Score = 69.3 bits (168), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 64/236 (27%), Positives = 103/236 (43%), Gaps = 9/236 (3%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES-ASIKSEILKIFPAA-RIHIME 174 KIA+ +H+YY+DL + ++ D+ +T+ +E+ A I E K FP + ++E Sbjct: 310 KIALAIHVYYMDLLESTFRYIQSMPEGCDIIITVGSEANAEIVREYCKQFPYRFDVRVIE 369 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVV 234 N GRDV L+ E L YDYVC H KK + GD + + ++L + V Sbjct: 370 NRGRDVSSLLVGC-GEDLFQYDYVCFAHDKKVTQLSPQSI-GDGFAYKCYENILASKEYV 427 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN---REMICTLAGRMGITFQDQKL 291 +I F+ + +G+ + + + YT G N + M + K Sbjct: 428 SNVIDLFEKNPRLGIAMPTPPNHASYFPGYTFPWGPNFPGTKDFLEQTLNMHVPLNANKE 487 Query: 292 DFFA-GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 GTMFW R EA + + P +DG + H +ER + V +AN Sbjct: 488 PVAPMGTMFWFRPEAFRGLLDHGWKYEDFPPEPNKVDGTLLHFIERAYGY-VPQAN 542 >gi|88808074|ref|ZP_01123585.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805] gi|88788113|gb|EAR19269.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805] Length = 512 Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 66/238 (27%), Positives = 109/238 (45%), Gaps = 33/238 (13%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKI---FPAARIH 171 KI +V+H YY + I L ++ FDL VT+ + + +K + K+ I Sbjct: 50 KILVVIHAYYPESLATIFPSLRHMPCHFDLVVTVCSCGDKEVVKEYLEKVDLPIDVLDIK 109 Query: 172 IMENHGRDVLPFLILLETEQLSN--YDYVCKIHGKKS------KRKGYSWWEGDLWRRWL 223 ++ N GRD+LPF+ +++ +L N YD+V K+H K+S K G W EG L Sbjct: 110 VLTNLGRDLLPFVQVIKGLKLQNKAYDFVLKLHTKRSVASSKGKEFGGKWLEGSL----- 164 Query: 224 FYDLLGAPG----VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLA 279 +LLG+P ++ ++++T + +I +R+ C N I L Sbjct: 165 -SNLLGSPENVKYILLELLQTTNCALVSPLISLDVFRF--------CKWKNNLAPISHLL 215 Query: 280 GRMGITFQDQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 R G+ + + F AG+MFWV +A I + P+ + +G HA ER Sbjct: 216 DRFGVRESPEDFICFPAGSMFWVDFKAAVLIASCFEESRVPPEPLPS-NGSYLHAFER 272 >gi|125623094|ref|YP_001031577.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris MG1363] gi|124491902|emb|CAL96823.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris MG1363] gi|300069842|gb|ADJ59242.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris NZ9000] Length = 588 Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 69/263 (26%), Positives = 120/263 (45%), Gaps = 28/263 (10%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIH---IM 173 KIAI +H +Y+DL E + ++DL +T T++ +I+K +P +I + Sbjct: 299 KIAIHLHAFYLDLIPEYLDYFDKYVQNYDLFIT--TDTKDKYEQIIKSYPLNQIKKVLVT 356 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRDVLP++ + +E +++YD H KKSK W G+ WRR + Y LL Sbjct: 357 GNKGRDVLPWMEI--SELMADYDLCGHFHTKKSKDN--DWIVGESWRRDIEYSLLKPAQA 412 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTC----SLGKNREMI---CTLAGRMGITF 286 +F + F+ + +G++ + + + T + + E I G+ Sbjct: 413 IF---QEFEKNPKLGLMIADVPSFFEHFYGPTYITERDIWPDMEEIWKKINFENPRGLKQ 469 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYF--EPKVHKALDGEIEHAVERCFSLS--V 342 +D + + GTM W R +AL+ + + + EP + + I HA ER + Sbjct: 470 KDSYVMSY-GTMIWYRPQALNNLLKVDIEAAVPEEPLPYNS----ILHAFERLLVYTSWA 524 Query: 343 KKANFRISDVDCILGYRKSLSQN 365 +FRIS + G+ + S N Sbjct: 525 NGYDFRISQIQTNNGFVANFSAN 547 >gi|78213552|ref|YP_382331.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9605] gi|78198011|gb|ABB35776.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9605] Length = 1162 Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 62/249 (24%), Positives = 112/249 (44%), Gaps = 34/249 (13%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES-ASIKSEILKIFPAARIHI 172 I K+ I +H++Y +L IA L N+ S D+ ++ +S A+++ ++ +I + Sbjct: 201 INKKVGIFLHIFYPELGETIAAYLKNIPCSIDVFISTREDSVAALEKIFARVENTQKIEV 260 Query: 173 --MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGA 230 N GRDV PF++ +Q+ NYDY+ K+H KKS + G W +L+G+ Sbjct: 261 RHFSNIGRDVAPFIVGFR-DQILNYDYILKLHSKKSPHS--NALSG--WFLHCLDNLIGS 315 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG---------------KNREMI 275 + ++ + ++G++ YP + +Y SLG K R + Sbjct: 316 EAITATNLKALQS-PEVGIV------YPIE--NYALSLGIQHDSCWGHEDGNYAKARPFL 366 Query: 276 CTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVE 335 R ++ + F GTMFW + L I + L+ + +DG I H++E Sbjct: 367 NRYNLRQ--IKRESQFQFPTGTMFWCKPAVLQSILDWGLNWNNFDEEGGQIDGTIAHSIE 424 Query: 336 RCFSLSVKK 344 R +S + Sbjct: 425 RLIGISTTE 433 >gi|146279467|ref|YP_001169625.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC 17025] gi|145557708|gb|ABP72320.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC 17025] Length = 823 Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 60/223 (26%), Positives = 91/223 (40%), Gaps = 19/223 (8%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARIHIME 174 A+ VH +Y D A++ S+ + + + T++ SEI A + ++ Sbjct: 591 ALHVHAFYTDDLA--ADVRSHRAFRLARRIVITTDNERKASEIRTRMGAEGLYPEVILVP 648 Query: 175 NHGRDVLPFLILLETEQLSNYDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+LPF+ L + D + C +H KKS S GD+WR +L LLG Sbjct: 649 NRGRDILPFMQLFLPGGPAGKDEIWCHLHQKKSLATSDS---GDVWRAFLLRILLGDDAG 705 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 + + G + A + Y +R ++ A R+ D L F Sbjct: 706 LSDAV---------GHLRDPAVGLVAPFDPYHVPWDASRALLPRFAPRLPGPLPDNPLLF 756 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 G MFWVR + + +L Y P A DG H VER Sbjct: 757 PVGNMFWVRAGVVRAMNDLFGPSYPWPNEPIANDGTEFHLVER 799 >gi|78184217|ref|YP_376652.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] gi|78168511|gb|ABB25608.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] Length = 519 Score = 63.5 bits (153), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 68/245 (27%), Positives = 109/245 (44%), Gaps = 19/245 (7%) Query: 118 IAIVVHLYYIDLWIEIANLLSN-----LSISFDLHVTLVTESASIKSEILKI--FPAARI 170 +A+++H +Y D+ +I L + + DL+V+ + + L+ F R+ Sbjct: 268 LALMIHGFYPDVLDDILLKLPSFCAGMVGTQLDLYVSTSMDQIDQVEKKLRDLDFACVRL 327 Query: 171 HIMENHGRDVLPFLI-LLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 +EN GRDV PFL+ LL + + + K+H KKS + G D W R L LL Sbjct: 328 FGVENRGRDVAPFLLHLLPAVAAAGHHFFVKLHTKKSLQFGIDGL--DKWSRHLIESLLS 385 Query: 230 APGVVFKIIR-TFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 A G+ + IR F D+G + P +L KN+ + L I + Sbjct: 386 AAGL--EAIRYQFLDDEDLGCLCPSGTLLP-----LAIALFKNKTHLHHLLSHSEINGRW 438 Query: 289 QKLD-FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + F AG+MF R EA + + S DG HA+ER SL VK++ + Sbjct: 439 ALMQTFVAGSMFAGRVEAFRSLLDQGFSLDDFELEGGQFDGTFAHALERLISLEVKRSGW 498 Query: 348 RISDV 352 +I ++ Sbjct: 499 QIKEM 503 >gi|260434430|ref|ZP_05788400.1| glycosyltransferase [Synechococcus sp. WH 8109] gi|260412304|gb|EEX05600.1| glycosyltransferase [Synechococcus sp. WH 8109] Length = 772 Score = 62.8 bits (151), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 61/243 (25%), Positives = 101/243 (41%), Gaps = 23/243 (9%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA---SIKSEILKIFPAARI 170 I K+ + +H++Y +L EI +S I +++++ ++ +IK+ +I Sbjct: 532 IDEKVGLHIHVHYPELLDEILKAISMNKIRPEIYISCTNQAIRDLAIKNINEHGLILKKI 591 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKS----KRKGYSWWEGDLWRRWLFYD 226 + N GRD+ P L L E Y IH KKS + + YSW R +L + Sbjct: 592 ILTPNRGRDIGPLLTCLGQELDEKYRIYGHIHTKKSIHIARHQSYSW------RTFLIEN 645 Query: 227 LLGAPG--VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI 284 L+G ++ II + IG+ P +Y + LA ++ I Sbjct: 646 LIGNEENHMMDCIISAMIKDKTIGLAFPSDPHCPGWDANYRQA--------KLLAEKLNI 697 Query: 285 TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 + +F GTMFW R AL P+ +L L P DG + H++ER + Sbjct: 698 KSLTNEFNFPIGTMFWARKNALSPLYSLNLGWDDYPSEPIGYDGTLLHSIERLIPFVAES 757 Query: 345 ANF 347 F Sbjct: 758 QGF 760 >gi|301632931|ref|XP_002945533.1| PREDICTED: o-antigen export system ATP-binding protein rfbB-like, partial [Xenopus (Silurana) tropicalis] Length = 367 Score = 62.8 bits (151), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 48/161 (29%), Positives = 69/161 (42%), Gaps = 30/161 (18%) Query: 192 LSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIG 251 + Y + ++H K+S G+ WR L+ L G+ V I+ TF+TH +GM+ Sbjct: 201 IPRYALILRLHSKRSLH--IPGQVGEEWRALLYTSLAGSRQRVNAIVDTFNTHPKLGMLC 258 Query: 252 SRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF-QDQKLDFFAGTMFWVRTEALD--- 307 + Y D G N + +C L GIT DQ +DF G+MFW R +AL Sbjct: 259 PAVI---DHYAD-CLHFGGNYKRMCALLQPHGITLPPDQPIDFPMGSMFWCRPQALSVWL 314 Query: 308 ----------PIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 P +L R DG + HA+ER F Sbjct: 315 EPGFTFDDFTPTNDLDTDR----------DGTLAHALERLF 345 >gi|297182567|gb|ADI18727.1| lipopolysaccharide biosynthesis protein [uncultured Rhizobiales bacterium HF4000_32B18] Length = 887 Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 51/191 (26%), Positives = 80/191 (41%), Gaps = 25/191 (13%) Query: 167 AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 A ++ N GRDVLPFL L + + N + C +H KKS G + G +WR +L Sbjct: 697 AIETRVVANRGRDVLPFLELFDGSEDDNALW-CHVHLKKSVGLGPTS-PGAVWRAFLMRI 754 Query: 227 LLGAP---GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 LLG P +IR + G++G+ + Y +R ++ L R+ Sbjct: 755 LLGGPERLSTALALIRAPEA----GLVGA--------FDPYVMGWTGSRRLLAPLQARLD 802 Query: 284 --------ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVE 335 D L F G MFWV+ ++ ++ L + Y P DG + H +E Sbjct: 803 GWEADGGRRPLPDHPLLFPVGDMFWVKAGVVNAMRRLFGADYPWPGEPLPGDGTVYHLIE 862 Query: 336 RCFSLSVKKAN 346 R + + A Sbjct: 863 RLWPTAAALAG 873 >gi|260890969|ref|ZP_05902232.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia hofstadii F0254] gi|260859295|gb|EEX73795.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia hofstadii F0254] Length = 709 Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 62/239 (25%), Positives = 109/239 (45%), Gaps = 12/239 (5%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK---IFPAARIHIM 173 K+ ++ H+Y+ DL E + + ++ + DL +T + K E F + ++ Sbjct: 152 KVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNIEVRVI 211 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR-KGYSWWEGDLWRRWLFYDLLGAPG 232 EN GRDV L+ + + + NYDYVC +H KK+ + K YS D++ + L Sbjct: 212 ENRGRDVSSLLVGAK-DAVMNYDYVCFMHDKKTAQLKPYSSL-NDVYINYC-KGTLATKK 268 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKY----CDYTCSLGKNREMICTLAGRMGITFQD 288 V +I TF + +GM+ + N + +++ + K ++I L + + Sbjct: 269 YVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVDFHWNL 328 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + + GTMFW R AL + + P+ DG I HAVER + V+ A + Sbjct: 329 EPISPL-GTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFVVQDAGY 386 >gi|160936497|ref|ZP_02083865.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC BAA-613] gi|158440582|gb|EDP18320.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC BAA-613] Length = 373 Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 53/219 (24%), Positives = 104/219 (47%), Gaps = 18/219 (8%) Query: 122 VHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVL 181 +++ I +I++ + SN I+F + + + I ++L + EN GRD+ Sbjct: 12 LYIEQIPKYIDVCFVTSNPKIAFKVK-KYINNTKKINYKVL---------VKENRGRDMA 61 Query: 182 PFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTF 241 L+ + + Y+Y+C +H KKS + G +G + ++ +L+G+ G++ I+R Sbjct: 62 ALLVTCH-DFIMEYEYLCFVHDKKSLQMGNDN-DGCKFMELIWKNLIGSTGLIENILRYL 119 Query: 242 DTHRDIG-MIGSRAY--RYPNKYCD-YTCSLGKNREMICTLAGRMGITFQDQKLDFFAGT 297 +RD+G M+ Y Y + + +TC+ + L + + ++ + + G Sbjct: 120 GNNRDVGLMVPPIPYWGNYIGVFINPWTCNYDNVINLGNQLKLKKNVCYEKEYVTI--GG 177 Query: 298 MFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 FW RT AL P+ + + A+DG I HA+ER Sbjct: 178 AFWCRTNALKPLFEYKWKLEDFCQEPMAVDGTISHAIER 216 >gi|332561589|ref|ZP_08415902.1| glycosyltransferase [Rhodobacter sphaeroides WS8N] gi|332274091|gb|EGJ19409.1| glycosyltransferase [Rhodobacter sphaeroides WS8N] Length = 821 Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 54/223 (24%), Positives = 95/223 (42%), Gaps = 15/223 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIME 174 + ++ VH +Y D + + + + VT ++ + I++ + + A + + Sbjct: 587 RFSLHVHAFYTDDLAQDVRRHAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRP 646 Query: 175 NHGRDVLPFLILLETEQLSNYDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+LPFL L + D + C +H KKS S GD+WR +L LLG Sbjct: 647 NRGRDILPFLQLFLPGGAAGEDEIWCHLHQKKSLATTDS---GDMWRAFLLRILLGDEAS 703 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 + T + +G++ + Y +R ++ +A R+ D L F Sbjct: 704 LSDAA-THLRNPGVGLVA--------PFDPYFIPWDASRALLPRVAPRLPGPLPDNPLLF 754 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 G MF+VR+ + + +L + Y P DG H +ER Sbjct: 755 PVGNMFFVRSAVVRAMNDLFGAGYPWPNEPIPNDGTEFHLIER 797 >gi|221218294|ref|YP_002524321.1| glycosyltransferase [Rhodobacter sphaeroides KD131] gi|221163321|gb|ACM04287.1| glycosyltransferase [Rhodobacter sphaeroides KD131] Length = 821 Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 54/223 (24%), Positives = 95/223 (42%), Gaps = 15/223 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIME 174 + ++ VH +Y D + + + + VT ++ + I++ + + A + + Sbjct: 587 RFSLHVHAFYTDDLAQDVRRHAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRP 646 Query: 175 NHGRDVLPFLILLETEQLSNYDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+LPFL L + D + C +H KKS S GD+WR +L LLG Sbjct: 647 NRGRDILPFLQLFLPGGAAGEDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEAS 703 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 + T + +G++ + Y +R ++ +A R+ D L F Sbjct: 704 LSDAA-THLRNPGVGLVA--------PFDPYFIPWDASRALLPRVAPRLPGPLPDNPLLF 754 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 G MF+VR+ + + +L + Y P DG H +ER Sbjct: 755 PVGNMFFVRSRVVRAMNDLFGAGYPWPNEPIPNDGTEFHLIER 797 >gi|14090418|gb|AAK53494.1| putative methyltransferase [Xanthomonas campestris pv. campestris] Length = 212 Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 43/159 (27%), Positives = 69/159 (43%), Gaps = 19/159 (11%) Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENHGR 178 +V+H +Y+D+ E+ + + + +T +T+ + I + A + EN GR Sbjct: 51 VVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRRGIQAEVEGFENRGR 110 Query: 179 DVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKII 238 D+LPFL + N V K+H KKS + +G+ WR + LLG P V I+ Sbjct: 111 DILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEMLTALLG-PQRVDAIV 165 Query: 239 RTFDTHRDIGM-------------IGSRAYRYPNKYCDY 264 F T +G+ IG A R YC + Sbjct: 166 NAFSTDPLVGLAAPEDHLLPVTEFIGGNAERTGLSYCSH 204 >gi|189440434|ref|YP_001955515.1| lipopolysaccharide biosynthesis protein [Bifidobacterium longum DJO10A] gi|317482688|ref|ZP_07941702.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA] gi|189428869|gb|ACD99017.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum DJO10A] gi|316915934|gb|EFV37342.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA] Length = 666 Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 64/248 (25%), Positives = 102/248 (41%), Gaps = 15/248 (6%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK 163 P+ Q + + A + H+Y++DL + + +++L DL++T + E ++ Sbjct: 291 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 350 Query: 164 ---IFPAARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSWWEG 216 I A + N GRDV L+ LS YD + H KKS + G+ E Sbjct: 351 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 410 Query: 217 DLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCS--LGKNREM 274 + L + LG+ V I+ F + +G + + Y Y +T G N E+ Sbjct: 411 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQV-TPPPPYHALYFAHTIPHDWGANYEI 469 Query: 275 ICT-LAGRMGITFQDQKLDFFA---GTMFWVRTEALDPIKNLRLS-RYFEPKVHKALDGE 329 L R+GI A G+ +W R EAL P+ F P+ DG Sbjct: 470 TKELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGT 529 Query: 330 IEHAVERC 337 I HA+ER Sbjct: 530 ISHAIERA 537 >gi|322690050|ref|YP_004209784.1| hypothetical protein BLIF_1872 [Bifidobacterium longum subsp. infantis 157F] gi|320461386|dbj|BAJ72006.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 672 Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 64/248 (25%), Positives = 102/248 (41%), Gaps = 15/248 (6%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK 163 P+ Q + + A + H+Y++DL + + +++L DL++T + E ++ Sbjct: 291 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 350 Query: 164 ---IFPAARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSWWEG 216 I A + N GRDV L+ LS YD + H KKS + G+ E Sbjct: 351 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 410 Query: 217 DLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCS--LGKNREM 274 + L + LG+ V I+ F + +G + + Y Y +T G N E+ Sbjct: 411 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQV-TPPPPYHALYFAHTIPHDWGANYEI 469 Query: 275 ICT-LAGRMGITFQDQKLDFFA---GTMFWVRTEALDPIKNLRLS-RYFEPKVHKALDGE 329 L R+GI A G+ +W R EAL P+ F P+ DG Sbjct: 470 TKELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGT 529 Query: 330 IEHAVERC 337 I HA+ER Sbjct: 530 ISHAIERA 537 >gi|291516581|emb|CBK70197.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum subsp. longum F8] Length = 688 Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 64/248 (25%), Positives = 102/248 (41%), Gaps = 15/248 (6%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK 163 P+ Q + + A + H+Y++DL + + +++L DL++T + E ++ Sbjct: 313 PSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIREYMQ 372 Query: 164 ---IFPAARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSWWEG 216 I A + N GRDV L+ LS YD + H KKS + G+ E Sbjct: 373 QHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHGTES 432 Query: 217 DLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCS--LGKNREM 274 + L + LG+ V I+ F + +G + + Y Y +T G N E+ Sbjct: 433 QGFAYKLMENTLGSEAYVKNILTLFAENPRLGQV-TPPPPYHALYFAHTIPHDWGANYEI 491 Query: 275 ICT-LAGRMGITFQDQKLDFFA---GTMFWVRTEALDPIKNLRLS-RYFEPKVHKALDGE 329 L R+GI A G+ +W R EAL P+ F P+ DG Sbjct: 492 TKELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGEDGT 551 Query: 330 IEHAVERC 337 I HA+ER Sbjct: 552 ISHAIERA 559 >gi|50982351|gb|AAT91804.1| hypothetical protein [Yersinia enterocolitica] Length = 358 Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 59/230 (25%), Positives = 96/230 (41%), Gaps = 32/230 (13%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARI--HIME 174 K I+VH +Y EI N L + D + + + +I + +I R+ IM Sbjct: 128 KKLIIVHAFYQREAEEIFNRLVAFT---DYDIVITSPYNNIICKAKEILGQERVIGFIMP 184 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK--RKGYSWWEGDLWRRWLFYDLLGAPG 232 N+GRD+LPFLI L+ + Y+Y K+H K+S+ +W+ +L D L Sbjct: 185 NYGRDILPFLICLQLIVIEKYEYFVKVHTKRSQHLNDNGAWFNNNL-------DYLVG-- 235 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCS----LGKNREMICTLAGRMGITFQD 288 + + G+ + P Y +Y + N + L + + + Sbjct: 236 ---------NKNATDGLFSIMSDDEPQIYGEYILPIQDHIANNIHWLTYLLEKEPASVE- 285 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 F GTMF L I++L+L + K + LDG HA+ER F Sbjct: 286 --ASFIPGTMFIGNRAFLVLIRDLQLHLFQIEKENGQLDGCCVHAIERYF 333 >gi|77404644|ref|YP_345218.1| glycosyltransferase [Rhodobacter sphaeroides 2.4.1] gi|77390294|gb|ABA81477.1| possible glycosyltransferase [Rhodobacter sphaeroides 2.4.1] Length = 793 Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 48/207 (23%), Positives = 88/207 (42%), Gaps = 15/207 (7%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIME 174 + ++ VH +Y D + + + + VT ++ + I++ + + A + + Sbjct: 587 RFSLHVHAFYTDDLAQDVRRHAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRP 646 Query: 175 NHGRDVLPFLILLETEQLSNYDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+LPFL L + D + C +H KKS S GD+WR +L LLG Sbjct: 647 NRGRDILPFLQLFLPGGAAGEDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEAS 703 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 + + +G++ + Y +R ++ +A R+ D L F Sbjct: 704 LSDAATNL-RNPGVGLVA--------PFDPYFIPWDASRALLPRVAPRLPGPLPDNPLLF 754 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEP 320 G MF+VR+ + + +L + Y P Sbjct: 755 PVGNMFFVRSAVVRAMNDLFGAGYPGP 781 >gi|219670466|ref|YP_002460901.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2] gi|219540726|gb|ACL22465.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2] Length = 606 Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 61/252 (24%), Positives = 98/252 (38%), Gaps = 12/252 (4%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEIL 162 P+ V + + K+ + H+YY DL + + ++ D+ +T E I E + Sbjct: 279 PSDYVVKPLKRQPKVVVCFHVYYEDLLDSCFHYMQSIPQFADIVITTPKKELVGIIEEKI 338 Query: 163 KIFP--AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK--RKGYSWWEGDL 218 K + I ++ GR FL+ + + + +YDY C +H KKS R G E L Sbjct: 339 KSYELNNTTIKVINARGRAESAFLVATK-DFILDYDYACIVHDKKSSFLRPGCVGVEFGL 397 Query: 219 WRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTL 278 LL V I+ F+ + IG + + N Y G N + Sbjct: 398 QN---LDALLATSAYVENILSIFEDNPRIGALEPVHLLHANFRDLYGGEWGANYKGTEEF 454 Query: 279 AGRMGITF---QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVE 335 R GI D G MFW R + I ++ P+ LDG + H +E Sbjct: 455 LKRAGIDLLISPDVPPLAPMGAMFWFRPICMKRILDMEWEYEDFPEEPLPLDGSLIHIIE 514 Query: 336 RCFSLSVKKANF 347 R + V+ A + Sbjct: 515 RAYPFIVQDAGY 526 >gi|291520449|emb|CBK75670.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 486 Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust. Identities = 45/186 (24%), Positives = 82/186 (44%), Gaps = 29/186 (15%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKI-----FPAARIH 171 K+A+V HLYY++++ + L+ + D+ +T T S K I+++ + Sbjct: 291 KVAVVAHLYYVEMFELCMDYLAKVPYGIDIIIT--TNSDDKKQNIIEVASEKGVKLTEVI 348 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + EN GR++ L+ + L Y Y C +H KKS K + G +R L+ L + Sbjct: 349 VAENRGRELAALLVGC-GKFLLKYKYFCFVHDKKSSAKEH-LSVGLAFRDILWDSSLYSE 406 Query: 232 GVVFKIIRTFDTHRDIGM-------IGSRAYRYPN-------------KYCDYTCSLGKN 271 G + II F+ + +G+ GS Y +P+ K + + ++ Sbjct: 407 GYIRNIIDMFEQNECMGLAVPPTVYCGSYFYPFPDYWVGNYEKTIELSKILNINVDIAED 466 Query: 272 REMICT 277 R + CT Sbjct: 467 RPIPCT 472 >gi|310286583|ref|YP_003937841.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum S17] gi|309250519|gb|ADO52267.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum S17] Length = 662 Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust. Identities = 59/254 (23%), Positives = 98/254 (38%), Gaps = 27/254 (10%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK 163 P + + + + A + H+Y++DL + +S L DL++T + + + Sbjct: 292 PTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRDYMA 351 Query: 164 I----FPAARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSWWE 215 P I ++ N GRDV L+ LS YD + H KKS + G+ E Sbjct: 352 SHGVNHPVTFISVV-NRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHHGTE 410 Query: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSR--------AYRYPNKYCDYTCS 267 + L + L + V I+ F +G + A+ P+ D+ + Sbjct: 411 SQGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPH---DWGAN 467 Query: 268 LGKNREMICTLAGRMGI--TFQDQKLDFFA-GTMFWVRTEALDPIKNLRLS-RYFEPKVH 323 +E+ L R I K A G+ +W R EAL P+ F P+ Sbjct: 468 FEITKEL---LEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGE 524 Query: 324 KALDGEIEHAVERC 337 DG + HA+ER Sbjct: 525 MGEDGTVSHAIERA 538 >gi|224284010|ref|ZP_03647332.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum NCIMB 41171] gi|313141164|ref|ZP_07803357.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313133674|gb|EFR51291.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 662 Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust. Identities = 59/254 (23%), Positives = 98/254 (38%), Gaps = 27/254 (10%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK 163 P + + + + A + H+Y++DL + +S L DL++T + + + Sbjct: 292 PTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRDYMA 351 Query: 164 I----FPAARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSWWE 215 P I ++ N GRDV L+ LS YD + H KKS + G+ E Sbjct: 352 SHGVNHPVTFISVV-NRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHHGTE 410 Query: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSR--------AYRYPNKYCDYTCS 267 + L + L + V I+ F +G + A+ P+ D+ + Sbjct: 411 SQGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPH---DWGAN 467 Query: 268 LGKNREMICTLAGRMGI--TFQDQKLDFFA-GTMFWVRTEALDPIKNLRLS-RYFEPKVH 323 +E+ L R I K A G+ +W R EAL P+ F P+ Sbjct: 468 FEITKEL---LEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGE 524 Query: 324 KALDGEIEHAVERC 337 DG + HA+ER Sbjct: 525 MGEDGTVSHAIERA 538 >gi|291520445|emb|CBK75666.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 109 Score = 42.0 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 5/96 (5%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS--IKSEILKIFPAARIHIM- 173 + A+ +L++ DL+ E SNL D++V TE I I K+ + ++ Sbjct: 15 RYAVFAYLFFDDLFEESLRYFSNLPNYVDIYVATNTEEKVDVINGYIPKMLFRHNVKVLL 74 Query: 174 -ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 N GRDV L+LL+ SNYD +C +H K R Sbjct: 75 HNNKGRDVSALLVLLK-RYYSNYDVICFVHDKNHLR 109 >gi|148238469|ref|YP_001223856.1| sulfotransferase [Synechococcus sp. WH 7803] gi|147847008|emb|CAK22559.1| Possible sulfotransferase [Synechococcus sp. WH 7803] Length = 476 Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust. Identities = 40/154 (25%), Positives = 69/154 (44%), Gaps = 12/154 (7%) Query: 190 EQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGM 249 ++L +D V H K++ + G+ WR+ L P + +T + G+ Sbjct: 5 DRLKEFDLVVHCHTKRTPHAPDGF--GESWRQSLLQCTFPNPDRC-QEFQTLLHKPEAGL 61 Query: 250 IGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQ-DQKLDFFAGTMFWVRTEALDP 308 I +P+++ + + G N L MG T + D L F AG+ FW R ++L Sbjct: 62 I----MPWPHRFVAHNVNWGSNFTQTRALMNLMGHTIRRDTLLAFPAGSFFWARVDSL-- 115 Query: 309 IKNLRLSRYFEPKVHKAL--DGEIEHAVERCFSL 340 + L L+ +E + L DG + H++ERC L Sbjct: 116 LALLDLTLRWEDFAAEPLPGDGRLAHSLERCLGL 149 >gi|73539274|ref|YP_299641.1| sensor histidine kinase [Ralstonia eutropha JMP134] gi|72122611|gb|AAZ64797.1| ATP-binding region, ATPase-like:MASE1:Histidine kinase, dimerisation and phosphoacceptor region [Ralstonia eutropha JMP134] Length = 556 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 31/114 (27%), Positives = 51/114 (44%), Gaps = 11/114 (9%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT---ESASIKSEILKIFPAARIHI 172 A A++ Y + I I ++L+N+S+ LH+ +V +A+ + + AAR HI Sbjct: 50 ALAALLTMGYVVAPGIAIGSVLANVSVGVPLHIAVVIGLGATAAALAATWLLTRAARFHI 109 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCK--------IHGKKSKRKGYSWWEGDL 218 + RDVL FLI+ + V + G + R WW GD+ Sbjct: 110 ALDRIRDVLAFLIVAAIFSTAVSALVGATVLRGGGLLAGSQYGRAMLEWWLGDM 163 >gi|308813905|ref|XP_003084258.1| conserved domain protein (ISS) [Ostreococcus tauri] gi|116056142|emb|CAL58323.1| conserved domain protein (ISS) [Ostreococcus tauri] Length = 684 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 55/276 (19%), Positives = 99/276 (35%), Gaps = 41/276 (14%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKI 164 ++G +Q+A ++ + + ++ ++ L +S V E A L+ Sbjct: 72 SVGFFLQLAQSVNDGLLKSCFDNLMKVQSDSVQLQLYLSLTPTVANAPEVAYFTERFLRN 131 Query: 165 FPAARIHIMENHGRDVLPFLILLET--EQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 R+ +++ G D+ FL L +L + Y+ K+H K +W Sbjct: 132 EKNIRVVHVKDEGYDIGAFLKQLHRFRHELQVHQYILKVHSKSDP----------IWLER 181 Query: 223 LFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAY-----------------RYPNKYCDYT 265 L G+ V I++ F+T + ++ +Y NK T Sbjct: 182 AVESLCGSEHQVKSILKAFETQSTLDIVSPMGSTFSATTSKDAVFPHLKRKYFNKVDLAT 241 Query: 266 CSLGKNREMICTLAGRMGIT----FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPK 321 K + L ++G+ F+ AGTMFW R L L R FE Sbjct: 242 AFDDKTMHTMERLCAQLGLEACPYFEKYLASITAGTMFWARNSRL---YTEHLPRLFESI 298 Query: 322 VHK-----ALDGEIEHAVERCFSLSVKKANFRISDV 352 ++ + + IEHA+ER + I D+ Sbjct: 299 RNELSQDYSNNNRIEHALERLIPTLSRLNGRMIGDI 334 >gi|326784138|ref|YP_004324531.1| hypothetical protein Syn1_160 [Prochlorococcus phage Syn1] gi|310004870|gb|ADO99261.1| hypothetical protein Syn1_160 [Prochlorococcus phage Syn1] Length = 398 Score = 37.4 bits (85), Expect = 4.0, Method: Compositional matrix adjust. Identities = 29/109 (26%), Positives = 44/109 (40%), Gaps = 18/109 (16%) Query: 36 GGFFFWFWTLFYKRSKKLCYDEN---------------YVVAYGSRSGKKFFAQSNLYMM 80 G FF ++F K K+CY E + + Y S+ G K Q +L Sbjct: 67 GRFFVGTKSVFNKTEPKICYSEKAIDKLYEGDLAEKLKFALRYFSQLGIKGVIQGDLMFT 126 Query: 81 ERELH---FDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYY 126 + L DG+R+H F + P + + A AKI +V H +Y Sbjct: 127 DSTLKTETVDGERLHTFRPNTITYAIPVDHPIGKAAGAAKIGVVFHTHY 175 >gi|258597730|ref|XP_001348427.2| conserved Plasmodium membrane protein, unknown function [Plasmodium falciparum 3D7] gi|255528816|gb|AAN36866.2| conserved Plasmodium membrane protein, unknown function [Plasmodium falciparum 3D7] Length = 526 Score = 37.4 bits (85), Expect = 4.0, Method: Compositional matrix adjust. Identities = 27/134 (20%), Positives = 66/134 (49%), Gaps = 9/134 (6%) Query: 42 FWTLFYKRSKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGW 101 F+++FY ++ + YG+ F+ S Y++++ + IH F ++ Sbjct: 111 FYSIFYTKTV-------FSKDYGNNIYDIFYVASISYILQKRFNMRKSSIHDFQHCMNT- 162 Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI 161 E+ + +++Q I I ++ Y + ++I + N S+SF++ V+++T + + Sbjct: 163 ETKNIIELIQKGIILNIIPLIFSYVLSFIVDILMTIIN-SVSFNIIVSILTHPFDVILSL 221 Query: 162 LKIFPAARIHIMEN 175 KI+ + I+++ N Sbjct: 222 PKIYNLSNIYVITN 235 >gi|328772459|gb|EGF82497.1| hypothetical protein BATDEDRAFT_23048 [Batrachochytrium dendrobatidis JAM81] Length = 1000 Score = 36.6 bits (83), Expect = 6.7, Method: Compositional matrix adjust. Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 4/100 (4%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 W++ +I + +V H +Y+D W E NL + S F L VT+ AS+K Sbjct: 823 WQAYTAKNSDEITLSVGDWVVAHHHYLDGWAEGVNLTTRQSGLFPLPVTIPRGGASLKIA 882 Query: 161 ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCK 200 ++ A + H E GRD+L +L + + Y+ + Sbjct: 883 VIN---ATKSH-FEIFGRDILTGALLAYPQHIQITHYLSR 918 Searching..................................................done Results from round 2 >gi|254780923|ref|YP_003065336.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter asiaticus str. psy62] gi|254040600|gb|ACT57396.1| hypothetical protein CLIBASIA_04110 [Candidatus Liberibacter asiaticus str. psy62] Length = 365 Score = 514 bits (1324), Expect = e-144, Method: Composition-based stats. Identities = 365/365 (100%), Positives = 365/365 (100%) Query: 1 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLFYKRSKKLCYDENYV 60 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLFYKRSKKLCYDENYV Sbjct: 1 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLFYKRSKKLCYDENYV 60 Query: 61 VAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAI 120 VAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAI Sbjct: 61 VAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAI 120 Query: 121 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDV 180 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDV Sbjct: 121 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDV 180 Query: 181 LPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRT 240 LPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRT Sbjct: 181 LPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRT 240 Query: 241 FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFW 300 FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFW Sbjct: 241 FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFW 300 Query: 301 VRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRK 360 VRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRK Sbjct: 301 VRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRK 360 Query: 361 SLSQN 365 SLSQN Sbjct: 361 SLSQN 365 >gi|315122651|ref|YP_004063140.1| hypothetical protein CKC_04515 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496053|gb|ADR52652.1| hypothetical protein CKC_04515 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 405 Score = 445 bits (1144), Expect = e-123, Method: Composition-based stats. Identities = 222/366 (60%), Positives = 282/366 (77%), Gaps = 7/366 (1%) Query: 1 MPVSGCSKGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLF-YKRSKKLCYDENY 59 +PVSGC KGY LF S ++WNF S+H I KV+ + GFFFW +LF +KR + L YDEN Sbjct: 38 LPVSGCRKGYLLFVSRRENWNFDSNHLVIRKVSFFLGFFFWIRSLFLFKRYQTLRYDENR 97 Query: 60 VVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIA 119 ++AYGSR GKKFFA SN M+ R + FDG++IH FP+LLHGW+SP+ K+ + I++++A Sbjct: 98 IIAYGSRIGKKFFACSNKDMLARGVPFDGEKIHRFPRLLHGWDSPSSEKIASVKIQSRVA 157 Query: 120 IVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRD 179 IVVH+YY DLW EIANLLS L+ SFDLH+TLVTE ASIKSEILK FP A I++MEN+GRD Sbjct: 158 IVVHIYYADLWAEIANLLSGLNFSFDLHITLVTEIASIKSEILKRFPNAHIYVMENYGRD 217 Query: 180 VLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIR 239 + FL LLE +L +YDYVCKIHGKKSKR G+ WW+GDLWRRWLF+DLLGAPG+ +II+ Sbjct: 218 IRSFLKLLEGGKLDSYDYVCKIHGKKSKRNGHVWWDGDLWRRWLFFDLLGAPGIALEIIK 277 Query: 240 TFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMF 299 TF+ + IGMIGSR YRY D SLG NRE +C +A +MG++F+D K+DFF GTMF Sbjct: 278 TFEKYPKIGMIGSRTYRY-----DQKISLGNNREFVCAIANKMGVSFEDTKIDFFGGTMF 332 Query: 300 WVRTEALDPIKNLRLSRYFEPKVHK-ALDGEIEHAVERCFSLSVKKANFRISDVDCILGY 358 WVR +ALDPIKNL L++YF+ KV LDG +EHA+ERCFS+SV+KANF ++ VDC+ Sbjct: 333 WVRPQALDPIKNLALTQYFKSKVDMVGLDGCLEHAIERCFSISVEKANFDLAYVDCLSEE 392 Query: 359 RKSLSQ 364 + S Sbjct: 393 SDNKSS 398 >gi|315122628|ref|YP_004063117.1| hypothetical protein CKC_04400 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496030|gb|ADR52629.1| hypothetical protein CKC_04400 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 399 Score = 393 bits (1011), Expect = e-107, Method: Composition-based stats. Identities = 145/354 (40%), Positives = 212/354 (59%), Gaps = 4/354 (1%) Query: 9 GYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWT-LFYKRSKKLCYDENYVVAYGSRS 67 GY++ S K S+ E+V + FW + L + + +L + + YGSR Sbjct: 45 GYYMLWSLSKEQKITSEDVFFEEVTTFKACLFWLRSFLTFSKYSQLSFPSCRIFFYGSRK 104 Query: 68 GKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWES-PAMGKVMQIAIKAKIAIVVHLYY 126 KK F + N +M + FDG++ + +L GW++ ++ +I I +KIAIVVH YY Sbjct: 105 DKKAFFRLNRFMSNSRMPFDGKKFLYIKELFEGWKNLSSLDNKGKIKINSKIAIVVHCYY 164 Query: 127 IDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 D W EI++LL L+ FDL +T V ++ + ++LK FP+AR+++MEN GRDVLPFL L Sbjct: 165 QDTWDEISHLLLRLNFDFDLFITTVKKNKDFEQDVLKNFPSARLYVMENKGRDVLPFLCL 224 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 LE +YDY+CKIHGKKS R+ Y +EG LWRRW+F+DLLG + +II F+ + Sbjct: 225 LELGVFYDYDYLCKIHGKKSARRNYHPFEGILWRRWIFFDLLGFSDIALRIINKFEQNPS 284 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 IGMIGS +R KY + K + + LA R+ ++ LDFF GTMFWVR + L Sbjct: 285 IGMIGSGRFRRYKKY-SFFKKRSKVYKRVVDLARRIDFPVEELDLDFFNGTMFWVRPKCL 343 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRK 360 +P++N+ L+ FE + DG +EHAVER F LSV++A F + VDC+ Y + Sbjct: 344 EPLRNIHLTGEFE-EECNLEDGALEHAVERFFPLSVQRAGFSLESVDCVAEYDQ 396 >gi|254780201|ref|YP_003064614.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] gi|254039878|gb|ACT56674.1| hypothetical protein CLIBASIA_00430 [Candidatus Liberibacter asiaticus str. psy62] Length = 394 Score = 380 bits (977), Expect = e-103, Method: Composition-based stats. Identities = 144/356 (40%), Positives = 211/356 (59%), Gaps = 4/356 (1%) Query: 9 GYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWT-LFYKRSKKLCYDENYVVAYGSRS 67 GY++ S S + ++++++ F FW + L + + KL + + YGSR Sbjct: 40 GYYVLWSFSPKQRITSKDVHFQELSIFESFIFWLRSFLAFSKYSKLSFPSCRIFFYGSRK 99 Query: 68 GKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGW-ESPAMGKVMQIAIKAKIAIVVHLYY 126 +K F + N +M + FD ++ + +L GW + P+ K + IK+KIAIVVH YY Sbjct: 100 EQKAFLRLNRFMSNSRMPFDSEKFLYVKELFEGWNDRPSSPKKSGLTIKSKIAIVVHCYY 159 Query: 127 IDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 D WIEI+++L L+ FDL VT+V + + ++LK FP+A++++MEN GRDV PFL L Sbjct: 160 QDTWIEISHILLRLNFDFDLFVTVVEANKDFEQDVLKYFPSAQLYVMENKGRDVRPFLYL 219 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 LE YDY+CKIHGKKS+R+GY EG +WRRWLF+DLLG + +II TF+ + Sbjct: 220 LELGVFDRYDYLCKIHGKKSQREGYHPIEGIIWRRWLFFDLLGFSDIAIRIINTFEQNPC 279 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 +GMIGSR YR ++ + + + LA R G + LDFF GTMFWV+ + L Sbjct: 280 LGMIGSRRYRRYKRWSFFAKR-SEVYRRVIDLAKRAGFPTKRLHLDFFNGTMFWVKPKCL 338 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRKSL 362 +P++NL L FE + DG +EHAVER F+ SV+ F I VDC+ Y + L Sbjct: 339 EPLRNLHLIGEFE-EERNLKDGALEHAVERFFACSVRYTEFSIESVDCVAEYERLL 393 >gi|13474020|ref|NP_105588.1| hypothetical protein mll4799 [Mesorhizobium loti MAFF303099] gi|14024772|dbj|BAB51374.1| mll4799 [Mesorhizobium loti MAFF303099] Length = 386 Score = 297 bits (761), Expect = 2e-78, Method: Composition-based stats. Identities = 109/308 (35%), Positives = 156/308 (50%), Gaps = 18/308 (5%) Query: 56 DENYVVA--YGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIA 113 E +A GS +G F S+ HF+G + P+ L Sbjct: 91 PEGRELARALGSLAGWGFGTASDNLQKTLSRHFNGPQAEA-PERLP-------------T 136 Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIM 173 ++ KIA+ +HL+Y DLW E LL F L +TL A++ + FP A I + Sbjct: 137 VEPKIAVALHLHYPDLWPEFEALLEATGRQFQLFLTLTRPDAALAQRVQARFPGAEITVY 196 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 EN GRDV PF+ LL + +D +CK+HGKKS + G G++WR+ +DL+G+ GV Sbjct: 197 ENRGRDVGPFIQLLREGKFDPFDLICKLHGKKSGQSGPRMVLGEIWRQVSAFDLIGSRGV 256 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 V +II F+ D MIGSR +R PN++ + G+NR M L MG+ +LDF Sbjct: 257 VDRIIANFERSPDTQMIGSRRFRLPNEWKGEKSAWGENRAMALNLLETMGMP-SSSRLDF 315 Query: 294 FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 FAGTMFWVR AL+P++ L L P+ DG ++HA+ER + K SD D Sbjct: 316 FAGTMFWVRRGALEPLRRLDLPLAAFPEETGQQDGTLQHALERVLGMICTKIGV-ASDGD 374 Query: 354 CILGYRKS 361 R++ Sbjct: 375 MTTDSREA 382 >gi|320531350|ref|ZP_08032322.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] gi|320136441|gb|EFW28417.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] Length = 626 Score = 291 bits (745), Expect = 1e-76, Method: Composition-based stats. Identities = 63/267 (23%), Positives = 119/267 (44%), Gaps = 11/267 (4%) Query: 90 RIHHFPQLLHGWESPAMGK---VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + + V++ + K+A++ HLY++DL ++ DL Sbjct: 279 RSMNMADLVKNLQLTYVLPTQAVVREPKQQKVALIAHLYFMDLLDSTLAYARSMPEGTDL 338 Query: 147 HVTLVT-ESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 +T+ + E A + + P + ++EN GRDV L+ + + + +YD VC +H K Sbjct: 339 ILTVGSQEKAELVERACQDLPYNVDVRVIENRGRDVSALLVGCK-DIVDDYDLVCFMHDK 397 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + + G+ + R F +LL V ++ TFD+ +G++ + + + Y Sbjct: 398 KVTQLS-PYTVGEGFARKCFDNLLPTREFVENVVATFDSEPRLGLLSPTPPNHADYFPIY 456 Query: 265 TCSLGKNREMICTLAGR---MGITFQDQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEP 320 + S G N + L + + + K + GTMFW R AL P+ + P Sbjct: 457 SYSWGPNFDRTKMLLEKELNLNVPLDAHKEVIAPLGTMFWFRPAALKPLFDHDWQWEDFP 516 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANF 347 +DG I HA+ER + + + + Sbjct: 517 PEPNDIDGTILHAIERAYGYVAQASGY 543 >gi|329944276|ref|ZP_08292535.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] gi|328531006|gb|EGF57862.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] Length = 636 Score = 289 bits (740), Expect = 5e-76, Method: Composition-based stats. Identities = 66/266 (24%), Positives = 113/266 (42%), Gaps = 10/266 (3%) Query: 90 RIHHFPQLLHGWESPAMGK---VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + + V++ K KIA++ HLYY+DL + N+ D+ Sbjct: 279 RSMNMADLVKNLQLTYVLPTQAVVREPKKQKIALIAHLYYMDLVEPTLKYIRNMPEGIDI 338 Query: 147 HVTLVT-ESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 ++ + E K P + ++EN GRDV PFL+ + + + +YD VC H K Sbjct: 339 FLSTSSPEKVEQVEAACKGLPYNIEVRLVENRGRDVGPFLVAWK-DVVHDYDVVCYTHDK 397 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + Y + GD + F +LL V +I TFD +G + + + + + Sbjct: 398 KVTQL-YPYSVGDGFAYKCFENLLPTRDFVKNVIATFDAEPRLGFLAPTPPNHADYFPVF 456 Query: 265 TCSLGKNREMICTLAGRMGITFQDQKLD---FFAGTMFWVRTEALDPIKNLRLSRYFEPK 321 T G N + L +G+ G+MFW R +AL P+ + P Sbjct: 457 TYGWGPNFDRTKALLRELGLDVPLDPTKEPIAPLGSMFWFRPQALKPLFDHDWQWEEFPP 516 Query: 322 VHKALDGEIEHAVERCFSLSVKKANF 347 +DG + HA+ER + + + Sbjct: 517 EPCPIDGTLMHAIERSHGYVAQGSGY 542 >gi|326772082|ref|ZP_08231367.1| rhamnan synthesis protein F [Actinomyces viscosus C505] gi|326638215|gb|EGE39116.1| rhamnan synthesis protein F [Actinomyces viscosus C505] Length = 652 Score = 288 bits (738), Expect = 7e-76, Method: Composition-based stats. Identities = 66/267 (24%), Positives = 116/267 (43%), Gaps = 11/267 (4%) Query: 90 RIHHFPQLLHGWESPAMGK---VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + + V + K+A++ HLYY+DL ++ D Sbjct: 305 RSMNMADLVKNLQLTYVLPTQAVAREPKPQKVALIAHLYYMDLLEPTLAYARSMPEGTDF 364 Query: 147 HVTLVT-ESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 +T+ + E + E K P + ++EN GRDV L+ + + +S+YD VC IH K Sbjct: 365 ILTVGSQEKVELVEEACKDLPYNVTVRLIENRGRDVSALLVGCK-DIVSDYDLVCFIHDK 423 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + + G+ + R F +LL V +I TFD+ +G++ + + + Y Sbjct: 424 KVTQLS-PYTVGEGFARKCFDNLLPTREFVENVISTFDSEPRLGLLSPTPPNHADYFPIY 482 Query: 265 TCSLGKNREMICTLAGR-MGITFQDQKLD---FFAGTMFWVRTEALDPIKNLRLSRYFEP 320 + S G N + L + + ++ GTMFW R AL P+ + P Sbjct: 483 SYSWGPNFDRTKMLLEKELNLSVPLDAHKEVIAPLGTMFWFRPAALKPLFDHDWQWEDFP 542 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANF 347 +DG I HA+ER + + + + Sbjct: 543 PEPNDIDGTILHAIERAYGYVAQASGY 569 >gi|221634566|ref|YP_002523254.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter sphaeroides KD131] gi|221163439|gb|ACM04401.1| Lipopolysaccharide biosynthesis protein-like protein [Rhodobacter sphaeroides KD131] Length = 755 Score = 287 bits (735), Expect = 2e-75, Method: Composition-based stats. Identities = 93/325 (28%), Positives = 138/325 (42%), Gaps = 32/325 (9%) Query: 53 LCYDENYVVAYGSRSGKK----FFAQSNLYMME--------------RELHFDGQRIHHF 94 L Y E + +AYG R G + F Q+ L R H + + Sbjct: 54 LKYPEKHYIAYGERLGYRPNPDFSPQAYLRYHPDVAEAGVPPFLHYVRVGHAEQRLTKEL 113 Query: 95 PQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-- 151 P+++ + + A A+ VH+YY DLW E A L L I FDL+VTL Sbjct: 114 PEVVALPARGMPQVRFEHGRQTAPYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYR 173 Query: 152 -TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E+ ++ EI FP A + M N GRD+LPF+ LL Y VCK H KKS + Sbjct: 174 GEETDALAQEIRADFPGAFVTPMPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQ 233 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 +GDLWR+ L +L G+ K + F + G + Y T G Sbjct: 234 ----DGDLWRKHLIEGILPETGLEEK-LEAFVEAPEAGFWVADGQHYTG-----TQWWGS 283 Query: 271 NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 N E L R+ I + L F AG+++WV+ L +++L+L +DG + Sbjct: 284 NVEATRHLLQRIEIPLDREALSFPAGSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTL 343 Query: 331 EHAVERCFSLSVKKANFRISDVDCI 355 HA+ER +A ++ + Sbjct: 344 AHAIERVLGYLTARAGQKVLQTSEL 368 >gi|126464825|ref|YP_001041801.1| lipopolysaccharide biosynthesis protein-like [Rhodobacter sphaeroides ATCC 17029] gi|126106640|gb|ABN79165.1| Lipopolysaccharide biosynthesis protein-like [Rhodobacter sphaeroides ATCC 17029] Length = 751 Score = 287 bits (735), Expect = 2e-75, Method: Composition-based stats. Identities = 93/325 (28%), Positives = 138/325 (42%), Gaps = 32/325 (9%) Query: 53 LCYDENYVVAYGSRSGKK----FFAQSNLYMME--------------RELHFDGQRIHHF 94 L Y E + +AYG R G + F Q+ L R H + + Sbjct: 50 LKYPEKHYIAYGERLGYRPNPDFSPQAYLRYHPDVAEAGVPPFLHYVRVGHAEQRLTKEL 109 Query: 95 PQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-- 151 P+++ + + A A+ VH+YY DLW E A L L I FDL+VTL Sbjct: 110 PEVVALPARGMPQVRFEHGRQTAPYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYR 169 Query: 152 -TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E+ ++ EI FP A + M N GRD+LPF+ LL Y VCK H KKS + Sbjct: 170 GEETDALAEEIRADFPGAFVTPMPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQ 229 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 +GDLWR+ L +L G+ K + F + G + Y T G Sbjct: 230 ----DGDLWRKHLIEGILPETGLEEK-LEAFVEAPEAGFWVADGQHYTG-----TQWWGS 279 Query: 271 NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 N E L R+ I + L F AG+++WV+ L +++L+L +DG + Sbjct: 280 NVEATRHLLQRIEIPLDREALSFPAGSIYWVKPLVLGLLRSLQLRLEDFDIEEGQVDGTL 339 Query: 331 EHAVERCFSLSVKKANFRISDVDCI 355 HA+ER +A ++ + Sbjct: 340 AHAIERVLGYLTARAGQKVLQTSEL 364 >gi|125654691|ref|YP_001033885.1| hypothetical protein RSP_3918 [Rhodobacter sphaeroides 2.4.1] gi|77386351|gb|ABA81780.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 751 Score = 287 bits (735), Expect = 2e-75, Method: Composition-based stats. Identities = 93/325 (28%), Positives = 138/325 (42%), Gaps = 32/325 (9%) Query: 53 LCYDENYVVAYGSRSGKK----FFAQSNLYMME--------------RELHFDGQRIHHF 94 L Y E + +AYG R G + F Q+ L R H + + Sbjct: 50 LKYPEKHYIAYGERLGYRPNPDFSPQAYLRYHPDVAEAGVPPFLHYVRVGHAEQRLTKEL 109 Query: 95 PQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-- 151 P+++ + + A A+ VH+YY DLW E A L L I FDL+VTL Sbjct: 110 PEVVALPARGMPQVRFEHGRQTAPYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYR 169 Query: 152 -TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E+ ++ EI FP A + M N GRD+LPF+ LL Y VCK H KKS + Sbjct: 170 GEETDALAEEIRADFPGAFVTPMPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQ 229 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 +GDLWR+ L +L G+ K + F + G + Y T G Sbjct: 230 ----DGDLWRKHLIEGILPETGLEEK-LEAFVEAPEAGFWVADGQHYTG-----TQWWGS 279 Query: 271 NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 N E L R+ I + L F AG+++WV+ L +++L+L +DG + Sbjct: 280 NVEATRHLLQRIEIPLDREALSFPAGSIYWVKPLVLGLLRSLQLRLEDFDLEEGQVDGTL 339 Query: 331 EHAVERCFSLSVKKANFRISDVDCI 355 HA+ER +A ++ + Sbjct: 340 AHAIERVLGYLTARAGQKVLQTSEL 364 >gi|332561391|ref|ZP_08415706.1| hypothetical protein RSWS8N_20164 [Rhodobacter sphaeroides WS8N] gi|332274190|gb|EGJ19507.1| hypothetical protein RSWS8N_20164 [Rhodobacter sphaeroides WS8N] Length = 751 Score = 287 bits (735), Expect = 2e-75, Method: Composition-based stats. Identities = 93/325 (28%), Positives = 138/325 (42%), Gaps = 32/325 (9%) Query: 53 LCYDENYVVAYGSRSGKK----FFAQSNLYMME--------------RELHFDGQRIHHF 94 L Y E + +AYG R G + F Q+ L R H + + Sbjct: 50 LKYPEKHYIAYGERLGYRPNPDFSPQAYLRYHPDVAEAGVPPFLHYVRVGHAEQRLTKEL 109 Query: 95 PQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-- 151 P+++ + + A A+ VH+YY DLW E A L L I FDL+VTL Sbjct: 110 PEVVALPARGMPQVRFEHGRQTAPYAVAVHVYYPDLWPEFAARLRRLRIPFDLYVTLTYR 169 Query: 152 -TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E+ ++ EI FP A + M N GRD+LPF+ LL Y VCK H KKS + Sbjct: 170 GEETDALAEEIRADFPGAFVTPMPNRGRDILPFVTLLNAGAFDGYRAVCKFHTKKSPHRQ 229 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 +GDLWR+ L +L G+ K + F + G + Y T G Sbjct: 230 ----DGDLWRKHLIEGILPETGLEEK-LEAFVEAPEAGFWVADGQHYTG-----TQWWGS 279 Query: 271 NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 N E L R+ I + L F AG+++WV+ L +++L+L +DG + Sbjct: 280 NVEATRHLLQRIEIPLDREALSFPAGSIYWVKPLVLGLLRSLQLRLEDFDLEEGQVDGTL 339 Query: 331 EHAVERCFSLSVKKANFRISDVDCI 355 HA+ER +A ++ + Sbjct: 340 AHAIERVLGYLTARAGQKVLQTSEL 364 >gi|325067622|ref|ZP_08126295.1| hypothetical protein AoriK_07369 [Actinomyces oris K20] Length = 626 Score = 286 bits (732), Expect = 4e-75, Method: Composition-based stats. Identities = 66/267 (24%), Positives = 114/267 (42%), Gaps = 11/267 (4%) Query: 90 RIHHFPQLLHGWESPAMGKVMQI---AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + + KIA++ HLYY+DL + ++ DL Sbjct: 279 RSMNMADLVKNLHLTYVLPTQAVVHEPKPQKIALIAHLYYMDLLEPTLAYVKSMPEGTDL 338 Query: 147 HVTLVT-ESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 +T+ + E A + E K P + ++EN GRDV L+ + + + +YD VC H K Sbjct: 339 ILTVGSQEKAELVEEACKDLPYNVTVRLIENRGRDVSALLVGCK-DIIHDYDLVCFTHDK 397 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + + GD + F +LL V +I TFD +G++ + + + + Sbjct: 398 KVTQVK-PYSVGDGFAIKCFENLLATRDFVKNVIATFDAEPRLGLLAPTPPNHGDYFPVF 456 Query: 265 TCSLGKNREMICTLAGR---MGITFQDQKLD-FFAGTMFWVRTEALDPIKNLRLSRYFEP 320 + G N E TL + + + + + GTMFW R AL P+ + P Sbjct: 457 SMGWGPNFERTKTLLEKELNLSVPIDESRAPIAPLGTMFWFRPAALKPLFDHDWQWEDFP 516 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANF 347 +DG I HA+ER + + + + Sbjct: 517 PEPNNIDGTILHAIERAYGYVAQASGY 543 >gi|227546966|ref|ZP_03977015.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212567|gb|EEI80455.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 631 Score = 280 bits (716), Expect = 2e-73, Method: Composition-based stats. Identities = 61/272 (22%), Positives = 111/272 (40%), Gaps = 11/272 (4%) Query: 90 RIHHFPQLLHGWESPAMGK---VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + + + KIA+ +H+YY+DL + + ++ D+ Sbjct: 280 RTMNVADLMKNLHLDYVLPQSLSVPLPEGKKIALAIHVYYMDLLESTFHYIQSMPEGCDI 339 Query: 147 HVTLVTE-SASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 +T+ +E +A E K FP + ++EN GRDV L+ + YDYVC H K Sbjct: 340 IITVGSEANAETVREYCKQFPYNFDVRVIENRGRDVSALLVGCGEDLF-QYDYVCFAHDK 398 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + GD + F ++L + V +I F+ + +G+ + + + Y Sbjct: 399 KVTQLSPQSI-GDGFAYKCFENILASKEYVSNVIDLFERNPRLGIAMPTPPNHASYFPGY 457 Query: 265 TCSLGKNREMICTLAGR---MGITFQ-DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEP 320 T G N + M + D++ GTMFW R EA + + P Sbjct: 458 TFPWGPNFPGTKDFLEQTLNMHVPLNADKEPVAPMGTMFWFRPEAFRGLLDHGWEYTDFP 517 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 +DG + H +ER + + + + + Sbjct: 518 PEPNKVDGTLLHFIERAYGYVPQANGYYPAYI 549 >gi|331086190|ref|ZP_08335272.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406349|gb|EGG85863.1| hypothetical protein HMPREF0987_01575 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 592 Score = 279 bits (715), Expect = 4e-73, Method: Composition-based stats. Identities = 60/246 (24%), Positives = 110/246 (44%), Gaps = 8/246 (3%) Query: 109 VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASI-KSEILKIFP- 166 + + + KIA+V+HLY+ DL E + +S + D+++T TE ++ P Sbjct: 249 LQKQTTENKIALVMHLYFEDLLEESYHYVSAMPEKADIYLTTDTEKKKAAIEKVFAKLPC 308 Query: 167 -AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFY 225 + +++N GRDV L+ ++ + + +YD VC H KK+ + G + F Sbjct: 309 NKLEVRVIKNRGRDVSSLLVGVK-DVIMDYDLVCFAHDKKTAQVKPGTI-GASFAYKCFE 366 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT 285 + L V +I TF + +G++ + + G N + LA ++G+T Sbjct: 367 NTLSNKAYVGNVINTFVNNPRMGLLCPPEPNHSTFFTTIGFEWGPNFNITRDLAKKLGLT 426 Query: 286 FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV 342 GTMFW R +A+ P+ N P +DG + HA+ER + V Sbjct: 427 VPISVASPPVAPLGTMFWFRPKAMKPLYNKDWKYEDFPAEPNKIDGTLLHAIERIYPFIV 486 Query: 343 KKANFR 348 +++ + Sbjct: 487 QESGYY 492 >gi|311063512|ref|YP_003970237.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum PRL2010] gi|310865831|gb|ADP35200.1| lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum PRL2010] Length = 631 Score = 279 bits (715), Expect = 4e-73, Method: Composition-based stats. Identities = 60/272 (22%), Positives = 111/272 (40%), Gaps = 11/272 (4%) Query: 90 RIHHFPQLLHGWESP---AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + + + KIA+ +H+YY+DL + ++ D+ Sbjct: 280 RTMNVADLMKNLHLDYVLSQSLSVPLPEGKKIALAIHVYYMDLLESTFRYIQSMPEGCDI 339 Query: 147 HVTLVTE-SASIKSEILKIFPAA-RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 +T+ +E +A I E K FP + ++EN GRDV L+ + YDYVC H K Sbjct: 340 IITVGSEANAEIVREYCKQFPYRFDVRVIENRGRDVSSLLVGCGEDLF-QYDYVCFAHDK 398 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + GD + + ++L + V +I F+ + +G+ + + + Y Sbjct: 399 KVTQLSPQSI-GDGFAYKCYENILASKEYVSNVIDLFEKNPRLGIAMPTPPNHASYFPGY 457 Query: 265 TCSLGKNREMICTLAGR---MGITFQ-DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEP 320 T G N + M + +++ GTMFW R EA + + P Sbjct: 458 TFPWGPNFPGTKDFLEQTLNMHVPLNANKEPVAPMGTMFWFRPEAFRGLLDHGWKYEDFP 517 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 +DG + H +ER + + + + + Sbjct: 518 PEPNKVDGTLLHFIERAYGYVPQANGYYPAYI 549 >gi|289662624|ref|ZP_06484205.1| hypothetical protein XcampvN_05932 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 945 Score = 277 bits (710), Expect = 1e-72, Method: Composition-based stats. Identities = 75/258 (29%), Positives = 127/258 (49%), Gaps = 12/258 (4%) Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE- 160 ++P V ++ K+ ++VH++Y DL E A L + + +DL V+++ +A ++ Sbjct: 69 QAPFGRPVPSAQLQLKVGVMVHVFYPDLIDEFAQSLQQMPVGYDLLVSVMDNAAEAQARD 128 Query: 161 ---ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGD 217 L+ I I+ N GRD+ P L+ EQ+ D V +H KKS Y+ E Sbjct: 129 RFSKLQQIEKLDIRIVPNRGRDIAPLLVTFR-EQILALDVVGHLHTKKSL---YTGSEQG 184 Query: 218 LWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT 277 WRR+L L+G+ + + F +GM+ +Y + + + N E+ T Sbjct: 185 QWRRYLVSSLMGSAERIAWQLGMFQAEPRLGMLYPESY---ERVPLWAHTWLSNFEVCRT 241 Query: 278 LAGRMGITFQ-DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 LA R+G + +DF AG+MFW + +AL P+ L L P+ H +DG + HA+ER Sbjct: 242 LAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLELKDFPEEHGQIDGTLHHAMER 301 Query: 337 CFSLSVKKANFRISDVDC 354 F V+ ++RI + Sbjct: 302 MFVAVVRHQHYRIGILPQ 319 >gi|312133751|ref|YP_004001090.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773029|gb|ADQ02517.1| Hypothetical protein BBMN68_1492 [Bifidobacterium longum subsp. longum BBMN68] Length = 641 Score = 277 bits (709), Expect = 2e-72, Method: Composition-based stats. Identities = 64/253 (25%), Positives = 105/253 (41%), Gaps = 7/253 (2%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS-IKSEILK 163 + I K ++A+V+HLYY+D+ +I ++ D+ +T+ +E + I E + Sbjct: 298 SQDNAQPIPQKFRVALVLHLYYMDILDQILRYARSMPEGCDVIITVGSEEKACIVKERCE 357 Query: 164 IFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 P + ++EN GRDV L+ + L NYD VC H KK ++ GD + + Sbjct: 358 GMPYNIDVRVIENRGRDVSALLVGAGKDVL-NYDLVCFAHDKKVRQL-RPETIGDGFAKK 415 Query: 223 LFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRM 282 F + L + V II F + +G+ A + + + Y S G N L + Sbjct: 416 CFENTLASKAYVANIINLFADNPRLGVAMPSAPNHADYFYSYAFSWGPNYRGTKDLLDGL 475 Query: 283 GITFQDQKLD---FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 GI GTMFW R +AL + + P DG H VER + Sbjct: 476 GIKVPLSPHADVIAPLGTMFWFRPKALHGLIDKSWEYSDFPPEPNPADGSFLHFVERAYC 535 Query: 340 LSVKKANFRISDV 352 + + + Sbjct: 536 YVAQSNGYYSEYI 548 >gi|218455303|gb|AAX19606.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 277 bits (709), Expect = 2e-72, Method: Composition-based stats. Identities = 88/286 (30%), Positives = 135/286 (47%), Gaps = 19/286 (6%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYID 128 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++ID Sbjct: 270 RAMIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHID 329 Query: 129 LWIEIANLLSNLSISFDLHVTL--VTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 L I + N+ + +D+ V++ +++ + A + I N GRDV PF+ L Sbjct: 330 LIDAICAYMRNVIVDYDVFVSVKSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCKIH KKS + G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYRDG----GGQWRDDLMKALLGSSFDVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 ++ + D + H +ER F L V++A F + Sbjct: 501 SALRARSIGLSEFDPEAGQRDATLAHVIERLFVLWVEQAGFFAATT 546 >gi|289668432|ref|ZP_06489507.1| hypothetical protein XcampmN_08015 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 945 Score = 277 bits (709), Expect = 2e-72, Method: Composition-based stats. Identities = 75/258 (29%), Positives = 127/258 (49%), Gaps = 12/258 (4%) Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE- 160 ++P V ++ K+ ++VH++Y DL E A L + + +DL V+++ +A ++ Sbjct: 69 QAPFGRPVPSAQLQVKVGVMVHVFYPDLIDEFAQSLQQMPVGYDLLVSVMDNAAEAQARD 128 Query: 161 ---ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGD 217 L+ I I+ N GRD+ P L+ EQ+ D V +H KKS Y+ E Sbjct: 129 RFSKLQQIEKLDIRIVPNRGRDIAPLLVTFR-EQILALDVVGHLHTKKSL---YTGSEQG 184 Query: 218 LWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT 277 WRR+L L+G+ + + F +GM+ +Y + + + N E+ T Sbjct: 185 QWRRYLVSSLMGSAERIAWQLGMFQAEPRLGMLYPESY---ERVPLWAHTWLSNFEVCRT 241 Query: 278 LAGRMGITFQ-DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 LA R+G + +DF AG+MFW + +AL P+ L L P+ H +DG + HA+ER Sbjct: 242 LAQRLGFDINASEYIDFPAGSMFWAKVDALRPLYALNLELKDFPEEHGQIDGTLHHAMER 301 Query: 337 CFSLSVKKANFRISDVDC 354 F V+ ++RI + Sbjct: 302 MFVAVVRHQHYRIGILPQ 319 >gi|218455307|gb|AAX19610.2| WxocB [Xanthomonas oryzae pv. oryzicola] gi|218455309|gb|AAX19612.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 276 bits (707), Expect = 4e-72, Method: Composition-based stats. Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 19/286 (6%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYID 128 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++ID Sbjct: 270 RAMIEREVARMRQTRKNIMPIVTGGDSPASDELERYGVGAIDAESLSSRFAIVLHLFHID 329 Query: 129 LWIEIANLLSNLSISFDLHVTL--VTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 L I + N+ + +D+ V++ +++ + A + I N GRDV PF+ L Sbjct: 330 LIDAICAYMRNVIVDYDVFVSVKSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCKIH KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYHDG----GGQWRDDLMKALLGSSFDVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 ++ + D + H +ER F L V++A F + Sbjct: 501 SALRARSIGLSEFDPEAGQRDATLAHVIERLFVLWVEQAGFFAATT 546 >gi|166713474|ref|ZP_02244681.1| hypothetical protein Xoryp_19045 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 568 Score = 276 bits (707), Expect = 4e-72, Method: Composition-based stats. Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 19/286 (6%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYID 128 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++ID Sbjct: 270 RAMIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHID 329 Query: 129 LWIEIANLLSNLSISFDLHVTL--VTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 L I + N+ + +D+ V++ +++ + A + I N GRDV PF+ L Sbjct: 330 LIDAICAYMRNVIVDYDVFVSVKSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCKIH KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYHDG----GGQWRDDLMKALLGSSFDVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 ++ + D + H +ER F L V++A F + Sbjct: 501 SALRARSIGLSEFDPEAGQRDATLAHVIERLFVLWVEQAGFFAATT 546 >gi|218455296|gb|AAV67426.2| glycosyltransferase [Xanthomonas oryzae pv. oryzicola] gi|218455299|gb|AAX19602.2| WxocB [Xanthomonas oryzae pv. oryzicola] gi|218455301|gb|AAX19604.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 276 bits (706), Expect = 4e-72, Method: Composition-based stats. Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 19/286 (6%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYID 128 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++ID Sbjct: 270 RAMIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHID 329 Query: 129 LWIEIANLLSNLSISFDLHVTL--VTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 L I + N+ + +D+ V++ +++ + A + I N GRDV PF+ L Sbjct: 330 LIDAICAYMRNVIVDYDVFVSVKSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCKIH KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKIHSKKSVYHDG----GGQWRDDLMKALLGSSFDVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 ++ + D + H +ER F L V++A F + Sbjct: 501 SALRARSIGLSEFDPEAGQRDATLAHVIERLFVLWVEQAGFFAATT 546 >gi|225352528|ref|ZP_03743551.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156722|gb|EEG70116.1| hypothetical protein BIFPSEUDO_04151 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 648 Score = 275 bits (705), Expect = 5e-72, Method: Composition-based stats. Identities = 62/273 (22%), Positives = 112/273 (41%), Gaps = 13/273 (4%) Query: 90 RIHHFPQLLHGWESPAMGK---VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + + I +IA+++HLYY+DL + ++ D Sbjct: 280 RTMNLADLVRNLHLDFVLPSNTIAPIPTNKRIALIMHLYYMDLLDKTLEYAKSMPEGCDF 339 Query: 147 HVTL-VTESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 T+ E+A+I E K P + +++N GRDV L+ + L YDYVC H K Sbjct: 340 IFTVGSEENATIVRERCKDLPYNVDVRVIQNRGRDVSALLVGAGKDCL-QYDYVCFAHDK 398 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + GD + F ++LG+ +V II F+ G++ + + + ++ Sbjct: 399 KVTQLSPYSI-GDGFSYKCFENVLGSKALVSNIINHFENDPHAGVLAPAPPNHADYFGNF 457 Query: 265 TCSLGKNREMICTLAG-----RMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFE 319 G N E + ++ + + + GTMFW R +AL ++ Sbjct: 458 ASLWGPNYEGTKKMLEETLQVKVPLDKSKEPIA-PMGTMFWFRPKALQQFFDIDWKYEDF 516 Query: 320 PKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 P +DG + H VER + + + + Sbjct: 517 PPEPNKIDGSMLHFVERAYGYVPQANGYYTGYI 549 >gi|218455305|gb|AAX19608.2| WxocB [Xanthomonas oryzae pv. oryzicola] Length = 568 Score = 275 bits (703), Expect = 9e-72, Method: Composition-based stats. Identities = 87/286 (30%), Positives = 134/286 (46%), Gaps = 19/286 (6%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPA--------MGKVMQIAIKAKIAIVVHLYYID 128 M+ERE+ Q + ++ G +SPA +G + ++ ++ AIV+HL++ID Sbjct: 270 RAMIEREVARMRQTRKNIMPIVTGDDSPASDELERYGVGAIDAESLSSRFAIVLHLFHID 329 Query: 129 LWIEIANLLSNLSISFDLHVTL--VTESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 L I + N+ + +D+ V++ +++ + A + I N GRDV PF+ L Sbjct: 330 LIDAICAYMRNVIVDYDVFVSVKSISDRRMAVRYFQEHKIRASVFIHPNIGRDVGPFISL 389 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L T L YD VCK+H KKS G WR L LLG+ V +++R FD H Sbjct: 390 LNTGLLDRYDAVCKVHSKKSVYHDG----GGQWRDDLMKALLGSSFNVLRVLRAFDDHPA 445 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G++G + Y G N E + LA GI + +L FFAGTMFW R AL Sbjct: 446 CGIVGPE-----SAYLSNARFWGGNEERLRVLAAETGIEEKRIRLGFFAGTMFWFRPAAL 500 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 ++ + D + H +ER F L V++A F + Sbjct: 501 SALRARSIGLSEFDPEAGQRDATLAHVIERLFVLWVEQAGFFAATT 546 >gi|308235695|ref|ZP_07666432.1| hypothetical protein GvagA14_05663 [Gardnerella vaginalis ATCC 14018] gi|311114292|ref|YP_003985513.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019] gi|310945786|gb|ADP38490.1| rhamnan synthesis protein F [Gardnerella vaginalis ATCC 14019] Length = 637 Score = 274 bits (700), Expect = 2e-71, Method: Composition-based stats. Identities = 64/268 (23%), Positives = 123/268 (45%), Gaps = 9/268 (3%) Query: 91 IHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL 150 + + H E+ + + K K+A+ +HLYY+DL + + + ++ D+ +T+ Sbjct: 290 LRYILPQNHVVENSSTA-TSESTAKPKVALCMHLYYMDLLDKSLHYIQSMPQGCDVILTV 348 Query: 151 VT-ESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 + E+ I + ++ P + ++EN GRDV FL+ + L YDYVC H KK + Sbjct: 349 GSKENQQIVKQRVEHLPYDVDVRLIENRGRDVSAFLVGGGAD-LMKYDYVCFAHDKKVTQ 407 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 GD + F ++L + V +I F+TH +GM + + + +T + Sbjct: 408 LSPRSI-GDGFAYKCFENILASKEYVQNVINLFETHPRLGMAMPTPPNHADYFPGFTYTW 466 Query: 269 GKNREMICTLAGR-MGITFQ---DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHK 324 G N E + +GI+ ++ GTMFW RT+A+ + + + + P Sbjct: 467 GPNFEGTKKFLEKTLGISVPLDENKDAIAPLGTMFWFRTKAMRGLLDRKWTYEDFPAEPL 526 Query: 325 ALDGEIEHAVERCFSLSVKKANFRISDV 352 +DG + H +ER + + + + + Sbjct: 527 KIDGTLLHFIERAYGYVPQYNGYYDAYI 554 >gi|119026520|ref|YP_910365.1| hypothetical protein BAD_1502 [Bifidobacterium adolescentis ATCC 15703] gi|118766104|dbj|BAF40283.1| hypothetical protein [Bifidobacterium adolescentis ATCC 15703] Length = 647 Score = 273 bits (698), Expect = 3e-71, Method: Composition-based stats. Identities = 63/272 (23%), Positives = 111/272 (40%), Gaps = 11/272 (4%) Query: 90 RIHHFPQLLHGWESPAMGK---VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + L+ + I +IA+++HLYY+DL + ++ D Sbjct: 279 RTMNLADLVKNLHLDFVMPSNITTPIPEGKRIALIMHLYYMDLLDKTLEYAKSMPEGCDF 338 Query: 147 HVTLVTE-SASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 T+ +E +A + E K P + +++N GRDV LI + L YDYVC H K Sbjct: 339 IFTVGSEENAKLVRERCKGLPYNVDVRVIQNRGRDVSALLIGAGKDCL-KYDYVCFAHDK 397 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + GD + F ++LG+ +V II F+ G++ + + + + ++ Sbjct: 398 KVTQLSPYSI-GDGFAYKCFENILGSKALVSNIINHFEQDPHAGLLAPTSPNHADYFGNF 456 Query: 265 TCSLGKNREMICTLAGR-MGITFQDQK---LDFFAGTMFWVRTEALDPIKNLRLSRYFEP 320 G N E + +G+ GTMFW R +AL + ++ P Sbjct: 457 ASLWGPNFEGTKKMLEETLGVKVPLNPYKEPIAPLGTMFWFRPKALHQLFDIDWKYEDFP 516 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 +DG + H +ER + + + V Sbjct: 517 PEPNKIDGSMLHFIERAYGYLPQANGYYTGFV 548 >gi|258591058|emb|CBE67353.1| protein of unknown function [NC10 bacterium 'Dutch sediment'] Length = 1460 Score = 272 bits (696), Expect = 6e-71, Method: Composition-based stats. Identities = 68/240 (28%), Positives = 118/240 (49%), Gaps = 12/240 (5%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA-SIKSEILKIFPAAR--- 169 I ++IA+ H YY DL E+A+ L N+ +FDL V++ + A + + P AR Sbjct: 589 ISSRIAVHAHAYYPDLTKELASYLKNMPFAFDLFVSVSNDEARDVCRQAFAGLPQARRVI 648 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 + ++ N GRD+ P + +L+ YDY+C +H KKS Y+ + D W +L L+G Sbjct: 649 VDVVANRGRDIAPM-VCHFGGRLATYDYICHLHTKKSM---YAQGKMDGWLEYLLRQLMG 704 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI-TFQD 288 + V +I F + G+I + Y Y + + N+ + + +MGI + Sbjct: 705 SEDQVRRIFSMFQSDPRAGIIYPQNYEYLPY---WGNTWLSNKALGAQMCRQMGITDVPE 761 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 D+ AG+MFW R+EA+ + + + P+ DG + H +ER L + A ++ Sbjct: 762 GYFDYPAGSMFWARSEAIRNLFSADIRLTDFPEEAGQTDGSLAHCIERLLVLVARHAGYK 821 >gi|21111631|gb|AAM39945.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66575237|gb|AAY50647.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 296 Score = 272 bits (696), Expect = 7e-71, Method: Composition-based stats. Identities = 71/260 (27%), Positives = 111/260 (42%), Gaps = 11/260 (4%) Query: 106 MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKI 164 V +V+H +Y+D+ E+ + + + +T T+ + I + Sbjct: 37 APDVATEICSPSACVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRR 96 Query: 165 FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 A + EN GRD+LPFL + N V K+H KKS + +G+ WR + Sbjct: 97 GIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEML 152 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI 284 LLG P V I+ F T G+ + P T +G N + + L R G Sbjct: 153 TALLG-PQRVDAIVNAFSTDPLAGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGS 206 Query: 285 TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 D F +G+MFW R EAL P+ + L +DG + HA+ER L+V Sbjct: 207 DAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVTH 266 Query: 345 ANFRISDVDCILGYRKSLSQ 364 + R++ V+ LG K+ S Sbjct: 267 SGHRVTTVEQTLGITKTPSA 286 >gi|310829395|ref|YP_003961752.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612] gi|308741129|gb|ADO38789.1| hypothetical protein ELI_3842 [Eubacterium limosum KIST612] Length = 627 Score = 268 bits (685), Expect = 1e-69, Method: Composition-based stats. Identities = 71/280 (25%), Positives = 110/280 (39%), Gaps = 8/280 (2%) Query: 75 SNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIA 134 +NL QL + S +I + +IA + HLY+ DL E Sbjct: 270 TNLIWQNILRTMHQADFKRALQLNYILSSEFSYDSKKILCEKRIAAIFHLYFEDLIDETY 329 Query: 135 NLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAA--RIHIMENHGRDVLPFLILLETEQ 191 LS++ D+++T TE + E K F ++ +++N GRDV L+ + Sbjct: 330 RYLSSMPEEADIYITTDTEPKKKLIQEKFKDFSCRNFKVILIQNRGRDVSALLVATKA-F 388 Query: 192 LSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIG 251 + NYDYVC H KK + G + F + L V II F+ + +GM+ Sbjct: 389 IMNYDYVCFAHDKKVTQTKPYSI-GGAFAYKCFENTLQNKNFVLNIINAFEKNPRLGMLM 447 Query: 252 SRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK---LDFFAGTMFWVRTEALDP 308 Y N E+ L +GI GTMFW R +AL Sbjct: 448 PAPPNNGPYYPTLGNEWMCNYEVTKNLIDELGIKVPMDPGKEPISPLGTMFWFRPKALKV 507 Query: 309 IKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 + + P+ +DG + HA+ER + L V+ F Sbjct: 508 LFDKNWEYSDFPEEPNKVDGTLLHAIERAYGLIVQSEGFY 547 >gi|160894491|ref|ZP_02075267.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50] gi|156863802|gb|EDO57233.1| hypothetical protein CLOL250_02043 [Clostridium sp. L2-50] Length = 646 Score = 267 bits (683), Expect = 2e-69, Method: Composition-based stats. Identities = 66/249 (26%), Positives = 116/249 (46%), Gaps = 8/249 (3%) Query: 106 MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKI 164 K+ +I K K+A+V+HLY+ DL + SN+ D+++T T E ++ K Sbjct: 303 QNKMDEILKKRKLALVMHLYFPDLVEDSFQWASNVPKETDVYITTDTVEKKEAILKVFKN 362 Query: 165 FPA--ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 P + ++ N GRDV L+ ++ + + NYDY C +H KK+ + GD + Sbjct: 363 LPCNHLEVRVIVNRGRDVSSILVGVK-DVIQNYDYACFVHDKKTAQAK-PGSVGDSFGYK 420 Query: 223 LFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRM 282 + + L V +++TF+ + +G++ + Y G N E +A ++ Sbjct: 421 CWNNTLYNKEFVCNVLQTFEDNERLGILSPPEPNHGPFYQTLGNEWGCNFEKSREVADKL 480 Query: 283 GITFQ---DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 GIT D++ GT FW R AL + + P+ DG I HA+ER + Sbjct: 481 GITIPMSEDKEALAPYGTFFWFRPTALKVLFDHDWQYEEFPEEPNNFDGTILHAIERLYP 540 Query: 340 LSVKKANFR 348 + V++A + Sbjct: 541 ICVQQAGYY 549 >gi|21109952|gb|AAM38419.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 296 Score = 265 bits (679), Expect = 6e-69, Method: Composition-based stats. Identities = 69/247 (27%), Positives = 116/247 (46%), Gaps = 11/247 (4%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENHG 177 +V+H +Y+D+ E + +++ +S L VT T ++ + + A++ EN G Sbjct: 50 CVVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQQRGVQAQVDGFENRG 109 Query: 178 RDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKI 237 RD+LPFL + V K+H KKS + +GD WRR +F LL P I Sbjct: 110 RDILPFLRVANRLLDEGEQVVLKLHTKKSTHRE----DGDAWRREMFSALL-TPQHADAI 164 Query: 238 IRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGT 297 +R F +G+ + P T +G N + + LA R G D+ F +G+ Sbjct: 165 MRGFTDDPLLGLAAPAQHLLP-----VTDFIGGNADALDYLAVRTGTDAIDEHSVFASGS 219 Query: 298 MFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILG 357 MFWV+ EAL P+ + L +DG + HA+ER +++V ++ +D +LG Sbjct: 220 MFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVSHCGHHVATIDQLLG 279 Query: 358 YRKSLSQ 364 + + Sbjct: 280 IPQPTAS 286 >gi|260890973|ref|ZP_05902236.1| conserved hypothetical protein [Leptotrichia hofstadii F0254] gi|260859000|gb|EEX73500.1| conserved hypothetical protein [Leptotrichia hofstadii F0254] Length = 319 Score = 265 bits (677), Expect = 9e-69, Method: Composition-based stats. Identities = 62/243 (25%), Positives = 107/243 (44%), Gaps = 8/243 (3%) Query: 112 IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKI---FPAA 168 I +K K+ ++ H+Y+ DL E + + ++ + DL +T + K E F Sbjct: 2 IYLKYKVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNI 61 Query: 169 RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + ++EN GRDV L+ + + + NYDYVC +H KK+ + + G +R + + L Sbjct: 62 EVRVIENRGRDVSSLLVGAK-DAVMNYDYVCFMHDKKTAQLK-PYSSGQGFRYKCYENNL 119 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 V +I TF + +GM+ + N + N + L ++G+ Sbjct: 120 ATKKYVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVDF 179 Query: 289 Q---KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + GTMFW R AL + + P+ DG I HAVER + +V+ A Sbjct: 180 HWNLEPISPLGTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFAVQDA 239 Query: 346 NFR 348 + Sbjct: 240 GYY 242 >gi|77747764|ref|NP_636021.2| hypothetical protein XCC0629 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761299|ref|YP_244667.2| hypothetical protein XC_3605 [Xanthomonas campestris pv. campestris str. 8004] Length = 546 Score = 265 bits (677), Expect = 9e-69, Method: Composition-based stats. Identities = 71/260 (27%), Positives = 111/260 (42%), Gaps = 11/260 (4%) Query: 106 MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKI 164 V +V+H +Y+D+ E+ + + + +T T+ + I + Sbjct: 287 APDVATEICSPSACVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRR 346 Query: 165 FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 A + EN GRD+LPFL + N V K+H KKS + +G+ WR + Sbjct: 347 GIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEML 402 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI 284 LLG P V I+ F T G+ + P T +G N + + L R G Sbjct: 403 TALLG-PQRVDAIVNAFSTDPLAGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGS 456 Query: 285 TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 D F +G+MFW R EAL P+ + L +DG + HA+ER L+V Sbjct: 457 DAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVTH 516 Query: 345 ANFRISDVDCILGYRKSLSQ 364 + R++ V+ LG K+ S Sbjct: 517 SGHRVTTVEQTLGITKTPSA 536 >gi|312133752|ref|YP_004001091.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773032|gb|ADQ02520.1| Hypothetical protein BBMN68_1493 [Bifidobacterium longum subsp. longum BBMN68] Length = 651 Score = 265 bits (677), Expect = 1e-68, Method: Composition-based stats. Identities = 69/272 (25%), Positives = 110/272 (40%), Gaps = 11/272 (4%) Query: 90 RIHHFPQLLHGWESPAMGK---VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R + ++ + +A+V HLYYIDL +S++ D+ Sbjct: 280 RTINLADVVKTLHLNYVLPQDHTSYEPNPKHVALVFHLYYIDLLDSSLQYISSMPEGCDV 339 Query: 147 HVTLVTESAS-IKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 +T+ +E + I E + P + ++EN GRDV L+ + L NYD VC H K Sbjct: 340 IITVGSEEKACIVKERCEGMPYNIDVRVIENRGRDVSALLVGAGKDVL-NYDLVCFAHDK 398 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + GD + F ++L + V II F+ +G++ + N + + Sbjct: 399 KVTQIK-PLSVGDGFAYKCFENILASKAYVANIIDQFEREPHLGVLMPNPPEHGNYFPVF 457 Query: 265 TCSLGKNREMICTLAGRMGITFQDQKLD---FFAGTMFWVRTEAL-DPIKNLRLSRYFEP 320 T S G N + L + T K GTMFW R +AL D + N P Sbjct: 458 TLSWGDNFDGTVQLLRDIHKTVPLDKKKEVIAPLGTMFWFRPKALSDGLLNHNWQYSDFP 517 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 K +DG I H +ER + + + V Sbjct: 518 KEPNKIDGTILHYIERAYCYVAQANGYYSGYV 549 >gi|84501312|ref|ZP_00999517.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597] gi|84390603|gb|EAQ03091.1| hypothetical protein OB2597_13143 [Oceanicola batsensis HTCC2597] Length = 741 Score = 264 bits (676), Expect = 1e-68, Method: Composition-based stats. Identities = 86/328 (26%), Positives = 129/328 (39%), Gaps = 31/328 (9%) Query: 55 YDENYVVAYGSRSG--------KKFFAQSNLYMMERELH----------FDGQRIHHFPQ 96 + + V +G R G + N + E L +G+ P Sbjct: 49 FPMRHYVLWGERMGLEPHPDFSPGAYLGLNPDVAEAGLPPLHHFLTLGRGEGRGTRAQPV 108 Query: 97 LLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TE 153 + + +A+ AI +HLYY DLW E + L L +SFDL+VTL E Sbjct: 109 ESLPPIRTTIPRFDPRRPRARFAIHLHLYYPDLWPEFSERLDRLDLSFDLYVTLTWRGPE 168 Query: 154 SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSW 213 + + I + P A++ + N GRD+LPFL LL Y+ +CK+HGKKS + Sbjct: 169 TEWLADIIREAHPRAQVFPVANRGRDILPFLRLLNAGAFDGYEAICKLHGKKSPHRD--- 225 Query: 214 WEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNRE 273 +GD WRR L +L + + F D + + RY G NR Sbjct: 226 -DGDAWRRHLVDGVLPGKALWTS-LSAFLADEDAALWVADGQRY-----SVRKWWGSNRA 278 Query: 274 MICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 L R+ + D DF AG+M+W++ L I+ L L+ DG + HA Sbjct: 279 RTDALLRRVELDRSDTDFDFPAGSMYWMKPLLLGMIRALDLTEDLFEPESGQTDGTLAHA 338 Query: 334 VERCFSLSVKKANFRISDVDCILGYRKS 361 ER K A + + Sbjct: 339 FERAIGALAKAAGQEVRQTSELGRAVAP 366 >gi|188993121|ref|YP_001905131.1| conserved protein involved in carbohydrate biosynthesis [Xanthomonas campestris pv. campestris str. B100] gi|189030067|sp|B0RVK2|WXCX_XANCB RecName: Full=Uncharacterized protein wxcX gi|167734881|emb|CAP53093.1| conserved protein involved in carbohydrate biosynthesis [Xanthomonas campestris pv. campestris] Length = 695 Score = 264 bits (674), Expect = 2e-68, Method: Composition-based stats. Identities = 71/260 (27%), Positives = 112/260 (43%), Gaps = 11/260 (4%) Query: 106 MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKI 164 V +V+H +Y+D+ E+ + + + +T T+ + I + Sbjct: 436 APDVATEICSPSACVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRR 495 Query: 165 FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 A + EN GRD+LPFL + N V K+H KKS + +G+ WR + Sbjct: 496 GIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEML 551 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI 284 LLG P V I+ F T +G+ + P T +G N + + L R G Sbjct: 552 TALLG-PQRVDAIVNAFSTDPLVGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGS 605 Query: 285 TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 D F +G+MFW R EAL P+ + L +DG + HA+ER L+V Sbjct: 606 DAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVTH 665 Query: 345 ANFRISDVDCILGYRKSLSQ 364 + R++ V+ LG K+ S Sbjct: 666 SGHRVTTVEQTLGITKTPSA 685 >gi|227497960|ref|ZP_03928140.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] gi|226832618|gb|EEH65001.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] Length = 626 Score = 264 bits (674), Expect = 2e-68, Method: Composition-based stats. Identities = 63/267 (23%), Positives = 119/267 (44%), Gaps = 10/267 (3%) Query: 90 RIHHFPQLLHGWESPAMGKVMQIAIKA---KIAIVVHLYYIDLWIEIANLLSNLSISFDL 146 R ++ L+ E + +A++ KIA+V+H+Y++DL ++ + +++ DL Sbjct: 278 RTNNLADLVKNLELTYILPTQAVAVQPEESKIALVMHVYHMDLLPQLLHYAASMPAGCDL 337 Query: 147 HVTLVTESA--SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 T+ TE+ ++ + ++EN GRDV L+ +L +YD VC IH K Sbjct: 338 IATVDTEAKAQQVREATAGLSLNVETILIENRGRDVAALLVGARP-RLLDYDLVCFIHDK 396 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 K + G+ + + F ++L P V +I TF +G++ A + + + Sbjct: 397 KVTQI-RPGSVGEGFAKRCFENVLATPEFVCNVIATFQAEPRLGVLTPSAPHHGDYFPIS 455 Query: 265 TCSLGKNREMICTLAGRMGITFQ---DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPK 321 S G N + L G+ D++ G++FW R +A+ P+ + P Sbjct: 456 AFSWGPNDKNTKELLASFGLHAPIDPDKEAIAPFGSVFWFRPQAIRPLLERKWRYDDFPA 515 Query: 322 VHKALDGEIEHAVERCFSLSVKKANFR 348 +DG I HA+ER + + + Sbjct: 516 EPLPIDGTISHAIERVYCYMAQARGYY 542 >gi|323138318|ref|ZP_08073389.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] gi|322396401|gb|EFX98931.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] Length = 754 Score = 263 bits (673), Expect = 3e-68, Method: Composition-based stats. Identities = 70/308 (22%), Positives = 128/308 (41%), Gaps = 20/308 (6%) Query: 56 DENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFP------QLLHGWESPAMGKV 109 + V +G G+ + Y L G P E P Sbjct: 304 PLVHYVKFGRAEGRDATPPTPDYTYTYALSDPGPIPDVVPARRPINDHDFALEVPFHY-A 362 Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFPA- 167 I + +A +VH +Y DL I L N+ + DL+++ + E A I ++++ + Sbjct: 363 PDINMDKPVAAIVHAFYPDLLEHILGYLENIPCAVDLYISTDSAEKAEIIGKVVRNWSKG 422 Query: 168 -ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 + IMEN GRD+ P ++ + + +D +H K+S G + WR +L Sbjct: 423 STDVRIMENRGRDIAPMIVGFR-DVFAKHDIFLHVHTKRSPHAGDLLYH---WRDYLLNT 478 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 L G + ++ F IG++ + + + + G + ++ L R+G+ Sbjct: 479 LFGTGDIARSVLSLF-NDPKIGVVFPQHFFEVRR----MLNWGFDYDLARNLLARVGVQL 533 Query: 287 Q-DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 D L+F +G+MFW RT+A+ P+ +L L P+ +DG + HA+ER + + Sbjct: 534 NKDLVLEFPSGSMFWGRTDAIRPLLDLDLQFSDFPEEAGQIDGTLAHAIERTLLMVAESK 593 Query: 346 NFRISDVD 353 + V Sbjct: 594 GYEWFKVA 601 >gi|90425670|ref|YP_534040.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18] gi|90107684|gb|ABD89721.1| glycosyl transferase, group 1 [Rhodopseudomonas palustris BisB18] Length = 846 Score = 263 bits (672), Expect = 3e-68, Method: Composition-based stats. Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 18/273 (6%) Query: 85 HFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISF 144 HF + H P + A + +IAI H YY DL L++ + S Sbjct: 566 HFLRNGRPE-GRWTHRVIRPEPRR-ESNAARPRIAIHGHFYYPDLLESFLKLIAANASSV 623 Query: 145 DLHVTLV-TESASIKSEILKIF--PAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKI 201 DL +T E A+ + L+ F A + + N GRD+ PFL + ++L +YD V Sbjct: 624 DLFLTTSGPEQAAQIRKSLRAFGIQNADVWSVPNRGRDIGPFLKEMP-DKLGSYDIVGHF 682 Query: 202 HGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIR-TFDTHRDIGMIGSRAYRYPNK 260 HGK+SK GD WR + + L+G + +I F +G++ + Sbjct: 683 HGKRSKHVD--STVGDQWRDFAWQHLIGDAFPMIDVIADAFAEDAKLGLVFAEDP----- 735 Query: 261 YCDYTCSLGKNREMICTLAGRMGITFQ-DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFE 319 Y +NR++ LA RM I + DF GTMFW R AL P+ L L Sbjct: 736 ---YLNGWDENRDLAERLAQRMKIEAPLPEHFDFPIGTMFWARVAALQPLFQLNLDWNDY 792 Query: 320 PKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 P +DG I HA+ER +V+K+ F + Sbjct: 793 PHEPLPIDGTILHALERIVPFAVQKSGFEYATT 825 >gi|190572676|ref|YP_001970521.1| putative glycosyltransferase, fusion protein [Stenotrophomonas maltophilia K279a] gi|190010598|emb|CAQ44207.1| putative glycosyltransferase, fusion protein [Stenotrophomonas maltophilia K279a] Length = 566 Score = 263 bits (672), Expect = 3e-68, Method: Composition-based stats. Identities = 85/285 (29%), Positives = 123/285 (43%), Gaps = 20/285 (7%) Query: 79 MMERELHFDGQRIHHFPQLL---------HGWESPAMGKVMQIAIKAKIAIVVHLYYIDL 129 +E E+ + LL + +G + +K++ AIV+HLY++DL Sbjct: 269 QVEEEVARMRATRVNAMPLLAQSSDGISQEELQRYGVGAIPADKLKSRFAIVLHLYHLDL 328 Query: 130 WIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKIFPAARIHIMENHGRDVLPFLILL 187 I + N+ + DL V++ + + + A + + N GRDV PF+ LL Sbjct: 329 IESIQGYMKNMIVDHDLFVSVKSVADRRVAVRFFEERKVRAFVFVHPNIGRDVGPFVSLL 388 Query: 188 ETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDI 247 T L YD VCKIH KKS G WR L LLG+ V KI+R F Sbjct: 389 NTGLLDRYDAVCKIHSKKSVYHDG----GGQWRDELMKSLLGSSHTVLKILRAFRHDSSC 444 Query: 248 GMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALD 307 G++G Y G N E + LA GI +L FFAGTMFW R AL Sbjct: 445 GIVGPEH-----AYVSNARFWGGNEERLRRLAAETGIDDARIRLGFFAGTMFWFRPAALY 499 Query: 308 PIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 ++ L+ LD + H +ER F L V++A + + Sbjct: 500 ALRERALALSEFDPEAGQLDATLAHVIERLFVLWVEQAGYFAATT 544 >gi|189030068|sp|P0C7J1|WXCX_XANCP RecName: Full=Uncharacterized protein wxcX Length = 695 Score = 263 bits (672), Expect = 4e-68, Method: Composition-based stats. Identities = 71/260 (27%), Positives = 111/260 (42%), Gaps = 11/260 (4%) Query: 106 MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKI 164 V +V+H +Y+D+ E+ + + + +T T+ + I + Sbjct: 436 APDVATEICSPSACVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRR 495 Query: 165 FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 A + EN GRD+LPFL + N V K+H KKS + +G+ WR + Sbjct: 496 GIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEML 551 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI 284 LLG P V I+ F T G+ + P T +G N + + L R G Sbjct: 552 TALLG-PQRVDAIVNAFSTDPLAGLAAPEDHLLP-----VTEFIGGNADALDYLTVRTGS 605 Query: 285 TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 D F +G+MFW R EAL P+ + L +DG + HA+ER L+V Sbjct: 606 DAPDTNSLFASGSMFWARLEALRPLLDAHLHASEFESEQGQIDGTLAHAIERFVGLAVTH 665 Query: 345 ANFRISDVDCILGYRKSLSQ 364 + R++ V+ LG K+ S Sbjct: 666 SGHRVTTVEQTLGITKTPSA 685 >gi|122879048|ref|YP_199439.6| hypothetical protein XOO0800 [Xanthomonas oryzae pv. oryzae KACC10331] Length = 546 Score = 263 bits (672), Expect = 4e-68, Method: Composition-based stats. Identities = 70/261 (26%), Positives = 123/261 (47%), Gaps = 11/261 (4%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILK 163 +V+ + + +V+H +Y+D+ E + L++ +S L VT T ++ + + Sbjct: 286 HTAEVVTRSGQHDACVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQ 345 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 A++ EN GRD+LPFL + V K+H KKS + +GD WRR + Sbjct: 346 RGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKSTHRE----DGDTWRRDM 401 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 LL AP V I+R F +G++ + P T +G N + + L R G Sbjct: 402 LSGLL-APQHVAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFMGGNADALDYLTVRTG 455 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 + F +G+MFWV+ EAL P+ + L +DG + HA+ER +++V Sbjct: 456 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 515 Query: 344 KANFRISDVDCILGYRKSLSQ 364 + R++ ++ +LG + + Sbjct: 516 HSGQRVATIEQLLGIPQPATA 536 >gi|260890969|ref|ZP_05902232.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia hofstadii F0254] gi|260859295|gb|EEX73795.1| O-antigen export system ATP-binding protein RfbB [Leptotrichia hofstadii F0254] Length = 709 Score = 262 bits (671), Expect = 4e-68, Method: Composition-based stats. Identities = 59/241 (24%), Positives = 102/241 (42%), Gaps = 8/241 (3%) Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKI---FPAARI 170 + K+ ++ H+Y+ DL E + + ++ + DL +T + K E F + Sbjct: 149 SEDKVLLIFHIYFEDLLDESIHYMKSMPETSDLLITTPRKELKEKIEEKVRGLNFRNIEV 208 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGA 230 ++EN GRDV L+ + + + NYDYVC +H KK+ + D++ + L Sbjct: 209 RVIENRGRDVSSLLVGAK-DAVMNYDYVCFMHDKKTAQLKPYSSLNDVYINYC-KGTLAT 266 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ- 289 V +I TF + +GM+ + N + N + L ++G+ Sbjct: 267 KKYVKNLIGTFKENPRLGMLMPPPPNHGNFFHIIGNEWSSNFKKTEKLIKKLGLNVDFHW 326 Query: 290 --KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + GTMFW R AL + + P+ DG I HAVER + V+ A + Sbjct: 327 NLEPISPLGTMFWFRPRALKKLFDYGWEYSDFPEEPNEHDGTILHAVERVYGFVVQDAGY 386 Query: 348 R 348 Sbjct: 387 Y 387 >gi|166713445|ref|ZP_02244652.1| hypothetical protein Xoryp_18900 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 695 Score = 261 bits (668), Expect = 1e-67, Method: Composition-based stats. Identities = 71/261 (27%), Positives = 123/261 (47%), Gaps = 11/261 (4%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILK 163 +V+ + + +V+H +Y+D+ E + L++ +S L VT T ++ + + Sbjct: 435 HTAEVVTRSGQHDACVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQ 494 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 A++ EN GRD+LPFL + V K+H KKS + +GD WRR + Sbjct: 495 RGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKSTHRE----DGDTWRRDM 550 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 LL AP V I+R F +G++ + P T +G N + + L R G Sbjct: 551 LSGLL-APQHVAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFIGGNADALDYLTVRTG 604 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 + F +G+MFWV+ EAL P+ + L +DG + HA+ER +++V Sbjct: 605 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 664 Query: 344 KANFRISDVDCILGYRKSLSQ 364 + R++ ++ +LG K + Sbjct: 665 HSGQRVATIEQLLGIPKPATA 685 >gi|325921211|ref|ZP_08183074.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC 19865] gi|325548310|gb|EGD19301.1| lipopolysaccharide biosynthesis protein [Xanthomonas gardneri ATCC 19865] Length = 706 Score = 261 bits (668), Expect = 1e-67, Method: Composition-based stats. Identities = 66/255 (25%), Positives = 115/255 (45%), Gaps = 11/255 (4%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILK 163 KV+ ++ +V+H +Y+D+ E+ + +++ +IS L +T T ++ + + Sbjct: 428 GPSKVVSELAPHRVCVVLHAWYLDVLDEMLDAVAHCAISPRLVITTDLTMVVEVRHRVQQ 487 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 A + EN GRD+LPFL + V K+H KKS + +GD WR + Sbjct: 488 RGMQAEVEGFENRGRDILPFLHVANRLLDEGVCLVVKLHTKKSTHRS----DGDTWRHEM 543 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 LL AP I+ F + +G+ + P +G N + + L R G Sbjct: 544 LSALL-APERADAIVNAFSSDPLLGLAAPDGHLLP-----VADFIGGNTDALDYLGARTG 597 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 ++ F +G+MFW R EAL P+ + L +DG + HA+ER +S Sbjct: 598 TETAIEQGMFASGSMFWARLEALRPLLDAHLHPSEFETEQGQIDGTLAHAIERFMGISAI 657 Query: 344 KANFRISDVDCILGY 358 ++ +RI+ + L Sbjct: 658 QSGYRIATIGQALEI 672 >gi|310816773|ref|YP_003964737.1| lipopolysaccharide biosynthesis protein-like protein [Ketogulonicigenium vulgare Y25] gi|308755508|gb|ADO43437.1| lipopolysaccharide biosynthesis protein-like protein [Ketogulonicigenium vulgare Y25] Length = 726 Score = 261 bits (667), Expect = 1e-67, Method: Composition-based stats. Identities = 76/268 (28%), Positives = 121/268 (45%), Gaps = 22/268 (8%) Query: 89 QRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHV 148 + L E+PA + I + +HLYY +L A L+ + + L+V Sbjct: 446 ETRKRHLVLQPRREAPAPARP--------IGVFLHLYYQELAPVFAKRLAQIPLPLSLYV 497 Query: 149 TLVTESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 + T A ++I + P A++ ++ N GRD+ P L + +++D V +HGKKS Sbjct: 498 STDT--AEKAAQIERALPQAQVRVLPNRGRDIFPKLYGF-GDAYADHDIVLHLHGKKSLH 554 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 D W + LLG P V +I+ FD+ +G++ +R Sbjct: 555 SSML----DEWLSHILDCLLGDPADVNRILSLFDSVPRLGIVMPVVHRS----VLNAAHW 606 Query: 269 GKNREMICTLAGRMGIT---FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKA 325 G NR++ LA RMG+ ++ L F AG+MFW RT AL PI +L L P Sbjct: 607 GFNRDIGAELAYRMGMATPLPENDALQFPAGSMFWARTAALQPILDLALEASHFPPEAGQ 666 Query: 326 LDGEIEHAVERCFSLSVKKANFRISDVD 353 +DG + HAVER + + + + V Sbjct: 667 VDGTLAHAVERMLGVVCRAGGYYMLPVA 694 >gi|84622385|ref|YP_449757.1| hypothetical protein XOO_0728 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188578640|ref|YP_001915569.1| hypothetical protein PXO_03177 [Xanthomonas oryzae pv. oryzae PXO99A] gi|84366325|dbj|BAE67483.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188523092|gb|ACD61037.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 695 Score = 261 bits (667), Expect = 1e-67, Method: Composition-based stats. Identities = 70/261 (26%), Positives = 123/261 (47%), Gaps = 11/261 (4%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILK 163 +V+ + + +V+H +Y+D+ E + L++ +S L VT T ++ + + Sbjct: 435 HTAEVVTRSGQHDACVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQ 494 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 A++ EN GRD+LPFL + V K+H KKS + +GD WRR + Sbjct: 495 RGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKSTHRE----DGDTWRRDM 550 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 LL AP V I+R F +G++ + P T +G N + + L R G Sbjct: 551 LSGLL-APQHVAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFMGGNADALDYLTVRTG 604 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 + F +G+MFWV+ EAL P+ + L +DG + HA+ER +++V Sbjct: 605 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 664 Query: 344 KANFRISDVDCILGYRKSLSQ 364 + R++ ++ +LG + + Sbjct: 665 HSGQRVATIEQLLGIPQPATA 685 >gi|58425017|gb|AAW74054.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 727 Score = 261 bits (667), Expect = 1e-67, Method: Composition-based stats. Identities = 70/261 (26%), Positives = 123/261 (47%), Gaps = 11/261 (4%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILK 163 +V+ + + +V+H +Y+D+ E + L++ +S L VT T ++ + + Sbjct: 467 HTAEVVTRSGQHDACVVLHAWYLDVLDEALDALAHCGLSLRLVVTTDITMVTQVRQCLQQ 526 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 A++ EN GRD+LPFL + V K+H KKS + +GD WRR + Sbjct: 527 RGLQAQVEGFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKSTHRE----DGDTWRRDM 582 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 LL AP V I+R F +G++ + P T +G N + + L R G Sbjct: 583 LSGLL-APQHVAAIVRGFAEDPLLGLVAPAQHLLP-----VTDFMGGNADALDYLTVRTG 636 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 + F +G+MFWV+ EAL P+ + L +DG + HA+ER +++V Sbjct: 637 TDAINAHSLFASGSMFWVKLEALRPLLDAHLHPSEFESEQGQIDGTLAHAIERFLAVAVA 696 Query: 344 KANFRISDVDCILGYRKSLSQ 364 + R++ ++ +LG + + Sbjct: 697 HSGQRVATIEQLLGIPQPATA 717 >gi|77748730|ref|NP_643883.2| hypothetical protein XAC3576 [Xanthomonas axonopodis pv. citri str. 306] Length = 546 Score = 260 bits (664), Expect = 3e-67, Method: Composition-based stats. Identities = 69/247 (27%), Positives = 116/247 (46%), Gaps = 11/247 (4%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENHG 177 +V+H +Y+D+ E + +++ +S L VT T ++ + + A++ EN G Sbjct: 300 CVVLHAWYLDVLDEALDAIADCGLSLRLVVTTDITMVEQVRQRLQQRGVQAQVDGFENRG 359 Query: 178 RDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKI 237 RD+LPFL + V K+H KKS + +GD WRR +F LL P I Sbjct: 360 RDILPFLRVANRLLDEGEQVVLKLHTKKSTHRE----DGDAWRREMFSALL-TPQHADAI 414 Query: 238 IRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGT 297 +R F +G+ + P T +G N + + LA R G D+ F +G+ Sbjct: 415 MRGFTDDPLLGLAAPAQHLLP-----VTDFIGGNADALDYLAVRTGTDAIDEHSVFASGS 469 Query: 298 MFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILG 357 MFWV+ EAL P+ + L +DG + HA+ER +++V ++ +D +LG Sbjct: 470 MFWVKLEALRPLLDANLHPSEFENEQGQIDGTLAHAIERFLAVAVSHCGHHVATIDQLLG 529 Query: 358 YRKSLSQ 364 + + Sbjct: 530 IPQPTAS 536 >gi|325928558|ref|ZP_08189746.1| Lipopolysaccharide biosynthesis protein/putative glycosyl transferase [Xanthomonas perforans 91-118] gi|325541097|gb|EGD12651.1| Lipopolysaccharide biosynthesis protein/putative glycosyl transferase [Xanthomonas perforans 91-118] Length = 695 Score = 258 bits (660), Expect = 1e-66, Method: Composition-based stats. Identities = 69/253 (27%), Positives = 120/253 (47%), Gaps = 11/253 (4%) Query: 113 AIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIH 171 + + + +V+H +Y+D+ E +++ +S L +T T ++ + + A++ Sbjct: 443 SDQRDVCVVLHAWYLDVLDEALEAIAHCGLSLRLVITTDITMVEQVRQRLQQRGVQAQVE 502 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 EN GRD+LPFL + V K+H KKS + +GD WRR +F LL AP Sbjct: 503 GFENRGRDILPFLRVANRLLDEGEQVVLKLHTKKSTHRE----DGDTWRREMFSALL-AP 557 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKL 291 V I+R F +G+ + P T +G N + + LA R G ++ Sbjct: 558 QHVDAIMRGFADDPLLGLAAPAQHLLP-----VTDFIGGNADALDYLAVRTGTDAINEHS 612 Query: 292 DFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISD 351 F +G+MFWV+ EAL P+ + L +DG + HA+ER +++V ++ Sbjct: 613 MFASGSMFWVKLEALRPLLDAHLHPSEFEDEQGQIDGTLAHAIERFLAVAVGHCGHHVAT 672 Query: 352 VDCILGYRKSLSQ 364 V+ +LG + + Sbjct: 673 VEQLLGIAQPTAT 685 >gi|82703518|ref|YP_413084.1| glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196] gi|82411583|gb|ABB75692.1| Glycosyl transferase, group 1 [Nitrosospira multiformis ATCC 25196] Length = 828 Score = 258 bits (659), Expect = 1e-66, Method: Composition-based stats. Identities = 68/245 (27%), Positives = 109/245 (44%), Gaps = 13/245 (5%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILK 163 + ++ ++A+ +H+YY +L+ EI L ++ DL +++ TE + + + +L Sbjct: 579 SEEAARPLSSSIRVALHLHVYYSELFPEIMARLKVNNVRPDLFISVPTECTRNEVTGLLN 638 Query: 164 IFPA--ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRR 221 +P I I+ N GRD+ P L + L +YD + +H KK+ G W Sbjct: 639 DYPGKVVDIQIVPNRGRDIGPLLTAFGSVFLDDYDAIGHLHTKKTADLSDEMI-GKRWYT 697 Query: 222 WLFYDLLGAPGVVFKII-RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 +L +LLG + II IG++ + G N+ +LA Sbjct: 698 FLLENLLGGKRNMADIILGRMTADPAIGIVFPDDP--------HVFDWGNNKAHADSLAS 749 Query: 281 RMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSL 340 ++G+ + F GTMFW RTEAL P+ L LS P DG I HA+ER L Sbjct: 750 KLGLGKLQENFVFPMGTMFWARTEALRPLFTLDLSWQDYPAEPLPYDGTILHALERLLPL 809 Query: 341 SVKKA 345 K Sbjct: 810 IAAKQ 814 >gi|3399709|dbj|BAA32094.1| rgpFc [Streptococcus mutans] Length = 583 Score = 257 bits (656), Expect = 2e-66, Method: Composition-based stats. Identities = 71/276 (25%), Positives = 119/276 (43%), Gaps = 21/276 (7%) Query: 93 HFPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 ++P + + K ++ +K K+A+ +H++Y+DL E S+DL +T Sbjct: 260 NYPDFSYLLGHKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTD 319 Query: 152 TE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 ++ A I+ + A++ + N GRDVLP L L LS YD+V H KKSK Sbjct: 320 SDDKKAEIEEILSANGQEAQVFVTGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEA 377 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G WR L L+ I+ + IG++ + + +Y + Sbjct: 378 DF--WAGQSWREELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTF-FRYNKIVDAWN 431 Query: 270 KN--REMICTLAGRMGITFQDQKLDF-----FAGTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + TL +MG+T + F GT W + +AL P+ +L L+ P+ Sbjct: 432 EHLIAPEMNTLWQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEE 491 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS L Sbjct: 492 PLP-QNSILHAIERLLIYIAWNEHYDFRISKNPVDL 526 >gi|290580710|ref|YP_003485102.1| rhamnan synthesis protein F [Streptococcus mutans NN2025] gi|254997609|dbj|BAH88210.1| RgpFc protein [Streptococcus mutans NN2025] Length = 557 Score = 256 bits (654), Expect = 5e-66, Method: Composition-based stats. Identities = 71/276 (25%), Positives = 119/276 (43%), Gaps = 21/276 (7%) Query: 93 HFPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 ++P + + K ++ +K K+A+ +H++Y+DL E S+DL +T Sbjct: 234 NYPDFSYLLGHKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTD 293 Query: 152 TE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 ++ A I+ + A++ + N GRDVLP L L LS YD+V H KKSK Sbjct: 294 SDDKKAEIEEILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSAYDFVGHFHTKKSKEA 351 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G WR L L+ I+ + IG++ + + +Y + Sbjct: 352 DF--WAGQSWREELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTF-FRYNKIVDAWN 405 Query: 270 KN--REMICTLAGRMGITFQDQKLDF-----FAGTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + TL +MG+T + F GT W + +AL P+ +L L+ P+ Sbjct: 406 EHLIAPEMNTLWQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEE 465 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS L Sbjct: 466 PLP-QNSILHAIERLLIYIAWNEHYDFRISKNPVDL 500 >gi|325915787|ref|ZP_08178089.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937] gi|325538051|gb|EGD09745.1| Putative glycosyltransferase [Xanthomonas vesicatoria ATCC 35937] Length = 695 Score = 256 bits (654), Expect = 5e-66, Method: Composition-based stats. Identities = 69/257 (26%), Positives = 113/257 (43%), Gaps = 11/257 (4%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAA 168 + + +VVH +Y+D+ EI + L+ L VT T ++ + + A Sbjct: 440 VPAPRTYRACVVVHAWYLDVLDEILDALAPSVAMLRLIVTTDLTLVGQVRGRLQQHGIEA 499 Query: 169 RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 + EN GRD+LPFL + V K+H KKS + +GD WRR + LL Sbjct: 500 EVEGFENRGRDILPFLHIANRLLDEGEQLVVKLHTKKSTHRH----DGDAWRREMLAALL 555 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 G G V I+ F +G+ + T +G N + + LA R G Sbjct: 556 GG-GRVDAIVNAFVADPQLGLAAPAQHLLA-----VTDFIGGNADALDYLAVRTGTGTVT 609 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 + F +G+MFW + +AL P+ + L +DG + HA+ER +V + R Sbjct: 610 EHDRFASGSMFWAKLDALRPLLDAHLQPGDFEGEQGQIDGTLAHAIERFLGHAVLHSGHR 669 Query: 349 ISDVDCILGYRKSLSQN 365 I+ +D ++G R+ + Sbjct: 670 IATIDGLMGQREPAAST 686 >gi|30024644|dbj|BAC75698.1| rhamnosyltransferase [Streptococcus mutans] Length = 583 Score = 255 bits (653), Expect = 5e-66, Method: Composition-based stats. Identities = 71/276 (25%), Positives = 119/276 (43%), Gaps = 21/276 (7%) Query: 93 HFPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 ++P + + K ++ +K K+A+ +H++Y+DL E S+DL +T Sbjct: 260 NYPDFSYLLGHKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTD 319 Query: 152 TE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 ++ A I+ + A++ + N GRDVLP L L LS YD+V H KKSK Sbjct: 320 SDDKKAEIEEILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEA 377 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G WR L L+ I+ + IG++ + + +Y + Sbjct: 378 DF--WAGQSWREELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTF-FRYNKIVDAWN 431 Query: 270 KN--REMICTLAGRMGITFQDQKLDF-----FAGTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + TL +MG+T + F GT W + +AL P+ +L L+ P+ Sbjct: 432 EHLIAPEMNTLWQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEE 491 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS L Sbjct: 492 PLP-QNSILHAIERLLIYIAWNEHYDFRISKNPVDL 526 >gi|30024633|dbj|BAC75688.1| rhamnosyltransferase [Streptococcus mutans] Length = 583 Score = 255 bits (652), Expect = 8e-66, Method: Composition-based stats. Identities = 72/276 (26%), Positives = 119/276 (43%), Gaps = 21/276 (7%) Query: 93 HFPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 ++P + + K ++ +K K+A+ +H++Y+DL E S+DL +T Sbjct: 260 NYPDFSYLLGHKYVKKRERVDLKNQKVAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTD 319 Query: 152 TE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 ++ A I+ + A+I + N GRDVLP L L LS YD+V H KKSK Sbjct: 320 SDDKKAEIEEVLSANSQEAQIFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEA 377 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G WR L L+ I+ + IG++ + + +Y + Sbjct: 378 DF--WAGQSWREELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTF-FRYNKIVDAWN 431 Query: 270 KN--REMICTLAGRMGITFQDQKLDF-----FAGTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + TL +MG+T + F GT W + +AL P+ +L L+ P+ Sbjct: 432 EHLIAPEMNTLWQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEE 491 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS L Sbjct: 492 PLP-QNSILHAIERLLIYIAWNEHYDFRISKNPVDL 526 >gi|289678438|ref|ZP_06499328.1| glycosyl transferase, group 1 [Pseudomonas syringae pv. syringae FF5] Length = 774 Score = 255 bits (651), Expect = 9e-66, Method: Composition-based stats. Identities = 66/317 (20%), Positives = 118/317 (37%), Gaps = 35/317 (11%) Query: 41 WFWTLFYKRSKKLCY-----DENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFP 95 +F T Y R Y + + G + G+ + A + Sbjct: 204 YFDTETYHRMNTDVYHAQISPLYHYLISGRQEGRTYHA-------------------NIK 244 Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155 + + A + +A+ +H++Y D + ++ L+N D+ +TL Sbjct: 245 KWQPNSVLDIQATLKPEAARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLADAKH 304 Query: 156 SIKSEIL----KIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY 211 K+ + ++ + N GR+ P L+ + L YD C +H KKS Y Sbjct: 305 QKKTIAVFSKHPRVKNLKVRCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSL---Y 360 Query: 212 SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN 271 S E W +L LL ++ +++ F H+D+G+ + + + N Sbjct: 361 SGREQTQWADYLTEYLLRDANIITRLLNAFADHKDLGLYYPTTFWMM---PSWVNHVTMN 417 Query: 272 REMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIE 331 + + I D L + AG MFW R EAL + F P+ DG + Sbjct: 418 KSFMAAWHNEWQIDPCDGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDGSML 477 Query: 332 HAVERCFSLSVKKANFR 348 HA+ER L +K ++ Sbjct: 478 HALERVIGLLAEKNGYK 494 >gi|24379285|ref|NP_721240.1| RgpFc protein [Streptococcus mutans UA159] gi|24377204|gb|AAN58546.1|AE014924_6 RgpFc protein [Streptococcus mutans UA159] Length = 583 Score = 254 bits (649), Expect = 2e-65, Method: Composition-based stats. Identities = 71/276 (25%), Positives = 118/276 (42%), Gaps = 21/276 (7%) Query: 93 HFPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 ++P + + K ++ +K K A+ +H++Y+DL E S+DL +T Sbjct: 260 NYPDFSYLLGHKYVKKRERVDLKNQKAAVHLHVFYVDLLEEFLTAFKQFHFSYDLFITTD 319 Query: 152 TE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 ++ A I+ + A++ + N GRDVLP L L LS YD+V H KKSK Sbjct: 320 SDDKKAEIEEILSANSQEAQVFVTGNIGRDVLPMLKL--KNYLSTYDFVGHFHTKKSKEA 377 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G WR L L+ I+ + IG++ + + +Y + Sbjct: 378 DF--WAGQSWREELIDMLVKPAD---NILAQLQQNPKIGLVIADMPTF-FRYNKIVDAWN 431 Query: 270 KN--REMICTLAGRMGITFQDQKLDF-----FAGTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + TL +MG+T + F GT W + +AL P+ +L L+ P+ Sbjct: 432 EHLIAPEMNTLWQKMGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPLFDLNLTDDDVPEE 491 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS L Sbjct: 492 PLP-QNSILHAIERLLIYIAWNEHYDFRISKNPVDL 526 >gi|259414984|ref|ZP_05738907.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B] gi|259349435|gb|EEW61182.1| glycosyl transferase, group 1 [Silicibacter sp. TrichCH4B] Length = 680 Score = 254 bits (648), Expect = 2e-65, Method: Composition-based stats. Identities = 79/298 (26%), Positives = 128/298 (42%), Gaps = 14/298 (4%) Query: 70 KFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDL 129 K + + + H H + + A + I + A+V+HLYY DL Sbjct: 27 KSGQRPFEHFLRAGRHEQRVTREHSATIAESGSAVAPLRGAGINQNLQ-AVVIHLYYTDL 85 Query: 130 WIEIANLLSNLSISFDLHVTLVT---ESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 W E + L + +FDL+VTL E+ ++ I + +P AR+ ++ N GRD+ PFL L Sbjct: 86 WDEFRDRLRSARFTFDLYVTLTEQGPETEETRARIAEDWPEARVLVLPNRGRDIYPFLHL 145 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L L +Y VCK+H KKS + +GD+WR L +L G +++ F D Sbjct: 146 LNAGWLDHYRAVCKLHSKKSPHRQ----DGDVWRTHLTEGIL-PEGETAELLERFLAAED 200 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 G+ + Y G N E L R+ + LDF AG+++W++ L Sbjct: 201 CGLWVADGQHYEGARW-----WGSNLERCRNLLARLELAASADTLDFPAGSIYWLKPAIL 255 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRKSLSQ 364 D ++ L L DG + HA+ER + +I + + + S Sbjct: 256 DMLRGLALGFDDFDIEQGQTDGTLAHALERALGMICAAGGLQILQTTQLRDLQPAPSS 313 >gi|33862360|ref|NP_893920.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313] gi|33640473|emb|CAE20262.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9313] Length = 738 Score = 253 bits (646), Expect = 3e-65, Method: Composition-based stats. Identities = 65/289 (22%), Positives = 108/289 (37%), Gaps = 20/289 (6%) Query: 70 KFFAQSNLYMMERELHFDGQRIHHFPQLLHG--WESPAMGK---VMQIAIKAKIAIVVHL 124 F + + IH+ + W P + + Q + IA+ VH+ Sbjct: 449 GFHPGIYKEQILEDAPCPDPLIHYLKENQPKGEWNVPMITPASSLQQQDSETTIALHVHV 508 Query: 125 YYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAAR---IHIMENHGRDVL 181 +Y +L I N L+ I DL ++ + + I N GRD+ Sbjct: 509 HYPELLDTILNALNYNKIRPDLFLSCTNHENHSEIQCKSAGANCTLKSIITTPNRGRDIG 568 Query: 182 PFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVV--FKIIR 239 P L + E + Y+ +H KKS +G WR +L +L+G + +I+ Sbjct: 569 PLLTEIGKELDTKYEIYGHLHTKKSALL--PGKQGCSWRDFLISNLVGMQDIAMADRIVT 626 Query: 240 TFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMF 299 + +G++ + NR+ LA ++ + + DF GTMF Sbjct: 627 ALKKNPKLGLVFADDPTCVG--------WSGNRKHADILANKLNLGPLPRCFDFPVGTMF 678 Query: 300 WVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 W + AL + NL L P+ DG I HA+ER + K F Sbjct: 679 WAKKGALTELYNLNLGWEDYPQEPLGYDGTILHAIERLLPIIAAKQGFT 727 >gi|83582737|ref|YP_425043.1| glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170] gi|83578053|gb|ABC24603.1| Glycosyl transferase, group 1 [Rhodospirillum rubrum ATCC 11170] Length = 1236 Score = 253 bits (646), Expect = 4e-65, Method: Composition-based stats. Identities = 78/278 (28%), Positives = 120/278 (43%), Gaps = 24/278 (8%) Query: 80 MERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSN 139 +ER G H L SP G K+ + H YY+DL + + Sbjct: 959 IERGSP-KGNWTHPVIDLETSLPSPIEG--------GKVLLHGHFYYVDLIDDFLKKIII 1009 Query: 140 LSISFDLHVTLVTESASI-KSEILKIFPA--ARIHIMENHGRDVLPFLILLETEQLSNYD 196 S DL +T E ++ + L+ + + ++ N GRDV F L + S+YD Sbjct: 1010 NDFSCDLIITTTDEDRAVFLRKKLEEYKNGSVEVRVVPNIGRDVGAFFTGLSDLKNSDYD 1069 Query: 197 YVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKI-IRTFDTHRDIGMIGSRAY 255 V IHGKKS S G+ WR +L+ L+G I + + DIG++ + Sbjct: 1070 VVGHIHGKKSIHL--SDGTGNKWRNFLWEHLIGGEKKAAAIAVSALIRNPDIGLVFAEEP 1127 Query: 256 RYPNKYCDYTCSLGKNREMICTLAGRMGITFQ-DQKLDFFAGTMFWVRTEALDPIKNLRL 314 + KN+E+ LA +MGI + D+ GTMFW + +AL+PI +L L Sbjct: 1128 --------FLFGWDKNKELANDLAKKMGIEKSLPRFFDWPIGTMFWAKRKALEPIFDLNL 1179 Query: 315 SRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 P + G + HA+ER +V+KA F + Sbjct: 1180 RWEDYPPEPIPVYGTMLHALERLLPFAVEKAGFSFATT 1217 >gi|320330331|gb|EFW86314.1| hypothetical protein PsgRace4_09215 [Pseudomonas syringae pv. glycinea str. race 4] Length = 774 Score = 252 bits (644), Expect = 7e-65, Method: Composition-based stats. Identities = 67/317 (21%), Positives = 121/317 (38%), Gaps = 35/317 (11%) Query: 41 WFWTLFYKRSKKLCY-----DENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFP 95 +F T Y R Y + + G + G+ + A + Sbjct: 204 YFDTETYHRMNTDVYHAQISPLYHYLISGRQEGRTYHA-------------------NVK 244 Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155 + + A + +AI +H++Y D + ++ L+N I+ D+ +TL + Sbjct: 245 KWQPNNVLDVQTTLKPEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNH 304 Query: 156 SIKSEIL----KIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY 211 K+ ++ + N GR+ P L+ + L YD C +H KKS Y Sbjct: 305 KKKTIATFGQHPRVKNLKVSCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSL---Y 360 Query: 212 SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN 271 S E W +L LL ++ +++ F H+D+G+ + + + N Sbjct: 361 SGREQTQWADYLTEYLLRDANIITRLLNAFVDHKDLGLYYPTTFWMM---PSWVNHVTMN 417 Query: 272 REMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIE 331 + + I + L + AG MFW R EAL + + F P+ DG + Sbjct: 418 KAFMNAWHNEWQIDPCEGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSML 477 Query: 332 HAVERCFSLSVKKANFR 348 HA+ER L +K ++ Sbjct: 478 HALERVIGLLAEKNGYK 494 >gi|330899783|gb|EGH31202.1| hypothetical protein PSYJA_20361 [Pseudomonas syringae pv. japonica str. M301072PT] Length = 626 Score = 252 bits (643), Expect = 7e-65, Method: Composition-based stats. Identities = 58/260 (22%), Positives = 104/260 (40%), Gaps = 11/260 (4%) Query: 93 HFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 + + + A + +A+ +H++Y D + ++ L+N D+ +TL Sbjct: 94 NIKKWQPNSVLDVQATLKPEAARLNVAVCLHIFYEDYIEKFSHALANFPTQVDVFITLAD 153 Query: 153 ESASIKSEIL----KIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 K+ + ++ + N GR+ P L+ + L YD C +H KKS Sbjct: 154 AKHQKKTIAVFSKHPRVKNLKVRCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSL- 211 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 YS E W +L LL ++ +++ F H+D+G+ + + + Sbjct: 212 --YSGREQTQWADYLTEYLLRDANIITRLLNAFADHKDLGLYYPTTFWMM---PSWVNHV 266 Query: 269 GKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDG 328 N+ + I D L + AG MFW R EAL + F P+ DG Sbjct: 267 TMNKSFMAAWHNEWQIAPCDGFLSYPAGGMFWARPEALKDMLEKEYDYDFFPQEPLPNDG 326 Query: 329 EIEHAVERCFSLSVKKANFR 348 + HA+ER L +K ++ Sbjct: 327 SMLHALERVIGLLAEKNGYK 346 >gi|220924211|ref|YP_002499513.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium nodulans ORS 2060] gi|219948818|gb|ACL59210.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium nodulans ORS 2060] Length = 1366 Score = 251 bits (642), Expect = 1e-64, Method: Composition-based stats. Identities = 73/309 (23%), Positives = 134/309 (43%), Gaps = 23/309 (7%) Query: 64 GSRSGKKFFAQSNLYMMERE--LHFDGQRIHHFP--QLLHGWESPAMGKVM-QIAIKAKI 118 G + + ++ + + G P ++ E P + + ++ Sbjct: 644 GEPARWRPVPSPKQRFVQNGALMTYYGAVTSVRPADEVDLALEVPFSSIGDVGLELPERV 703 Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA-SIKSEILKIFP--AARIHIMEN 175 A++ H++Y D E++ L+ + DL ++ TE + L+ + + +M N Sbjct: 704 AVIAHVFYTDFCSELSAYLARIPTQADLFISTDTEDKRQQIAFALQSYNMGKLTVRVMPN 763 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVF 235 GRD+ P L+ + + ++Y+Y IH KKS WR +L +LLG+ ++ Sbjct: 764 IGRDIAPMLVGFD-DVFNSYEYFLHIHSKKSPHDP----AFGSWREFLLENLLGSEDIIR 818 Query: 236 KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF-QDQKLDFF 294 I+ H+ G++ S+ + + + G N E + L GR GI D L+F Sbjct: 819 SILYLLHAHKT-GIVFSQHFEP----VRHLLNFGYNFETMKGLLGRCGIKISNDLVLEFP 873 Query: 295 AGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDC 354 + + FW R+ AL P+ +L L +DG + HA+ER V+K+ FR + V Sbjct: 874 SSSFFWGRSSALKPLLDLNLDWSDFAAEAGQIDGTLAHAIERSVLYIVEKSGFRWAKV-- 931 Query: 355 ILGYRKSLS 363 G+R L Sbjct: 932 --GHRSKLP 938 >gi|171779906|ref|ZP_02920810.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171281254|gb|EDT46689.1| hypothetical protein STRINF_01693 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 592 Score = 251 bits (641), Expect = 1e-64, Method: Composition-based stats. Identities = 71/276 (25%), Positives = 121/276 (43%), Gaps = 22/276 (7%) Query: 93 HFPQLLHGWESPAMGKVMQIA-IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 +FP + + +V ++ KIA+ +H++Y+DL + + N +DL +T Sbjct: 266 NFPDFKYLLARKYVKEVPAVSLADKKIAVHLHVFYVDLLEDFLDAFENFHFVYDLFITTD 325 Query: 152 --TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 T+ I+S + A+I + N GRDVLP L L + LS+YDY+ H KKSK Sbjct: 326 NATKKQEIESILRSNGKDAQIFVTGNVGRDVLPMLKL--KDYLSDYDYIGHFHTKKSKEA 383 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G+ WR L L+ I+ FD +G++ + + ++ + Sbjct: 384 DF--WAGESWRNELIDMLIKPAD---NILANFDND-KLGIVIADIPTF-FRFNKIVDAWN 436 Query: 270 KN--REMICTLAGRMGITFQDQKLDF-----FAGTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + L +MG+T +F GT W + +AL P+ +L L+ P Sbjct: 437 EHLIAPAMNDLWQQMGMTKAIDFNNFHNFVMSYGTYVWFKYDALKPLFDLGLTDEDVPAE 496 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS + Sbjct: 497 PLP-QNSILHAIERLLIYIAWNEHYDFRISKNPIDM 531 >gi|330882679|gb|EGH16828.1| hypothetical protein Pgy4_27710 [Pseudomonas syringae pv. glycinea str. race 4] Length = 608 Score = 250 bits (640), Expect = 2e-64, Method: Composition-based stats. Identities = 67/317 (21%), Positives = 121/317 (38%), Gaps = 35/317 (11%) Query: 41 WFWTLFYKRSKKLCY-----DENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFP 95 +F T Y R Y + + G + G+ + A + Sbjct: 38 YFDTETYHRMNTDVYHAQISPLYHYLISGRQEGRTYHA-------------------NVK 78 Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155 + + A + +AI +H++Y D + ++ L+N I+ D+ +TL + Sbjct: 79 KWQPNNVLDVQTTLKPEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNH 138 Query: 156 SIKSEIL----KIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY 211 K+ ++ + N GR+ P L+ + L YD C +H KKS Y Sbjct: 139 KKKTIATFGQHPRVKNLKVSCVPNRGRNFGPLLVEFSKD-LMAYDLFCHLHSKKSL---Y 194 Query: 212 SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN 271 S E W +L LL ++ +++ F H+D+G+ + + + N Sbjct: 195 SGREQTQWADYLTEYLLRDANIITRLLNAFVDHKDLGLYYPTTFWMM---PSWVNHVTMN 251 Query: 272 REMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIE 331 + + I + L + AG MFW R EAL + + F P+ DG + Sbjct: 252 KAFMNAWHNEWQIDPCEGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSML 311 Query: 332 HAVERCFSLSVKKANFR 348 HA+ER L +K ++ Sbjct: 312 HALERVIGLLAEKNGYK 328 >gi|325276923|ref|ZP_08142610.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51] gi|324097938|gb|EGB96097.1| hypothetical protein G1E_25356 [Pseudomonas sp. TJI-51] Length = 758 Score = 250 bits (638), Expect = 3e-64, Method: Composition-based stats. Identities = 67/259 (25%), Positives = 105/259 (40%), Gaps = 11/259 (4%) Query: 94 FPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE 153 P+ +E ++ A KIAI +H+YY D A L DL +T+ E Sbjct: 227 MPKWTPKFEINPASELTPKASNQKIAICLHIYYDDYIERFAEALYTFPTEVDLLITIANE 286 Query: 154 SASIKSE----ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 S ++ ++ I + N GR+ P L+ E L+ YD +C +H KKS Sbjct: 287 SFRDRAYQTFSKIQAVKKVTIKSVPNRGRNFGPLLVEFAQELLT-YDLLCHLHSKKSL-- 343 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 YS E W +L LL VV +++ F + G+ + + + Sbjct: 344 -YSGREQTQWADYLSEYLLNDCSVVKRVLNAFSDNPQFGVYYPTTFWMM---PSWVNHVT 399 Query: 270 KNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGE 329 N+ + L +G D L + AG MFW R +AL I N + P DG Sbjct: 400 MNKPHMRNLQTALGFGHFDDFLSYPAGGMFWARPKALVDILNKTYTYDDFPNEPLPNDGS 459 Query: 330 IEHAVERCFSLSVKKANFR 348 + HA+ER +K ++ Sbjct: 460 MLHALERVIGPVCEKNGYQ 478 >gi|261367011|ref|ZP_05979894.1| putative polysaccharide biosynthesis protein [Subdoligranulum variabile DSM 15176] gi|282571129|gb|EFB76664.1| putative polysaccharide biosynthesis protein [Subdoligranulum variabile DSM 15176] Length = 646 Score = 249 bits (637), Expect = 5e-64, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 113/286 (39%), Gaps = 18/286 (6%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANL 136 M E + ++ + + ++ + +IA+ +HLY++D+ + Sbjct: 274 RTMHPHEFTRNLGLTRVIQPVVQNAK-----QAEELCAQRRIALAMHLYFMDMLEQSVAF 328 Query: 137 LSNLSISFDLHVTLVTESASIKSEILKIFPA-----ARIHIMENHGRDVLPFLILLETEQ 191 + D+ V+ +E K +I + F + ++EN GRDV FL L Sbjct: 329 AAKFPPQTDVFVSTNSE--EKKEQIEQAFSGQKLHSVTVMVVENRGRDVGAFLCDLAP-H 385 Query: 192 LSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIG 251 L NYDY C +H KK+ + G + ++ V ++ F+ +G++ Sbjct: 386 LRNYDYACFMHDKKAIQTK-PGSVGASFGYVCNENVCKNAAHVLNVLCEFENDPYLGILC 444 Query: 252 SRAYRYPNKYCDY-TCSLGKNREMICTLAGRMGITFQ---DQKLDFFAGTMFWVRTEALD 307 + + + + G N E L +G+ ++ G++FW R +AL+ Sbjct: 445 PPYPTHGLYFMNMCSGGWGPNFENTKKLLKELGLDVPISGEESPIAPFGSVFWFRPKALE 504 Query: 308 PIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 P+ P+ DG I HA+ER + + A + + V Sbjct: 505 PLFAHGWQHTDFPQEPLPQDGTISHAIERVYPFVAQAAGYYPAVVM 550 >gi|297538440|ref|YP_003674209.1| Rhamnan synthesis F [Methylotenera sp. 301] gi|297257787|gb|ADI29632.1| Rhamnan synthesis F [Methylotenera sp. 301] Length = 782 Score = 249 bits (636), Expect = 5e-64, Method: Composition-based stats. Identities = 72/264 (27%), Positives = 118/264 (44%), Gaps = 17/264 (6%) Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155 ++ + P +V +A++ HL+Y + + LSN+ +FD+++T TE Sbjct: 490 KIEYAMLVPFSYQVESPQNNPSLAVICHLFYHQMCEDYKVYLSNIPFNFDIYITTDTE-- 547 Query: 156 SIKSEILKIFP-----AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 K+ I K F + + N GRD+ P LI + S Y+Y+ IH K S Sbjct: 548 DKKAYIEKSFSGWQRGKVEVRLAVNQGRDIAPKLIACR-DIYSAYEYILHIHSKNSPYSS 606 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 WR ++ LLG+ V I F + ++G+I + ++ + Sbjct: 607 IH----TGWRDYILDTLLGSQKTVSSIFEAFQLNSNLGIIAPQHFKALKLD----IGWDR 658 Query: 271 NREMICTLAGRMGITFQDQ-KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGE 329 N ++ LAGRMG + +DF +G+MFW R+ AL P+ N LS P+ DG Sbjct: 659 NFKIAKKLAGRMGFDISRKAPIDFPSGSMFWARSAALLPLLNCSLSLQDFPREDGQKDGT 718 Query: 330 IEHAVERCFSLSVKKANFRISDVD 353 H++ER + +KA F V Sbjct: 719 TAHSIERLYFFICEKAGFSWIKVA 742 >gi|312866008|ref|ZP_07726229.1| rhamnan synthesis protein F [Streptococcus downei F0415] gi|311098412|gb|EFQ56635.1| rhamnan synthesis protein F [Streptococcus downei F0415] Length = 584 Score = 248 bits (634), Expect = 9e-64, Method: Composition-based stats. Identities = 68/277 (24%), Positives = 120/277 (43%), Gaps = 26/277 (9%) Query: 95 PQLLHGWESPA--MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P L + + ++ ++K+A+ +H++Y+DL E + ++DL +T Sbjct: 262 PDLPYLLARKYLEQAEAEELPAESKVAVHLHVFYVDLLQEFLDAFKTFHFAYDLFITT-- 319 Query: 153 ESASIKSEILKIFPA----ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 + ++EI I A+I + N GRDVLP L L +QL YDY+ H KKSK Sbjct: 320 DKEEKRAEIQAILEQNQVLAQIFVTGNIGRDVLPMLKL--KDQLKGYDYIGHFHTKKSKE 377 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 Y W G WR+ L L+ +I+ + +G++ + + ++ + Sbjct: 378 ADY--WAGQSWRQELIAMLVKPAN---QILAQMAKNDRLGIVIADMPSF-FRFNKIVVAW 431 Query: 269 GKN--REMICTLAGRMGIT--FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPK 321 +N + L +M + + +D F GT W + +AL P+ +L L+ + P Sbjct: 432 NENLIAPEMEELWEKMSLKKSIDFKAMDTFVMSYGTYAWFKYDALSPLFDLDLTDEYVPA 491 Query: 322 VHKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER K ++RIS + Sbjct: 492 EPLP-QNSILHAIERLLIYIAWDKHYDYRISPSPITM 527 >gi|317047360|ref|YP_004115008.1| family 2 glycosyl transferase [Pantoea sp. At-9b] gi|316948977|gb|ADU68452.1| glycosyl transferase family 2 [Pantoea sp. At-9b] Length = 1419 Score = 248 bits (634), Expect = 1e-63, Method: Composition-based stats. Identities = 72/316 (22%), Positives = 125/316 (39%), Gaps = 22/316 (6%) Query: 41 WFWTLFYKRSKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHG 100 W+ + + + + YG G++ + M+E G + Sbjct: 527 WYLEAYPDVATSGMDPFEHFLLYGHAEGRQ--PSATPRMLEAPFFRYGPSEYGAKGRPLL 584 Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV------TES 154 ++P + I + +HLYY+DL E L+ + FDL ++L E Sbjct: 585 IDAPVQ---LNEGFARTIGVHLHLYYVDLADEFIKHLNTIPTGFDLFISLPRGKHNVEEC 641 Query: 155 ASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWW 214 +K + EN GRD+ PF++ E LS Y+ + IH KKS + Sbjct: 642 ERKFRSGIKTLKKLVVRETENKGRDIYPFIVEFGAELLS-YELILHIHSKKSPQA----- 695 Query: 215 EGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREM 274 WRR+L + LG + +I+ +FD +G++ + + + G NRE+ Sbjct: 696 LSKGWRRFLLHYTLGTESITTQILNSFDNDPKLGVLFPAYFYGVTRQP----NWGGNREI 751 Query: 275 ICTLAGRMGITFQDQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 + R+G ++ D+ AG+ FW R++AL P+ N + DG + H Sbjct: 752 VKQQLARLGFSYDMTYCPDYPAGSFFWSRSDALRPLLNGEYRLEDFDEEAGQYDGTLAHG 811 Query: 334 VERCFSLSVKKANFRI 349 ER F N+ Sbjct: 812 FERLFGTIPLLQNYST 827 >gi|296876714|ref|ZP_06900762.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus parasanguinis ATCC 15912] gi|296432216|gb|EFH18015.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus parasanguinis ATCC 15912] Length = 582 Score = 248 bits (634), Expect = 1e-63, Method: Composition-based stats. Identities = 74/257 (28%), Positives = 117/257 (45%), Gaps = 28/257 (10%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARI 170 K+A+ +H++Y+DL E + + +DL +T + K I +I AR+ Sbjct: 283 DKKVAVHLHVFYVDLLEEFLDAFRDFHFDYDLWITT--DVEEKKQAIEQILSNRAQDARV 340 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGA 230 + N GRDVLP L+L EQLS YDYV H KKSK + W G+ WR+ L L+ Sbjct: 341 VVTGNIGRDVLPMLLL--KEQLSKYDYVGHFHTKKSKEADF--WAGESWRKELIEMLVKP 396 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREM----ICTLAGRMGI-- 284 +I+ + + +G+ + + +Y + N + + L RMG Sbjct: 397 AD---QILANMEANPKVGITIADIPTF-FRYNRIVVAW--NEALISPEMNKLWQRMGATK 450 Query: 285 TFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLS 341 T +K++ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 451 TIDFEKINTFVMSYGTFVWFKYDALKPLFDLNLTAADVPAEPLP-QNSILHAIERLLIYI 509 Query: 342 V--KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 510 AWDQKYDFRISQNPHVL 526 >gi|154509526|ref|ZP_02045168.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC 17982] gi|153799160|gb|EDN81580.1| hypothetical protein ACTODO_02058 [Actinomyces odontolyticus ATCC 17982] Length = 620 Score = 248 bits (633), Expect = 1e-63, Method: Composition-based stats. Identities = 70/256 (27%), Positives = 107/256 (41%), Gaps = 10/256 (3%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS--IKSEIL 162 A + A KI V H++Y D+ EI + LS L + L T E I++ Sbjct: 286 ADQATLDAAASLKILAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEARAQ 345 Query: 163 KIFPAARIHIME-NHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWR 220 + A + ++ N GRD+ FL+ S YD V KIH KKS + Y+ L++ Sbjct: 346 ERGVDADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYN--AAQLFK 403 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 L+ +LL + V I+ F H +GM+ + Y + NR A Sbjct: 404 EHLYDNLLASSDHVASILAEFAAHPGLGMVIAPMPHMG--YPTMGHAWFANRAPARDFAK 461 Query: 281 RMGI--TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 ++GI F D + G+MF R EAL + L P+ DG + H +ER Sbjct: 462 KVGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLL 521 Query: 339 SLSVKKANFRISDVDC 354 S +V + + V Sbjct: 522 SYAVLSRGYYVRPVMT 537 >gi|320095829|ref|ZP_08027469.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str. F0338] gi|319977239|gb|EFW08942.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 178 str. F0338] Length = 619 Score = 247 bits (632), Expect = 1e-63, Method: Composition-based stats. Identities = 66/256 (25%), Positives = 108/256 (42%), Gaps = 10/256 (3%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEIL 162 A + + A ++ + H++Y D+ EI + LS L + L T E A+I+ + Sbjct: 286 AAPEAREKAASLRVVAIAHIFYADMADEIIDRLSVLPDGWRLVATTADEERKAAIEETMA 345 Query: 163 KIFPAARIHIM-ENHGRDVLPFLILLETEQL-SNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + ++ ++ N GRD+ FL+ +YD V KIH KKS + + L++ Sbjct: 346 RRGAVGQVRVVASNRGRDISAFLVDCSDVLAGDDYDVVVKIHSKKSVQDEAN--AAQLFK 403 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 L+ +LL + V I+ F H +GM + Y + NR LA Sbjct: 404 DHLYENLLDSKDHVANILAEFADHPGLGMALAPMPHMG--YPTMGHAWFANRPPARELAK 461 Query: 281 RMGI--TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 R+GI F D + G+MF R AL P+ L+ P DG + H +ER Sbjct: 462 RIGITVPFDDHQPLAPYGSMFIARPRALRPLVEAGLTHDDFPPEGGYQDGSLAHVIERLL 521 Query: 339 SLSVKKANFRISDVDC 354 + +V + V Sbjct: 522 AYAVLSEGYYARPVMT 537 >gi|194364297|ref|YP_002026907.1| hypothetical protein Smal_0519 [Stenotrophomonas maltophilia R551-3] gi|194347101|gb|ACF50224.1| conserved hypothetical protein [Stenotrophomonas maltophilia R551-3] Length = 686 Score = 247 bits (632), Expect = 2e-63, Method: Composition-based stats. Identities = 59/265 (22%), Positives = 107/265 (40%), Gaps = 11/265 (4%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 ++ ++ +V+H +++D E+ + + + + L +T ++ + Sbjct: 423 LQATRRAMTPSQPAPSRPCVVIHAWHLDALPELLSAVKDSGLPARLVITTTSDRQAQVQS 482 Query: 161 I-LKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLW 219 I A I +NHGRD+LPFL + N V K+H K+S + GD W Sbjct: 483 ITESHGLPAEIWAYDNHGRDILPFLHAADRLLQQNESLVLKLHTKRSTHRD----NGDQW 538 Query: 220 RRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLA 279 RR + LLG P + +G++ + + +G N + + L Sbjct: 539 RREMVDALLG-PAQAAANLAHLQADPRLGLMAPAGHLL-----NVADYIGGNAQRMERLW 592 Query: 280 GRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 ++G+ F +G+MFWVR +AL P+ + L +DG + HA+ER Sbjct: 593 AQLGLDGAPGDGQFASGSMFWVRLQALRPLLDAHLLPSMFEVEAGQIDGTLAHAIERATG 652 Query: 340 LSVKKANFRISDVDCILGYRKSLSQ 364 A F + D + G + Sbjct: 653 AVATCAGFSVGDTSQVHGAPPQPAS 677 >gi|55821450|ref|YP_139892.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG 18311] gi|55737435|gb|AAV61077.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMG 18311] Length = 594 Score = 247 bits (632), Expect = 2e-63, Method: Composition-based stats. Identities = 78/272 (28%), Positives = 119/272 (43%), Gaps = 21/272 (7%) Query: 94 FPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P + + + K Q KIA+ +H YY+DL + N ++DL +T + Sbjct: 264 LPTPPYLLDRKVIEKSSQTYSDTKKIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDS 323 Query: 153 E--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E A I+S + K ARI I N GRDV+P L L ++LS YDY+ H KKS Sbjct: 324 EDKKAEIQSILDKNGKVARIFITGNRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPE-- 379 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 Y +W GD WR LF L+ II + +G++ + + +Y + Sbjct: 380 YPYWVGDSWRNELFSMLIQPAD---NIIANLERDDRLGLVIADIPSF-FRYTKIVDPWNE 435 Query: 271 NR--EMICTLAGRM--GITFQDQKLDFF---AGTMFWVRTEALDPIKNLRLSRYFEPKVH 323 NR E + L RM G K++ F GT W + +AL P+ +L L P Sbjct: 436 NRFAEGMNDLWERMDLGRDIDFDKMNTFIMSYGTFIWFKYDALKPLFDLDLQDEEIPAEP 495 Query: 324 KALDGEIEHAVERCFSL--SVKKANFRISDVD 353 I H++ER ++ ++ I+ D Sbjct: 496 IP-QHTILHSIERILVYLAWARRYDYAIAKND 526 >gi|225350704|ref|YP_002720664.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae WA1] gi|225216388|gb|ACN85121.1| putative glycosyl transferase, group 1 [Brachyspira hyodysenteriae WA1] Length = 342 Score = 247 bits (631), Expect = 2e-63, Method: Composition-based stats. Identities = 72/245 (29%), Positives = 109/245 (44%), Gaps = 14/245 (5%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFP-- 166 + K KI I +HLYYID+ L + I FDL +T E+ I P Sbjct: 19 TEEIKKLKIGIHIHLYYIDMMDMFIKYLKDSPIEFDLFITTSKEENKDICLNAFNKLPKL 78 Query: 167 -AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFY 225 I I+EN GRD+ P+LI Q +NYD C +H KKS W + W +L Sbjct: 79 KNITIFIVENIGRDIAPWLIECNNIQ-NNYDLFCHLHTKKSLH----WESINEWGEYLIE 133 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT 285 +L+ + + I+ F +IG+I Y Y Y Y + I L ++ I Sbjct: 134 NLI-SEEAINNILSNFILDNNIGIISPHIYYYLFPYILYIDK--DDMHHIKLLLNKLNIN 190 Query: 286 FQDQK--LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 F+ + F G+M W R + L P+ +L L P+ G I HA+ER + + Sbjct: 191 FEPKPENFVFPVGSMLWYRPKVLKPLFDLNLKYSDFPQEPIPKTGTIAHAIERIIGIICE 250 Query: 344 KANFR 348 ++N++ Sbjct: 251 QSNYK 255 >gi|55823377|ref|YP_141818.1| polysaccharide biosynthesis protein [Streptococcus thermophilus CNRZ1066] gi|55739362|gb|AAV63003.1| polysaccharide biosynthesis protein [Streptococcus thermophilus CNRZ1066] Length = 581 Score = 247 bits (631), Expect = 2e-63, Method: Composition-based stats. Identities = 72/254 (28%), Positives = 115/254 (45%), Gaps = 22/254 (8%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV--TESASIKSEILKIFPAARIHI 172 K+A+ +H++Y+DL E + + ++DL +T + I+ + + A I + Sbjct: 283 DKKVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWITTDIEEKKQEIEQILSRRSQDATIVV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L+L E+LS YDYV H KKSK + W G+ WR+ L L+ Sbjct: 343 TGNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD 398 Query: 233 VVFKIIRTFDTHRDIGMI---GSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQ 287 +I+ + + +G+ +RY + +L + L RMG T Sbjct: 399 ---QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALIS--PEMNKLWQRMGATKNID 453 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 + L+ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 454 FKNLNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVPAEPLP-QNSILHAIERLLVYIAWD 512 Query: 343 KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 513 QKYDFRISQNPHVL 526 >gi|319946716|ref|ZP_08020950.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus australis ATCC 700641] gi|319746764|gb|EFV99023.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus australis ATCC 700641] Length = 581 Score = 247 bits (631), Expect = 2e-63, Method: Composition-based stats. Identities = 73/255 (28%), Positives = 114/255 (44%), Gaps = 24/255 (9%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA----ARI 170 K+A+ +H++Y+DL E + ++DL +T + K I +I A + Sbjct: 283 DKKVAVHLHVFYVDLLEEFLDAFQAFHFAYDLWITT--DVEEKKQAIEEILSNRAQVATV 340 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGA 230 + N GRDVLP L+L EQLS+YDYV H KKSK + W G+ WR+ L L+ Sbjct: 341 VVTGNIGRDVLPMLLL--KEQLSHYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKP 396 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR--EMICTLAGRMGI--TF 286 KI+ + + +G+ + + +Y + + + L RMG T Sbjct: 397 AD---KILANMEANPKVGITIADIPTF-FRYNRIVVAWNEVLISPEMNKLWQRMGATKTI 452 Query: 287 QDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV- 342 + L+ F GT W + +AL P+ +L L P I HA+ER Sbjct: 453 DFKNLNTFVMSYGTFVWFKYDALKPLFDLNLKAADVPAEPLP-QNSILHAIERLLVYIAW 511 Query: 343 -KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 512 DQKYDFRISQNPHVL 526 >gi|320325880|gb|EFW81940.1| hypothetical protein PsgB076_04646 [Pseudomonas syringae pv. glycinea str. B076] Length = 774 Score = 247 bits (630), Expect = 3e-63, Method: Composition-based stats. Identities = 66/317 (20%), Positives = 120/317 (37%), Gaps = 35/317 (11%) Query: 41 WFWTLFYKRSKKLCY-----DENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFP 95 +F T Y R Y + + G + G+ + A + Sbjct: 204 YFDTETYHRMNTDVYHAQISPLYHYLISGRQEGRTYHA-------------------NVK 244 Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155 + + A + +AI +H++Y D + ++ L+N I+ D+ +TL + Sbjct: 245 KWQPNNVLDVQTTLKPEAARLNVAICLHIFYEDYIEKFSHALANFPIAVDVFITLAAPNH 304 Query: 156 SIKSEIL----KIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY 211 K+ ++ + N R+ P L+ + L YD C +H KKS Y Sbjct: 305 KKKTIATFGQHPRVKNLKVSCVPNRERNFGPLLVEFSKD-LMAYDLFCHLHSKKSL---Y 360 Query: 212 SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN 271 S E W +L LL ++ +++ F H+D+G+ + + + N Sbjct: 361 SGREQTQWADYLTEYLLRDANIITRLLNAFVDHKDLGLYYPTTFWMM---PSWVNHVTMN 417 Query: 272 REMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIE 331 + + I + L + AG MFW R EAL + + F P+ DG + Sbjct: 418 KAFMNAWHNEWQIDPCEGFLSYPAGGMFWARPEALKEMLEKEYTYDFFPQEPLPNDGSML 477 Query: 332 HAVERCFSLSVKKANFR 348 HA+ER L +K ++ Sbjct: 478 HALERVIGLLAEKNGYK 494 >gi|13476280|ref|NP_107850.1| hypothetical protein mlr7559 [Mesorhizobium loti MAFF303099] gi|14027041|dbj|BAB53995.1| mlr7559 [Mesorhizobium loti MAFF303099] Length = 644 Score = 247 bits (630), Expect = 3e-63, Method: Composition-based stats. Identities = 65/256 (25%), Positives = 109/256 (42%), Gaps = 12/256 (4%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKIFPAAR---IH 171 KIA+ H+YY D+ EI L N+ + +D T T E + L P + + Sbjct: 297 GKIAVCAHIYYTDMLDEILGLTGNIPVPYDFIATTNTPEKKAEIETALANRPGVKNVIVR 356 Query: 172 IME-NHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 ++E N GRD+ I L + + YD VC++H KKS + S G+L++R + +LL Sbjct: 357 VVEQNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQVQSSM--GNLFKRHMVDNLLN 414 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI--TFQ 287 + G V ++ F + +G+ + Y S N+ + A + I F Sbjct: 415 SRGYVHNVLDMFHDNPSVGLAIPPIFHI--SYPTMGFSWFANKPKVEETARLLNINVKFD 472 Query: 288 DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 + GTMFW R AL + + +DG HA+ER + +V+ A + Sbjct: 473 ENTPVAAYGTMFWFRPRALRKMFEHKWKWEEFNAEPDHVDGGFAHALERLIAYAVQNAGY 532 Query: 348 RISDVDCILGYRKSLS 363 + C + + Sbjct: 533 TTQHIMCAHQAEHNYT 548 >gi|83950907|ref|ZP_00959640.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM] gi|83838806|gb|EAP78102.1| hypothetical protein ISM_07395 [Roseovarius nubinhibens ISM] Length = 752 Score = 247 bits (630), Expect = 3e-63, Method: Composition-based stats. Identities = 87/350 (24%), Positives = 138/350 (39%), Gaps = 42/350 (12%) Query: 42 FWTLFYKRSKKLCYDEN------YVVAYGSRSGKKF--------FAQSNLYMMERELH-- 85 F FY+R K + + V +G + G + + + N + + Sbjct: 32 FDVRFYRREHKDLHWSCRLWARHHYVTWGEKMGLRPRADFAPEEYLRLNPDVAGSGIPPL 91 Query: 86 -----------FDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIA 134 GQ + L P + KA+ A+ H+YY DLW E A Sbjct: 92 MHYLTSGQGEGRSGQPRYATRPL-PSCPLPRLRF-DPGRPKARFALHAHIYYPDLWPEFA 149 Query: 135 NLLSNLSISFDLHVTLV---TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQ 191 + DL++TL E+ + EI + FP A + + N GRD+LPFL+L Sbjct: 150 TRFDEIGDGIDLYITLTWRGEETRWLADEITERFPRAFVTPVPNRGRDILPFLLLANAGA 209 Query: 192 LSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIG 251 YD +CKIH KKS + +GD WRR L +L A G+ + ++ F Sbjct: 210 FDGYDALCKIHTKKSPHRD----DGDQWRRHLIDGVLPATGLQER-LQHFLADDAAAFWV 264 Query: 252 SRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKN 311 + Y + G NR+ + R+ + L F AG+++W++ L IK Sbjct: 265 ADGQAYAARDW-----WGINRDKTAAVLRRVELDPLLDALRFPAGSIYWMKPLMLGMIKA 319 Query: 312 LRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRKS 361 L L +DG + HAVER A I + ++ R++ Sbjct: 320 LDLDAPMFEPEKGQVDGTLAHAVERAIGGLALAAGQEIRETAALMRPRRA 369 >gi|116628171|ref|YP_820790.1| polysaccharide biosynthesis protein [Streptococcus thermophilus LMD-9] gi|116101448|gb|ABJ66594.1| Lipopolysaccharide biosynthesis protein [Streptococcus thermophilus LMD-9] Length = 581 Score = 246 bits (629), Expect = 4e-63, Method: Composition-based stats. Identities = 73/254 (28%), Positives = 115/254 (45%), Gaps = 22/254 (8%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA--SIKSEILKIFPAARIHI 172 K+A+ +H++Y+DL E + + ++DL +T E I+ + + A I + Sbjct: 283 DKKVAVHLHVFYVDLLEEFLDAFQDFHFAYDLWITTDVEEKKQEIEQILSRRSQDATIVV 342 Query: 173 MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 N GRDVLP L+L E+LS YDYV H KKSK + W G+ WR+ L L+ Sbjct: 343 TGNIGRDVLPMLLL--KEKLSRYDYVGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD 398 Query: 233 VVFKIIRTFDTHRDIGMI---GSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQ 287 +I+ + + +G+ +RY + +L + L RMG T Sbjct: 399 ---QILANMEANPKVGITIGDIPTYFRYNRIVVAWNEALIS--PEMNKLWQRMGATKNID 453 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 + L+ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 454 FKNLNTFVMSYGTFVWFKYDALKPLFDLNLTVSDVPAEPLP-QNSILHAIERLLVYIAWD 512 Query: 343 KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 513 QKYDFRISQNPHVL 526 >gi|160936495|ref|ZP_02083863.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC BAA-613] gi|158440580|gb|EDP18318.1| hypothetical protein CLOBOL_01386 [Clostridium bolteae ATCC BAA-613] Length = 674 Score = 246 bits (628), Expect = 4e-63, Method: Composition-based stats. Identities = 64/249 (25%), Positives = 108/249 (43%), Gaps = 15/249 (6%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP-----AAR 169 KIA+V HL+Y DL E L N+ + DL++T+ K ++ F + Sbjct: 290 NKKIAVVAHLFYPDLMDETLRYLQNIQENIDLYITVAN--IETKYKVYNYFESIRRSNVK 347 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 + + N GRD L+ E L Y+Y+C +H KK+ R G G + + + L Sbjct: 348 VLLSGNRGRDAGSLLVACR-EYLMQYEYLCFVHDKKTTRGGGPVTVGKAFMYHAWENTLR 406 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY-----CDYTCSLGKNREMICTLAGRMGI 284 + G V II+ F+ + +G++ ++TC K +E+ L+ ++ + Sbjct: 407 SGGFVSSIIKLFEKNDRLGILTPPVPALGGYLTELVGNEWTCCYQKTKELAEILSLKVPM 466 Query: 285 TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 + Q Q F T FW R AL P+ P+ A DG + HA+ER + Sbjct: 467 SPQKQP--FALATAFWCRPAALKPLFEYPWRYEDFPEEPLASDGTLNHAIERIIIYVAQS 524 Query: 345 ANFRISDVD 353 + + V+ Sbjct: 525 EGYYTAMVE 533 >gi|269219069|ref|ZP_06162923.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon 848 str. F0332] gi|269211216|gb|EEZ77556.1| glycosyl transferase, group 2 family [Actinomyces sp. oral taxon 848 str. F0332] Length = 687 Score = 245 bits (627), Expect = 6e-63, Method: Composition-based stats. Identities = 70/296 (23%), Positives = 130/296 (43%), Gaps = 17/296 (5%) Query: 64 GSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVH 123 GS G + ++ E + G++ FP W + IA ++IA+V+H Sbjct: 31 GSVKGYRSQHPAD--FSEYVMRQGGRQDSGFPD---KWR---IDPQFDIADPSRIAVVIH 82 Query: 124 LYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVLPF 183 +Y DL E+ + L NL FDL VT + + + L+ + + +ENHGRD+ P Sbjct: 83 CFYADLMPELFDRLRNLPTDFDLFVTNASGADVAVPKDLERMRHSVVVEVENHGRDIFPT 142 Query: 184 LILLETEQLSNYDYVCKIHGKKSK---RKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRT 240 + L+ + L YD + K+H KKS G W+ DL+G+ V +I+ Sbjct: 143 VQLVNSGILDPYDLILKLHTKKSPWREEHADLDGSGAAWKDQFLSDLVGSREKVEEILNA 202 Query: 241 FDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFW 300 F +G++ + ++ G ++ ++ L R+ ++ +L+F +G+M+W Sbjct: 203 FAADPTLGLVTAADSIVGKEF------WGGDQRIVEQLMLRIEMSIDPDELEFASGSMYW 256 Query: 301 VRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCIL 356 R L ++ L+ + +D HA+ER + +A R + + Sbjct: 257 TRAFVLQGLRAFNLTSADFDEEKGQVDATTAHAIERIVGIVTDEAGLRTVEAPRLA 312 >gi|306831662|ref|ZP_07464819.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325978600|ref|YP_004288316.1| rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] gi|304426087|gb|EFM29202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325178528|emb|CBZ48572.1| rhamnosyltransferase [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] Length = 586 Score = 245 bits (626), Expect = 8e-63, Method: Composition-based stats. Identities = 69/264 (26%), Positives = 116/264 (43%), Gaps = 19/264 (7%) Query: 94 FPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P + + + V + K+A+ +H +Y+DL E N N +DL +T T Sbjct: 264 LPTPPYLLDRKVLKVVEKEYSNTKKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDT 323 Query: 153 E--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E A I+S + K A++ + N GRD++P L L E+LS+YDY+ H KKS Sbjct: 324 EAKKAEIESILEKNGKIAQVFLTGNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPE-- 379 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 Y +W GD WR L+ L+ + I+ + + ++G++ + + +Y + Sbjct: 380 YPYWVGDSWRNELYQMLIQSAD---NILANLENNDNLGLVIADIPSF-FRYTKIVDPWNE 435 Query: 271 NR--EMICTLAGRMGIT--FQDQKLDFF---AGTMFWVRTEALDPIKNLRLSRYFEPKVH 323 NR + + L RM + L F GT W + + L P+ +L L+ P Sbjct: 436 NRFADGMNELWERMNLERQIDFNNLSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPSEP 495 Query: 324 KALDGEIEHAVERCFSLSVKKANF 347 I H++ER N+ Sbjct: 496 IP-QHTILHSIERILVYLAWANNY 518 >gi|322516362|ref|ZP_08069287.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124] gi|322125095|gb|EFX96488.1| rhamnosyltransferase [Streptococcus vestibularis ATCC 49124] Length = 581 Score = 245 bits (626), Expect = 9e-63, Method: Composition-based stats. Identities = 74/260 (28%), Positives = 114/260 (43%), Gaps = 28/260 (10%) Query: 112 IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA---- 167 K+A+ +H++Y+DL E + +DL +T + K I KI Sbjct: 280 ENFDRKVAVHLHVFYVDLLEEFLDAFQAFHFIYDLWITT--DVEEKKQAIEKILSNRVQD 337 Query: 168 ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 A + + N GRDVLP L+L EQLS YDYV H KKSK + W G+ WR+ L L Sbjct: 338 ATVVVTGNIGRDVLPMLLL--KEQLSRYDYVGHFHTKKSKEADF--WAGESWRKELIEML 393 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREM----ICTLAGRMG 283 + + I+ + + +G+ + + +Y + N + + L RMG Sbjct: 394 VKPADL---ILANMEANPKVGITIADIPTF-FRYNRIVVAW--NEALISPEMNKLWQRMG 447 Query: 284 I--TFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 T + L+ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 448 ATKTIDFKSLNTFVMSYGTFVWFKYDALKPLFDLNLTAADVPAEPLP-QNSILHAIERLL 506 Query: 339 SLSV--KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 507 IYIAWDQKYDFRISQNPHVL 526 >gi|163853098|ref|YP_001641141.1| lipopolysaccharide biosynthesis protein-like protein [Methylobacterium extorquens PA1] gi|163664703|gb|ABY32070.1| Lipopolysaccharide biosynthesis protein-like protein [Methylobacterium extorquens PA1] Length = 916 Score = 245 bits (625), Expect = 9e-63, Method: Composition-based stats. Identities = 72/303 (23%), Positives = 132/303 (43%), Gaps = 17/303 (5%) Query: 56 DENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFP--QLLHGWESPAMGKVMQIA 113 + + YG + G K A+ Y + ++ +G P + + + P ++ Sbjct: 189 PFIHYIRYGRKKGYK-GAREEYYYSNQLVYPNGVVPSRRPRNENDYAFSIPLPERLRSHP 247 Query: 114 IKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIF--PAARI 170 K K A +VH +Y +L EI L + D++V+ ++ A + K + + Sbjct: 248 YK-KAAAIVHGFYPELMEEILIYLGKSNFPIDIYVSTDDSKKAEQIISMGKKYHNGQLDV 306 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGA 230 I+ N GRD+ P L ++ NY+ IH KKS G WR +LF +L+G+ Sbjct: 307 RIISNRGRDIGPMLTGF-SDVFDNYEAFLHIHTKKSPHGGDGL---SSWRDYLFKNLIGS 362 Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF-QDQ 289 ++ + T ++G + + + G N + + +L R+G+ +D Sbjct: 363 AEIIDSNLHILGTR-NVGFVYPQHL----YALRGILNWGYNFDTVSSLLRRVGVRLSKDM 417 Query: 290 KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRI 349 L+F +G+MFW RT AL + +L L +DG + HA+ER F + + + Sbjct: 418 VLEFPSGSMFWARTAALHGLLSLDLKLEDFDNEAGQVDGTLGHAIERSFLYFAETSGYSW 477 Query: 350 SDV 352 + V Sbjct: 478 AKV 480 >gi|293189412|ref|ZP_06608132.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309] gi|292821502|gb|EFF80441.1| rhamnan synthesis protein F [Actinomyces odontolyticus F0309] Length = 620 Score = 245 bits (625), Expect = 1e-62, Method: Composition-based stats. Identities = 69/256 (26%), Positives = 107/256 (41%), Gaps = 10/256 (3%) Query: 105 AMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS--IKSEIL 162 A + A K+ V H++Y D+ EI + LS L + L T E I++ Sbjct: 286 ADQATLDAAASLKVLAVAHIFYADMADEILDRLSVLPAGYHLVATTSNEENKALIEAHAQ 345 Query: 163 KIFPAARIHIME-NHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWR 220 + A + ++ N GRD+ FL+ S YD V KIH KKS + Y+ L++ Sbjct: 346 ERGVDADVRVVSSNRGRDIGAFLVDCNDVLTSGEYDIVVKIHSKKSVQDDYN--AAQLFK 403 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 L+ +LL + V I+ F H +GM+ + Y + NR A Sbjct: 404 EHLYDNLLASSDHVASILAKFAAHPGLGMVIAPMPHMG--YPTMGHAWFANRAPARDFAK 461 Query: 281 RMGI--TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 ++GI F D + G+MF R EAL + L P+ DG + H +ER Sbjct: 462 KVGITVPFDDHQPLAPYGSMFIARPEALSLLTGAGLVPEDFPEEGGYKDGSLAHVIERLL 521 Query: 339 SLSVKKANFRISDVDC 354 S +V + + V Sbjct: 522 SYAVLSRGYYVRPVMT 537 >gi|262038042|ref|ZP_06011449.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii F0264] gi|261747934|gb|EEY35366.1| lipopolysaccharide biosynthesis protein [Leptotrichia goodfellowii F0264] Length = 629 Score = 245 bits (625), Expect = 1e-62, Method: Composition-based stats. Identities = 64/262 (24%), Positives = 113/262 (43%), Gaps = 11/262 (4%) Query: 100 GWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKS 159 + +I K+ + H+Y+ DL E N+ D+ +T E K Sbjct: 287 NYNLSENQVTKKIENSPKVGLFFHIYFEDLIEECYRYALNMPEYADIFITTDKEEKKEKI 346 Query: 160 EIL--KIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGD 217 E + K+ I +++N GRDV FLI E++ YDY C H KK+K+ +G+ Sbjct: 347 EKIFSKMKNKIDIKVIQNRGRDVSAFLIP-NKEEILKYDYACFAHDKKTKQLQ-PEIKGE 404 Query: 218 LWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG----KNRE 273 ++ F ++LG+ +V II F + +G++ + + Y + G N E Sbjct: 405 DFKFRCFENILGSKELVENIIGLFIENPRLGLLSPPSPNHAEFYGNLGREWGHSGNDNYE 464 Query: 274 MICTLAGRMGITFQ---DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 C L + I + GT+FW R ++L+ + PK +DG + Sbjct: 465 ETCNLLKELVIEVNVDISKAPVAPYGTIFWFRPKSLEKLLKKGWKYEDFPKEPNKVDGTL 524 Query: 331 EHAVERCFSLSVKKANFRISDV 352 HA+ER + V+ A + +++ Sbjct: 525 LHAIERVYPFVVQGAGYYSANI 546 >gi|329116186|ref|ZP_08244903.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020] gi|326906591|gb|EGE53505.1| rhamnan synthesis protein F [Streptococcus parauberis NCFD 2020] Length = 589 Score = 244 bits (624), Expect = 1e-62, Method: Composition-based stats. Identities = 74/278 (26%), Positives = 121/278 (43%), Gaps = 26/278 (9%) Query: 93 HFPQLLHGWESPAMGK-VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 ++P + + + + I K+AI +H +Y+DL E + N +DL +T Sbjct: 266 NYPDFKYLLANKYLKSDLPSEVINKKVAIHLHTFYVDLLQEFLSAFENFHFDYDLFITT- 324 Query: 152 TESASIKSEILKIFP----AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK 207 + K++I + A + + N GRDVLP L+L E+LS YDY+ H KKSK Sbjct: 325 -DIEEKKTQIENVLNENNQKAEVFVTGNIGRDVLPMLLL--KEKLSVYDYIGHFHTKKSK 381 Query: 208 RKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCS 267 + W G+ WR+ L L+ I+ T + + +G++ + Y +Y + Sbjct: 382 EADF--WAGESWRKELIKMLVLPAD---SILATLEKN-KVGIVIADMPTY-FRYNKIVTA 434 Query: 268 LGKN--REMICTLAGRMGIT--FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEP 320 +N + L +MG+T L F GT W + +AL P+ +L L+ P Sbjct: 435 WNENLIAPEMNELWKKMGLTKSIDFNHLHTFVMSYGTFVWFKYDALKPLFDLNLTVEDVP 494 Query: 321 KVHKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS L Sbjct: 495 AEPLP-QNSILHAIERLLIYIAWNQHYDFRISKNPVDL 531 >gi|116071143|ref|ZP_01468412.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. BL107] gi|116066548|gb|EAU72305.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. BL107] Length = 1161 Score = 244 bits (623), Expect = 2e-62, Method: Composition-based stats. Identities = 59/311 (18%), Positives = 119/311 (38%), Gaps = 32/311 (10%) Query: 46 FYKRS----KKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGW 101 ++ + + + +G + G ++HF + + + Sbjct: 141 YFSKYPDIKESGLNPIIHFFKFGIQEG---------RFSMDDIHFMRKTAN-----IKKV 186 Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI 161 SP K+ Q + K + +H++Y +L IA+ L+ + + D++++ + ++ Sbjct: 187 SSPHPQKLTQAIEQKKFGVFLHIFYPELAKTIADYLAKIPVKIDIYISTTEKEVDELAKT 246 Query: 162 LKIFPA---ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDL 218 + ++ N GRDV PF++ E L YD++ K+H KKS Sbjct: 247 FRRLDNSEHVQVKSFSNTGRDVAPFVVGFREEIL-KYDFILKLHSKKSPHSDALSG---- 301 Query: 219 WRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG---KNREMI 275 W +L+G+ V + I + +I + + G N + Sbjct: 302 WFEHCLDNLIGSKDVFYTNIFELMNNETA-IIYPVENYALSLGIKHDSCWGHEDGNYDKA 360 Query: 276 CTLAGRMGITF--QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 L ++ + +D K F GTMFW ++ L PI + L + +DG + H+ Sbjct: 361 KPLLDKLNLKHIDRDSKFLFPTGTMFWCKSYILQPILDWNLGFHDFDNEGGQIDGTLAHS 420 Query: 334 VERCFSLSVKK 344 +ER L + Sbjct: 421 IERLIGLCCTE 431 >gi|319939379|ref|ZP_08013739.1| RgpFc protein [Streptococcus anginosus 1_2_62CV] gi|319811365|gb|EFW07660.1| RgpFc protein [Streptococcus anginosus 1_2_62CV] Length = 587 Score = 244 bits (622), Expect = 2e-62, Method: Composition-based stats. Identities = 67/272 (24%), Positives = 114/272 (41%), Gaps = 22/272 (8%) Query: 93 HFPQLLHGWESPAMGKVMQIAI-KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 +FP + + ++ KI + +H++Y+DL + N ++DL +T Sbjct: 266 NFPDFKYLLARKYIQTTAPTSLSNKKIGVHLHVFYVDLLEDFLKAFENFHFAYDLFITTD 325 Query: 152 TESA--SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 ++ I++ + + A I + N GRDVLP L L + LS YDY+ H KKSK Sbjct: 326 NDTKKLEIEAILNQNHKNAHIFVTGNIGRDVLPMLKL--KKYLSTYDYIGHFHTKKSKEA 383 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G+ WR L L+ I+ F+ +G++ S + +Y + Sbjct: 384 DF--WAGESWRNELIDMLIKPAD---NILANFEND-KLGLVISDIPTF-FRYNKIVDAWN 436 Query: 270 KN--REMICTLAGRMGITFQDQKLDF-----FAGTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + L +M +T F GT W + +AL P+ +L L+ P Sbjct: 437 EHLIAPEMNDLWYKMKMTKPIDFNTFHTFVMSYGTFIWFKYDALKPLFDLDLTDKDVPIE 496 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDV 352 I HA+ER + +FRIS Sbjct: 497 PLP-QNSILHAIERLIVYVAWNEHYDFRISKN 527 >gi|260434430|ref|ZP_05788400.1| glycosyltransferase [Synechococcus sp. WH 8109] gi|260412304|gb|EEX05600.1| glycosyltransferase [Synechococcus sp. WH 8109] Length = 772 Score = 243 bits (621), Expect = 3e-62, Method: Composition-based stats. Identities = 60/263 (22%), Positives = 100/263 (38%), Gaps = 17/263 (6%) Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA 155 P + + I K+ + +H++Y +L EI +S I +++++ ++ Sbjct: 516 PWNTKLIIPTEDVL--MNIDEKVGLHIHVHYPELLDEILKAISMNKIRPEIYISCTNQAI 573 Query: 156 ---SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYS 212 +IK+ +I + N GRD+ P L L E Y IH KKS Sbjct: 574 RDLAIKNINEHGLILKKIILTPNRGRDIGPLLTCLGQELDEKYRIYGHIHTKKSIHIARH 633 Query: 213 WWEGDLWRRWLFYDLLGAPG--VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 WR +L +L+G ++ II + IG+ P Sbjct: 634 QSY--SWRTFLIENLIGNEENHMMDCIISAMIKDKTIGLAFPSDPHCPG--------WDA 683 Query: 271 NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 N LA ++ I + +F GTMFW R AL P+ +L L P DG + Sbjct: 684 NYRQAKLLAEKLNIKSLTNEFNFPIGTMFWARKNALSPLYSLNLGWDDYPSEPIGYDGTL 743 Query: 331 EHAVERCFSLSVKKANFRISDVD 353 H++ER + F + + Sbjct: 744 LHSIERLIPFVAESQGFSYTMTN 766 >gi|291520004|emb|CBK75225.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 984 Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats. Identities = 63/236 (26%), Positives = 107/236 (45%), Gaps = 12/236 (5%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEI--LKIFPAARIH 171 +A+ +HL+Y+DL E + +N+ FDL+++ + + IKS + LK+ I Sbjct: 327 PSVAVHLHLFYVDLLPEFVSYFANIPFRFDLYISCQEGADVSVIKSGVKELKMANKVVIR 386 Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + N GRD+ P + +++ +DY +H KKS YS E WR++ LLG+P Sbjct: 387 PLPNRGRDLAPLYVGF-ADEIRQHDYFLHVHSKKSL---YSGAEKGGWRQFSLELLLGSP 442 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKL 291 V I F +++ G++ + + S N + L + Sbjct: 443 EKVNSIFNLF-KNKNAGLVYPDIH---EEVPMIAYSWLANAGLGRKLFDEFELGEMPTVF 498 Query: 292 DFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 ++ AG+ FW RT+AL PI N P+ DG + HA+ER +K + Sbjct: 499 NYPAGSFFWARTDALMPIFNRNYIYEDFPEEAGQTDGTLAHALERIIPFVSRKLGY 554 >gi|78184210|ref|YP_376645.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] gi|78168504|gb|ABB25601.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] Length = 1161 Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats. Identities = 60/312 (19%), Positives = 122/312 (39%), Gaps = 32/312 (10%) Query: 46 FYKRSKKLC----YDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGW 101 ++ + + + + +G + ++HF + + Sbjct: 141 YFSKYPDIKESEINPMVHFLKFGIQES---------RYSMDDIHFMRKTAG-----IKKA 186 Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI 161 GK++ + K + +H++Y +L IA+ + + + D+H++ ++ S +EI Sbjct: 187 TYSNDGKLISSIGQKKFGVFLHIFYPELAPIIADYIRKIPVKIDIHISTTHDAISGLTEI 246 Query: 162 LKIFPA---ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDL 218 K ++ N GRDV PF++ E++ YDY+ K+H KKS Sbjct: 247 FKGLENSLNVQVKSFPNIGRDVAPFIVGFR-EEIPKYDYILKLHSKKSPHSNALSG---- 301 Query: 219 WRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG---KNREMI 275 W +L+G+ V + I+ + DI ++ + + G N Sbjct: 302 WFEHCLDNLIGSIDVFYTNIQELNK-EDISIVYPVENYALSLGIKHDSCWGHEDGNYNKA 360 Query: 276 CTLAGRMGITF--QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 TL ++G+ ++ + F G MFW + + L PI + L +DG + H+ Sbjct: 361 KTLLKKLGLEQINRNSEFLFPTGNMFWCKPDILKPILDWDLKFEDFDNEGGQIDGTLAHS 420 Query: 334 VERCFSLSVKKA 345 +ER L + Sbjct: 421 IERLIGLCCTEY 432 >gi|322373386|ref|ZP_08047922.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sp. C150] gi|321278428|gb|EFX55497.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sp. C150] Length = 594 Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats. Identities = 73/249 (29%), Positives = 111/249 (44%), Gaps = 20/249 (8%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 KIA+ +H YY+DL + N ++DL +T +E I+S + K ARI I Sbjct: 287 KKIAVHLHTYYVDLLDDFLRQFENFHFTYDLFLTTDSEEKKKEIQSILDKHGKEARIFIT 346 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRDV+P L L ++LS YDY+ H KKS Y +W GD WR LF L+ Sbjct: 347 GNRGRDVIPMLKL--KDELSAYDYIGHFHTKKSPE--YPYWVGDSWRNELFSMLIQPAD- 401 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR--EMICTLAGRM--GITFQDQ 289 II + +G++ + + +Y +NR E + L RM G Sbjct: 402 --NIIANLEHDDRLGLVIADIPTF-FRYTKIVDPWNENRFAEGMNDLWERMDLGRDIDFD 458 Query: 290 KLDFF---AGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSL--SVKK 344 K++ F GT W + + L P+ +L L P I H++ER ++ Sbjct: 459 KMNTFIMSYGTFIWFKYDTLKPLFDLDLQDEEIPAEPIP-QHTILHSIERILVYLAWARR 517 Query: 345 ANFRISDVD 353 ++ I+ D Sbjct: 518 YDYAIAKND 526 >gi|299133415|ref|ZP_07026610.1| Rhamnan synthesis F [Afipia sp. 1NLS2] gi|298593552|gb|EFI53752.1| Rhamnan synthesis F [Afipia sp. 1NLS2] Length = 408 Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats. Identities = 88/242 (36%), Positives = 133/242 (54%), Gaps = 5/242 (2%) Query: 104 PAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILK 163 P K +Q+ + I+VHL+Y D+W + L NL+ F L VTL + + + Sbjct: 153 PGAPKPLQLNGRIATGIIVHLHYCDVWPDFEKRLRNLTCPFSLIVTLNESNPDFAARVAG 212 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 FP A++ + N GRDV PF+ LL L +++ +CK+HGKK+ G G++WRR L Sbjct: 213 QFPNAKVLVYPNRGRDVGPFIQLLREGHLDDFELICKLHGKKTVSLGPRMIFGEIWRRLL 272 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 DL+G+ +V I++ F + +G++GS +R +Y + +N + LA R+G Sbjct: 273 LNDLVGSDELVRAILQRFISQPGLGLVGSSHFR-----GNYLGTWPRNAALTLELAKRLG 327 Query: 284 ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 + KLDFFAGTMFWVR E LD +K+L LS+ P DG ++HA+ER F Sbjct: 328 CPEERFKLDFFAGTMFWVRRELLDLLKSLNLSQDDFPVEAGQTDGTLQHALERIFGALPG 387 Query: 344 KA 345 A Sbjct: 388 LA 389 >gi|306833804|ref|ZP_07466929.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis ATCC 700338] gi|304423998|gb|EFM27139.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus bovis ATCC 700338] Length = 586 Score = 242 bits (619), Expect = 5e-62, Method: Composition-based stats. Identities = 68/264 (25%), Positives = 116/264 (43%), Gaps = 19/264 (7%) Query: 94 FPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P + + + V + K+A+ +H +Y+DL E N N +DL +T T Sbjct: 264 LPTPPYLLDRKVLKVVEKEYSNTKKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDT 323 Query: 153 E--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E A I+S + K A++ + N GRD++P L L E+LS+YDY+ H KKS Sbjct: 324 EAKKAEIESILEKNGKTAQVFLTGNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPE-- 379 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 Y +W GD WR L+ L+ + ++ + + ++G++ + + +Y + Sbjct: 380 YPYWVGDSWRNELYQMLIQSAD---NVLANLENNDNLGLVIADIPSF-FRYTKIVDPWNE 435 Query: 271 NR--EMICTLAGRMGI--TFQDQKLDFF---AGTMFWVRTEALDPIKNLRLSRYFEPKVH 323 NR + + L RM + L F GT W + + L P+ +L L+ P Sbjct: 436 NRFADGMNELWERMNLGRQIDFNNLSTFIMSYGTFIWFKHDTLKPLFDLELTDDEIPSEP 495 Query: 324 KALDGEIEHAVERCFSLSVKKANF 347 I H++ER N+ Sbjct: 496 IP-QHTILHSIERILVYLAWANNY 518 >gi|312867647|ref|ZP_07727853.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405] gi|311096710|gb|EFQ54948.1| rhamnan synthesis protein F [Streptococcus parasanguinis F0405] Length = 582 Score = 242 bits (619), Expect = 6e-62, Method: Composition-based stats. Identities = 74/262 (28%), Positives = 115/262 (43%), Gaps = 28/262 (10%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA-- 167 + K+A+ +H++Y+DL E + +DL +T + K I KI Sbjct: 278 LAENFDRKVAVHLHVFYVDLLEEFLDAFQAFHFVYDLWITT--DVEEKKQTIEKILSNRA 335 Query: 168 --ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFY 225 A + + N GRDVLP L+L EQLS YDYV H KKSK + W G+ WR+ L Sbjct: 336 QDATVVVTGNIGRDVLPMLLL--KEQLSQYDYVGHFHTKKSKEADF--WAGESWRKELIE 391 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREM----ICTLAGR 281 L+ +I+ + + +G+ + + +Y + N + + L R Sbjct: 392 MLVKPAD---QILANMEANPKVGITIADIPTF-FRYNRIVVAW--NEALISPEMNKLWER 445 Query: 282 MGI--TFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 MG T + L+ F GT W + +AL P+ +L L+ P I HA+ER Sbjct: 446 MGAAKTIDFKNLNTFVMSYGTFVWFKYDALKPLFDLNLTAANVPAEPLP-QNSILHAIER 504 Query: 337 CFSLSV--KKANFRISDVDCIL 356 +K +FRIS +L Sbjct: 505 LLIYIAWDQKYDFRISQNPHVL 526 >gi|13476281|ref|NP_107851.1| hypothetical protein mlr7560 [Mesorhizobium loti MAFF303099] gi|14027042|dbj|BAB53996.1| mlr7560 [Mesorhizobium loti MAFF303099] Length = 637 Score = 242 bits (618), Expect = 7e-62, Method: Composition-based stats. Identities = 61/247 (24%), Positives = 104/247 (42%), Gaps = 12/247 (4%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKI--FPAARIH 171 KIA+ H+YY D+ EI L N+ + +D T T + A I++ + K + Sbjct: 297 GKIAVCAHIYYTDMLEEILALTGNIPVPYDFIATTDTPDKKAEIEATLAKRPGVKNVIVR 356 Query: 172 IME-NHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 ++E N GRD+ I L + + YD VC++H KKS + S +L++R + +LL Sbjct: 357 VVEKNRGRDMSSLFISLRDLLVDDRYDLVCRLHTKKSPQVQASRS--NLFKRHMLENLLN 414 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 G V ++ F + +G+ Y + NR + A + I + Sbjct: 415 TRGYVHNVLDMFHDNPSVGLAVPPVVHI--SYPTMGHAWFFNRPKVEETARLLNIKVKFD 472 Query: 290 K--LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 GTMFW R AL + + +DG + H +ER + + + A + Sbjct: 473 HDTPVAAYGTMFWFRPRALRKMFEHKWKWEDFNAEPNHVDGGLAHVLERLIAYAAQDAGY 532 Query: 348 RISDVDC 354 + C Sbjct: 533 TTRHIMC 539 >gi|288905572|ref|YP_003430794.1| polysaccharide biosynthesis protein (RgpF) [Streptococcus gallolyticus UCN34] gi|288732298|emb|CBI13867.1| Putative polysaccharide biosynthesis protein (RgpF) [Streptococcus gallolyticus UCN34] Length = 586 Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 68/264 (25%), Positives = 115/264 (43%), Gaps = 19/264 (7%) Query: 94 FPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P + + + V + K+A+ +H +Y+DL E N N +DL +T T Sbjct: 264 LPTPPYLLDRKVLKVVEKEYSNTKKVAVHLHTFYVDLLEEFLNQFENFHFDYDLFLTTDT 323 Query: 153 E--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E A I+S + K A++ + N GRD++P L L E+LS+YDY+ H KKS Sbjct: 324 EAKKAEIESILEKNGKIAQVFLTGNRGRDIIPMLKL--KEELSSYDYIGHFHTKKSPE-- 379 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 Y +W GD WR L+ L+ + I+ + + ++G++ + + +Y + Sbjct: 380 YPYWVGDSWRNELYQMLIQSAD---NILANLENNDNLGLVIADIPSF-FRYTKIVDPWNE 435 Query: 271 NR--EMICTLAGRMGIT--FQDQKLDFF---AGTMFWVRTEALDPIKNLRLSRYFEPKVH 323 NR + + L M + L F GT W + + L P+ +L L+ P Sbjct: 436 NRFADGMNELWECMNLERQIDFNNLSTFIMSYGTFIWFKRDTLKPLFDLELTDDEIPSEP 495 Query: 324 KALDGEIEHAVERCFSLSVKKANF 347 I H++ER N+ Sbjct: 496 IP-QHTILHSIERILVYLAWANNY 518 >gi|219670466|ref|YP_002460901.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2] gi|219540726|gb|ACL22465.1| Rhamnan synthesis F [Desulfitobacterium hafniense DCB-2] Length = 606 Score = 240 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 60/266 (22%), Positives = 99/266 (37%), Gaps = 8/266 (3%) Query: 100 GWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIK 158 + P+ V + + K+ + H+YY DL + + ++ D+ +T E I Sbjct: 275 NYFLPSDYVVKPLKRQPKVVVCFHVYYEDLLDSCFHYMQSIPQFADIVITTPKKELVGII 334 Query: 159 SEILKIFP--AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEG 216 E +K + I ++ GR FL+ + + + +YDY C +H KKS G Sbjct: 335 EEKIKSYELNNTTIKVINARGRAESAFLVATK-DFILDYDYACIVHDKKSSFL-RPGCVG 392 Query: 217 DLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMIC 276 + LL V I+ F+ + IG + + N Y G N + Sbjct: 393 VEFGLQNLDALLATSAYVENILSIFEDNPRIGALEPVHLLHANFRDLYGGEWGANYKGTE 452 Query: 277 TLAGRMGITF---QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 R GI D G MFW R + I ++ P+ LDG + H Sbjct: 453 EFLKRAGIDLLISPDVPPLAPMGAMFWFRPICMKRILDMEWEYEDFPEEPLPLDGSLIHI 512 Query: 334 VERCFSLSVKKANFRISDVDCILGYR 359 +ER + V+ A + V I Sbjct: 513 IERAYPFIVQDAGYLTGWVSTIEDAE 538 >gi|322385732|ref|ZP_08059376.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus cristatus ATCC 51100] gi|321270470|gb|EFX53386.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus cristatus ATCC 51100] Length = 598 Score = 240 bits (612), Expect = 3e-61, Method: Composition-based stats. Identities = 73/272 (26%), Positives = 121/272 (44%), Gaps = 21/272 (7%) Query: 94 FPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P + + + + Q KIA+ +H YY+DL + N ++DL +T + Sbjct: 264 LPTPPYLLDRKVIEESSQTYSDTKKIAVHLHTYYVDLLEDFLKQFENFHFTYDLFLTTDS 323 Query: 153 ESA--SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E I++ +LK +I+I N GRD++P L L E+L YDY+ H KKS Sbjct: 324 EKKKLEIEAVLLKRNQLGKIYITGNKGRDIIPMLKL--REELCTYDYIGHFHTKKSPE-- 379 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 Y +W GD WR LF LL + I+ + + + +G++ + + +Y + Sbjct: 380 YPYWVGDSWRNELFDMLLKPADL---IMASLENDKRLGLVIADIPTF-FRYTKIVDPWNE 435 Query: 271 NR--EMICTLAGRMGIT--FQDQKLDFF---AGTMFWVRTEALDPIKNLRLSRYFEPKVH 323 N+ + + L RM I KL+ F GT W + +AL P+ +L L P Sbjct: 436 NKFADDMNILWERMDINRSIDFNKLNTFIMSYGTFIWFKYDALKPLFDLNLQDEDIPSEP 495 Query: 324 KALDGEIEHAVERCFSLSV--KKANFRISDVD 353 I H++ER ++ ++ IS D Sbjct: 496 LP-QHTILHSIERILVYLAWSQRFDYAISKND 526 >gi|291516581|emb|CBK70197.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum subsp. longum F8] Length = 688 Score = 240 bits (612), Expect = 3e-61, Method: Composition-based stats. Identities = 58/261 (22%), Positives = 99/261 (37%), Gaps = 13/261 (4%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 + P+ Q + + A + H+Y++DL + + +++L DL++T + E Sbjct: 310 YVLPSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIRE 369 Query: 161 ILKIFP---AARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSW 213 ++ A + N GRDV L+ LS YD + H KKS + G+ Sbjct: 370 YMQQHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHG 429 Query: 214 WEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY-CDYTCSLGKNR 272 E + L + LG+ V I+ F + +G + + + G N Sbjct: 430 TESQGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANY 489 Query: 273 EMICTLAG-RMGITFQDQKLDF---FAGTMFWVRTEALDPIKNLRLSRYFE-PKVHKALD 327 E+ L R+GI G+ +W R EAL P+ P+ D Sbjct: 490 EITKELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGED 549 Query: 328 GEIEHAVERCFSLSVKKANFR 348 G I HA+ER + + Sbjct: 550 GTISHAIERANGYICQSRGYY 570 >gi|325928537|ref|ZP_08189725.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans 91-118] gi|325541076|gb|EGD12630.1| Lipopolysaccharide biosynthesis protein [Xanthomonas perforans 91-118] Length = 1415 Score = 240 bits (612), Expect = 4e-61, Method: Composition-based stats. Identities = 71/264 (26%), Positives = 122/264 (46%), Gaps = 18/264 (6%) Query: 95 PQLLHGWE-----SPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVT 149 P +G+ A+ + + ++A+V+H +Y ++ E+ L + + + L ++ Sbjct: 1141 PDRKYGYAYLEATRRALRRPVMPRTPERVAVVIHAFYPEILPEMLKELQSWDVPYFLIIS 1200 Query: 150 LVTESASIKSEILKIFPA-ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 V + A L A + + EN GRD+LPFL +++ + V K+H K+S Sbjct: 1201 TVADKADEVRGYLADLSVVADVRVFENRGRDILPFLEIMK-DLRGRESLVLKLHTKRSLH 1259 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 + +G+ WRR + LL AP V +I F +G+ + T Sbjct: 1260 RQ----DGESWRRDMLEKLL-APKVASEIFAAFREQERLGLAAPEGHIL-----SMTTYW 1309 Query: 269 GKNREMICTLAGRMGITF-QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALD 327 G N + + L+ +M + F AG+MF+VR EA+D I +L L R +D Sbjct: 1310 GANADTVHRLSKQMHVDPVNPVTAMFAAGSMFYVRPEAIDSIMDLDLRREDFEPEAGQVD 1369 Query: 328 GEIEHAVERCFSLSVKKANFRISD 351 G + HA+ERCFSL+V + I+ Sbjct: 1370 GTLAHAIERCFSLAVCSTGYYIAS 1393 >gi|322690050|ref|YP_004209784.1| hypothetical protein BLIF_1872 [Bifidobacterium longum subsp. infantis 157F] gi|320461386|dbj|BAJ72006.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 672 Score = 239 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 58/261 (22%), Positives = 99/261 (37%), Gaps = 13/261 (4%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 + P+ Q + + A + H+Y++DL + + +++L DL++T + E Sbjct: 288 YVLPSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIRE 347 Query: 161 ILKIFP---AARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSW 213 ++ A + N GRDV L+ LS YD + H KKS + G+ Sbjct: 348 YMQQHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHG 407 Query: 214 WEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY-CDYTCSLGKNR 272 E + L + LG+ V I+ F + +G + + + G N Sbjct: 408 TESQGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANY 467 Query: 273 EMICTLAG-RMGITFQDQKLDF---FAGTMFWVRTEALDPIKNLRLSRYFE-PKVHKALD 327 E+ L R+GI G+ +W R EAL P+ P+ D Sbjct: 468 EITKELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGED 527 Query: 328 GEIEHAVERCFSLSVKKANFR 348 G I HA+ER + + Sbjct: 528 GTISHAIERANGYICQSRGYY 548 >gi|189440434|ref|YP_001955515.1| lipopolysaccharide biosynthesis protein [Bifidobacterium longum DJO10A] gi|317482688|ref|ZP_07941702.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA] gi|189428869|gb|ACD99017.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium longum DJO10A] gi|316915934|gb|EFV37342.1| rhamnan synthesis protein F [Bifidobacterium sp. 12_1_47BFAA] Length = 666 Score = 239 bits (611), Expect = 5e-61, Method: Composition-based stats. Identities = 58/261 (22%), Positives = 99/261 (37%), Gaps = 13/261 (4%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 + P+ Q + + A + H+Y++DL + + +++L DL++T + E Sbjct: 288 YVLPSQAINPQTHDRPRSAFIYHVYFMDLLEDTCHYIASLPEETDLYITSTEDKIPQIRE 347 Query: 161 ILKIFP---AARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYSW 213 ++ A + N GRDV L+ LS YD + H KKS + G+ Sbjct: 348 YMQQHGISHQATFIPVINRGRDVSALLVAACPVVLSGKYDVIGFAHDKKSSQNQENGHHG 407 Query: 214 WEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY-CDYTCSLGKNR 272 E + L + LG+ V I+ F + +G + + + G N Sbjct: 408 TESQGFAYKLMENTLGSEAYVKNILTLFAENPRLGQVTPPPPYHALYFAHTIPHDWGANY 467 Query: 273 EMICTLAG-RMGITFQDQKLDF---FAGTMFWVRTEALDPIKNLRLSRYFE-PKVHKALD 327 E+ L R+GI G+ +W R EAL P+ P+ D Sbjct: 468 EITKELLEDRLGIHVPLSPTKPTASAMGSCYWFRVEALKPLFEYGWKYEDFLPEGQMGED 527 Query: 328 GEIEHAVERCFSLSVKKANFR 348 G I HA+ER + + Sbjct: 528 GTISHAIERANGYICQSRGYY 548 >gi|302337198|ref|YP_003802404.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293] gi|301634383|gb|ADK79810.1| Rhamnan synthesis F [Spirochaeta smaragdinae DSM 11293] Length = 1808 Score = 239 bits (609), Expect = 6e-61, Method: Composition-based stats. Identities = 69/305 (22%), Positives = 124/305 (40%), Gaps = 28/305 (9%) Query: 55 YDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAI 114 + + YG +K + + E H F + +G P + +I + Sbjct: 961 NPLFHFIKYGYSENRKMVHPGD---INSEFHR------SFGKSEYGVTGPVLYYDREIQL 1011 Query: 115 KAK----IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV---TESASIKSEILKIFP- 166 + I + +HL+YIDL E+ + L N+ + F L ++ + IK + K P Sbjct: 1012 NPRFNLSIGVHLHLFYIDLAEELLSSLINIPVCFSLFISTSAGVKDQEYIKKIVNKKLPL 1071 Query: 167 --AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 + EN GRD+ PF++ LS +D + H KKS RR+L Sbjct: 1072 CNECTVIQTENRGRDIAPFIVEFGN-SLSQFDLILHFHSKKSLHSDSLSDA----RRFLL 1126 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI 284 + +LG + + + F + IGM+ + ++ + +MGI Sbjct: 1127 HYILGNKAITIQNLNMFFENGSIGMVAPPYHPSLRNMPNFGLQEYE----TKQFLKKMGI 1182 Query: 285 TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 + + DF AG+ FW R +A+ + + P+ +DG + H +ER + K+ Sbjct: 1183 NYSGKCTDFPAGSFFWCRKDAIRQLLTSNIRWNSFPEEKGQIDGTLAHVIERSLGIICKQ 1242 Query: 345 ANFRI 349 NF+I Sbjct: 1243 NNFKI 1247 >gi|94988294|ref|YP_596395.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS9429] gi|94992170|ref|YP_600269.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS2096] gi|94541802|gb|ABF31851.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS9429] gi|94545678|gb|ABF35725.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS2096] Length = 581 Score = 239 bits (609), Expect = 7e-61, Method: Composition-based stats. Identities = 67/247 (27%), Positives = 112/247 (45%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T+ DIG+I + + ++ + N +I L +M + Sbjct: 399 --SILSAFETN-DIGIIIADIPSF-FRFNKIVNAW--NEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|94990172|ref|YP_598272.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10270] gi|94543680|gb|ABF33728.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10270] Length = 581 Score = 239 bits (609), Expect = 7e-61, Method: Composition-based stats. Identities = 67/247 (27%), Positives = 112/247 (45%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T+ DIG+I + + ++ + N +I L +M + Sbjct: 399 --SILSAFETN-DIGIIIADIPSF-FRFNKIVNAW--NEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|332535404|ref|ZP_08411194.1| glycosyl transferase, group 1 [Pseudoalteromonas haloplanktis ANT/505] gi|332035169|gb|EGI71680.1| glycosyl transferase, group 1 [Pseudoalteromonas haloplanktis ANT/505] Length = 672 Score = 239 bits (609), Expect = 7e-61, Method: Composition-based stats. Identities = 62/268 (23%), Positives = 106/268 (39%), Gaps = 11/268 (4%) Query: 85 HFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISF 144 H + P + K+A+ H++Y + L+ + Sbjct: 132 HGKKEGRLALPAKAKWYPKAIASSANGEPTTLKLAMCFHVFYGEFIDYYCGALAKFTQQV 191 Query: 145 DLHVTLVTESASIKS----EILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCK 200 D+ V++ +E + K+ + + ++ NHGR+ P L+ ++ L NYD C Sbjct: 192 DVFVSVASEELAKKAIHDFKACSKVNKVVVKVVPNHGRNFGPMLVEFASD-LQNYDLFCH 250 Query: 201 IHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNK 260 +H KKS YS W +L LL P V+ +++ F+ + G+ ++ Sbjct: 251 MHSKKSL---YSGRAQTQWADYLGEYLLNDPHVIKQVLNHFNDNPKSGLYYPTSFWMM-- 305 Query: 261 YCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEP 320 D+ KN+ + I +D L + AG MFW R EAL + N P Sbjct: 306 -PDWVNHWLKNKPAAQKFTKKWNIELKDDFLAYPAGGMFWARPEALKQLLNKEYKYDDFP 364 Query: 321 KVHKALDGEIEHAVERCFSLSVKKANFR 348 DG HA+ER L V+K ++ Sbjct: 365 GEPLPNDGSQLHALERMLGLLVEKNGYK 392 >gi|21910063|ref|NP_664331.1| hypothetical protein SpyM3_0527 [Streptococcus pyogenes MGAS315] gi|28896239|ref|NP_802589.1| hypothetical protein SPs1327 [Streptococcus pyogenes SSI-1] gi|21904254|gb|AAM79134.1| putative protein [Streptococcus pyogenes MGAS315] gi|28811490|dbj|BAC64422.1| conserved hypothetical protein [Streptococcus pyogenes SSI-1] Length = 581 Score = 239 bits (609), Expect = 7e-61, Method: Composition-based stats. Identities = 67/247 (27%), Positives = 112/247 (45%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T+ DIG+I + + ++ + N +I L +M + Sbjct: 399 --SILSAFETN-DIGIIIADIPSF-FRFNKIVNAW--NEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|50913971|ref|YP_059943.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10394] gi|50903045|gb|AAT86760.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10394] Length = 581 Score = 239 bits (609), Expect = 8e-61, Method: Composition-based stats. Identities = 67/247 (27%), Positives = 112/247 (45%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T+ DIG+I + + ++ + N +I L +M + Sbjct: 399 --SILSAFETN-DIGIIIADIPSF-FRFNKIVNAW--NEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|78213552|ref|YP_382331.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9605] gi|78198011|gb|ABB35776.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9605] Length = 1162 Score = 238 bits (608), Expect = 9e-61, Method: Composition-based stats. Identities = 57/302 (18%), Positives = 113/302 (37%), Gaps = 28/302 (9%) Query: 51 KKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVM 110 + + +G + +HF + + + + + Sbjct: 152 ESGLNPMVHCFKFG---------MNEHRYSMDNIHFMRKMADIKKTVTDDFNA-----IK 197 Query: 111 QIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPA--- 167 + I K+ I +H++Y +L IA L N+ S D+ ++ +S + +I Sbjct: 198 EGLINKKVGIFLHIFYPELGETIAAYLKNIPCSIDVFISTREDSVAALEKIFARVENTQK 257 Query: 168 ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 + N GRDV PF++ +Q+ NYDY+ K+H KKS W +L Sbjct: 258 IEVRHFSNIGRDVAPFIVGFR-DQILNYDYILKLHSKKSPHSNALSG----WFLHCLDNL 312 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG---KNREMICTLAGRMGI 284 +G+ + ++ + ++G++ + + G N R + Sbjct: 313 IGSEAITATNLKALQS-PEVGIVYPIENYALSLGIQHDSCWGHEDGNYAKARPFLNRYNL 371 Query: 285 T--FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV 342 ++ + F GTMFW + L I + L+ + +DG I H++ER +S Sbjct: 372 RQIKRESQFQFPTGTMFWCKPAVLQSILDWGLNWNNFDEEGGQIDGTIAHSIERLIGIST 431 Query: 343 KK 344 + Sbjct: 432 TE 433 >gi|225868697|ref|YP_002744645.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. zooepidemicus] gi|225701973|emb|CAW99527.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. zooepidemicus] Length = 581 Score = 238 bits (608), Expect = 1e-60, Method: Composition-based stats. Identities = 75/276 (27%), Positives = 121/276 (43%), Gaps = 22/276 (7%) Query: 93 HFPQLLHGWESPAMGK-VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 H P + + + + I KIA+ +H++Y+DL E S+ +DL +T Sbjct: 260 HLPDAKYLLAHKYLPEQPISIDQSKKIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLITTD 319 Query: 152 T--ESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 + + A IK + + +A I + N GRDVLP L L E+LS YDY+ H KKSK Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G WR L ++ +I+ IG++ + + ++ + Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAAD-AIGIVIADIPSF-FRFNKIVDAWN 430 Query: 270 KN--REMICTLAGRMGIT--FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + L MG+T Q +D F GT W + +AL P+ +L LS P Sbjct: 431 EHLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIPAE 490 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS + +L Sbjct: 491 PLP-QNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525 >gi|332995244|gb|AEF05299.1| hypothetical protein ambt_19030 [Alteromonas sp. SN2] Length = 638 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 65/280 (23%), Positives = 119/280 (42%), Gaps = 15/280 (5%) Query: 73 AQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIE 132 ++ R H ++ + + + + K + K+AIV+H++Y D + Sbjct: 93 PLTHFMYHGRHEHRASSKVMN--RWFPSTDL--VAKETSSWKQQKVAIVLHIFYPDFVDK 148 Query: 133 IANLLSNLSISFDLHVTLVTESASIKSEI----LKIFPAARIHIMENHGRDVLPFLILLE 188 A + + D+ +T T++ K+ LK + + EN GR+ PFL+ Sbjct: 149 FAASVQRFPTNVDVFITAGTDAIKNKALKTFNGLKNVQKVQAVLCENRGRNFGPFLVNF- 207 Query: 189 TEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIG 248 +++L +YD +C +H KKS YS E W + LL VV I+R FD ++G Sbjct: 208 SDELLDYDLMCHLHSKKSL---YSGREQTQWFDYQNNFLLKDKHVVKSILRLFDEREELG 264 Query: 249 MIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDP 308 + ++ + N+ + + G+ D +++ G MFW R +A+ P Sbjct: 265 IYYPTSFWMM---PAWVNHWTCNKGISQDFVDKWGLDISDDFVNYPVGGMFWARPKAIAP 321 Query: 309 IKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 + + P+ DG HA+ER L +K F+ Sbjct: 322 LLKEKFEYSDFPEEPLPNDGSWLHALERSIGLLAEKQGFK 361 >gi|261868364|ref|YP_003256286.1| lipopolysaccharide biosynthesis protein [Aggregatibacter actinomycetemcomitans D11S-1] gi|3132260|dbj|BAA28137.1| unnamed protein product [Actinobacillus actinomycetemcomitans] gi|261413696|gb|ACX83067.1| lipopolysaccharide biosynthesis protein [Aggregatibacter actinomycetemcomitans D11S-1] Length = 632 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 67/250 (26%), Positives = 114/250 (45%), Gaps = 13/250 (5%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE---ILK 163 KV ++ KI +V H+YY D+ EI + N+ S+DL +T E + ++ E ILK Sbjct: 285 SKVEKVRSDIKILVVAHIYYSDMLDEIISYTQNIPCSYDLLITTANEKSKLEIESNPILK 344 Query: 164 IFPA--ARIHIME-NHGRDVLPFLILLETEQLS-NYDYVCKIHGKKSKRKGYSWWEGDLW 219 + A + ++E N GRD+ I + E +S YD+VC++H KKS + ++ + Sbjct: 345 MSGAKGINVKVVEQNRGRDMSSLFITCKQEIISERYDWVCRLHSKKSPQNSHNMSIH--F 402 Query: 220 RRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLA 279 + ++ ++L + K+I D ++ IG + + NR++ +A Sbjct: 403 KEMMYLNILKDKAYISKVINYLDKNKSIGFAMPSMVHIGHPTLGHA--WFTNRDLAIKIA 460 Query: 280 GRMGITFQDQKLDFFA--GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 R+GI + FA GTMFW R EAL + K D + H +ER Sbjct: 461 ERVGIKLPFDDISPFAAYGTMFWFRPEALKKLFEYNWKFEDFNKEPMHQDSSLAHILERL 520 Query: 338 FSLSVKKANF 347 + A + Sbjct: 521 LVYAAHDAGY 530 >gi|285019449|ref|YP_003377160.1| hypothetical protein XALc_2689 [Xanthomonas albilineans GPE PC73] gi|283474667|emb|CBA17166.1| conserved hypothetical protein [Xanthomonas albilineans] Length = 686 Score = 237 bits (606), Expect = 2e-60, Method: Composition-based stats. Identities = 76/325 (23%), Positives = 121/325 (37%), Gaps = 28/325 (8%) Query: 41 WFWTLFYKRSKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHG 100 W + R + E +V + + A + D R P + Sbjct: 376 WLRATIHTRLQ-GRRAEQRLVFVNAWNEWAEGAVLEPDTRLGHAYLDATRRALSPARVRE 434 Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 +P +VH +Y ++ E+ N L+ ++ + L VT + AS Sbjct: 435 ATAPHHA-------------IVHAWYPNVLPELLNPLAASALPWRLLVTTSPDQASAVQA 481 Query: 161 ILKI--FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDL 218 L+ FP + ++EN GRD+LPFL E D V K+H K+S GD Sbjct: 482 QLRDCSFP-YEVMVLENRGRDILPFLHAGERLLQDGVDVVLKLHTKRSTHLH----NGDA 536 Query: 219 WRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTL 278 WR L L G +I+ F +G++ + P G NR L Sbjct: 537 WRSELLQRLAG-ADRAARILEAFAQDPMLGLVAPEGHLLPL-----ADFWGGNRMAADYL 590 Query: 279 AGRMG-ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 R G + F +G+MFWVR AL P+ + L +DG + HA ER Sbjct: 591 LRRTGYTDVCLDEAHFISGSMFWVRLHALRPLLDSHLCPSEFEPEQGQIDGTLAHAAERV 650 Query: 338 FSLSVKKANFRISDVDCILGYRKSL 362 +L + +R++ ++G Sbjct: 651 TALLAQHRGYRVATAAELIGAPSPP 675 >gi|262282406|ref|ZP_06060174.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA] gi|262261697|gb|EEY80395.1| rhamnosyltransferase [Streptococcus sp. 2_1_36FAA] Length = 582 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 70/274 (25%), Positives = 115/274 (41%), Gaps = 25/274 (9%) Query: 94 FPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P + + + I+I KIA+ +H++Y+DL E + + S+DL +T + Sbjct: 260 LPDYPYLLGRKILDLSLPISIPDKKIAVHLHVFYVDLLAEFLHAFESFHFSYDLFITTDS 319 Query: 153 ESA--SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E I + A + + N GRDVLP L L LS YDY+ H KKSK Sbjct: 320 EKKKNEILDILEGKQAKAEVLVTGNVGRDVLPMLKLKR--HLSQYDYIGHFHTKKSKEAD 377 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 Y W G+ WR+ L L+ +I+ +G++ + P+ + + Sbjct: 378 Y--WAGESWRKELINMLVHPAD---QIVSQLGQDDRLGLVIADI---PSFFRFNRIVVAW 429 Query: 271 NREM----ICTLAGRMGI--TFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPK 321 N + + L RM ++++ F GT W + +AL P+ +L L+ P Sbjct: 430 NEALISPEMNKLWERMNCQKEVDFKQMNTFVMSYGTFVWFKYDALSPLFDLNLTEEDVPS 489 Query: 322 VHKALDGEIEHAVERCFSLSV--KKANFRISDVD 353 I HA+ER K+ +F+IS Sbjct: 490 EPLP-QNSILHAIERLLVYIAWDKQYDFKISYNQ 522 >gi|298290915|ref|YP_003692854.1| Rhamnan synthesis F [Starkeya novella DSM 506] gi|296927426|gb|ADH88235.1| Rhamnan synthesis F [Starkeya novella DSM 506] Length = 633 Score = 237 bits (604), Expect = 3e-60, Method: Composition-based stats. Identities = 72/258 (27%), Positives = 110/258 (42%), Gaps = 16/258 (6%) Query: 101 WESPAMGK-VMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA-SIK 158 W P G +A K+ + H +Y DL E+ L+ + DL +T T + Sbjct: 378 WAVPVFGPPAAPVASPLKVGLHGHFFYPDLLPELLERLAANASRPDLFLTTDTPAKVEQL 437 Query: 159 SEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEG 216 + +P RI ++ N GRD+ PFL L YD + +HGKK+K G G Sbjct: 438 RALTAAWPAKVRIDVVPNSGRDIGPFLTALRDVLTGGEYDVLLHLHGKKTK--GRRRAIG 495 Query: 217 DLWRRWLFYDLLGAPGVVFK-IIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMI 275 D WR +L+ +L+G + ++ H +G++ + +N ++ Sbjct: 496 DPWRNFLWENLIGGDHPMLDAVLAYMAAHPQVGLVYPE--------DTHLLDWARNGRVV 547 Query: 276 CTLAGRMGITFQ-DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 L MG+T +DF G MF VR AL P+ L L P LDG + H + Sbjct: 548 EELRRDMGLTEPMGTYVDFPVGNMFAVRPAALAPVLALDLKWSDYPVEPIPLDGTVLHGI 607 Query: 335 ERCFSLSVKKANFRISDV 352 ER V+KA F + V Sbjct: 608 ERLLPTVVRKAGFTTAAV 625 >gi|195977971|ref|YP_002123215.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195974676|gb|ACG62202.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 581 Score = 236 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 75/276 (27%), Positives = 121/276 (43%), Gaps = 22/276 (7%) Query: 93 HFPQLLHGWESPAMG-KVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 H P + + + + IA KIA+ +H++Y DL E S+ +DL +T Sbjct: 260 HLPDAKYLLAHKYLSNQPISIAPSKKIAVHLHVFYADLLSEFLEAFSHFHFDYDLLITTD 319 Query: 152 T--ESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK 209 + + A IK + + +A I + N GRDVLP L L E+LS YDY+ H KKSK Sbjct: 320 SKAKKAEIKEILRESGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEA 377 Query: 210 GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLG 269 + W G WR L ++ +I+ IG++ + + ++ + Sbjct: 378 DF--WAGQSWRTELIDMMVKPAD---QILTALAAD-AIGIVIADIPSF-FRFNKIVDAWN 430 Query: 270 KN--REMICTLAGRMGIT--FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + L MG+T Q +D F GT W + +AL P+ +L L+ P Sbjct: 431 EHLIAPEMNQLWQAMGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLGLNEADIPAE 490 Query: 323 HKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 I HA+ER + +FRIS + +L Sbjct: 491 PLP-QNSILHAIERLLIYIAWDRHYDFRISRNEKLL 525 >gi|222152862|ref|YP_002562039.1| rhamnan synthesis protein F family protein [Streptococcus uberis 0140J] gi|222113675|emb|CAR41606.1| rhamnan synthesis protein F family protein [Streptococcus uberis 0140J] Length = 585 Score = 236 bits (603), Expect = 4e-60, Method: Composition-based stats. Identities = 78/275 (28%), Positives = 123/275 (44%), Gaps = 24/275 (8%) Query: 93 HFPQLLHGWESPAMGKVMQIAIK---AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVT 149 H+P + + K + +K IA+ +H++Y+DL E + ++ FDL++T Sbjct: 260 HYPDAPYLLSQKYLEKQEESDLKVSEHSIAVHLHVFYVDLLEEFLHAFTSFKFPFDLYIT 319 Query: 150 LV-TESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK 207 +E S IL F +A+I + N GRDVLP L L ++LS YDY+ H KKSK Sbjct: 320 TDKSEKESEIKAILDSFRVSAKIVVTGNIGRDVLPMLKL--KDELSQYDYIGHFHTKKSK 377 Query: 208 RKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCS 267 + W G+ WR L L+ II F+ IG+I + + ++ Sbjct: 378 EADF--WAGESWRNELIDMLIKPANT---IINQFE-DPAIGIIIADIPSF-FRFNKIVTP 430 Query: 268 LGKN--REMICTLAGRMGI--TFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEP 320 L ++ + L +M + T ++ D F GT W + +AL P+ +L L P Sbjct: 431 LNEHLIAPEMNKLWEKMNLSKTIDFEQFDTFVMSYGTFVWFKYDALKPLFDLNLKDGDVP 490 Query: 321 KVHKALDGEIEHAVERCFSLSV--KKANFRISDVD 353 K I HAVER +FRI+ + Sbjct: 491 KEPLP-QNSILHAVERLLIYIAWDSHFDFRIAKNN 524 >gi|298346187|ref|YP_003718874.1| hypothetical protein HMPREF0573_11061 [Mobiluncus curtisii ATCC 43063] gi|304390053|ref|ZP_07372007.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|298236248|gb|ADI67380.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 43063] gi|304326535|gb|EFL93779.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. curtisii ATCC 35241] Length = 680 Score = 236 bits (603), Expect = 4e-60, Method: Composition-based stats. Identities = 63/250 (25%), Positives = 116/250 (46%), Gaps = 14/250 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKIFPAAR--- 169 ++A+V+H+YY DL EI LSN+ + FD+ +T + + + +I + P + Sbjct: 49 DCRLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLV 108 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK---GYSWWEGDLWRRWLFYD 226 + +ENHGRD+ P + L+ L Y + K+H KKS + G W+ Sbjct: 109 VVPVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRENHPDLEGSGAQWKDEFLDA 168 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 LLG+ V KI+ F +G++ + ++ G ++ + L R+ + Sbjct: 169 LLGSKDSVEKIMSAFGADPWLGLVTAPGNIVGPQF------WGGDQALTAELLRRLEMQL 222 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 + KL F AG+M+WVR + +++L LS +D HA+ER + +A Sbjct: 223 KPSKLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAG 282 Query: 347 FRISDVDCIL 356 ++ + + + Sbjct: 283 LKLRETNQLA 292 >gi|315654770|ref|ZP_07907675.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333] gi|315490731|gb|EFU80351.1| group 2 glycosyl transferase [Mobiluncus curtisii ATCC 51333] Length = 680 Score = 236 bits (602), Expect = 5e-60, Method: Composition-based stats. Identities = 63/250 (25%), Positives = 117/250 (46%), Gaps = 14/250 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKIFPAAR--- 169 ++A+V+H+YY DL EI LSN+ + FD+ +T + + + +I + P + Sbjct: 49 DCRLAVVMHVYYPDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLV 108 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSW---WEGDLWRRWLFYD 226 + +ENHGRD+ P + L+ L Y + K+H KKS + G W+ Sbjct: 109 VVPVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDA 168 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 LLG+ V KI+ F + +G++ + ++ G ++ + L R+ + Sbjct: 169 LLGSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQF------WGGDQALTAELLRRLEMQL 222 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 + KL F AG+M+WVR + +++L LS +D HA+ER + +A Sbjct: 223 KPSKLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAG 282 Query: 347 FRISDVDCIL 356 ++ + + + Sbjct: 283 LKLRETNQLA 292 >gi|225870347|ref|YP_002746294.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. equi 4047] gi|225699751|emb|CAW93520.1| rhamnan synthesis protein F family protein [Streptococcus equi subsp. equi 4047] Length = 581 Score = 236 bits (602), Expect = 5e-60, Method: Composition-based stats. Identities = 73/262 (27%), Positives = 118/262 (45%), Gaps = 21/262 (8%) Query: 106 MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILK 163 + + I KIA+ +H++Y+DL E S+ +DL +T + + A IK + + Sbjct: 274 PEQPISIDQSKKIAVHLHVFYVDLLSEFLEAFSHFHFDYDLLITTDSKAKKAEIKEILRE 333 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWL 223 +A I + N GRDVLP L L E+LS YDY+ H KKSK + W G WR L Sbjct: 334 SGASADILVTGNIGRDVLPMLTL--KERLSQYDYIGHFHTKKSKEADF--WAGQSWRTEL 389 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN--REMICTLAGR 281 ++ +I+ IG++ + + ++ + ++ + L Sbjct: 390 IDMMVKPAD---QILTALAAD-AIGIVIADIPSF-FRFNKIVDAWNEHLIAPEMNQLWQA 444 Query: 282 MGIT--FQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 MG+T Q +D F GT W + +AL P+ +L LS P + I HA+ER Sbjct: 445 MGLTKRIDFQTMDTFVMSYGTFVWFKYDALKPLFDLDLSEADIPAEPLS-QNSILHAIER 503 Query: 337 CFSLSV--KKANFRISDVDCIL 356 + +FRIS + +L Sbjct: 504 LLIYIAWDRHYDFRISRNEKLL 525 >gi|134297301|ref|YP_001121036.1| lipopolysaccharide biosynthesis protein-like protein [Burkholderia vietnamiensis G4] gi|134140458|gb|ABO56201.1| Lipopolysaccharide biosynthesis protein-like protein [Burkholderia vietnamiensis G4] Length = 1231 Score = 234 bits (597), Expect = 2e-59, Method: Composition-based stats. Identities = 81/322 (25%), Positives = 133/322 (41%), Gaps = 12/322 (3%) Query: 36 GGFFFWFWTLFYKRSKKLCYDENYVVAYGSRS-GKKFFAQSNLYMMERELHFDGQRIHHF 94 + W F E V + + + + LH + + Sbjct: 915 ESYKSWLRRCFDFVLSNNKQSERLVFINAWNEWAEGTYLEPDRRYGYAYLHATADLLRQY 974 Query: 95 PQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE- 153 ES + + K + A+V HLYY DL E+ +L+ +++ D +T+ Sbjct: 975 YNSEDLDESIKINN-QRFVKKNENALVAHLYYFDLLPELLSLIER-NVNLDAFITIPVHF 1032 Query: 154 SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSW 213 S EIL + ++N GRD+LPFL + + +Y + K+H KKS ++ Sbjct: 1033 SREQVGEILASLDNVYVLRVQNRGRDILPFLNIYPIIKSYSYANLVKVHSKKSPQRA--- 1089 Query: 214 WEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNRE 273 +G L R+ +LL P +V ++R +T IG+I DY + NR+ Sbjct: 1090 -DGALLRKRALLELL-DPSIVPGVLRALNTDPKIGLIAPSNSLCSLSNSDYLIN---NRK 1144 Query: 274 MICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 + R+G+ +F AG+MFW R +AL + +L L + LDG + HA Sbjct: 1145 QLNYCLSRLGLVDSSLNFEFIAGSMFWARVDALRMLSDLSLREEDFEEELGQLDGTLAHA 1204 Query: 334 VERCFSLSVKKANFRISDVDCI 355 +ER F K +R VD I Sbjct: 1205 IERLFCFLGKHVGYRTLPVDQI 1226 >gi|148927812|ref|ZP_01811237.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] gi|147886838|gb|EDK72383.1| Lipopolysaccharide biosynthesis protein-like protein [candidate division TM7 genomosp. GTL1] Length = 498 Score = 234 bits (596), Expect = 2e-59, Method: Composition-based stats. Identities = 68/242 (28%), Positives = 112/242 (46%), Gaps = 15/242 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA--SIKSEILKIFPAARIHIME 174 ++A+VVH++Y +L EI +++ N+ FDL +T E A + + + I + E Sbjct: 240 RLAVVVHIFYPELANEIYDVIKNIVEPFDLIITTPHEGAVSELIDTFAPLASSVAIALSE 299 Query: 175 NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVV 234 N GRDV PFL + + L YD V K+H KKS G W++ LF L G +V Sbjct: 300 NRGRDVGPFLAVHRSGLLERYDAVLKLHSKKSTYSD----SGQQWQQSLFRQLCGNSQIV 355 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRM---GITFQDQKL 291 + + GM+G Y Y + G NR + L + + +D L Sbjct: 356 RRSVALL-RDGKTGMVGPHDY-----YLTHPHYWGANRPAVHKLLQSLTATPLKEEDVPL 409 Query: 292 DFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISD 351 FFAGTMFW +A+ + ++ + + DG + HA+ER F + + + ++ Sbjct: 410 RFFAGTMFWFAPKAIVALHDIPEALLNFESENGKQDGTLAHALERLFGIVPQLGGYNVTS 469 Query: 352 VD 353 + Sbjct: 470 LA 471 >gi|315657309|ref|ZP_07910191.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315491781|gb|EFU81390.1| group 2 glycosyl transferase [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 680 Score = 234 bits (596), Expect = 2e-59, Method: Composition-based stats. Identities = 63/250 (25%), Positives = 117/250 (46%), Gaps = 14/250 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKIFPAAR--- 169 ++A+V+H+YY DL EI LSN+ + FD+ +T + + + +I + P + Sbjct: 49 DCRLAVVMHVYYSDLVTEIVQRLSNIPVEFDMFITNASGADLPLLPQQIHERLPLLKHLV 108 Query: 170 IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSW---WEGDLWRRWLFYD 226 + +ENHGRD+ P + L+ L Y + K+H KKS + G W+ Sbjct: 109 VVPVENHGRDIFPLVQLVNFGALDPYQLILKVHTKKSAWRESHPDLEGSGAQWKDEFLDA 168 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 LLG+ V KI+ F + +G++ + ++ G ++ + L R+ + Sbjct: 169 LLGSKDSVEKIMSAFGSDPWLGLVTAPGNIVGPQF------WGGDQALTAELLRRLEMQL 222 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 + KL F AG+M+WVR + +++L LS +D HA+ER + +A Sbjct: 223 KPSKLKFAAGSMYWVRGFVIQGLRSLGLSATDFETEAGQIDATTAHALERAIGILTTEAG 282 Query: 347 FRISDVDCIL 356 ++ + + + Sbjct: 283 LKLRETNQLA 292 >gi|190572709|ref|YP_001970554.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia K279a] gi|190010631|emb|CAQ44240.1| putative glycosyltransferase protein [Stenotrophomonas maltophilia K279a] Length = 707 Score = 233 bits (595), Expect = 3e-59, Method: Composition-based stats. Identities = 68/257 (26%), Positives = 108/257 (42%), Gaps = 11/257 (4%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAA 168 A ++ IV+H +Y+D E+ + + + L +T E A ++S I A Sbjct: 453 PSQAAPSRPCIVIHAWYLDALPELLQAVKDSGLQARLVITTTGERQAQVQSIIDAEGLTA 512 Query: 169 RIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLL 228 I + +NHGRDVLPFL + N V K+H K+S + GD WRR + LL Sbjct: 513 EIWVYDNHGRDVLPFLHAADRLLQQNESLVLKLHTKRSTHRD----NGDQWRREMVDALL 568 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 G + + IG++ + +G N + + L +G+ Sbjct: 569 GTAQAAANLAHL-LANPSIGLMAPAGHLL-----KVADYIGGNAQRMERLWALLGLDSAP 622 Query: 289 QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 F +G+MFWVR AL P+ + L +DG + HA+ER V A F Sbjct: 623 GDGQFASGSMFWVRLPALRPLLDAHLLPSMFDTEAGQIDGTLAHAIERATGAVVSAAGFT 682 Query: 349 ISDVDCILGYRKSLSQN 365 ++D + G S + Sbjct: 683 VADTSEVEGAPPRASSS 699 >gi|302337197|ref|YP_003802403.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293] gi|301634382|gb|ADK79809.1| glycosyl transferase family 2 [Spirochaeta smaragdinae DSM 11293] Length = 1100 Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats. Identities = 84/305 (27%), Positives = 128/305 (41%), Gaps = 43/305 (14%) Query: 42 FWTLFYKRS-----KKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQ 96 F TLFY RS +K + + G R G + N +M+ER+ F + FP Sbjct: 128 FDTLFYLRSYPDVAEKGVNPLGHYIKIGWRKGNR----PNPFMVERDCTF----LPAFPL 179 Query: 97 LLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESA 155 H S I +V H+Y+ DL +S++ FDL VT E+ Sbjct: 180 QDHSALS--------------IVVVFHIYHEDLVGSCLQYISHIPYPFDLIVTTPLEENN 225 Query: 156 SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWE 215 ++ ++P A I +N GRD+ PFL + +++ YD CK+H KK + Sbjct: 226 DAILQVKSLYPDAEIVRSKNAGRDIGPFLQVW--DRVLQYDLCCKVHTKK-----GNSAY 278 Query: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMI 275 ++WR +L V I+R F+ + + G+ Y Y LGKN+++ Sbjct: 279 SEIWRDLSLRGILETVDTVHGILRMFEQEDSLALAGAELL-----YGSYQFLLGKNKDLS 333 Query: 276 CTLAGRMGITFQD-QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 +L I FF GTMFW+R + I L + P DG+ EHA+ Sbjct: 334 NSLIKDYNIPVNSYSNNGFFMGTMFWMRVK--KFIFLSNLKQLQFPIEDGKNDGKYEHAL 391 Query: 335 ERCFS 339 ER Sbjct: 392 ERLLG 396 >gi|157151529|ref|YP_001450315.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr. CH1] gi|157076323|gb|ABV11006.1| rhamnosyltransferase [Streptococcus gordonii str. Challis substr. CH1] Length = 582 Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats. Identities = 68/274 (24%), Positives = 115/274 (41%), Gaps = 25/274 (9%) Query: 94 FPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT 152 P + + + I++ KIA+ +H++Y+DL E + + S+DL +T + Sbjct: 260 LPDYPYLLGRKILDLSLPISLPDKKIAVHLHVFYVDLLAEFLHAFESFHFSYDLFITTDS 319 Query: 153 ESA--SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG 210 E I + A + + N GRDVLP L L LS YDY+ H KKSK Sbjct: 320 EKKKNEILGILEGKQAKAEVFVTGNVGRDVLPMLKLKR--HLSQYDYIGHFHTKKSKEAD 377 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 Y W G+ WR+ L L+ +I+ +G++ + P+ + + Sbjct: 378 Y--WAGESWRKELINMLVHPAD---QIVSQLGQDDCLGLVIADI---PSFFRFNRIVVAW 429 Query: 271 NREM----ICTLAGRMGI--TFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPK 321 N + + L RM ++++ F GT W + +AL P+ +L ++ P Sbjct: 430 NEALISPEMNKLWERMNCQKEVDFKQMNTFVMSYGTFVWFKYDALSPLFDLNMTEEDVPS 489 Query: 322 VHKALDGEIEHAVERCFSLSV--KKANFRISDVD 353 I HA+ER K+ +F+IS Sbjct: 490 EPLP-QNSILHAIERLLVYIAWDKQYDFKISYNQ 522 >gi|13476282|ref|NP_107852.1| hypothetical protein mlr7561 [Mesorhizobium loti MAFF303099] gi|14027043|dbj|BAB53997.1| mlr7561 [Mesorhizobium loti MAFF303099] Length = 609 Score = 232 bits (593), Expect = 5e-59, Method: Composition-based stats. Identities = 62/247 (25%), Positives = 105/247 (42%), Gaps = 13/247 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKIFP--AARI 170 +IA++ H+Y++D+ EI N+ +DL VT + A I+ I K A + Sbjct: 296 DCRIAVLAHVYHLDMIDEILGYAENVPKGYDLIVTTDNADKQALIQQAIAKATNASNAVV 355 Query: 171 HIMENHGRDVLPFLILLETEQL-SNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 ++ N GRD L+ L YD +C++H K+S + G G+L++ F +LL Sbjct: 356 LVVRNDGRDTSALLVGCRDYVLEDRYDLICRVHSKRSPQDGPR---GELFKLHTFENLLH 412 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGI--TFQ 287 PG V ++ F + +G++ Y S N+ + LA ++G+ Sbjct: 413 TPGYVSNLLELFANNPALGLVMP--PLVHIGYPTIGNSWAGNKANVAKLARQLGLIVHLD 470 Query: 288 DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 D G M+W R AL + R + + DG + HA+ER + A + Sbjct: 471 DSTPVAPYGGMYWFRPAALRKLFEERWNWNDFANMD-YRDGSLVHAIERIIAYVAIDAGY 529 Query: 348 RISDVDC 354 V Sbjct: 530 TFRHVMT 536 >gi|323135560|ref|ZP_08070643.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] gi|322398651|gb|EFY01170.1| Rhamnan synthesis F [Methylocystis sp. ATCC 49242] Length = 812 Score = 232 bits (591), Expect = 9e-59, Method: Composition-based stats. Identities = 64/262 (24%), Positives = 113/262 (43%), Gaps = 14/262 (5%) Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ES 154 ++ E+P + A + IA +VH +Y ++ + L N++ DL + T E Sbjct: 126 EMDFALETP-FAFAPKPARERPIAAIVHGFYPEIAPLVLEKLKNVTGPVDLFFSTDTQEK 184 Query: 155 ASIKSEILKIFPA--ARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYS 212 ++ + +P I I N GRD+ + ++YD +H K+S G + Sbjct: 185 KHALEDVCRDWPKGRVEIRICPNRGRDIAAKFFGFR-DVYADYDLFIHLHTKRSPHGGAA 243 Query: 213 WWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR 272 WR +LF +LLG+P +V I+ F IG++ + + G + Sbjct: 244 LAR---WRDYLFDNLLGSPEIVNSILSLF-DDPKIGVVFPQHLFELRGI----LNWGYDY 295 Query: 273 EMICTLAGRMGITFQDQ-KLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIE 331 + L RMG+ L+F +G+MFW R+ A P+ +L + P+ +DG + Sbjct: 296 DHARALMKRMGVEIDKNLVLEFPSGSMFWGRSAAFRPLLDLDIDFDDFPQEGGQVDGTLA 355 Query: 332 HAVERCFSLSVKKANFRISDVD 353 HA+ER + + F V Sbjct: 356 HAIERSLLMIAESRGFEWLKVA 377 >gi|312962408|ref|ZP_07776899.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas fluorescens WH6] gi|311283335|gb|EFQ61925.1| lipopolysaccharide biosynthesis protein-like protein [Pseudomonas fluorescens WH6] Length = 1308 Score = 232 bits (591), Expect = 9e-59, Method: Composition-based stats. Identities = 81/253 (32%), Positives = 134/253 (52%), Gaps = 20/253 (7%) Query: 106 MGKVMQIAIKAKI-----AIVVHLYYIDLWIEIANLLSNLS-ISFDLHVTLVTESASIKS 159 + KV ++A K + A+V+HL+Y DLW +I + L + + +DL+VT+ + SA ++ Sbjct: 1070 ISKVSRMAFKRSVRQADYAVVLHLHYDDLWDDIKSYLDSFGQLEYDLYVTVTSSSAGVR- 1128 Query: 160 EILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLW 219 + + +P A I ++EN GRDVLPFL +L+ + Y VCKIH K+S + +GD Sbjct: 1129 -VAQEYPKAHIQLVENRGRDVLPFLKILQVIKDMGYVAVCKIHSKRSLYRD----DGDKI 1183 Query: 220 RRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLA 279 R L LLG+ + ++ F+ +DIG+I Y P+ + T ++ L+ Sbjct: 1184 RGELIGSLLGSKETILSVVDRFERQKDIGVIVPVKYLIPHTDHNMTYC----GAIVTELS 1239 Query: 280 GRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 ++G F +F AG+MFW R +AL+ + ++ S DG I H +ER Sbjct: 1240 SKLGFNF--SYCEFIAGSMFWFRPKALEALLSIDESS--FEVEDGLADGTIAHGIERVLC 1295 Query: 340 LSVKKANFRISDV 352 VKKAN+ + + Sbjct: 1296 NVVKKANYTVETI 1308 >gi|281490695|ref|YP_003352675.1| bifunctional alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis subsp. lactis KF147] gi|281374464|gb|ADA63985.1| Alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Lactococcus lactis subsp. lactis KF147] Length = 589 Score = 230 bits (587), Expect = 3e-58, Method: Composition-based stats. Identities = 75/288 (26%), Positives = 127/288 (44%), Gaps = 21/288 (7%) Query: 81 ERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNL 140 + L D +P L S + + + + +A+ +H+YY +L E + N Sbjct: 252 PKSLVVDHLSTIGYPDLNFLLPSKMITSLSKTVLHQTVAVHLHVYYPELLEEFLDAFKNF 311 Query: 141 SISFDLHVTLVT-ESASIKSEILKIFPA-ARIHIMENHGRDVLPFLILLETEQLSNYDYV 198 S +DL++T T E I E+LK A A++ NHGRD++PFL L E+L YD V Sbjct: 312 SFDYDLYLTTNTDEKEEIIKEMLKCKDARAKLVRTPNHGRDIVPFLAL--KEELKKYDIV 369 Query: 199 CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYP 258 H K+S + + G+ WR L L+ I+ F+ + +G++ + + Sbjct: 370 GHFHTKRSLEAAF--FAGESWRTELISMLI---EPADNIMAHFEQKQKLGIVIADIPSF- 423 Query: 259 NKYCDYTCSLGKNR---EMICTLAGRMGITFQDQKLDFF-----AGTMFWVRTEALDPIK 310 ++ + +N+ ++ + RM + + DF GT FW +TE L+P+ Sbjct: 424 FRFNKIVNADNENKQIAPIMNDIWKRMKMNKKVNFHDFNTFTMSYGTFFWAKTEVLEPLF 483 Query: 311 NLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 NL + P I HA+ER K+ +F+IS L Sbjct: 484 NLEIMDREIPNEPLP-QNTILHAIERVLIYLAWDKEMDFKISPNKLNL 530 >gi|15674835|ref|NP_269009.1| hypothetical protein SPy_0792 [Streptococcus pyogenes M1 GAS] gi|71910421|ref|YP_281971.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS5005] gi|13621968|gb|AAK33730.1| conserved hypothetical protein - possibly involved in cell wall localization and side chain formation of rhamnose-glucose polysaccharide [Streptococcus pyogenes M1 GAS] gi|71853203|gb|AAZ51226.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS5005] Length = 581 Score = 228 bits (582), Expect = 1e-57, Method: Composition-based stats. Identities = 66/247 (26%), Positives = 105/247 (42%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E N + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFENWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T G P+ + N +I L +M + Sbjct: 399 --SILSAFETDDI----GIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 DSYDFRI 518 >gi|32455988|ref|NP_861990.1| rb115 [Ruegeria sp. PR1b] gi|22726340|gb|AAN05136.1| RB115 [Ruegeria sp. PR1b] Length = 963 Score = 228 bits (582), Expect = 1e-57, Method: Composition-based stats. Identities = 67/263 (25%), Positives = 115/263 (43%), Gaps = 18/263 (6%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI-------L 162 K ++ + +HLYY+D+ E+ LL+ L ++F+L ++L + E+ L Sbjct: 149 QPPLPKGRLVVQLHLYYVDMAAEMIALLARLPVTFELLLSLPETAVVADEEMISLFRAGL 208 Query: 163 KIFPAARIHIMENHGRDVLPFLILLETEQ--LSNYDYVCKIHGKKSKRKGYSWWEGDLWR 220 + A + + N GRDV P+++ +E L++ D V +H KKS Y W Sbjct: 209 ERLGAITLRRVPNRGRDVAPWMVSFRSELRALADRDLVLHLHSKKSPHGNYHVG----WG 264 Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 R+L + LLG+ V +++ F ++G++ + + +Y GK ++ L Sbjct: 265 RYLGHSLLGSTAVAAQMLGLFAEDPELGLVAPAYWPALRRAPNY----GKVGDLCAHLFR 320 Query: 281 RMGI-TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 RMG+ DF AG+ F R L P L L P + G + HAVER Sbjct: 321 RMGLGEVDPICADFPAGSFFCARAAVLRPFLTLGLEARDFPAEAGQICGTLAHAVERLLG 380 Query: 340 LSVKKANFRISDVDCILGYRKSL 362 + R V L + ++ Sbjct: 381 QVPARLGLRFDMVAVDLPFEEAA 403 >gi|15672189|ref|NP_266363.1| polysaccharide biosynthesis protein [Lactococcus lactis subsp. lactis Il1403] gi|12723062|gb|AAK04305.1|AE006258_8 polysaccharide biosynthesis protein [Lactococcus lactis subsp. lactis Il1403] Length = 589 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 73/288 (25%), Positives = 124/288 (43%), Gaps = 21/288 (7%) Query: 81 ERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNL 140 + L D +P L S + + + + +A+ +H+YY +L E + N Sbjct: 252 PKSLVVDHLSTVGYPDLNFLLPSKMITPLSKTVLHQTVAVHLHVYYPELLEEFLDAFKNF 311 Query: 141 SISFDLHVTLVT-ESASIKSEILKIFP-AARIHIMENHGRDVLPFLILLETEQLSNYDYV 198 S +DL++T T E I E+LK A++ NHGRD++PFL L E+L YD V Sbjct: 312 SFDYDLYLTTNTDEKEEIIKEMLKCKDAKAKLVRTPNHGRDIVPFLAL--KEELKKYDIV 369 Query: 199 CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYP 258 H K+S + + G+ WR L L+ I+ F+ + +G++ + + Sbjct: 370 GHFHTKRSLEAAF--FAGESWRTELISMLI---EPADNIMAHFEQKQKLGIVIADIPSF- 423 Query: 259 NKYCDYTCSLGKNR---EMICTLAGRMGITFQDQKLDFF-----AGTMFWVRTEALDPIK 310 ++ + +N+ ++ + RM + + DF GT FW + E L+P+ Sbjct: 424 FRFNKIVNADNENKQIAPIMNDIWKRMKMNKKVNFHDFNTFTMSYGTFFWAKIEVLEPLF 483 Query: 311 NLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANFRISDVDCIL 356 NL + P I HA+ER K+ +F IS L Sbjct: 484 NLEIMDREIPNEPLP-QNTILHAIERVLIYLAWDKEMDFNISPNKLNL 530 >gi|209559162|ref|YP_002285634.1| RgpFc protein [Streptococcus pyogenes NZ131] gi|209540363|gb|ACI60939.1| RgpFc protein [Streptococcus pyogenes NZ131] Length = 581 Score = 227 bits (578), Expect = 3e-57, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 105/247 (42%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T G P+ + N +I L +M + Sbjct: 399 --SILSAFETDDI----GIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|306827605|ref|ZP_07460885.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes ATCC 10782] gi|304430168|gb|EFM33197.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes ATCC 10782] Length = 581 Score = 227 bits (578), Expect = 3e-57, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 105/247 (42%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T G P+ + N +I L +M + Sbjct: 399 --SILSAFETDDI----GIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|56808559|ref|ZP_00366292.1| COG3754: Lipopolysaccharide biosynthesis protein [Streptococcus pyogenes M49 591] Length = 581 Score = 226 bits (577), Expect = 3e-57, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 105/247 (42%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T G P+ + N +I L +M + Sbjct: 399 --SILSAFETDDI----GIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|19745874|ref|NP_607010.1| hypothetical protein spyM18_0853 [Streptococcus pyogenes MGAS8232] gi|19748025|gb|AAL97509.1| conserved hypothetical protein [Streptococcus pyogenes MGAS8232] Length = 581 Score = 226 bits (577), Expect = 4e-57, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 105/247 (42%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T G P+ + N +I L +M + Sbjct: 399 --SILSAFETDDI----GIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|139474025|ref|YP_001128741.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes str. Manfredo] gi|134272272|emb|CAM30524.1| rhamnan synthesis protein F family protein [Streptococcus pyogenes str. Manfredo] Length = 581 Score = 226 bits (577), Expect = 4e-57, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 105/247 (42%), Gaps = 25/247 (10%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPAD- 398 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT----LAGRMGIT--FQ 287 I+ F+T G P+ + N +I L +M + Sbjct: 399 --SILSAFETDDI----GIIIADIPSFFRFNKIVNAWNEHLIAQEMMSLWRKMDVKKQID 452 Query: 288 DQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV-- 342 Q +D F GT W + +AL + +L L++ P I HA+ER Sbjct: 453 FQAMDTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLLVYIAWG 511 Query: 343 KKANFRI 349 +FRI Sbjct: 512 NSYDFRI 518 >gi|71903253|ref|YP_280056.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS6180] gi|71802348|gb|AAX71701.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase [Streptococcus pyogenes MGAS6180] Length = 581 Score = 225 bits (575), Expect = 6e-57, Method: Composition-based stats. Identities = 64/243 (26%), Positives = 107/243 (44%), Gaps = 17/243 (6%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPADS 399 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQDQKL 291 + + T D I +R+ + L + + +L +M + Q + Sbjct: 400 ILSVFETDDIGIII-ADIPSFFRFNKIVNAWNEHLI--AQEMMSLWRKMDVKKQIDFQAM 456 Query: 292 DFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKAN 346 D F GT W + +AL + +L L++ P I HA+ER F + Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLFVYIAWGNSYD 515 Query: 347 FRI 349 FRI Sbjct: 516 FRI 518 >gi|94994091|ref|YP_602189.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10750] gi|94547599|gb|ABF37645.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus pyogenes MGAS10750] Length = 581 Score = 225 bits (575), Expect = 6e-57, Method: Composition-based stats. Identities = 64/243 (26%), Positives = 107/243 (44%), Gaps = 17/243 (6%) Query: 116 AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIM 173 K+A+ +H++Y+DL E + + +DL +T ++ IK + + A I + Sbjct: 284 QKVAVHLHVFYVDLLDEFLTAFEDWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVT 343 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 N GRD+ P L+L ++LS YDY+ H KKSK + W G+ WR+ L L+ Sbjct: 344 GNRGRDIYPMLLL--KDKLSQYDYIGHFHTKKSKEADF--WAGESWRKELIDMLVKPADS 399 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQDQKL 291 + + T D I +R+ + L + + +L +M + Q + Sbjct: 400 ILSVFETDDIGIII-ADIPSFFRFNKIVNAWNEHLI--AQEMMSLWRKMDVKKQIDFQAM 456 Query: 292 DFFA---GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKAN 346 D F GT W + +AL + +L L++ P I HA+ER F + Sbjct: 457 DTFVMSYGTFVWFKYDALKSLFDLELTQNDIPSEPLP-QNSILHAIERLFVYIAWGNSYD 515 Query: 347 FRI 349 FRI Sbjct: 516 FRI 518 >gi|296135664|ref|YP_003642906.1| glycosyl transferase family 2 [Thiomonas intermedia K12] gi|295795786|gb|ADG30576.1| glycosyl transferase family 2 [Thiomonas intermedia K12] Length = 1414 Score = 225 bits (574), Expect = 7e-57, Method: Composition-based stats. Identities = 88/329 (26%), Positives = 132/329 (40%), Gaps = 30/329 (9%) Query: 43 WTLFYKRSKKLCYDENY-----VVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQL 97 + F++ ++ + N VV + + ++ R + P Sbjct: 198 DSRFFQTTRPTVFPANLETEHLVVKNVTADEYQQLTSLQVFYRARPNPAQNRIF--IPTR 255 Query: 98 LHGWESPAMGKVMQIAIKAK----IAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE 153 P + A+++HLYY DLW E L L D++V+L Sbjct: 256 AASAAQPVQVPSPVLHAGRPHGNGTAVLLHLYYPDLWPEFLAHLKTLPAPCDVYVSLSEG 315 Query: 154 SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG--- 210 + ++I++ P A + N GRD+ P L LL + NY + +HGKKS Sbjct: 316 REELLTDIVRDLPDAVVMRHPNKGRDIAPRLALLRLARAHNYKQLLFLHGKKSPHLKEVE 375 Query: 211 ------YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 +GD WRR L LL A K I F +G+IG + + Sbjct: 376 NIHIPFLQHKDGDRWRRELLAALLDASE---KTIAAFAQQPKLGLIGPHGFWLGLRG--- 429 Query: 265 TCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHK 324 N + A RMGIT + +FAG+MFW R +ALDP+ L L Sbjct: 430 ----DANFPRLSAQAQRMGITPDPARHGYFAGSMFWCRPQALDPLLALDLKDADFEDETG 485 Query: 325 ALDGEIEHAVERCFSLSVKKANFRISDVD 353 DG + H VER F+LS +KA F+I+D Sbjct: 486 QTDGTLAHVVERLFALSAEKAGFQIADTA 514 >gi|116511036|ref|YP_808252.1| lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp. cremoris SK11] gi|116106690|gb|ABJ71830.1| Lipopolysaccharide biosynthesis protein [Lactococcus lactis subsp. cremoris SK11] Length = 588 Score = 225 bits (574), Expect = 8e-57, Method: Composition-based stats. Identities = 73/285 (25%), Positives = 124/285 (43%), Gaps = 24/285 (8%) Query: 94 FPQLLHGWESPAMGKVMQIAIKA--KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 +P + E + +++ K KI I +H +Y+DL E N ++DL++T Sbjct: 274 YPDREYMLEEKTLKFTKEVSAKTNLKIGIHLHAFYLDLIPEYLNYFDKYVQNYDLYITTD 333 Query: 152 TESASIKSEILKIFPAARIH---IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 TE EILK +P +I + N GRDVLP++ + +E +++YD H KKSK Sbjct: 334 TE--EKYEEILKNYPLPQIKKVIVTGNKGRDVLPWMQV--SELMTDYDLCGHFHTKKSKD 389 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 W G+ WRR + Y LL I + F+ + +G+I + + + T Sbjct: 390 ND--WIVGESWRRDIEYSLL---EPAQAIFQEFEKNPKLGLIIADVPSFFEHFYGPTYIT 444 Query: 269 GKNR-EMICTLAGRMGITFQDQ---KLDFFA--GTMFWVRTEALDPIKNLRLSRYFEPKV 322 ++ + + ++ + K + GTM W R +AL+ + N+ + P+ Sbjct: 445 ERDIWPDMQEIWQKIDFENSKELKQKDSYVMSYGTMIWYRPQALNNLLNVNIQ-ADVPEE 503 Query: 323 HKALDGEIEHAVERCFSLS--VKKANFRISDVDCILGYRKSLSQN 365 I HA ER +FRIS + G+ + S N Sbjct: 504 PLPY-NSILHAFERLLVYVSWANGYDFRISQIQTNNGFVANFSAN 547 >gi|269978088|ref|ZP_06185038.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1] gi|306818459|ref|ZP_07452182.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239] gi|307700705|ref|ZP_07637730.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16] gi|269933597|gb|EEZ90181.1| lipopolysaccharide biosynthesis protein [Mobiluncus mulieris 28-1] gi|304648632|gb|EFM45934.1| rhamnan synthesis protein F [Mobiluncus mulieris ATCC 35239] gi|307613700|gb|EFN92944.1| rhamnan synthesis protein F [Mobiluncus mulieris FB024-16] Length = 613 Score = 225 bits (574), Expect = 8e-57, Method: Composition-based stats. Identities = 75/256 (29%), Positives = 115/256 (44%), Gaps = 10/256 (3%) Query: 97 LLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS 156 L + A V+ K +IA V H++Y D+ EI S L + +T T Sbjct: 277 LTQVLDDNAEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKK 336 Query: 157 --IKSEILKIFPAARIHIME-NHGRDVLPFLILL-ETEQLSNYDYVCKIHGKKSKRKGYS 212 I+ ++ + A + I+E N GRDV FL+ + + +D V KIH KKS + Y+ Sbjct: 337 TQIEQQLQTMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGRFDVVAKIHSKKSAQDAYN 396 Query: 213 WWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR 272 E L++R LF +LL +PG ++ F T +GM+ A Y + N+ Sbjct: 397 AAE--LFKRHLFENLLPSPGYTANLLHLFATEPYLGMVFPPAVSLG--YPTLGHAWFANK 452 Query: 273 EMICTLAGRMGI--TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 + L R+GI F D G+MF+ R EAL P+ + P+ + DG + Sbjct: 453 KPALALCERLGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSL 512 Query: 331 EHAVERCFSLSVKKAN 346 H +ER FS S Sbjct: 513 AHVIERIFSYSSLSEG 528 >gi|227875198|ref|ZP_03993340.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris ATCC 35243] gi|227844103|gb|EEJ54270.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase / alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Mobiluncus mulieris ATCC 35243] Length = 613 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 75/256 (29%), Positives = 113/256 (44%), Gaps = 10/256 (3%) Query: 97 LLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS 156 L + A V+ K +IA V H++Y D+ EI S L + +T T Sbjct: 277 LTQVLDDNAEESVLAANAKLRIAGVAHVFYADMTAEIMKRFSYLGDHAQIFLTTSTPEKK 336 Query: 157 --IKSEILKIFPAARIHIME-NHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYS 212 I+ ++ + A + I+E N GRDV FL+ +D V KIH KKS + Y+ Sbjct: 337 TQIEQQLQTMGRQAEVRIVESNRGRDVSAFLVTCADVLEPGCFDVVAKIHSKKSAQDAYN 396 Query: 213 WWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNR 272 E L++R LF +LL +PG ++ F T +GM+ A Y + N+ Sbjct: 397 AAE--LFKRHLFENLLPSPGYTANLLHLFATEPYLGMVFPPAVSLG--YPTLGHAWFANK 452 Query: 273 EMICTLAGRMGI--TFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 + L R+GI F D G+MF+ R EAL P+ + P+ + DG + Sbjct: 453 KPALALCERLGIKLPFDDTTPLSPYGSMFFARPEALLPLTKAHFTFNDFPEEGQYSDGSL 512 Query: 331 EHAVERCFSLSVKKAN 346 H +ER FS S Sbjct: 513 AHVIERIFSYSSLSEG 528 >gi|254876593|ref|ZP_05249303.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254842614|gb|EET21028.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 765 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 64/247 (25%), Positives = 105/247 (42%), Gaps = 15/247 (6%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVT-LVTESASIKSEILKIFPAA 168 + I K AI +HL+YIDL E L +DL++T + + ++ E AA Sbjct: 523 SEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSSGAA 582 Query: 169 RIHI--MENHGRDVLPFLILLETEQL-SNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFY 225 + I ++N GRD+ P + L+ + L Y+ V H KK+ + GD WR +L Sbjct: 583 NVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKTVSAHDNL--GDKWRAYLLN 640 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT 285 +L+G + I IG++ Y +G+N+ + L +G+ Sbjct: 641 NLIGDNEQISNSILNLFNDEKIGLVFPEDRTY--------IDIGENKFYVDELCTAIGLE 692 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + F G MFW R +A+ I +L + DG HA+ER V+K Sbjct: 693 KICETPLFPLGNMFWARVDAIRDIFSLN-EDMILQEEPLPRDGSYMHALERIIPNIVEKN 751 Query: 346 NFRISDV 352 ++ V Sbjct: 752 GYKYVTV 758 >gi|241668058|ref|ZP_04755636.1| glycosyl transferase, group 1 [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 756 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 64/247 (25%), Positives = 105/247 (42%), Gaps = 15/247 (6%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVT-LVTESASIKSEILKIFPAA 168 + I K AI +HL+YIDL E L +DL++T + + ++ E AA Sbjct: 514 SEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSSGAA 573 Query: 169 RIHI--MENHGRDVLPFLILLETEQL-SNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFY 225 + I ++N GRD+ P + L+ + L Y+ V H KK+ + GD WR +L Sbjct: 574 NVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKTVSAHDNL--GDKWRAYLLN 631 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT 285 +L+G + I IG++ Y +G+N+ + L +G+ Sbjct: 632 NLIGDNEQISNSILNLFNDEKIGLVFPEDRTY--------IDIGENKFYVDELCTAIGLE 683 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + F G MFW R +A+ I +L + DG HA+ER V+K Sbjct: 684 KICETPLFPLGNMFWARVDAIRDIFSLN-EDMILQEEPLPRDGSYMHALERIIPNIVEKN 742 Query: 346 NFRISDV 352 ++ V Sbjct: 743 GYKYVTV 749 >gi|258654317|ref|YP_003203473.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233] gi|258557542|gb|ACV80484.1| Rhamnan synthesis F [Nakamurella multipartita DSM 44233] Length = 631 Score = 223 bits (570), Expect = 2e-56, Method: Composition-based stats. Identities = 69/246 (28%), Positives = 107/246 (43%), Gaps = 12/246 (4%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILK-IFPAARIH 171 +I ++ H++Y D+ E+ + N+ + FDL VT + + +I+S + + + Sbjct: 295 PLRICVLAHIFYEDMTDEMMGWIGNIPVPFDLVVTTTSAAKKEAIESALEAYALKSVEVR 354 Query: 172 IME-NHGRDVLPFLILLETEQLSN-YDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLG 229 ++E N GR FLI S YD V KIH KKS + G + G L++ +LL Sbjct: 355 LVESNRGRAESAFLIACRDVLTSGEYDLVLKIHSKKSPQNGANL--GQLFKHHSVDNLLS 412 Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT--FQ 287 +PG V I+ F + +GM+ + S NRE LA ++GI F Sbjct: 413 SPGYVASILGMFQSQPSLGMVFPPVVNIG--FPTLGHSWFTNREAAHELADQLGIHTIFD 470 Query: 288 DQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKV-HKALDGEIEHAVERCFSLSVKKAN 346 GTMFW R E+L + DG + H +ER + +V A Sbjct: 471 RTTPLAPNGTMFWARPESLAKLARHDFDYSQFAAEHEGWSDGMLGHVIERLYGYAVLDAG 530 Query: 347 FRISDV 352 RI V Sbjct: 531 LRIQCV 536 >gi|167627488|ref|YP_001677988.1| group 1 glycosyl transferase [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167597489|gb|ABZ87487.1| glycosyl transferase, group 1 [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 763 Score = 223 bits (569), Expect = 3e-56, Method: Composition-based stats. Identities = 64/247 (25%), Positives = 105/247 (42%), Gaps = 15/247 (6%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVT-LVTESASIKSEILKIFPAA 168 + I K AI +HL+YIDL E L +DL++T + + ++ E AA Sbjct: 521 SEKNISHKFAIHLHLFYIDLADEFNEYFKLLPKGYDLYITIIDSNNSEFIKEKFSSSGAA 580 Query: 169 RIHI--MENHGRDVLPFLILLETEQL-SNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFY 225 + I ++N GRD+ P + L+ + L Y+ V H KK+ + GD WR +L Sbjct: 581 NVEIVAVDNIGRDIAPMIFGLKDQLLNRGYEIVGHFHSKKTVSAHDNL--GDKWRAYLLN 638 Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT 285 +L+G + I IG++ Y +G+N+ + L +G+ Sbjct: 639 NLIGDNEQISNSILNLFNDEKIGLVFPEDRTY--------IDIGENKFYVDELCTAIGLE 690 Query: 286 FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKA 345 + F G MFW R +A+ I +L + DG HA+ER V+K Sbjct: 691 KICETPLFPLGNMFWARVDAIRDIFSLN-EDMILQEEPLPRDGSYMHALERIIPNIVEKN 749 Query: 346 NFRISDV 352 ++ V Sbjct: 750 GYKYVTV 756 >gi|16124886|ref|NP_419450.1| hypothetical protein CC_0633 [Caulobacter crescentus CB15] gi|221233606|ref|YP_002516042.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000] gi|13421844|gb|AAK22618.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|51039815|tpg|DAA00361.1| TPA_exp: conserved hypothetical protein [Caulobacter vibrioides] gi|220962778|gb|ACL94134.1| hypothetical protein CCNA_00669 [Caulobacter crescentus NA1000] Length = 818 Score = 222 bits (565), Expect = 9e-56, Method: Composition-based stats. Identities = 77/319 (24%), Positives = 127/319 (39%), Gaps = 15/319 (4%) Query: 36 GGFFFWFWTLFYKRSKKLCYDENY--VVAYGSRS-GKKFFAQSNLYMMERELHFDGQRIH 92 +F W + + + + V G+ + + + + LH + Sbjct: 508 ESYFGWLSGA--LKHAEARHPKGEALVFVNAWNEWGEGAYLEPDRWFGHGYLHATRTALS 565 Query: 93 HFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV- 151 + L Q A +A ++HL+Y +L A L+ + DL +T+ Sbjct: 566 AWLPRLTNAHPIIAEAQSQFAKRADAVTLLHLFYPELIDWFAERLAATADVLDLMITVPE 625 Query: 152 TESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGY 211 T S + + FP A + I EN GRD+ PF+ L + Y CK+H K+S + Sbjct: 626 TWSEADLARARAAFPTAHLAIAENRGRDIRPFVETLRRARALGYSVFCKLHSKRSPHQA- 684 Query: 212 SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKN 271 +GD WR L LLG + +G++ + R D + N Sbjct: 685 ---KGDQWRTTLVEGLLGGEAAALALRAF-AQDPKLGLLAAAGARMRIGDPDV---MDNN 737 Query: 272 REMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIE 331 R L+ MG+ + + F AG+MFW RTEA P+ +L +DG Sbjct: 738 RAEADRLSAHMGLKPRPETP-FAAGSMFWGRTEAFAPLTDLSDDEIAFGPELGRVDGTTA 796 Query: 332 HAVERCFSLSVKKANFRIS 350 HA+ER + V++A +R S Sbjct: 797 HAIERLTAAIVERAGYRAS 815 >gi|295687882|ref|YP_003591575.1| rhamnan synthesis protein F [Caulobacter segnis ATCC 21756] gi|295429785|gb|ADG08957.1| Rhamnan synthesis F [Caulobacter segnis ATCC 21756] Length = 818 Score = 222 bits (565), Expect = 9e-56, Method: Composition-based stats. Identities = 74/284 (26%), Positives = 118/284 (41%), Gaps = 10/284 (3%) Query: 68 GKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYI 127 G+ + + + + LH + + L +A ++HL+Y Sbjct: 541 GEGAYLEPDRWFGHGYLHATRAALSAYQPRLTDAHPLVAQAQAAFVKRADAVTLLHLFYP 600 Query: 128 DLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENHGRDVLPFLIL 186 +L A L+ + DL +T+ T S + + FP A + I EN GRD+ PF+ Sbjct: 601 ELIDWFAERLAATADVLDLMITVPETWSEADLARARATFPMAHLAIAENRGRDIRPFVET 660 Query: 187 LETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRD 246 L + Y CK+H K+S + +GD WR L LLG + Sbjct: 661 LRRARTLGYSVFCKLHSKRSPHRA----KGDEWRAELVDGLLGGEAAALALRAF-AQDAK 715 Query: 247 IGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEAL 306 +G++ + R D + NR+ LA RMG+ + F AG+MFW RTEA Sbjct: 716 LGLLAAAGSRLRIGDPDV---MNNNRQDADRLARRMGLKLAPETP-FSAGSMFWGRTEAF 771 Query: 307 DPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRIS 350 P+ +L + +DG HA+ER + V +A +R S Sbjct: 772 APLSDLTDAEIDFGPELGRVDGTTAHAIERLTAAIVARAGYRAS 815 >gi|304309760|ref|YP_003809358.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1] gi|301795493|emb|CBL43691.1| hypothetical protein HDN1F_01080 [gamma proteobacterium HdN1] Length = 1315 Score = 221 bits (563), Expect = 2e-55, Method: Composition-based stats. Identities = 68/262 (25%), Positives = 108/262 (41%), Gaps = 15/262 (5%) Query: 92 HHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 H +P+ L + + A IAI +HLYY DL LS + FDL++++ Sbjct: 422 HPYPEQLKQL---HELTLPKHTSNATIAIHIHLYYADLAPTFVQALSRMERPFDLYISIQ 478 Query: 152 --TESASIKSEILKI--FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSK 207 I++ + KI I N GRD+ PF+ + E L YD + +H KKS Sbjct: 479 VRANPVEIEAVVRKIPCLRGLDIRATPNLGRDLYPFVCIF-GEALRKYDIIAHLHSKKSL 537 Query: 208 RKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCS 267 Y+ W ++ L +P + +I+ G++ + + Y YT Sbjct: 538 ---YNQGATAGWLEYILDSLFRSPEDIARILERLSDASQTGIVYPQNFS-GLPYMAYT-- 591 Query: 268 LGKNREMICTLAGRMGIT-FQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKAL 326 NR + R G+T D+ AG+MFW R +A+ P +L+ Sbjct: 592 WLANRSRAQQVQARFGLTSLPSGYFDYPAGSMFWARADAIAPFFEAQLNEDDFENESGQT 651 Query: 327 DGEIEHAVERCFSLSVKKANFR 348 DG + H +ER L + FR Sbjct: 652 DGTLAHTLERFLVLVPESLGFR 673 >gi|310286583|ref|YP_003937841.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum S17] gi|309250519|gb|ADO52267.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum S17] Length = 662 Score = 220 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 57/267 (21%), Positives = 97/267 (36%), Gaps = 15/267 (5%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 + P + + + + A + H+Y++DL + +S L DL++T + + Sbjct: 289 YVLPTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRD 348 Query: 161 ILK----IFPAARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYS 212 + P I ++ N GRDV L+ LS YD + H KKS + G+ Sbjct: 349 YMASHGVNHPVTFISVV-NRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHH 407 Query: 213 WWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY-CDYTCSLGKN 271 E + L + L + V I+ F +G + + + G N Sbjct: 408 GTESQGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPHDWGAN 467 Query: 272 REMICTLAG-RMGITFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFE-PKVHKAL 326 E+ L R I A G+ +W R EAL P+ P+ Sbjct: 468 FEITKELLEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGEMGE 527 Query: 327 DGEIEHAVERCFSLSVKKANFRISDVD 353 DG + HA+ER + + + V Sbjct: 528 DGTVSHAIERANGYICQSQGYYPAWVM 554 >gi|224284010|ref|ZP_03647332.1| Lipopolysaccharide biosynthesis protein [Bifidobacterium bifidum NCIMB 41171] gi|313141164|ref|ZP_07803357.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313133674|gb|EFR51291.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 662 Score = 220 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 57/267 (21%), Positives = 97/267 (36%), Gaps = 15/267 (5%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSE 160 + P + + + + A + H+Y++DL + +S L DL++T + + Sbjct: 289 YVLPTVTRNPRTGADVRSAFIYHIYFLDLLGDTCRYISALPEETDLYITTTEDKIDAIRD 348 Query: 161 ILK----IFPAARIHIMENHGRDVLPFLILLETEQLSN-YDYVCKIHGKKS---KRKGYS 212 + P I ++ N GRDV L+ LS YD + H KKS + G+ Sbjct: 349 YMASHGVNHPVTFISVV-NRGRDVSALLVAACDVVLSGKYDVIGFAHDKKSSQNQENGHH 407 Query: 213 WWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY-CDYTCSLGKN 271 E + L + L + V I+ F +G + + + G N Sbjct: 408 GTESQGFAYKLMENTLASRDYVENILTLFSNEPRLGQVAPPPPFHALYFAHTLPHDWGAN 467 Query: 272 REMICTLAG-RMGITFQDQKLDFFA---GTMFWVRTEALDPIKNLRLSRYFE-PKVHKAL 326 E+ L R I A G+ +W R EAL P+ P+ Sbjct: 468 FEITKELLEDRFDIHVPLSPGKPSASAIGSCYWFRVEALKPLFEYGWKYEDFLPEGEMGE 527 Query: 327 DGEIEHAVERCFSLSVKKANFRISDVD 353 DG + HA+ER + + + V Sbjct: 528 DGTVSHAIERANGYICQSQGYYPAWVM 554 >gi|71735705|ref|YP_273244.1| hypothetical protein PSPPH_0972 [Pseudomonas syringae pv. phaseolicola 1448A] gi|71556258|gb|AAZ35469.1| conserved hypothetical protein [Pseudomonas syringae pv. phaseolicola 1448A] Length = 1262 Score = 220 bits (560), Expect = 4e-55, Method: Composition-based stats. Identities = 69/256 (26%), Positives = 106/256 (41%), Gaps = 17/256 (6%) Query: 101 WESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA----- 155 E+ + + +I + +HLYY DL I+ L+N+ ++FDL ++ E Sbjct: 195 LENSSHINFNESIEDMRIGVYLHLYYTDLLGAISKHLNNIPLAFDLFISTPHELDHKKLR 254 Query: 156 SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWE 215 I S+ + I + N GRD+ PF+I E L YD +C IH KKS+ Sbjct: 255 KIVSDSVTNVKEISIKHVPNRGRDIAPFIIEFGNE-LQAYDAICHIHTKKSEHTKGL--- 310 Query: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMI 275 W + LLG+ V KI+ +I Y Y +N E+ Sbjct: 311 -SDWGDDILSSLLGSREDVKKILTLLKGDAK--IIYPEGQNY---YMKDPTGWSENHEIA 364 Query: 276 CT-LAGRMGITFQD-QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 L+ + + K +F G+MFW R E + N+ L P+ DG + HA Sbjct: 365 KHILSDHLETDISNFPKAEFPEGSMFWARQEGIQSFLNIPLDWEDFPEEPIPTDGTLAHA 424 Query: 334 VERCFSLSVKKANFRI 349 +ER +S A RI Sbjct: 425 LERIILISAYAAPGRI 440 >gi|125623094|ref|YP_001031577.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris MG1363] gi|124491902|emb|CAL96823.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris MG1363] gi|300069842|gb|ADJ59242.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase RgpF [Lactococcus lactis subsp. cremoris NZ9000] Length = 588 Score = 218 bits (555), Expect = 1e-54, Method: Composition-based stats. Identities = 70/286 (24%), Positives = 124/286 (43%), Gaps = 26/286 (9%) Query: 94 FPQLLHGWESPAMGKVMQIAIKA--KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV 151 +P + E + +I K+ KIAI +H +Y+DL E + ++DL +T Sbjct: 274 YPDREYMLEEKTLKLSTEINKKSNLKIAIHLHAFYLDLIPEYLDYFDKYVQNYDLFITTD 333 Query: 152 TESASIKSEILKIFPAARIH---IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKR 208 T +I+K +P +I + N GRDVLP++ + +E +++YD H KKSK Sbjct: 334 T--KDKYEQIIKSYPLNQIKKVLVTGNKGRDVLPWMEI--SELMADYDLCGHFHTKKSKD 389 Query: 209 KGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSL 268 W G+ WRR + Y LL I + F+ + +G++ + + + T Sbjct: 390 ND--WIVGESWRRDIEYSLLKPAQ---AIFQEFEKNPKLGLMIADVPSFFEHFYGPTYIT 444 Query: 269 GKNR-EMICTLAGRM------GITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPK 321 ++ + + ++ G+ +D + GTM W R +AL+ + + + P+ Sbjct: 445 ERDIWPDMEEIWKKINFENPRGLKQKDSYVMS-YGTMIWYRPQALNNLLKVDIEAA-VPE 502 Query: 322 VHKALDGEIEHAVERCFSL--SVKKANFRISDVDCILGYRKSLSQN 365 I HA ER +FRIS + G+ + S N Sbjct: 503 EPLPY-NSILHAFERLLVYTSWANGYDFRISQIQTNNGFVANFSAN 547 >gi|145588508|ref|YP_001155105.1| methyltransferase type 11 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145046914|gb|ABP33541.1| Methyltransferase type 11 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 1082 Score = 217 bits (554), Expect = 2e-54, Method: Composition-based stats. Identities = 75/255 (29%), Positives = 131/255 (51%), Gaps = 15/255 (5%) Query: 96 QLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNL--SISFDLHVTLVTE 153 +L + +P ++++ + + IAIVVH++Y+D W +I ++ + D+++T+ Sbjct: 837 DILKNYINPEHAEIIRESQENSIAIVVHIHYMDTWEDIKKIIKKILSVHDSDIYITITN- 895 Query: 154 SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSW 213 I FP+A I ++EN GRD+LPF+ +L+ NY +CKIH KKS+ + Sbjct: 896 -LEQYQSIKNDFPSANIELVENRGRDILPFINVLKKIIHKNYVAICKIHSKKSEYRS--- 951 Query: 214 WEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNRE 273 +G++ R+ L++ L+ + KI + F+ ++ +GM+ Y + + NRE Sbjct: 952 -DGEVIRKELYFSLINNEITLEKIPKFFEVNKKLGMLVPGKYFLQHNDINMYF----NRE 1006 Query: 274 MICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 I + +G+ F++ K F AG+MFW R AL + L S DG + HA Sbjct: 1007 NISKVCSVIGVNFKESK--FPAGSMFWARPAALQKLLKLE-SGELFDVEEGLADGTVAHA 1063 Query: 334 VERCFSLSVKKANFR 348 VER F L + + F Sbjct: 1064 VERLFGLVSESSGFY 1078 >gi|116071634|ref|ZP_01468902.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107] gi|116065257|gb|EAU71015.1| hypothetical protein BL107_05779 [Synechococcus sp. BL107] Length = 934 Score = 212 bits (539), Expect = 1e-52, Method: Composition-based stats. Identities = 59/250 (23%), Positives = 108/250 (43%), Gaps = 21/250 (8%) Query: 110 MQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT-ESASIKSEILKI---- 164 I + ++AI +H+YY + E L+ L L +T T E + EIL+ Sbjct: 38 TSIFQECQVAIYLHIYYPESLHEFLEYLTVLPSQIRLVITTTTSEKKELIIEILERALLI 97 Query: 165 --FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRW 222 ++ EN GRD+ F+ + ++L YD VCK+H KKS G G W R+ Sbjct: 98 NRLDLCHVY-HENKGRDIGAFINIY--DELIKYDVVCKLHAKKSPHLGEF---GKSWFRY 151 Query: 223 LFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRM 282 L +G + I+ +DIG++ +++ N + N ++ +++ + Sbjct: 152 LIRSTIGNQSAIENIVNILYHSKDIGILAPTSFQGTNNH-----DWASNFDISQSISDHI 206 Query: 283 ---GITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 + +KL + + T+FW + EAL+ + + F P+ +DG H++ER Sbjct: 207 FNSELDINKEKLRYPSATVFWFKPEALNQQQFRSIQPDFFPEEPIPIDGTTAHSLERLIP 266 Query: 340 LSVKKANFRI 349 + Sbjct: 267 YISILNGLKT 276 >gi|192359986|ref|YP_001983898.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio japonicus Ueda107] gi|190686151|gb|ACE83829.1| Capsule polysaccharide biosynthesis protein family [Cellvibrio japonicus Ueda107] Length = 872 Score = 211 bits (538), Expect = 1e-52, Method: Composition-based stats. Identities = 74/246 (30%), Positives = 102/246 (41%), Gaps = 14/246 (5%) Query: 112 IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAARI 170 + +IA+V HLYY DL EI + L + +FDL VTL + I+ + + +P A Sbjct: 597 EHSQKRIAVVAHLYYRDLVPEILSALETIPEAFDLIVTLPDWGTRHIEQMVREAYPEAVF 656 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWE--GDLWRRWLFYDLL 228 + N GRD+ PF+ LL NYD + KI K+ + G LWR F LL Sbjct: 657 YRAVNRGRDIGPFVDLLPLITEKNYDALLKIQTKRGYYRSGRLLPQFGQLWRSETFRALL 716 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD 288 G V I+ T + M+G Y Y + ++ Sbjct: 717 GNKSRVTDILEALRTDPSLNMVGPSPYFLSLTKYPYHDQGDLAQTILN----------NP 766 Query: 289 QKLDFFAGTMFWVRTEALDPIKN-LRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 FFAGTMFWVR L P+ LS A DG H +ER FS + Sbjct: 767 TGNGFFAGTMFWVRPSCLRPLTEPEHLSITAFEPESGANDGATAHLIERLFSQVAFANDG 826 Query: 348 RISDVD 353 +I+ V Sbjct: 827 KIAGVP 832 >gi|332561589|ref|ZP_08415902.1| glycosyltransferase [Rhodobacter sphaeroides WS8N] gi|332274091|gb|EGJ19409.1| glycosyltransferase [Rhodobacter sphaeroides WS8N] Length = 821 Score = 208 bits (530), Expect = 1e-51, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 107/275 (38%), Gaps = 15/275 (5%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANL 136 M R+L +R+ +H A + + ++ VH +Y D + Sbjct: 547 RRRMWRDLAEVEERLADADNWVHRKLRIAPYGTAEATELPRFSLHVHAFYTDDLAQDVRR 606 Query: 137 LSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSN 194 + + + VT ++ + I++ + + A + + N GRD+LPFL L + Sbjct: 607 HAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRPNRGRDILPFLQLFLPGGAAG 666 Query: 195 YDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSR 253 D + C +H KKS S GD+WR +L LLG + + +G++ Sbjct: 667 EDEIWCHLHQKKSLATTDS---GDMWRAFLLRILLGDEASLSDAATHL-RNPGVGLVAP- 721 Query: 254 AYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLR 313 + Y +R ++ +A R+ D L F G MF+VR+ + + +L Sbjct: 722 -------FDPYFIPWDASRALLPRVAPRLPGPLPDNPLLFPVGNMFFVRSAVVRAMNDLF 774 Query: 314 LSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 + Y P DG H +ER + + Sbjct: 775 GAGYPWPNEPIPNDGTEFHLIERLWPALAAQCGLT 809 >gi|23009067|ref|ZP_00050256.1| COG3754: Lipopolysaccharide biosynthesis protein [Magnetospirillum magnetotacticum MS-1] Length = 486 Score = 208 bits (530), Expect = 1e-51, Method: Composition-based stats. Identities = 69/280 (24%), Positives = 111/280 (39%), Gaps = 30/280 (10%) Query: 74 QSNLYMMERELHFDGQRIHHFPQLLHGWE--SPAMGKVMQIA-----------IKAKIAI 120 Q+N + D P++ + E PA KV+ + A+I + Sbjct: 218 QANPLVHYLRKGVDLNLSPRGPKVANVAEKKLPASRKVVNLVAAGSSAPAGALANARIGV 277 Query: 121 VVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAA-----RIHIMEN 175 H+++ DL + +N+ ++VT + AS I K F I I N Sbjct: 278 FAHIFHTDLCEYVLKYTNNIPFDTTVYVTTSS--ASKADFIRKTFGRLSKHRYEIVIAPN 335 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVF 235 GRD+ P L+ N DY +H KKS + D WR +LF LG+ ++ Sbjct: 336 RGRDIAPMLVGYRN-AFQNCDYAVHVHTKKSLHYSSGF---DAWRDYLFEMNLGSAELIT 391 Query: 236 KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK-LDFF 294 I+ +IG + Y K G N + I L G++ + LDF Sbjct: 392 GIVNVLSRS-NIGAVAPDHYAPIAKL----IQWGGNIDAINGLLSFTGLSVASENVLDFP 446 Query: 295 AGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAV 334 +G+MFW + +AL + + L Y +DG + HA+ Sbjct: 447 SGSMFWFKPDALSKLMEIHLQSYHFDPELGQVDGTLAHAI 486 >gi|221218294|ref|YP_002524321.1| glycosyltransferase [Rhodobacter sphaeroides KD131] gi|221163321|gb|ACM04287.1| glycosyltransferase [Rhodobacter sphaeroides KD131] Length = 821 Score = 207 bits (527), Expect = 3e-51, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 107/275 (38%), Gaps = 15/275 (5%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANL 136 M R+L +R+ +H A + + ++ VH +Y D + Sbjct: 547 RRRMWRDLAEVEERLADADNWVHRKLRIAPYGTAEATELPRFSLHVHAFYTDDLAQDVRR 606 Query: 137 LSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSN 194 + + + VT ++ + I++ + + A + + N GRD+LPFL L + Sbjct: 607 HAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRPNRGRDILPFLQLFLPGGAAG 666 Query: 195 YDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSR 253 D + C +H KKS S GD+WR +L LLG + + +G++ Sbjct: 667 EDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEASLSDAATHL-RNPGVGLVAP- 721 Query: 254 AYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLR 313 + Y +R ++ +A R+ D L F G MF+VR+ + + +L Sbjct: 722 -------FDPYFIPWDASRALLPRVAPRLPGPLPDNPLLFPVGNMFFVRSRVVRAMNDLF 774 Query: 314 LSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFR 348 + Y P DG H +ER + + Sbjct: 775 GAGYPWPNEPIPNDGTEFHLIERLWPAMAAQCGLT 809 >gi|222148479|ref|YP_002549436.1| hypothetical protein Avi_2007 [Agrobacterium vitis S4] gi|221735467|gb|ACM36430.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 513 Score = 206 bits (525), Expect = 4e-51, Method: Composition-based stats. Identities = 76/316 (24%), Positives = 125/316 (39%), Gaps = 23/316 (7%) Query: 50 SKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKV 109 + Y R N + E GQ+ + G + + Sbjct: 180 FSEDRQPRADFDGYYYRKANGLSRGQNPFQHCIESMDGGQKFPANVESRPGVQLFSKPPF 239 Query: 110 MQ---------IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTES-ASIKS 159 + +A++ + I VH +Y++L+ EIA+ L L++ F L VT+ ES A + Sbjct: 240 AEDITIDSSNTVALQLSLCIHVHCFYVELFNEIADRLQCLTLPFYLVVTVCNESDAKVVE 299 Query: 160 EILKIFPAAR---IHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEG 216 +L F + I ++EN GRD+ PFLI D V +H KKS G Sbjct: 300 NLLVDFNQRQNTHILVVENRGRDIAPFLIDASPI-WRKSDLVLHLHTKKSPH----ITWG 354 Query: 217 DLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMIC 276 D WRR+LF +G ++ II F D+GM+ + + + KN++ I Sbjct: 355 DNWRRYLFDQTIGYEPLLKGIIDQFQDRDDMGMMYPENFCMIKHFTE----EEKNKDAIR 410 Query: 277 TLAGRMGITFQDQKLD-FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVE 335 +A ++ + + L + AG+M + R +AL + LDG H +E Sbjct: 411 YIAQKLRLECSFEALGAYAAGSMAFYRVKALASVLEYDALENLFGPEQGQLDGTAAHVLE 470 Query: 336 RCFSLSVKKANFRISD 351 R V+ F Sbjct: 471 RLLPEMVRLNGFETQP 486 >gi|160936497|ref|ZP_02083865.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC BAA-613] gi|158440582|gb|EDP18320.1| hypothetical protein CLOBOL_01388 [Clostridium bolteae ATCC BAA-613] Length = 373 Score = 205 bits (522), Expect = 1e-50, Method: Composition-based stats. Identities = 50/236 (21%), Positives = 102/236 (43%), Gaps = 7/236 (2%) Query: 125 YYIDLWIEIANLLSNLSISFDL-HVTLVTESASIKSEILKIFPAA--RIHIMENHGRDVL 181 +Y DL + + + D+ VT + A + + ++ + EN GRD+ Sbjct: 2 FYEDLLNQCYLYIEQIPKYIDVCFVTSNPKIAFKVKKYINNTKKINYKVLVKENRGRDMA 61 Query: 182 PFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTF 241 L+ + + Y+Y+C +H KKS + G + +G + ++ +L+G+ G++ I+R Sbjct: 62 ALLVTCH-DFIMEYEYLCFVHDKKSLQMG-NDNDGCKFMELIWKNLIGSTGLIENILRYL 119 Query: 242 DTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQD--QKLDFFAGTMF 299 +RD+G++ + N + N + + L ++ + +K G F Sbjct: 120 GNNRDVGLMVPPIPYWGNYIGVFINPWTCNYDNVINLGNQLKLKKNVCYEKEYVTIGGAF 179 Query: 300 WVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCI 355 W RT AL P+ + + A+DG I HA+ER + + ++ + Sbjct: 180 WCRTNALKPLFEYKWKLEDFCQEPMAVDGTISHAIERILGFVALNNGYDVLEIINL 235 >gi|146279467|ref|YP_001169625.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC 17025] gi|145557708|gb|ABP72320.1| hypothetical protein Rsph17025_3443 [Rhodobacter sphaeroides ATCC 17025] Length = 823 Score = 202 bits (514), Expect = 8e-50, Method: Composition-based stats. Identities = 59/274 (21%), Positives = 100/274 (36%), Gaps = 16/274 (5%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIAN 135 + R+L + + +H A + + A+ VH +Y D Sbjct: 548 RRRVWRDLAEVEETLSASDNWVHRALRLAPYATVADSSDLPPFALHVHAFYTDDLAADVR 607 Query: 136 LLSNLSISFDLHVTLVTESA--SIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLS 193 ++ + +T E I++ + + ++ N GRD+LPF+ L + Sbjct: 608 SHRAFRLARRIVITTDNERKASEIRTRMGAEGLYPEVILVPNRGRDILPFMQLFLPGGPA 667 Query: 194 NYDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGS 252 D + C +H KKS S GD+WR +L LLG + + +G++ Sbjct: 668 GKDEIWCHLHQKKSLATSDS---GDVWRAFLLRILLGDDAGLSDAVGHL-RDPAVGLVAP 723 Query: 253 RAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNL 312 + Y +R ++ A R+ D L F G MFWVR + + +L Sbjct: 724 --------FDPYHVPWDASRALLPRFAPRLPGPLPDNPLLFPVGNMFWVRAGVVRAMNDL 775 Query: 313 RLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 Y P A DG H VER + + Sbjct: 776 FGPSYPWPNEPIANDGTEFHLVERLWPTMAARCG 809 >gi|78184217|ref|YP_376652.1| lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] gi|78168511|gb|ABB25608.1| Lipopolysaccharide biosynthesis protein-like [Synechococcus sp. CC9902] Length = 519 Score = 199 bits (507), Expect = 4e-49, Method: Composition-based stats. Identities = 68/279 (24%), Positives = 116/279 (41%), Gaps = 21/279 (7%) Query: 95 PQLLHGWESPAMGKVM-QIAIKAKIAIVVHLYYIDLWIEIANLLSNL-----SISFDLHV 148 P+ L+ + P+ + + +A+++H +Y D+ +I L + DL+V Sbjct: 244 PERLNVTQIPSTDERSWGESSPGHLALMIHGFYPDVLDDILLKLPSFCAGMVGTQLDLYV 303 Query: 149 TLVTESASIKSEILKI--FPAARIHIMENHGRDVLPFL-ILLETEQLSNYDYVCKIHGKK 205 + + + L+ F R+ +EN GRDV PFL LL + + + K+H KK Sbjct: 304 STSMDQIDQVEKKLRDLDFACVRLFGVENRGRDVAPFLLHLLPAVAAAGHHFFVKLHTKK 363 Query: 206 SKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYT 265 S + G D W R L LL + + I F D+G + P + Sbjct: 364 SLQFGIDGL--DKWSRHLIESLL-SAAGLEAIRYQFLDDEDLGCLCPSGTLLPLAIALF- 419 Query: 266 CSLGKNREMICTLAGRMGITFQDQKLD-FFAGTMFWVRTEALDPIKNLRLSRYFEPKVHK 324 KN+ + L I + + F AG+MF R EA + + S Sbjct: 420 ----KNKTHLHHLLSHSEINGRWALMQTFVAGSMFAGRVEAFRSLLDQGFSLDDFELEGG 475 Query: 325 ALDGEIEHAVERCFSLSVKKANFRISDVDCILGYRKSLS 363 DG HA+ER SL VK++ ++I ++ G +++ Sbjct: 476 QFDGTFAHALERLISLEVKRSGWQIKEMS---GSPEAVP 511 >gi|209524107|ref|ZP_03272658.1| glycosyl transferase family 2 [Arthrospira maxima CS-328] gi|209495482|gb|EDZ95786.1| glycosyl transferase family 2 [Arthrospira maxima CS-328] Length = 2819 Score = 192 bits (487), Expect = 1e-46, Method: Composition-based stats. Identities = 73/259 (28%), Positives = 115/259 (44%), Gaps = 20/259 (7%) Query: 108 KVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS-IKSEILKIFP 166 K+ + KIA+V+H YY +L E+ + L NLS +DL VT+ + S + K Sbjct: 1729 KLYASSENPKIAVVLHAYYPELLPELFSKLDNLS-DYDLFVTIPENVVDSVTSALDKYTK 1787 Query: 167 AARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYD 226 ++ I++N G D+LPFL ++ Y YVCKIH K+ G LWR L Sbjct: 1788 NYQVSIVKNIGYDILPFLEVISELDTLGYKYVCKIHTKR-----DHPDFGSLWRECLLDA 1842 Query: 227 LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF 286 +LG + +II FD + + ++G Y + K ++MI + + Sbjct: 1843 VLGDKNITEQIITAFDNNPSLQIVGPA-LLYMSMLGTIYDGHEKMKKMIHDFMEPLNL-- 1899 Query: 287 QDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHK--ALDGEIEHAVERCFSLS--- 341 + FF G+MFW R L I + L + + + K G H VER L Sbjct: 1900 -IEDWGFFGGSMFWSRITPLKYIADQILLKPIDWQASKSWLTTGFYYHIVERLLGLVSYI 1958 Query: 342 ----VKKANFRISDVDCIL 356 V +++I D + I+ Sbjct: 1959 NEGQVGLVDYKIIDTENIV 1977 >gi|148556902|ref|YP_001264484.1| glycosyl transferase family protein [Sphingomonas wittichii RW1] gi|148502092|gb|ABQ70346.1| glycosyl transferase, family 2 [Sphingomonas wittichii RW1] Length = 1301 Score = 188 bits (479), Expect = 9e-46, Method: Composition-based stats. Identities = 67/240 (27%), Positives = 106/240 (44%), Gaps = 16/240 (6%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP-AARIHIMEN 175 K A+V+HL+Y ++ +E+ + ++ + S D+ VT L P A + + N Sbjct: 2 KAALVLHLFYPEVAVELIDRVAAIGASVDIFVTHSVALDETVLAALDRLPRKAEVVTVAN 61 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVF 235 G D+ P LL YD + K+H KK WRR + ++G+P +V Sbjct: 62 RGWDIGPLFELLPLLAERGYDLIGKLHSKK-----GGSGYAPEWRRLAYDGMIGSPALVA 116 Query: 236 KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT-FQDQKLDFF 294 I+ FD H D+ ++G++ Y L +N E++ LA R+ + FF Sbjct: 117 DIVAAFDAHPDLSLLGAKPL-----YKSVASHLFRNAELLSDLAPRLTAPAYPPADWGFF 171 Query: 295 AGTMFWVRTEALDPIKNLRLSRYFEPKVHKA-LDGEIEHAVERCFSLSVKKANFRISDVD 353 AGT FW R L+ + L F DG + HAVER F L+ +I V+ Sbjct: 172 AGTFFWARRTLLEKVAAL---ADFRDAAPNQDRDGALGHAVERLFGLAPIGLGGKIGLVE 228 >gi|88808074|ref|ZP_01123585.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805] gi|88788113|gb|EAR19269.1| Glycosyl transferase, group 1 [Synechococcus sp. WH 7805] Length = 512 Score = 187 bits (474), Expect = 3e-45, Method: Composition-based stats. Identities = 61/238 (25%), Positives = 100/238 (42%), Gaps = 13/238 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKI---FPAAR 169 KI +V+H YY + I L ++ FDL VT+ + + +K + K+ Sbjct: 48 PLKILVVIHAYYPESLATIFPSLRHMPCHFDLVVTVCSCGDKEVVKEYLEKVDLPIDVLD 107 Query: 170 IHIMENHGRDVLPFLILLETEQLSN--YDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 I ++ N GRD+LPF+ +++ +L N YD+V K+H K+S G W +L Sbjct: 108 IKVLTNLGRDLLPFVQVIKGLKLQNKAYDFVLKLHTKRSVASSKGKEFGGKWLEGSLSNL 167 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQ 287 LG+P V I+ + ++ ++ C N I L R G+ Sbjct: 168 LGSPENVKYILLELLQTTNCALVSPLISLDVFRF----CKWKNNLAPISHLLDRFGVRES 223 Query: 288 DQK-LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKK 344 + + F AG+MFWV +A I + P +G HA ER ++ Sbjct: 224 PEDFICFPAGSMFWVDFKAAVLIASCFEESR-VPPEPLPSNGSYLHAFERLVPYILES 280 >gi|221634514|ref|YP_002523202.1| hypothetical protein RSKD131_4489 [Rhodobacter sphaeroides KD131] gi|221163387|gb|ACM04349.1| Hypothetical Protein RSKD131_4489 [Rhodobacter sphaeroides KD131] Length = 1042 Score = 185 bits (469), Expect = 1e-44, Method: Composition-based stats. Identities = 65/249 (26%), Positives = 106/249 (42%), Gaps = 17/249 (6%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAARIHIMENHG 177 A ++H++++D+ ++ L +L S D VTL + + + FP A I +EN G Sbjct: 8 AAIIHVWHLDVLDDLTEALEHLHGSADQFVTLPSSFRQEQRDRVTAAFPKATIVEVENRG 67 Query: 178 RDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKI 237 +D+ L++ L YD++CKIH KK + WRR L +LG+ V I Sbjct: 68 QDIGALFQLMQKVNLGRYDFICKIHTKKGPNMP------EEWRRALLDGVLGSKRQVTHI 121 Query: 238 IRTFDTHRDIGMIGSRA-YRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAG 296 + +F + + G+R + Y Y + + K E +L G + + F AG Sbjct: 122 VESFRADPKVMLAGARQLFVYGPAYLE--PNADKVAEDYASLIG--DFDVRSEDWGFIAG 177 Query: 297 TMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDCIL 356 T FW+RT L + + DG HA ER F L V + + L Sbjct: 178 TCFWIRTSILQEMAACAV---DFLPADYVTDGAPAHAAERMFGLCVALRGGTV--LLQDL 232 Query: 357 GYRKSLSQN 365 + L + Sbjct: 233 RFADRLPET 241 >gi|158422520|ref|YP_001523812.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium caulinodans ORS 571] gi|158329409|dbj|BAF86894.1| putative lipopolysaccharide biosynthesis protein [Azorhizobium caulinodans ORS 571] Length = 661 Score = 182 bits (461), Expect = 1e-43, Method: Composition-based stats. Identities = 74/267 (27%), Positives = 108/267 (40%), Gaps = 15/267 (5%) Query: 100 GWESPAMGKVMQIAIK-AKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIK 158 G PA V ++ +A VVH YY DL + L L VT E A Sbjct: 381 GARRPAAAPVTPRPVRTGSLASVVHGYYEDLLPGLIAGLD----PAHLFVTTPPEKAEAV 436 Query: 159 SEILKIFPAARIHI-MENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGD 217 +L A +EN GRDV PFL LL + YD V K+H K+S +G EG Sbjct: 437 RAVLARAAPAARLRVVENRGRDVRPFLSLLPELEAEGYDLVLKVHTKRSPHQG---KEGS 493 Query: 218 LWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICT 277 W + L LL ++ F+ H +G++G+ + Y S G N + Sbjct: 494 DWLQRLSGPLLKL-ARSERLAPVFEAHPQMGLLGAAGHVLDGAL--YAGSAG-NAAWMRR 549 Query: 278 LAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERC 337 LA +G T + AGTMF R P++ DG + HA ER Sbjct: 550 LAAELG-TGAPLTSPYVAGTMFVARLGIFAPLRGASELLDLFDTDMGLKDGTLAHAFERF 608 Query: 338 FSLSVKKANFRISDVDCILGYRKSLSQ 364 F + +A + +V G+ + +++ Sbjct: 609 FGVLAAEAGLSVGEVAAD-GHVEPVAE 634 >gi|77404644|ref|YP_345218.1| glycosyltransferase [Rhodobacter sphaeroides 2.4.1] gi|77390294|gb|ABA81477.1| possible glycosyltransferase [Rhodobacter sphaeroides 2.4.1] Length = 793 Score = 175 bits (443), Expect = 1e-41, Method: Composition-based stats. Identities = 55/255 (21%), Positives = 100/255 (39%), Gaps = 15/255 (5%) Query: 77 LYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANL 136 M R+L +R+ +H A + + ++ VH +Y D + Sbjct: 547 RRRMWRDLAEVEERLADADNWVHRKLRIAPYGTAEATELPRFSLHVHAFYTDDLAQDVRR 606 Query: 137 LSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSN 194 + + + VT ++ + I++ + + A + + N GRD+LPFL L + Sbjct: 607 HAAYRCASRIVVTTDSDRKADEIRTLMAAVGLAPEVLVRPNRGRDILPFLQLFLPGGAAG 666 Query: 195 YDYV-CKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSR 253 D + C +H KKS S GD+WR +L LLG + + +G++ Sbjct: 667 EDEIWCHLHQKKSLATTDS---GDIWRAFLLRILLGDEASLSDAATNL-RNPGVGLVAP- 721 Query: 254 AYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLR 313 + Y +R ++ +A R+ D L F G MF+VR+ + + +L Sbjct: 722 -------FDPYFIPWDASRALLPRVAPRLPGPLPDNPLLFPVGNMFFVRSAVVRAMNDLF 774 Query: 314 LSRYFEPKVHKALDG 328 + Y P G Sbjct: 775 GAGYPGPTNPFPTTG 789 >gi|297182567|gb|ADI18727.1| lipopolysaccharide biosynthesis protein [uncultured Rhizobiales bacterium HF4000_32B18] Length = 887 Score = 173 bits (438), Expect = 5e-41, Method: Composition-based stats. Identities = 58/250 (23%), Positives = 98/250 (39%), Gaps = 21/250 (8%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFP 166 G+ ++ VH +Y D + E + + + T TE+ + + Sbjct: 635 GETDAEDDPPPYSVHVHAHYTDGFAEDLAGFAAWRHAARVVATTDTEAKAAEIAAAGRNG 694 Query: 167 AARI--HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 I ++ N GRDVLPFL L + + N C +H KKS G + G +WR +L Sbjct: 695 GVAIETRVVANRGRDVLPFLELFDGSEDDN-ALWCHVHLKKSVGLGPTSP-GAVWRAFLM 752 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG- 283 LLG P + + + G++G+ + Y +R ++ L R+ Sbjct: 753 RILLGGPERLSTAL-ALIRAPEAGLVGA--------FDPYVMGWTGSRRLLAPLQARLDG 803 Query: 284 -------ITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 D L F G MFWV+ ++ ++ L + Y P DG + H +ER Sbjct: 804 WEADGGRRPLPDHPLLFPVGDMFWVKAGVVNAMRRLFGADYPWPGEPLPGDGTVYHLIER 863 Query: 337 CFSLSVKKAN 346 + + A Sbjct: 864 LWPTAAALAG 873 >gi|50982351|gb|AAT91804.1| hypothetical protein [Yersinia enterocolitica] Length = 358 Score = 170 bits (432), Expect = 2e-40, Method: Composition-based stats. Identities = 62/317 (19%), Positives = 108/317 (34%), Gaps = 33/317 (10%) Query: 42 FWTLFYKRSKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGW 101 L + Y+ + + NL + E+ H ++ Sbjct: 66 LSNLVLSQKPAGKYNSSGFIDISKLDENNNIYSINLLLSEKNKHVSNDGSFELNKIKPNT 125 Query: 102 ESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEI 161 ++ K I+VH +Y EI N L +D+ +T + I + Sbjct: 126 DN-------------KKLIIVHAFYQREAEEIFNRLVAF-TDYDIVITSPYNN--IICKA 169 Query: 162 LKIFPAARI--HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLW 219 +I R+ IM N+GRD+LPFLI L+ + Y+Y K+H K+S+ + W Sbjct: 170 KEILGQERVIGFIMPNYGRDILPFLICLQLIVIEKYEYFVKVHTKRSQHLNDNG----AW 225 Query: 220 RRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLA 279 L+G + Y + N + L Sbjct: 226 FNNNLDYLVGNKNATDGLFSIMSDD--------EPQIYGEYILPIQDHIANNIHWLTYLL 277 Query: 280 GRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFS 339 + + + F GTMF L I++L+L + K + LDG HA+ER F Sbjct: 278 EKEPASVE---ASFIPGTMFIGNRAFLVLIRDLQLHLFQIEKENGQLDGCCVHAIERYFG 334 Query: 340 LSVKKANFRISDVDCIL 356 + ++ ++ Sbjct: 335 YIASVNGGKCCSIETLI 351 >gi|14090418|gb|AAK53494.1| putative methyltransferase [Xanthomonas campestris pv. campestris] Length = 212 Score = 158 bits (401), Expect = 1e-36, Method: Composition-based stats. Identities = 45/181 (24%), Positives = 74/181 (40%), Gaps = 13/181 (7%) Query: 106 MGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKI 164 V +V+H +Y+D+ E+ + + + +T T+ + I + Sbjct: 37 APDVATEICSPSACVVLHAWYLDVLDEMLDAIVECGTPLRIIITTDLTKVIEVTKCIQRR 96 Query: 165 FPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLF 224 A + EN GRD+LPFL + N V K+H KKS + +G+ WR + Sbjct: 97 GIQAEVEGFENRGRDILPFLHVANRLLDENVQLVLKLHTKKSTHRD----DGNAWRGEML 152 Query: 225 YDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREM--ICTLAGRM 282 LLG P V I+ F T +G+ + P T +G N E + + R+ Sbjct: 153 TALLG-PQRVDAIVNAFSTDPLVGLAAPEDHLLP-----VTEFIGGNAERTGLSYCSHRL 206 Query: 283 G 283 G Sbjct: 207 G 207 >gi|301632931|ref|XP_002945533.1| PREDICTED: o-antigen export system ATP-binding protein rfbB-like, partial [Xenopus (Silurana) tropicalis] Length = 367 Score = 146 bits (369), Expect = 4e-33, Method: Composition-based stats. Identities = 45/170 (26%), Positives = 68/170 (40%), Gaps = 10/170 (5%) Query: 192 LSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIG 251 + Y + ++H K+S G+ WR L+ L G+ V I+ TF+TH +GM+ Sbjct: 201 IPRYALILRLHSKRSLH--IPGQVGEEWRALLYTSLAGSRQRVNAIVDTFNTHPKLGMLC 258 Query: 252 SRAYRYPNKYCDYTCSLGKNREMICTLAGRMGIT-FQDQKLDFFAGTMFWVRTEALDPIK 310 + G N + +C L GIT DQ +DF G+MFW R +AL Sbjct: 259 PAVIDHYAD----CLHFGGNYKRMCALLQPHGITLPPDQPIDFPMGSMFWCRPQALSVWL 314 Query: 311 NLRLSRYFEPKVHKA---LDGEIEHAVERCFSLSVKKANFRISDVDCILG 357 + + DG + HA+ER F V +L Sbjct: 315 EPGFTFDDFTPTNDLDTDRDGTLAHALERLFFFGCGLKGLGWGRVPELLP 364 >gi|291520449|emb|CBK75670.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 486 Score = 146 bits (369), Expect = 5e-33, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 73/174 (41%), Gaps = 5/174 (2%) Query: 117 KIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESAS---IKSEILKIFPAARIHIM 173 K+A+V HLYY++++ + L+ + D+ +T ++ I+ K + + Sbjct: 291 KVAVVAHLYYVEMFELCMDYLAKVPYGIDIIITTNSDDKKQNIIEVASEKGVKLTEVIVA 350 Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGV 233 EN GR++ L+ L Y Y C +H KKS K + G +R L+ L + G Sbjct: 351 ENRGRELAALLVGCGK-FLLKYKYFCFVHDKKSSAKEH-LSVGLAFRDILWDSSLYSEGY 408 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQ 287 + II F+ + +G+ + + + N E L+ + I Sbjct: 409 IRNIIDMFEQNECMGLAVPPTVYCGSYFYPFPDYWVGNYEKTIELSKILNINVD 462 >gi|291520444|emb|CBK75665.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 424 Score = 122 bits (306), Expect = 9e-26, Method: Composition-based stats. Identities = 30/139 (21%), Positives = 55/139 (39%), Gaps = 2/139 (1%) Query: 216 GDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMI 275 G + ++ LLG+ +V +++ F + +G++ + + S + Sbjct: 7 GRDFNNRIWQSLLGSKELVEEVLSAFSDEKYLGLLMPSMVTHGEYFHTAIDSWTICYDGT 66 Query: 276 CTLAGRMGITFQD--QKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 LA ++G+ + GT FW RT+AL+ + S P +DG I H Sbjct: 67 VELAKKIGLNVPIYGDRNPLSLGTAFWARTKALEKLFEYNFSYDMFPGEPFPVDGSISHY 126 Query: 334 VERCFSLSVKKANFRISDV 352 +ER F A + V Sbjct: 127 IERIFPYVALDAGYYTGIV 145 >gi|148238469|ref|YP_001223856.1| sulfotransferase [Synechococcus sp. WH 7803] gi|147847008|emb|CAK22559.1| Possible sulfotransferase [Synechococcus sp. WH 7803] Length = 476 Score = 114 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 37/154 (24%), Positives = 62/154 (40%), Gaps = 8/154 (5%) Query: 189 TEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIG 248 ++L +D V H K++ + G+ WR+ L P + +T + G Sbjct: 4 RDRLKEFDLVVHCHTKRTPHAPDGF--GESWRQSLLQCTFPNPDRCQE-FQTLLHKPEAG 60 Query: 249 MIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLD-FFAGTMFWVRTEALD 307 +I +R + + + G N L MG T + L F AG+ FW R ++L Sbjct: 61 LIMPWPHR----FVAHNVNWGSNFTQTRALMNLMGHTIRRDTLLAFPAGSFFWARVDSLL 116 Query: 308 PIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLS 341 + +L L DG + H++ERC L Sbjct: 117 ALLDLTLRWEDFAAEPLPGDGRLAHSLERCLGLL 150 >gi|308813905|ref|XP_003084258.1| conserved domain protein (ISS) [Ostreococcus tauri] gi|116056142|emb|CAL58323.1| conserved domain protein (ISS) [Ostreococcus tauri] Length = 684 Score = 84.9 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 86/250 (34%), Gaps = 55/250 (22%) Query: 142 ISFDLHVTLVT------ESASIKSEILKIFPAARIHIMENHGRDVLPFLILLET--EQLS 193 + L+++L E A L+ R+ +++ G D+ FL L +L Sbjct: 103 VQLQLYLSLTPTVANAPEVAYFTERFLRNEKNIRVVHVKDEGYDIGAFLKQLHRFRHELQ 162 Query: 194 NYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSR 253 + Y+ K+H K +W L G+ V I++ F+T + ++ Sbjct: 163 VHQYILKVHSKSDP----------IWLERAVESLCGSEHQVKSILKAFETQSTLDIVSPM 212 Query: 254 AY-----------------RYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF-- 294 +Y NK T K + L ++G+ + Sbjct: 213 GSTFSATTSKDAVFPHLKRKYFNKVDLATAFDDKTMHTMERLCAQLGLEACPYFEKYLAS 272 Query: 295 --AGTMFWVR---------TEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVK 343 AGTMFW R + I+N LS+ + + + IEHA+ER + Sbjct: 273 ITAGTMFWARNSRLYTEHLPRLFESIRN-ELSQDY------SNNNRIEHALERLIPTLSR 325 Query: 344 KANFRISDVD 353 I D+ Sbjct: 326 LNGRMIGDIQ 335 >gi|291520445|emb|CBK75666.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 109 Score = 83.4 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 30/94 (31%), Positives = 47/94 (50%), Gaps = 5/94 (5%) Query: 115 KAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVTESA--SIKSEILKIF--PAARI 170 + + A+ +L++ DL+ E SNL D++V TE I I K+ ++ Sbjct: 13 QNRYAVFAYLFFDDLFEESLRYFSNLPNYVDIYVATNTEEKVDVINGYIPKMLFRHNVKV 72 Query: 171 HIMENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 + N GRDV L+LL+ SNYD +C +H K Sbjct: 73 LLHNNKGRDVSALLVLLKR-YYSNYDVICFVHDK 105 >gi|329944274|ref|ZP_08292533.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] gi|328531004|gb|EGF57860.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 170 str. F0386] Length = 699 Score = 81.9 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 88/262 (33%), Gaps = 47/262 (17%) Query: 130 WIEIANLLSNLSISFDLHVTLVTE--SASIKSEILKIFPAARIHIMENHGRDVLPFLILL 187 ++A L++L + + VT +E +A ++ + ++ ++ G + FL Sbjct: 341 ADDLAERLASLPEHWRVVVTSPSELNAADLERVTGRRTTFRKVRDLDPRG--TIAFLTEC 398 Query: 188 ETE------------------------QLSNYDYVCKIHGKKS--KRKGYSWWEGDLWRR 221 + ++ D V I S G S D+ RR Sbjct: 399 DDLWDPAHAGDVGASDGGDGTDTTDTAEVDRVDLVLTI----SAGPLSGSSERADDVARR 454 Query: 222 WLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR 281 + LL +PG V ++ F H +G++ A Y L+ R Sbjct: 455 QVLDCLLASPGYVAGLLDLFGRHPSLGVVMPAACHIGQPY--VGPQWDGLVGAADALSRR 512 Query: 282 MGIT--FQDQKLDFFAGTMFWVRTEALDPIKN-----LRLSRYFEPKVHKALD----GEI 330 +G+T + G+MF R EAL + +RL+ P ++++ Sbjct: 513 LGLTAALDEIAPVAPVGSMFLARPEALRTLSEGAKELVRLTEQAAPSTERSVERLRRARA 572 Query: 331 EHAVERCFSLSVKKANFRISDV 352 +E + + +V Sbjct: 573 AEVLELLTVYAAMSNGYHPREV 594 >gi|291520448|emb|CBK75669.1| Lipopolysaccharide biosynthesis protein [Butyrivibrio fibrisolvens 16/4] Length = 625 Score = 78.0 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 24/56 (42%) Query: 297 TMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDV 352 + FW RTEAL + S F PK + HA+ER F A + S + Sbjct: 4 SCFWCRTEALKKLLEYDFSYNFFPKEPMDANLTTSHAIERIFPYVACDAGYYTSTI 59 Score = 44.5 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 36/96 (37%), Gaps = 5/96 (5%) Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 ++ F +L+ G + I F ++ +G++G+ + K + I Sbjct: 421 QYTFDELIKNNGYISAICEVFKENQSVGVVGNIYGEIIFQINSNMNIYSKYEDEILEFEK 480 Query: 281 RMGITFQ---DQKLDFFAGTMFWVRTEALDPIKNLR 313 R F L + G FW+R +AL I + Sbjct: 481 RFNFDFNRGGKHSLLNYNG--FWLRRDALQMIADCE 514 >gi|320531345|ref|ZP_08032317.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] gi|320136436|gb|EFW28412.1| rhamnan synthesis protein F [Actinomyces sp. oral taxon 171 str. F0337] Length = 678 Score = 76.5 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 69/204 (33%), Gaps = 30/204 (14%) Query: 130 WIEIANLLSNLSISFDLHVTLVTE-SASIKSEILKIFPAAR-------------IHIMEN 175 +A L++L + + VT T A+ + PA + ++ Sbjct: 323 ADGLAQRLASLPAHWRVVVTSPTHLDAADLERLTGRRPADEAAAPGGAAVAFRAVRDLDP 382 Query: 176 HGRDVLPFLILLETEQLSNY--------DYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDL 227 G +PFL D V +I S G D+ RR + L Sbjct: 383 RG--TIPFLTECGDLWDPGRATGSDGGGDLVLRI-TVGSP-SGPESKADDVARRQVLDCL 438 Query: 228 LGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQ 287 L +PG +I F+ H +G+ A + + TL+ R+G+T + Sbjct: 439 LASPGYTAGLIDLFERHPGLGVAMPAASHIGQAHG--GPTWDGLAGAAKTLSRRLGLTVE 496 Query: 288 DQKLDF--FAGTMFWVRTEALDPI 309 + G MF R EAL + Sbjct: 497 LDPVAPVVPVGAMFMARPEALRTL 520 >gi|324991549|gb|EGC23482.1| rhamnosyltransferase [Streptococcus sanguinis SK353] Length = 556 Score = 71.5 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 36/261 (13%), Positives = 78/261 (29%), Gaps = 31/261 (11%) Query: 109 VMQIAIKAKIA--IVVHLYYID--LWIEIANLLSNLSISFDLHVTLV-TESASIKSEILK 163 + Q + A +++H++ D ++ + + L +LS + VT E L Sbjct: 273 LSQATSNYRAALPVLLHIHVTDFPIFQQYQDKLFSLSSQYQYLVTTDQPEVLKQLQTALG 332 Query: 164 IFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRR 221 ++ R L L + E L +Y Y+ + S + G R Sbjct: 333 HLGNKVQIVLSQKSRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRS 386 Query: 222 WLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR 281 L ++ I + +G++ P D R + + Sbjct: 387 DLINMMV---DYADASIEALEKESAVGLVIPD---LPRLVRDGLFESEPPRPRLAAVWQE 440 Query: 282 MGITFQDQKLDFFA-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVER 336 G+ + + G+ W + AL + ++ + D +E Sbjct: 441 AGLHKSFDFIITPSLTRVYGSFVWFKYSALASLFQMKSLESLPSFEQELSD-----VLEH 495 Query: 337 CFSLSV--KKANFRISDVDCI 355 +F+I + + Sbjct: 496 LLVYLAWDSHYDFKIMPLSSL 516 >gi|325690859|gb|EGD32860.1| rhamnosyltransferase [Streptococcus sanguinis SK115] Length = 556 Score = 63.0 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 73/248 (29%), Gaps = 29/248 (11%) Query: 120 IVVHLYYIDL--WIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENH 176 +++H++ DL + + N L +LS + VT+ E L ++ Sbjct: 286 VLLHIHVTDLPIFQQYQNKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 L L + E L +Y Y+ + S + R L ++ Sbjct: 346 SHAWLAMLE--QKEILQDYAYIGHL----STHRIMENQAVFDQAMRSDLINLMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF 294 I + +G++ R + L R + + G+ + Sbjct: 397 DASIEALEQESAVGLVIPDLPRLVRDGLFESEPL---RPRLAAIWQEAGLHKSFDFMTPP 453 Query: 295 A-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 + G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHYDF 508 Query: 348 RISDVDCI 355 +I + + Sbjct: 509 KIMPLSSL 516 >gi|326772087|ref|ZP_08231372.1| hypothetical protein HMPREF0059_00469 [Actinomyces viscosus C505] gi|326638220|gb|EGE39121.1| hypothetical protein HMPREF0059_00469 [Actinomyces viscosus C505] Length = 681 Score = 62.6 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 33/91 (36%), Gaps = 4/91 (4%) Query: 221 RWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAG 280 R LL +PG +I F H +G+ A + + L+ Sbjct: 439 RQALDCLLDSPGYTAGLIDLFARHPGLGVAMPAAGHIGQAHG--GPTWDGLAGAAKALSR 496 Query: 281 RMGITFQDQKLDF--FAGTMFWVRTEALDPI 309 R+G++ + L G MF R EAL + Sbjct: 497 RLGLSAELDPLAPVAPPGAMFMARPEALRTL 527 >gi|324993910|gb|EGC25829.1| rhamnosyltransferase [Streptococcus sanguinis SK405] gi|324994771|gb|EGC26684.1| rhamnosyltransferase [Streptococcus sanguinis SK678] Length = 556 Score = 61.8 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 71/242 (29%), Gaps = 29/242 (11%) Query: 120 IVVHLYYID--LWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENH 176 +++H++ D ++ + + L +LS ++ +T E L ++ Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 R L L + E L +Y Y+ + S + G R L ++ Sbjct: 346 SRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF 294 I + +G++ P D + + + + Sbjct: 397 DASIEALEQESAVGLVIPD---LPRLVRDGLFESEPPLPSLTAVWQEAVLHKSFDFMTAP 453 Query: 295 A-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 + G W + AL + ++ + D +E V +F Sbjct: 454 SLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIVWDSHYDF 508 Query: 348 RI 349 +I Sbjct: 509 KI 510 >gi|327489888|gb|EGF21677.1| rhamnosyltransferase [Streptococcus sanguinis SK1058] Length = 556 Score = 61.8 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 33/266 (12%), Positives = 73/266 (27%), Gaps = 28/266 (10%) Query: 95 PQLLHGWESPAMGKVMQ-IAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL-VT 152 P L E + + + + +H+ ++ + + L +LS ++ +T Sbjct: 262 PDLPCLLEDKYLSQATSNYRTDQPVLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQP 321 Query: 153 ESASIKSEILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--G 210 E L ++ R L L + E L +Y Y+ + S + G Sbjct: 322 EVLKQLQTALGHLGNKVQLVLSQKSRAWLAMLE--QKEILQDYAYIGHL----STHRLVG 375 Query: 211 YSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK 270 R L ++ I + +G++ P D Sbjct: 376 NQAVFDQAMRSDLINMMV---DYADASIEALEQESAVGLVIPD---LPRLVRDGLFESEP 429 Query: 271 NREMICTLAGRMGITFQDQKLDFFA-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKA 325 + + + + + G W + AL + ++ + Sbjct: 430 PLPSLTAVWQEAVLHKSFDFMTAPSLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQEL 489 Query: 326 LDGEIEHAVERCFSLSV--KKANFRI 349 D +E +F+I Sbjct: 490 SD-----VLEHLLVYIAWDSHYDFKI 510 >gi|327463172|gb|EGF09493.1| rhamnosyltransferase [Streptococcus sanguinis SK1] gi|327474781|gb|EGF20186.1| rhamnosyltransferase [Streptococcus sanguinis SK408] Length = 556 Score = 61.4 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 70/242 (28%), Gaps = 29/242 (11%) Query: 120 IVVHLYYID--LWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENH 176 +++H++ D ++ + + L +LS ++ +T E L ++ Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQLVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 R L L + E L +Y Y+ + S + G R L ++ Sbjct: 346 SRAWLAMLE--QKEILQDYAYIGHL----STHRLVGNQAVFDQAMRSDLINMMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF 294 I + +G++ P D + + + + Sbjct: 397 DASIEALEQESAVGLVIPD---LPRLVRDGLFESEPPLPSLTAVWQEAVLHKSFDFMTAP 453 Query: 295 A-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 + G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFLWFKYSALTSLFRMKSLESLPSSEQELSD-----VLEHLLVYIAWDSHYDF 508 Query: 348 RI 349 +I Sbjct: 509 KI 510 >gi|327470704|gb|EGF16160.1| rhamnosyltransferase [Streptococcus sanguinis SK330] Length = 556 Score = 61.4 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 72/248 (29%), Gaps = 29/248 (11%) Query: 120 IVVHLYYID--LWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENH 176 +++H++ D ++ + + L +LS + VT+ E L ++ Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 L L + E L +Y Y+ + S + R L ++ Sbjct: 346 SHAWLAMLE--QKEILQDYAYIGHL----STHRIMENQAVFDQAMRSDLINMMV---YYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF 294 I + +G++ P D R + + G+ + Sbjct: 397 DTSIEALEQESAVGLVIPD---LPRLVRDGLFESEPPRPRLAAIWQEAGLHKSFDFMTPP 453 Query: 295 A-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 + G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----VLEHLLVYLAWDSHYDF 508 Query: 348 RISDVDCI 355 +I + + Sbjct: 509 KIMPLSSL 516 >gi|325067617|ref|ZP_08126290.1| hypothetical protein AoriK_07344 [Actinomyces oris K20] Length = 233 Score = 61.1 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 45/133 (33%), Gaps = 13/133 (9%) Query: 231 PGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQK 290 PG V +I F H +G+ A + + L+ R+G+T + Sbjct: 1 PGYVAGLIDLFARHPGLGVAMPAAGHIGQAHG--GATWDGLAGAATALSRRLGLTVELDP 58 Query: 291 LDF--FAGTMFWVRTEALDPIKN-----LRLSRYFEPKVHKALD----GEIEHAVERCFS 339 L G MF R AL + +RL+ P ++++ +E Sbjct: 59 LAPVVPVGAMFLARPAALRTLSEGAKELVRLTEQAAPSAEQSVERLKRARAAEVLELLTV 118 Query: 340 LSVKKANFRISDV 352 + + + +V Sbjct: 119 YAAMSSGYHPREV 131 >gi|332361934|gb|EGJ39736.1| rhamnosyltransferase [Streptococcus sanguinis SK49] Length = 556 Score = 60.3 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 75/247 (30%), Gaps = 27/247 (10%) Query: 120 IVVHLYYIDL--WIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENH 176 +++H++ DL + + + L +LS + VT+ E L ++ Sbjct: 286 VLLHIHVTDLPIFQQYQDKLFSLSSQYQYLVTVAQPEMLKQLQTALGHLGNKVQLVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG-YSWWEGDLWRRWLFYDLLGAPGVVF 235 + L + E L NY Y+ + S + + D R +L+ Sbjct: 346 SYAWIAMLE--QKEILQNYAYIGHL----STHRLVENQAAFDQTMRSDLINLMV--DYAD 397 Query: 236 KIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFFA 295 I + +G++ P D R + + G+ + + Sbjct: 398 ASIEALEKEAAVGLVIPD---LPRLVRDGLFESEPARPRLAAIWQEAGLHKSFDFMTAPS 454 Query: 296 -----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANFR 348 G W + AL + ++ + D +E+ +F+ Sbjct: 455 LTRVYGGFLWFKYSALASLFPMKSLENSPSSEQELSD-----VLEQLLVYLAWDNHYDFK 509 Query: 349 ISDVDCI 355 I + + Sbjct: 510 IMPLSSL 516 >gi|328946538|gb|EGG40677.1| rhamnosyltransferase [Streptococcus sanguinis SK1087] Length = 556 Score = 60.3 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 70/249 (28%), Gaps = 31/249 (12%) Query: 120 IVVHLYYIDLWIEIANL---LSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMEN 175 +++H++ D L +LS + VT E L ++ Sbjct: 286 VLLHIHVTDF-PIFQQYQDNLFSLSSQYQYLVTTGQPEVLKQLQTSLAHLGNKVQIVLSQ 344 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGV 233 L L + E L NY Y+ + S + R L ++ + Sbjct: 345 KSHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAVFDQAMRSDLINMMVDSAD- 397 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 I + + D+G++ P D R + ++ G+ + Sbjct: 398 --ASIEALEKNSDLGLVIPD---LPRLVRDGLFESEPPRPRLTSVWQDAGLHKSFNFMST 452 Query: 294 FA-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKAN 346 + G W + AL ++ + D +E + Sbjct: 453 PSLTRVYGGFLWFKYSALASWFQMKSLESLPSSDQELSD-----VLEHLLVYLAWDSHYD 507 Query: 347 FRISDVDCI 355 F+I + + Sbjct: 508 FKIMPLSSL 516 >gi|325694904|gb|EGD36809.1| rhamnosyltransferase [Streptococcus sanguinis SK150] Length = 556 Score = 60.3 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 30/253 (11%), Positives = 66/253 (26%), Gaps = 27/253 (10%) Query: 107 GKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIF 165 + + +H+ ++ + L +L+ + VT E L Sbjct: 275 QSTSSYRTNQSVLLHIHVTNFPIFQQYQEKLFSLASQYQYLVTTNLPEMLKQLQTALAHL 334 Query: 166 PAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWL 223 ++ +L L + E L NY Y+ + S + R L Sbjct: 335 DDKVQIVLSQKSHALLAMLE--QKEILQNYVYIGHL----STHRIMENQAVFDQAMRSDL 388 Query: 224 FYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMG 283 ++ I + +G++ P D + + G Sbjct: 389 INMMV---DYADASIEALEQESAVGLVIPD---LPRLVRDGLFESEPPLPSLTAVWQEAG 442 Query: 284 ITFQDQKLDFFA-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCF 338 + + + G W + AL + ++ + D +E Sbjct: 443 LHKSFDFMTAPSLTRVYGGFLWFKYSALTSLFQMKSLESLPSSEQELSD-----VLEHLL 497 Query: 339 SLSV--KKANFRI 349 +F+I Sbjct: 498 VYIAWDSHYDFKI 510 >gi|332359457|gb|EGJ37277.1| rhamnosyltransferase [Streptococcus sanguinis SK355] Length = 556 Score = 59.5 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 64/243 (26%), Gaps = 31/243 (12%) Query: 120 IVVHLYYIDLWIEIANL---LSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMEN 175 +++H++ D L +L + VT E L ++ Sbjct: 286 VLLHIHVTDF-PIFLQYQDKLFSLLSQYQYLVTTDQPEVLKQLQTALVHLGNKVQLVLSQ 344 Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGV 233 L L + E L NY Y+ + S + R L ++ Sbjct: 345 KSHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAIFDQAIRSDLINMMV---DY 395 Query: 234 VFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF 293 I + +G++ P D R + + G+ + Sbjct: 396 ADASIEALEQESALGLVIPD---LPRLVRDGLFESEPPRSWLAAVWQEAGLHKSFDFMIA 452 Query: 294 FA-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKAN 346 + G W + AL + ++ + D +E + Sbjct: 453 PSLTRVYGGFLWFKYSALASVFQMKSLESLPSSDQELSD-----VLEHLLVYLAWDSHYD 507 Query: 347 FRI 349 F+I Sbjct: 508 FKI 510 >gi|332362506|gb|EGJ40306.1| rhamnosyltransferase [Streptococcus sanguinis SK1056] Length = 556 Score = 59.5 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 73/248 (29%), Gaps = 29/248 (11%) Query: 120 IVVHLYYIDL--WIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENH 176 +++H++ DL + + N L +LS + VT+ E L ++ Sbjct: 286 VLLHIHVTDLPIFQQYQNKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 L L + E L +Y Y+ + S + R L ++ Sbjct: 346 SHAWLAMLE--QKEILQDYAYIGHL----STHRIMENQAVFDQAMRSDLINLMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF 294 I + +G++ R + L R + + G+ + Sbjct: 397 DASIEALEQESAVGLVIPDLPRLVRDGLFESEPL---RPRLAAIWQEAGLHKSFDFMTPP 453 Query: 295 A-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 + G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFVWFKYSALASVFRMKSLESLPSSEQEFSD-----VLEHLLVYLAWDNHYDF 508 Query: 348 RISDVDCI 355 +I + + Sbjct: 509 KIMPLSSL 516 >gi|325696073|gb|EGD37964.1| rhamnosyltransferase [Streptococcus sanguinis SK160] Length = 556 Score = 59.5 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 31/264 (11%), Positives = 78/264 (29%), Gaps = 37/264 (14%) Query: 109 VMQIAIKAKIA--IVVHLYYIDLWIEIANL---LSNLSISFDLHVTL--VTESASIKSEI 161 + Q + A +++H++ D + L +LS + VT+ +++ + Sbjct: 273 LSQATSNYRAAQPVLLHIHVTDF-PIFQHYQDKLFSLSSQYQYLVTVAQPEMLKQLQTAL 331 Query: 162 LKIFPAARIHIMENHGRDVLPFLI-LLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDL 218 + ++ + + +L L + E L +Y Y+ + S + Sbjct: 332 AHLGDKVQLVLSQAS----HAWLAMLDQKEILQDYAYIGHL----STHRLVENQAVFDQA 383 Query: 219 WRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTL 278 R L ++ I + +G++ P D R + + Sbjct: 384 MRSDLINMMV---YYADTSIEALEQESAVGLVIPD---LPRLVRDGLFESEPPRPRLAAI 437 Query: 279 AGRMGITFQDQKLDFFA-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHA 333 + + + G W + AL + ++ + D Sbjct: 438 WQEADLHKSFDCMTPPSLTRVYGGFVWFKYSALASLFQMKSLESLPSSEQELSD-----V 492 Query: 334 VERCFSLSV--KKANFRISDVDCI 355 +E +F+I + + Sbjct: 493 LEHLLVYLAWDSHYDFKIMPLSSL 516 >gi|327461067|gb|EGF07400.1| rhamnosyltransferase [Streptococcus sanguinis SK1057] Length = 556 Score = 59.1 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 73/248 (29%), Gaps = 29/248 (11%) Query: 120 IVVHLYYIDL--WIEIANLLSNLSISFDLHVTLV-TESASIKSEILKIFPAARIHIMENH 176 +++H++ DL + + N L +LS + VT+ E L ++ Sbjct: 286 VLLHIHVTDLPIFQQYQNKLFSLSSQYQYLVTVTQPEMLKQLQTTLAHLGDKVQLVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 L L + E L +Y Y+ + S + R L ++ Sbjct: 346 SHAWLAMLE--QKEILQDYAYIGHL----STHRIMENQAVFDQAMRSDLINLMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF 294 I + +G++ R + L R + + G+ + Sbjct: 397 DASIEALEQESAVGLVIPDLPRLVRDGLFESEPL---RPRLAAIWQEAGLHKSFDFMTPP 453 Query: 295 A-----GTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 + G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFVWFKYSALASVFRMKSLESLPSSEQEFSD-----VLEHLLVYLAWDNHYDF 508 Query: 348 RISDVDCI 355 +I + + Sbjct: 509 KIMPLSSL 516 >gi|332367199|gb|EGJ44935.1| rhamnosyltransferase [Streptococcus sanguinis SK1059] Length = 556 Score = 58.7 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/281 (11%), Positives = 79/281 (28%), Gaps = 46/281 (16%) Query: 95 PQLLHGWESPAMGKVMQ-IAIKAKIAIVVHL-------YYIDLWIEIANLLSNLSISFDL 146 P L + + + + + +H+ +Y D L +LS + Sbjct: 262 PDLPCLLQDKYLSQATSNYRADQPVLLHIHVTDFPIFQHYQD-------KLFSLSSQYQY 314 Query: 147 HVTL--VTESASIKSEILKIFPAARIHIMENHGRDVLPFLI-LLETEQLSNYDYVCKIHG 203 VT+ +++ + + ++ + + +L L + E L +Y Y+ + Sbjct: 315 LVTVAQPEMLKQLQTALAHLGDKVQLVLSQAS----HAWLAMLDQKEILQDYAYIGHL-- 368 Query: 204 KKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKY 261 S + R L ++ I + +G++ P Sbjct: 369 --STHRLVENQAVFDQAMRSDLINMMV---YYADTSIEALEQESAVGLVIPD---LPRLV 420 Query: 262 CDYTCSLGKNREMICTLAGRMGITFQDQKLDFFA-----GTMFWVRTEALDPIKNLRLSR 316 D R + + + + + G W + AL + ++ Sbjct: 421 RDGLFESEPPRPRLAAIWQEADLHKSFDFMTPPSLTRVYGGFVWFKYSALASLFQMKSLE 480 Query: 317 YFEPKVHKALDGEIEHAVERCFSLSV--KKANFRISDVDCI 355 + D +E +F+I + + Sbjct: 481 SLPSSEQELSD-----VLEHLLVYLAWDSHYDFKIMPLSSL 516 >gi|315221431|ref|ZP_07863352.1| rhamnan synthesis protein F [Streptococcus anginosus F0211] gi|315189550|gb|EFU23244.1| rhamnan synthesis protein F [Streptococcus anginosus F0211] Length = 555 Score = 58.4 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 48/174 (27%), Gaps = 23/174 (13%) Query: 182 PFLILL-ETEQLSNYDYVCKIHGKKSKRKGYSWWEG-DLW-RRWLFYDLLGAPGVVFKII 238 P L + + E+L Y Y+ + S G D W R LF ++ I Sbjct: 348 PLLAMFAQAERLKTYKYIGHL----STHTLIPEVAGLDQWMRDDLFNMMIENMNY---SI 400 Query: 239 RTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDFF---- 294 + ++G+I Y L +E + L + + D Sbjct: 401 NALEHCSNLGLIIPDLPSVVRNGLFYQKPL---KEEMEKLWKLLSCRKSFKFTDAVTLTR 457 Query: 295 -AGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANF 347 G W + EA++ + I E N+ Sbjct: 458 VYGGWMWFKYEAVESLFKASFKT--FSSYSLQEQSTIL---ENLLVYVAWDKNY 506 >gi|323351266|ref|ZP_08086922.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sanguinis VMC66] gi|322122490|gb|EFX94201.1| alpha-L-Rha alpha-1,2-L-rhamnosyltransferase/alpha-L-Rha alpha-1,3-L-rhamnosyltransferase [Streptococcus sanguinis VMC66] Length = 556 Score = 56.4 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 71/242 (29%), Gaps = 29/242 (11%) Query: 120 IVVHLYYID--LWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENH 176 +++H++ D ++ + + L +LS ++ +T E L ++ Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYEYLLTTNQPEVLKQLQTALGHLGNKVQIVLSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 R L L + E L +Y Y+ + S + R L ++ Sbjct: 346 SRAWLAMLE--QKEILQDYAYIGHL----STHRLVENQAVFDQAMRSDLINLMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF- 293 I + +G++ R T L R + + G+ + Sbjct: 397 DASIEALEQESAVGLVIPDLPRLVRDGLFETEPL---RPSLSAVWQEAGLHKSFDFMTAS 453 Query: 294 ----FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPSSDQELSD-----VLEHLLVYLAWDSHYDF 508 Query: 348 RI 349 +I Sbjct: 509 KI 510 >gi|125718317|ref|YP_001035450.1| lipopolysaccharide biosynthesis protein [Streptococcus sanguinis SK36] gi|125498234|gb|ABN44900.1| Lipopolysaccharide biosynthesis protein, putative [Streptococcus sanguinis SK36] Length = 556 Score = 53.0 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 68/242 (28%), Gaps = 29/242 (11%) Query: 120 IVVHLYYID--LWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENH 176 +++H++ D ++ + + L +LS + +T E L I+ Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLLTTNQPEVLKQLQTALGHLGNKVQIILSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 L L + E L NY Y+ + S + R L ++ Sbjct: 346 SHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAVFDQAMRSDLINMMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF- 293 I + G++ P D + R + + G+ + Sbjct: 397 DASIEALEQDSAEGLVIPD---LPRLVRDGLFEIEPPRPSLSAVWQEAGLHKSFDFMTAS 453 Query: 294 ----FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPSSDQELSD-----VLEHLLVYLAWDSHYDF 508 Query: 348 RI 349 +I Sbjct: 509 KI 510 >gi|325687210|gb|EGD29232.1| rhamnosyltransferase [Streptococcus sanguinis SK72] Length = 556 Score = 52.2 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 68/242 (28%), Gaps = 29/242 (11%) Query: 120 IVVHLYYID--LWIEIANLLSNLSISFDLHVTL-VTESASIKSEILKIFPAARIHIMENH 176 +++H++ D ++ + + L +LS + +T E L I+ Sbjct: 286 VLLHIHVTDFPIFQQYQDKLFSLSSQYQYLLTTNQPEVLKQLQTALGHLGNKVQIILSQK 345 Query: 177 GRDVLPFLILLETEQLSNYDYVCKIHGKKSKRK--GYSWWEGDLWRRWLFYDLLGAPGVV 234 L L + E L NY Y+ + S + R L ++ Sbjct: 346 SHAWLAMLE--QKEILQNYAYIGHL----STHRLVENQAVFDQTMRSDLINMMV---DYA 396 Query: 235 FKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQKLDF- 293 I + G++ P D + R + + G+ + Sbjct: 397 DASIEALEQDSAEGLVIPD---LPRLVRDGLFEIEPPRPSLSAVWQEAGLHKSFDFMTAS 453 Query: 294 ----FAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSV--KKANF 347 G W + AL + ++ + D +E +F Sbjct: 454 SLTRVYGGFLWFKNSALASLFQMKSLESLPSSDQELSD-----VLEHLLVYLAWDSHYDF 508 Query: 348 RI 349 +I Sbjct: 509 KI 510 >gi|254431846|ref|ZP_05045549.1| hypothetical protein CPCC7001_1737 [Cyanobium sp. PCC 7001] gi|197626299|gb|EDY38858.1| hypothetical protein CPCC7001_1737 [Cyanobium sp. PCC 7001] Length = 205 Score = 47.2 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 55/182 (30%), Gaps = 23/182 (12%) Query: 172 IMENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAP 231 + N+G D F L + S+ K+ KKS G G W + + Sbjct: 29 RVTNYGEDWSSFHHLFYSGAFSSRGATFKLQTKKSSNLGADG--GMAWVDEALQPIASSY 86 Query: 232 GVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITF-QDQK 290 +I+ + + G N +++ R+G+ ++ Sbjct: 87 RATATVIKNLK---------AGTIKLAASKLCKRTGFGANPQLVAEYIHRLGLNEQSAKR 137 Query: 291 LDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGE-------IEHAVERCFSLSVK 343 F G+MF + L + L DG HA+ER F Sbjct: 138 QSFCMGSMFAADND-LIQLFYSSLGDVDY---RITSDGGSQFCGRYPGHAIERAFFYYSY 193 Query: 344 KA 345 +A Sbjct: 194 QA 195 >gi|160894490|ref|ZP_02075266.1| hypothetical protein CLOL250_02042 [Clostridium sp. L2-50] gi|156863801|gb|EDO57232.1| hypothetical protein CLOL250_02042 [Clostridium sp. L2-50] Length = 783 Score = 46.8 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 46/290 (15%), Positives = 81/290 (27%), Gaps = 48/290 (16%) Query: 38 FFFWFWTLFYKRSKKLCYDENYVVAYGSRSGKKFFAQSNLYMMERELHFDGQRIHHFPQL 97 F+ LF+ K+ D + + SR +F Q Y L + + Sbjct: 227 CPFFKRKLFFYPMKEKIVDSSSYI---SRVFYQFIKQETPYDETLILKNLIRTAP-MQDI 282 Query: 98 LHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLS------ISFDLHVTLV 151 + G + I KIA+V+ E L + DL+ Sbjct: 283 VDGLGLNVIPDSECERIAEKIAVVI--------DEDF-YLQHQPDIDDLETHADLYYWGS 333 Query: 152 TESASIKSE-------ILKIFPAARIHIMENHGRDVLPFLILLETEQLSNYDYVCKI-HG 203 ES K A ++ V F Y+Y+C + + Sbjct: 334 EESFHQKKNWEEMHLLECTTGNFAEVYYA------VGAF--------AKEYEYICFLVNE 379 Query: 204 KKSKRKGYSWWEGDLWRRWLFYD-LLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYC 262 +S W + + +LG + I+ + + IG++ Y Sbjct: 380 DRSYIAENLDNGHTGW---IIENSILGKGVSLGNIVSCLNDNSGIGLVYPPNSSQSLYYS 436 Query: 263 DYTCSLGKNREMICTLAGRMGITFQDQKLDFFAG---TMFWVRTEALDPI 309 I + I K+ G FW R++ L + Sbjct: 437 RQYKERELISCEIQQILEDSDIHLNIAKVRGSIGQYTGCFWCRSQVLQNL 486 >gi|254788145|ref|YP_003075574.1| glycosyltransferase family 2 domain-containing protein [Teredinibacter turnerae T7901] gi|237685039|gb|ACR12303.1| glycosyltransferase family 2 domain protein [Teredinibacter turnerae T7901] Length = 307 Score = 46.0 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 48/131 (36%), Gaps = 32/131 (24%) Query: 128 DLWIEIANLLSNLSISFDLHVTL----VTESASIKSEILKIFPAARIHIMENHG-RDVLP 182 + + + + N ++ DL V + E+ +I ++ + +P ++ EN G R V P Sbjct: 14 EFMRKTLDSVCNQTVPPDLWVVVDDGSTDETPAILADYSERYPFIQVITRENRGHRSVGP 73 Query: 183 -----FLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKI 237 F + +S +DYVCK L DL P + Sbjct: 74 GVIEAFYYGYDKIDVSQFDYVCKF--------------------DLDLDL--PPRYFEIL 111 Query: 238 IRTFDTHRDIG 248 I + + +G Sbjct: 112 IDRMEKNPRLG 122 >gi|312278781|gb|ADQ63438.1| Rhamnosyltransferase [Streptococcus thermophilus ND03] Length = 547 Score = 44.5 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 43/198 (21%), Positives = 77/198 (38%), Gaps = 28/198 (14%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKIFPAARIHIME-- 174 AI VH+ DL E + S+ +S + TL + + +++ +L F R ++ Sbjct: 287 AIHVHI--SDL--ERLKVFSDKKLSAFYYFTLSSHLDKNIVENTLLNSFDKDRFQLVSQK 342 Query: 175 --NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPG 232 NH + L Q S YD+V H + G D R ++L Sbjct: 343 FDNH---YYALVSLAS--QFSEYDFVGHFHT---EDFGNEGKFVDEATRLALVNMLLDEE 394 Query: 233 VVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK-NREMICTLAGRMGITFQDQKL 291 V I F ++G++ + + +T ++G N+ L T ++ L Sbjct: 395 RVASIFDHF---PEVGLVFADLSKELY----WTDAIGTLNQNQAAKLDNECQKTIKNS-L 446 Query: 292 DFFAGTMFWVRTEALDPI 309 F G+M W+ + L+ I Sbjct: 447 HVFQGSM-WLSKDFLEKI 463 >gi|228477260|ref|ZP_04061898.1| rhamnosyltransferase [Streptococcus salivarius SK126] gi|228251279|gb|EEK10450.1| rhamnosyltransferase [Streptococcus salivarius SK126] Length = 547 Score = 43.0 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 47/226 (20%), Positives = 86/226 (38%), Gaps = 26/226 (11%) Query: 91 IHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYIDLWIEIANLLSNLSISFDLHVTL 150 +H P L + +G K + +V+H++ DL E + + +S + TL Sbjct: 257 RYHSPDSLVISDEKIIGPSFITLSKHEYRMVIHVHISDL--ERLKVFFDSKLSAFYYFTL 314 Query: 151 VT--ESASIKSEILKIFPAARIHIM----ENHGRDVLPFLILLETEQLSNYDYVCKIHGK 204 + + +++ +L F R ++ ENH + L LS YD+V H Sbjct: 315 SSHLDKNKVENTLLNSFDKDRFQLVSKTFENH---YHALVFLAS--HLSEYDFVGHFHT- 368 Query: 205 KSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDY 264 + G D R ++L V I F ++G++ + + + Sbjct: 369 --EAFGNEGKLVDEDTRHALVNMLSDEEKVVSIFDHF---PEVGLVFADLSKELY----W 419 Query: 265 TCSLGK-NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPI 309 T ++G N+E LA + L F G M W+ + L+ I Sbjct: 420 TDAIGTLNQEQTVKLANEC-RKVINHSLHVFQGFM-WLTKDFLENI 463 >gi|156740479|ref|YP_001430608.1| glycosyl transferase family protein [Roseiflexus castenholzii DSM 13941] gi|156231807|gb|ABU56590.1| glycosyl transferase family 2 [Roseiflexus castenholzii DSM 13941] Length = 303 Score = 41.8 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 66/224 (29%), Gaps = 57/224 (25%) Query: 143 SFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVLPFL-------ILLETEQLSNY 195 SFDL V S + FP ++ ++EN L + L Y Sbjct: 35 SFDLLVIDNASSDGTPEVVRARFP--QVKVIEN-----GANLGFAAGNNVGLRYALRHGY 87 Query: 196 DYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAY 255 DY L D AP + ++I ++ IG +G Sbjct: 88 DYAL----------------------LLNNDTEVAPDFLARLIDAAESDPKIGAVGPTIT 125 Query: 256 RYPNKYCDY----TCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALD--PI 309 Y + + ++ G + +DF G V+ L+ + Sbjct: 126 YYERPDLIWSAGGMIDWKRGIAVMRANEVDSGQYHEPADVDFVTGCALLVKRAVLERAGL 185 Query: 310 KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVD 353 + R YFE + + +++A FRI V Sbjct: 186 LDERFFMYFEETE---------------WCVRIRRAGFRILHVP 214 >gi|322510485|gb|ADX05799.1| putative N-acetyl glucosaminyl transferase [Organic Lake phycodnavirus 1] Length = 690 Score = 41.4 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 34/94 (36%), Gaps = 23/94 (24%) Query: 121 VVHLY--------YIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAARIHI 172 +H Y Y D +++ F + VT K+E + + Sbjct: 314 HLHCYDISQFTTIYKDYIYDLSKY-------FHIIVTYTIGYLDKKNEYITLLK------ 360 Query: 173 MENHGRDVLP--FLILLETEQLSNYDYVCKIHGK 204 + N+G D+ ++ ++ +Y Y+ +H K Sbjct: 361 IPNNGYDIGAKMMMVKYLKDKNIDYKYIYFMHSK 394 >gi|148658318|ref|YP_001278523.1| glycosyl transferase family protein [Roseiflexus sp. RS-1] gi|148570428|gb|ABQ92573.1| glycosyl transferase, family 2 [Roseiflexus sp. RS-1] Length = 300 Score = 40.6 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 42/225 (18%), Positives = 64/225 (28%), Gaps = 57/225 (25%) Query: 143 SFDLHVTLVTESASIKSEILKIFPAARIHIMENHGRDVLPFL-------ILLETEQLSNY 195 SFDL V S + FP ++ ++EN L + L Y Sbjct: 32 SFDLLVVDNASSDGTPQAVRAQFP--QVTVIEN-----GANLGFAAGNNVGLRYALRHGY 84 Query: 196 DYVCKIHGKKSKRKGYSWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAY 255 DY L D AP + ++I + DIG G Sbjct: 85 DYAL----------------------LLNNDTEVAPDFLTRLIDVAGSDPDIGAAGPTIT 122 Query: 256 RYPNKYCDY----TCSLGKNREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALD--PI 309 Y + + T G + +DF G V+ AL + Sbjct: 123 YYERPDLIWSAGGMIDWRHGIARMRTNEVDSGQYCEPADVDFVTGCALLVKRAALQRAGL 182 Query: 310 KNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKANFRISDVDC 354 + R YFE + + +++A FRI V Sbjct: 183 LDERFFMYFEETE---------------WCVRIRRAGFRIVHVPQ 212 >gi|312863645|ref|ZP_07723883.1| conserved hypothetical protein [Streptococcus vestibularis F0396] gi|311101181|gb|EFQ59386.1| conserved hypothetical protein [Streptococcus vestibularis F0396] Length = 262 Score = 40.3 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 45/202 (22%), Positives = 76/202 (37%), Gaps = 36/202 (17%) Query: 119 AIVVHLYYIDLWIEIANLLSNLSISFDLHVTLVT--ESASIKSEILKIFPAARIHIME-- 174 AI VH+ DL E + + +S + TL + +++ +L F R I+ Sbjct: 2 AIHVHI--SDL--ERLKVFFDSKLSAFYYFTLSGHLDKNQVENNLLNSFDKDRFQIVSQK 57 Query: 175 --NHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEG----DLWRRWLFYDLL 228 NH + L QLS YD++ H EG + R L LL Sbjct: 58 FDNH---YHALVSLAS--QLSEYDFIGHFHT------ADFGNEGKLVDEATRLALIDMLL 106 Query: 229 GAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGK-NREMICTLAGRMGITFQ 287 V I F ++G++ + + +T ++G N+ L T + Sbjct: 107 -DEKKVSSIFADF---PEVGLVFADLSKELY----WTDAIGTLNQNQAAKLDNECQKTIK 158 Query: 288 DQKLDFFAGTMFWVRTEALDPI 309 + L F G+M W+ + L+ I Sbjct: 159 NS-LHVFQGSM-WLSKDFLEKI 178 >gi|168216438|ref|ZP_02642063.1| putative glycosyl transferase [Clostridium perfringens NCTC 8239] gi|182381532|gb|EDT79011.1| putative glycosyl transferase [Clostridium perfringens NCTC 8239] Length = 289 Score = 39.5 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 46/139 (33%), Gaps = 29/139 (20%) Query: 226 DLLGAPGVVFKIIRTFDTHRDIGMIGSRA---------------YRYPNKYCDYTCSLGK 270 D+L + +I + + GMI +Y + L K Sbjct: 92 DVLIEEKCIINLINELNKNSTFGMITGTMLNSNKEISKGLAWKLPKYKDDIITSFIVLNK 151 Query: 271 NREMICTLAGRMGITFQDQKLDFFAGTMFWVRTEALDPIKNLRLSRYFEPKVHKALDGEI 330 + + ++ + ++D G+ F + EAL ++ F + Sbjct: 152 IFKPLEYNNKKL-LNNTLNEVDVIPGSFFIAKAEALKKVR-------FFDED------TF 197 Query: 331 EHAVERCFSLSVKKANFRI 349 + ER S +KK N++I Sbjct: 198 LYCEERILSYKMKKMNYKI 216 >gi|109521889|ref|YP_655326.1| gp49 [Mycobacterium phage Pipefish] gi|88910620|gb|ABD58546.1| gp49 [Mycobacterium phage Pipefish] Length = 354 Score = 38.3 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 32/91 (35%), Gaps = 22/91 (24%) Query: 222 WLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR 281 + +G V +++R RA +P Y + KN + LA R Sbjct: 107 HMSDMWIGPAAWVGEVLRA----------EGRALHFPTAYDRFALPF-KNETGVRELARR 155 Query: 282 MGITFQDQKLDFFAGT-----------MFWV 301 +G T QD + F AGT M W Sbjct: 156 VGQTVQDNGVKFPAGTPWVGTAFSQGSMIWC 186 >gi|229112680|ref|ZP_04242216.1| Glycosytransferase [Bacillus cereus Rock1-15] gi|228670812|gb|EEL26120.1| Glycosytransferase [Bacillus cereus Rock1-15] Length = 355 Score = 37.9 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 14/66 (21%), Positives = 29/66 (43%), Gaps = 8/66 (12%) Query: 115 KAKIAIVVH-----LYYIDLWIEIANLLSNLSISFDLHVTLVTESASIKSEILKIFPAAR 169 K K+ + +H ++Y + +I N + + D+ +TL S K +I I Sbjct: 99 KKKVVLHIHGAEFMVFYQESSEDIRNQIREILNQVDVIITL---SQKWKEDIESITNNRN 155 Query: 170 IHIMEN 175 + ++ N Sbjct: 156 VKVIYN 161 >gi|222435717|ref|YP_002564145.1| gp47 [Mycobacterium phage Phlyer] gi|222088270|gb|ACM42211.1| gp47 [Mycobacterium phage Phlyer] Length = 354 Score = 37.9 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 32/91 (35%), Gaps = 22/91 (24%) Query: 222 WLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR 281 + +G V +++R RA +P Y + KN + LA R Sbjct: 107 HMSDMWIGPAAWVGEVLRA----------EGRALHFPTAYDRFALPF-KNETGVRELARR 155 Query: 282 MGITFQDQKLDFFAGT-----------MFWV 301 +G T QD + F AGT M W Sbjct: 156 VGQTVQDNGVKFPAGTPWVGTAFSQGSMIWC 186 >gi|194303242|ref|YP_002014656.1| gp45 [Mycobacterium phage Phaedrus] gi|194150974|gb|ACF34009.1| gp45 [Mycobacterium phage Phaedrus] Length = 354 Score = 37.9 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 32/91 (35%), Gaps = 22/91 (24%) Query: 222 WLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGR 281 + +G V +++R RA +P Y + KN + LA R Sbjct: 107 HMSDMWIGPAAWVGEVLRA----------EGRALHFPTAYDRFALPF-KNETGVRELARR 155 Query: 282 MGITFQDQKLDFFAGT-----------MFWV 301 +G T QD + F AGT M W Sbjct: 156 VGQTVQDNGVKFPAGTPWVGTAFSQGSMIWC 186 >gi|326784138|ref|YP_004324531.1| hypothetical protein Syn1_160 [Prochlorococcus phage Syn1] gi|310004870|gb|ADO99261.1| hypothetical protein Syn1_160 [Prochlorococcus phage Syn1] Length = 398 Score = 37.9 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 29/110 (26%), Positives = 44/110 (40%), Gaps = 18/110 (16%) Query: 36 GGFFFWFWTLFYKRSKKLCYDENYV---------------VAYGSRSGKKFFAQSNLYMM 80 G FF ++F K K+CY E + + Y S+ G K Q +L Sbjct: 67 GRFFVGTKSVFNKTEPKICYSEKAIDKLYEGDLAEKLKFALRYFSQLGIKGVIQGDLMFT 126 Query: 81 EREL---HFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLYYI 127 + L DG+R+H F + P + + A AKI +V H +Y Sbjct: 127 DSTLKTETVDGERLHTFRPNTITYAIPVDHPIGKAAGAAKIGVVFHTHYT 176 >gi|328948788|ref|YP_004366125.1| 1-deoxy-D-xylulose-5-phosphate synthase [Treponema succinifaciens DSM 2489] gi|328449112|gb|AEB14828.1| 1-deoxy-D-xylulose-5-phosphate synthase [Treponema succinifaciens DSM 2489] Length = 589 Score = 37.6 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 1/46 (2%) Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLW 219 EN G D+ + L E + ++ V IH K K + + W Sbjct: 215 EN-GNDIGAMIALFEKVKDIDHPVVLHIHTLKGKGYAPAEKNKEAW 259 >gi|153811516|ref|ZP_01964184.1| hypothetical protein RUMOBE_01908 [Ruminococcus obeum ATCC 29174] gi|149832257|gb|EDM87342.1| hypothetical protein RUMOBE_01908 [Ruminococcus obeum ATCC 29174] Length = 589 Score = 37.6 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 11/46 (23%), Positives = 21/46 (45%), Gaps = 1/46 (2%) Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLW 219 EN G D+ + L + +++ V IH +K K + + + W Sbjct: 219 EN-GNDIASLISLFRKVKDTDHPIVVHIHTQKGKGYEIAEKDKEGW 263 >gi|153814764|ref|ZP_01967432.1| hypothetical protein RUMTOR_00979 [Ruminococcus torques ATCC 27756] gi|145847795|gb|EDK24713.1| hypothetical protein RUMTOR_00979 [Ruminococcus torques ATCC 27756] Length = 589 Score = 37.6 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 11/46 (23%), Positives = 21/46 (45%), Gaps = 1/46 (2%) Query: 174 ENHGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKGYSWWEGDLW 219 EN G D+ + L + +++ V IH +K K + + + W Sbjct: 219 EN-GNDIASLISLFRKVKDTDHPIVVHIHTQKGKGYEIAEKDKEGW 263 >gi|146341712|ref|YP_001206760.1| glycosyl transferase family protein [Bradyrhizobium sp. ORS278] gi|146194518|emb|CAL78543.1| putative glycosyltransferase (family 2) [Bradyrhizobium sp. ORS278] Length = 316 Score = 37.6 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 6/57 (10%) Query: 151 VTESASIKSEILKIFPAARIHIMENHG-RDVLP-----FLILLETEQLSNYDYVCKI 201 E+ +I E + P R+ + G R V P F LET +L ++DYVCK+ Sbjct: 45 TDETPAILDEYARRLPYLRVVRRADRGGRKVGPGVIEAFYAGLETVRLEDFDYVCKL 101 >gi|297569722|ref|YP_003691066.1| glycosyl transferase family 2 [Desulfurivibrio alkaliphilus AHT2] gi|296925637|gb|ADH86447.1| glycosyl transferase family 2 [Desulfurivibrio alkaliphilus AHT2] Length = 318 Score = 37.2 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 32/84 (38%), Gaps = 10/84 (11%) Query: 128 DLWIEIANLLSNLSISFDLHVTL----VTESASIKSEILKIFPAARIHIMENHG-RDVLP 182 D + + ++ DL V + ++ I +E + +I N G R V P Sbjct: 17 DYMRHTLDSMVAQTVRPDLWVIVDDGSTDQTPQILAEYAAKYDFIKIVPKANRGHRSVGP 76 Query: 183 -----FLILLETEQLSNYDYVCKI 201 F + +++Y+CK+ Sbjct: 77 GVIEAFYAGYRAVRPDDFEYICKL 100 >gi|261338088|ref|ZP_05965972.1| 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium gallicum DSM 20093] gi|270276707|gb|EFA22561.1| 1-deoxy-D-xylulose-5-phosphate synthase [Bifidobacterium gallicum DSM 20093] Length = 639 Score = 37.2 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 46/147 (31%), Gaps = 24/147 (16%) Query: 176 HGRDVLPFLILLETEQLSNYDYVCKIHGKKSKRKG-----YSWWEG----DLWRRWLFYD 226 HG DV + L + +++ V IH K Y EG + W+ L Sbjct: 218 HGNDVHAMIQALREVKDTDHPVVLHIHTCKGLGLDQQDAHYGVLEGRCEANHWQNPLAQA 277 Query: 227 L--LGAPGVVFKII-----RTFDTHRDIGMIGSRAY-------RYPNKYCDYTCSLGKNR 272 LG+ + I + FD + +I + + + G Sbjct: 278 NAPLGSRKTYGRAIMAMLEQRFDEEPGLMVISPATPGSNGITREFRERAGAHYIDTGITE 337 Query: 273 EMICTLAGRMGITFQDQKLDFFAGTMF 299 E T A +G + + + T F Sbjct: 338 EHAVTFASGVG-KAGGRPVVATSATFF 363 >gi|282600847|ref|ZP_05979893.2| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] gi|282571128|gb|EFB76663.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] Length = 475 Score = 36.8 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 38/129 (29%), Gaps = 25/129 (19%) Query: 230 APGVVFKIIRTFDTHRDIGMIGSRAYRYPNKYCDYTCSLGKNREMICTLAGRMGITFQDQ 289 A G + + F +G++ +P +R ++ ++ + + Sbjct: 331 AEGRLAEAASLFAEKPLLGVLSPALPLWPKALQTVKSRWHADRSLLD---KKLSVPLGND 387 Query: 290 KLDFF-AGTMFWV--RTEALDPIKNLRLSRYFEPKVHKALDGEIEHAVERCFSLSVKKAN 346 AG W R++A P V + +D L ++ Sbjct: 388 PPPAPHAG---WALVRSDAF---------PQGVPIVEQEID-CW------LLPLIAQQNG 428 Query: 347 FRISDVDCI 355 + + + + Sbjct: 429 YYSATFESV 437 >gi|256784290|ref|ZP_05522721.1| transferase [Streptomyces lividans TK24] gi|289768176|ref|ZP_06527554.1| transferase [Streptomyces lividans TK24] gi|289698375|gb|EFD65804.1| transferase [Streptomyces lividans TK24] Length = 314 Score = 36.4 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 46/134 (34%), Gaps = 30/134 (22%) Query: 137 LSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIME-NHGRDVLPFLILLETEQLSNY 195 L L D+ V + ++ + + FP R+ + N G Sbjct: 46 LLALPERPDVLVVDNASTDDTRAMLARDFPQVRVLPLPYNRG------------------ 87 Query: 196 DYVCKIHGKKSKRKGY-SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRA 254 + + HG ++ Y ++ + D W APG + R FD H +G++ +R Sbjct: 88 -ALARTHGVRALDTPYVAFSDDDSW---------WAPGALGAAARHFDDHPRLGLLSART 137 Query: 255 YRYPNKYCDYTCSL 268 P D L Sbjct: 138 LVGPGDAADPLNDL 151 >gi|153007604|ref|YP_001368819.1| glycosyl transferase family protein [Ochrobactrum anthropi ATCC 49188] gi|151559492|gb|ABS12990.1| glycosyl transferase family 2 [Ochrobactrum anthropi ATCC 49188] Length = 658 Score = 36.4 bits (83), Expect = 8.0, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 24/61 (39%), Gaps = 7/61 (11%) Query: 199 CKIHGKKSKRKGYSWWEGDLWRR-------WLFYDLLGAPGVVFKIIRTFDTHRDIGMIG 251 H K+ K G+ DL++R L D + G +++ F++ IG+ Sbjct: 60 LFHHIKREKNLGFLGNVNDLFKRATGDVVVLLNSDTILPAGWAERVLAAFESDPTIGLAC 119 Query: 252 S 252 Sbjct: 120 P 120 >gi|21224513|ref|NP_630292.1| transferase [Streptomyces coelicolor A3(2)] gi|4455734|emb|CAB36597.1| putative transferase [Streptomyces coelicolor A3(2)] Length = 317 Score = 36.4 bits (83), Expect = 8.1, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 46/134 (34%), Gaps = 30/134 (22%) Query: 137 LSNLSISFDLHVTLVTESASIKSEILKIFPAARIHIME-NHGRDVLPFLILLETEQLSNY 195 L L D+ V + ++ + + FP R+ + N G Sbjct: 46 LLALPERPDVLVVDNASTDDTRAMLARDFPQVRVLPLPYNRG------------------ 87 Query: 196 DYVCKIHGKKSKRKGY-SWWEGDLWRRWLFYDLLGAPGVVFKIIRTFDTHRDIGMIGSRA 254 + + HG ++ Y ++ + D W APG + R FD H +G++ +R Sbjct: 88 -ALARTHGVRALDTPYVAFSDDDSW---------WAPGALGAAARHFDDHPRLGLLSART 137 Query: 255 YRYPNKYCDYTCSL 268 P D L Sbjct: 138 LVGPGDAADPLNDL 151 >gi|289742609|gb|ADD20052.1| beta-1,6-N-acetylglucosaminyltransferase [Glossina morsitans morsitans] Length = 311 Score = 36.0 bits (82), Expect = 10.0, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 52/135 (38%), Gaps = 14/135 (10%) Query: 8 KGYFLFTSHFKSWNFYSDHFNIEKVNLWGGFFFWFWTLFYKRSKKLCYDENYVVAYGSRS 67 K Y L + F + N ++ + K + WG W LF K + E ++A +R Sbjct: 130 KDYGLLKTGFPAENVCNNSVLLVKTHEWGPKA---WALFSKALLLVRDPEKAILAEFNRQ 186 Query: 68 --GKKFFAQSNLYMMERELHFDGQRIHHFPQLLHGWESPAMGKVMQIAIKAKIAIVVHLY 125 G FA + Y R ++ LHGWE + KI ++ Sbjct: 187 SGGHVGFASPDRYKRTRGKYWQQ----FVANKLHGWELMNLNWARNFTGNIKI-----IF 237 Query: 126 YIDLWIEIANLLSNL 140 Y DL ++ +L N+ Sbjct: 238 YEDLVNKLEPVLRNI 252 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.313 0.151 0.476 Lambda K H 0.267 0.0464 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 7,184,896,618 Number of Sequences: 14124377 Number of extensions: 305538913 Number of successful extensions: 948982 Number of sequences better than 10.0: 244 Number of HSP's better than 10.0 without gapping: 250 Number of HSP's successfully gapped in prelim test: 190 Number of HSP's that attempted gapping in prelim test: 947188 Number of HSP's gapped (non-prelim): 569 length of query: 365 length of database: 4,842,793,630 effective HSP length: 140 effective length of query: 225 effective length of database: 2,865,380,850 effective search space: 644710691250 effective search space used: 644710691250 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 82 (36.0 bits)