BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781203|ref|YP_003065616.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] (478 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781203|ref|YP_003065616.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] gi|254040880|gb|ACT57676.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] gi|317120669|gb|ADV02492.1| hypothetical protein SC1_gp035 [Liberibacter phage SC1] gi|317120813|gb|ADV02634.1| hypothetical protein SC1_gp035 [Candidatus Liberibacter asiaticus] Length = 478 Score = 811 bits (2094), Expect = 0.0, Method: Composition-based stats. Identities = 478/478 (100%), Positives = 478/478 (100%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ 60 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ Sbjct: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ 60 Query: 61 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP 120 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP Sbjct: 61 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP 120 Query: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV Sbjct: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 Query: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV 240 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV Sbjct: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV 240 Query: 241 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH 300 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH Sbjct: 241 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH 300 Query: 301 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL 360 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL Sbjct: 301 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL 360 Query: 361 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP 420 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP Sbjct: 361 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP 420 Query: 421 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL 478 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL Sbjct: 421 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL 478 >gi|268589386|ref|ZP_06123607.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315413|gb|EFE55866.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 594 Score = 299 bits (765), Expect = 8e-79, Method: Composition-based stats. Identities = 74/345 (21%), Positives = 135/345 (39%), Gaps = 57/345 (16%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKW--------HTGLGKEVINMPARSL----DK 48 M + ++ I + + Q P S D + +TGL +I P + L D Sbjct: 1 MSYFGLNPTRINQQLDDAMQSPENSGDADFFDGAFTSTYTGLYSGLIAKPEQVLWGIADT 60 Query: 49 LVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPL 108 +V+P E ++Q + S A + + SL P A AG+++ L Sbjct: 61 VVSPIAREVNEQFDINDTSEQFIQEQRKNAE--KQVRSLTPDRATTGTAGQVM----FSL 114 Query: 109 TRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSA 164 + G AL PL L + ++ + +GVDK TA A E + Sbjct: 115 FDIGGEALTGAMIGGPLGGAMLVGGVQGFSDYE-KLRADGVDKNTAINKATGEGLFAGLG 173 Query: 165 LLAP-------GAIASQSI---------------------AKTVASGAVLNVPFGMVERG 196 +L P G I ++SI + + N+ GM +RG Sbjct: 174 VLTPMTLGFKGGGILAESIGAQFTARGGTLSSLAGTAARATPDIVYASGSNIAMGMAQRG 233 Query: 197 WSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS---KQVQNMSLRLVN--DL 251 ++S++L++ GY +A Y ++D +++ DG++G FGGM + +N+ L + + Sbjct: 234 FASQILKERGYNQLASQYDVYDKQAIAIDGVLGVAFGGMGRYINSRGENVPLPEFDTPHV 293 Query: 252 KEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 +T H PG+ + + + H + ++ L +G Sbjct: 294 DAALTANQQL-HLEADLPPGIPINAMSLDGHLAAMNKAMNDLSQG 337 >gi|309702800|emb|CBJ02131.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 600 Score = 258 bits (658), Expect = 2e-66, Method: Composition-based stats. Identities = 69/349 (19%), Positives = 124/349 (35%), Gaps = 63/349 (18%) Query: 1 MYFNAVSDEDIRDNIKEWAQRP----------RVSPDIKWHTGLGKEVINMPARSL---- 46 M + ++ + + E A P S +GL ++ P + L Sbjct: 1 MSYFGLNAVNQNQQLDEAASNPAGFNTDVGFFDNSGTAA-VSGLYSGLVAKPDQLLWAGM 59 Query: 47 DKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPT 106 DK+V+P + ++ + S A + + L P A AG++L Sbjct: 60 DKIVSPIAKFVNENTSINDTSAEYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLHG--- 114 Query: 107 PLTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHT 162 L + G A+ P A L +E +GVD TA + + Sbjct: 115 -LFDMGGQAVVGTLLSGPAGGAAAVTALQGFSEFE-RLTAQGVDFRTAQEAGLVQGVTAG 172 Query: 163 SALLAP-------GAIASQSI----------------------AKTVASGAVLNVPFGMV 193 + L P G ++S+ A +A A N+ FGM Sbjct: 173 AGTLIPMSLGLRAGGALAESVGAQLARTGESAVRNVAATAVRAAPDIAYAAGTNIAFGMA 232 Query: 194 ERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRL 247 +RG ++K L D GY +MA Y +FD +S+ D ++G FGG+ + Sbjct: 233 QRGLTAKTLRDGGYNEMANQYDVFDRQSIAIDAVLGVAFGGVGRFLNARGESAAAPEFSP 292 Query: 248 VNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 ++ + + H +PG+ + + +AH L ++ + +G Sbjct: 293 A-EVDAALAANASH-HAEIDVAPGVPVNVLSRDAHIQALQKAMNDVSQG 339 >gi|298381706|ref|ZP_06991305.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279148|gb|EFI20662.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 600 Score = 252 bits (644), Expect = 8e-65, Method: Composition-based stats. Identities = 66/348 (18%), Positives = 126/348 (36%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGSALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + + + +PG+ + + +H L + + +G Sbjct: 294 D-VDAALAANAAHH-AEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|332344342|gb|AEE57676.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 600 Score = 252 bits (644), Expect = 9e-65, Method: Composition-based stats. Identities = 66/348 (18%), Positives = 125/348 (35%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + + + +PG+ + + +H L + + G Sbjct: 294 D-VDAALAANAAHH-AEIDIAPGVPINVLSRNSHIQALRKAMSDVSEG 339 >gi|218700978|ref|YP_002408607.1| hypothetical protein ECIAI39_2668 [Escherichia coli IAI39] gi|218370964|emb|CAR18791.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 600 Score = 252 bits (643), Expect = 1e-64, Method: Composition-based stats. Identities = 67/348 (19%), Positives = 125/348 (35%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + H +PG+ + + +H L + + +G Sbjct: 294 D--IDAALAANAAHHAEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|300898439|ref|ZP_07116780.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357906|gb|EFJ73776.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 600 Score = 251 bits (642), Expect = 1e-64, Method: Composition-based stats. Identities = 66/348 (18%), Positives = 126/348 (36%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPISLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + + + +PG+ + + +H L + + +G Sbjct: 294 D-VDAALAANAAHH-AEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|323948673|gb|EGB44578.1| hypothetical protein ERKG_04896 [Escherichia coli H252] Length = 600 Score = 251 bits (642), Expect = 1e-64, Method: Composition-based stats. Identities = 67/348 (19%), Positives = 125/348 (35%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + H +PG+ + + +H L + + +G Sbjct: 294 D--IDAALAANAAHHAEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|324008548|gb|EGB77767.1| hypothetical protein HMPREF9532_01735 [Escherichia coli MS 57-2] Length = 600 Score = 251 bits (642), Expect = 1e-64, Method: Composition-based stats. Identities = 67/348 (19%), Positives = 125/348 (35%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + H +PG+ + + +H L + + +G Sbjct: 294 D--IDAALAANAAHHAEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|117624700|ref|YP_853613.1| hypothetical protein APECO1_4053 [Escherichia coli APEC O1] gi|115513824|gb|ABJ01899.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 600 Score = 251 bits (641), Expect = 2e-64, Method: Composition-based stats. Identities = 66/348 (18%), Positives = 125/348 (35%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 + + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 VFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GALIPMSLWLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + H +PG+ + + +H L + + +G Sbjct: 294 D--IDAALAANAAHHAEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|323156121|gb|EFZ42280.1| hypothetical protein ECEPECA14_1896 [Escherichia coli EPECa14] Length = 600 Score = 251 bits (640), Expect = 2e-64, Method: Composition-based stats. Identities = 67/348 (19%), Positives = 126/348 (36%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRP-RVSPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPVGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVIGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGEATSTPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + + + SPG+ + + +H L + + +G Sbjct: 294 D-VDAALAANAAHH-AEIDISPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|89152440|ref|YP_512273.1| hypothetical protein PhiV10p19 [Escherichia phage phiV10] gi|74055463|gb|AAZ95912.1| hypothetical protein PhiV10p19 [Escherichia phage phiV10] Length = 600 Score = 250 bits (639), Expect = 3e-64, Method: Composition-based stats. Identities = 66/348 (18%), Positives = 128/348 (36%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAALNPVGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A +AG++L Sbjct: 61 KIVSPIAQLVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGIAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + + + +PG+ ++ + +H L + + +G Sbjct: 294 D-VDAALAANAAHH-AEIDIAPGVPSNVLSRNSHIQALRKAMSDVSQG 339 >gi|215487809|ref|YP_002330240.1| hypothetical protein E2348C_2742 [Escherichia coli O127:H6 str. E2348/69] gi|215265881|emb|CAS10290.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 600 Score = 246 bits (627), Expect = 7e-63, Method: Composition-based stats. Identities = 70/349 (20%), Positives = 126/349 (36%), Gaps = 63/349 (18%) Query: 1 MYFNAVSDEDIRDNIKEWAQRP----------RVSPDIKWHTGLGKEVINMPARSL---- 46 M + ++ + + E A P S +GL ++ P + L Sbjct: 1 MSYFGLNAVNQNQQLDEAASNPAGFNTDVGFFDNSGTAA-VSGLYSGLVAKPDQLLWAGM 59 Query: 47 DKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPT 106 DK+V+P + ++ + S A + + L P A AG++L+ Sbjct: 60 DKIVSPIAKFVNENTSINDTSAEYIGEQRKLAE--QQVKRLTPDAATTGTAGQVLNG--- 114 Query: 107 PLTRLAGLALQSAPLAAGALYAYL----SHKAESSIHHQIEGVDKETADALAWREAIVHT 162 L + G A+ LA A A +E +GVD TA + + Sbjct: 115 -LFDMGGQAVVGTLLAGPAGGAAAVTALQGFSEFE-KLTAQGVDFRTAQEAGLVQGVTAG 172 Query: 163 SALLAP-------GAIASQSI----------------------AKTVASGAVLNVPFGMV 193 + L P G ++S+ A +A A N+ FGM Sbjct: 173 AGTLIPMSLGLRAGGALAESVGAQLARTGESAVRNVAATAVRAAPDIAYAAGTNIAFGMA 232 Query: 194 ERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRL 247 +RG ++K L D GY +MA Y +FD +S+ D ++G FGG+ + Sbjct: 233 QRGLTAKTLRDGGYNEMAAQYDVFDRQSIAIDAVLGVAFGGVGRFLNARGESAATPEFSP 292 Query: 248 VNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 ++ + + H +PG+ + + +AH L ++ + +G Sbjct: 293 A-EVDAALAANASH-HAEIDVAPGVPVNVLSRDAHIQALQKAMNDVSQG 339 >gi|331648164|ref|ZP_08349254.1| hypothetical protein ECIG_04090 [Escherichia coli M605] gi|331043024|gb|EGI15164.1| hypothetical protein ECIG_04090 [Escherichia coli M605] Length = 600 Score = 246 bits (627), Expect = 7e-63, Method: Composition-based stats. Identities = 66/346 (19%), Positives = 125/346 (36%), Gaps = 57/346 (16%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGSAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS---KQVQNMSLRLVNDL 251 R ++K L D GY +MA Y + D +++ D ++G FGG+ + + S + + Sbjct: 234 RVLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGEPTSAPNFSPV 293 Query: 252 K-EGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + H +PG+ + + +H L + + +G Sbjct: 294 DIDAALAANAAHHAEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|327252172|gb|EGE63844.1| hypothetical protein ECSTEC7V_3019 [Escherichia coli STEC_7v] Length = 600 Score = 239 bits (610), Expect = 7e-61, Method: Composition-based stats. Identities = 66/348 (18%), Positives = 124/348 (35%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQSAPLAAGAL----YAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ L A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVIGTTLGGPAGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 + P G ++ +A +A A N+ FGM + Sbjct: 174 GTMIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVSATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS------KQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G FGG+ + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + H +PG+ + + +H L + + +G Sbjct: 294 D--IDAALAANAAHHAEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|85059172|ref|YP_454874.1| hypothetical protein SG1194 [Sodalis glossinidius str. 'morsitans'] gi|84779692|dbj|BAE74469.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 490 Score = 235 bits (600), Expect = 9e-60, Method: Composition-based stats. Identities = 63/344 (18%), Positives = 121/344 (35%), Gaps = 51/344 (14%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSP---DIKWHTGLGKEVI-------NMPARS----L 46 M + S + ++ P + D + G G + + L Sbjct: 1 MSYFGFSPTQQNKALAYASEHPIGTGTLQDAAFFDGAGTALFEGLWSGVRQADQVGWAAL 60 Query: 47 DKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPT 106 D +++P E + S A + + L P AG++L + Sbjct: 61 DTVMSPVAEAVSETFGVRDSSADFFKEQRKLAE--KSVRELTPDPGTTGTAGQVLYSLGQ 118 Query: 107 PLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALL 166 + +L P A A L ++ + +GVD TA A ++ Sbjct: 119 LGGQAIAGSLMGGPWGAAATVGTLQGFSDYE-KSRADGVDYGTAVDKALVTGGTAALGVV 177 Query: 167 AP-------GAIASQSIAKTVASG---------------------AVLNVPFGMVERGWS 198 P G ++ ++ ++ G A N+ GM +RG S Sbjct: 178 LPMSLGLRAGGAVAEGVSAALSVGRGASGALAGAVARAAPDLFYSAGTNIAMGMAQRGLS 237 Query: 199 SKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS---KQVQNMSLRLV--NDLKE 253 ++ L GY DMA+ Y + D ++L TD ++G FGG+ + +++ +R V ++ Sbjct: 238 AETLRRGGYEDMARQYDVMDAQALATDAVLGVAFGGLGRFINSRGEDVPVRRVSPEEIDA 297 Query: 254 GITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGE 297 +T + +PG+ S + AH + + ++ GE Sbjct: 298 ALTSSSHVNF-EVTVAPGVPVSVLSRNAHAQAMNKAMTDVLAGE 340 >gi|320175033|gb|EFW50146.1| 16 [Shigella dysenteriae CDC 74-1112] Length = 600 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 65/348 (18%), Positives = 122/348 (35%), Gaps = 61/348 (17%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPARSL----D 47 M + ++ + + E A P + D+ + +GL ++ P + L D Sbjct: 1 MSYFGLNPVNQNQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMD 60 Query: 48 KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTP 107 K+V+P + ++ + S + A + + L P A AG++L Sbjct: 61 KIVSPIAQFVNENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVL----YG 114 Query: 108 LTRLAGLALQS----APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 L + G A+ P+ A L +E +GVD TA + I + Sbjct: 115 LFDMGGQAVVGTTLGGPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGA 173 Query: 164 ALLAP-------GAIASQSIA----------------------KTVASGAVLNVPFGMVE 194 L P G ++ +A +A A N+ FGM + Sbjct: 174 GTLIPMSLGLRAGGALAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQ 233 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAF------FGGMHSKQVQNMSLRLV 248 RG ++K L D GY +MA Y + D +++ D ++G F + + V Sbjct: 234 RGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVVFGGVGRFINSRGEPTSAPNFSPV 293 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + + H +PG+ + + +H L + + +G Sbjct: 294 D--IDAALAANAAHHAEIDIAPGVPINVLSRNSHIQALRKAMSDVSQG 339 >gi|304398391|ref|ZP_07380265.1| hypothetical protein PanABDRAFT_3526 [Pantoea sp. aB] gi|304354257|gb|EFM18630.1| hypothetical protein PanABDRAFT_3526 [Pantoea sp. aB] Length = 625 Score = 223 bits (568), Expect = 5e-56, Method: Composition-based stats. Identities = 72/319 (22%), Positives = 127/319 (39%), Gaps = 37/319 (11%) Query: 16 KEWAQRPRVSPD---IKWHTGLGKEVINMPAR----------SLDKLVAPFREETHDQPN 62 + A + PD +W+ G G + A KL + D P Sbjct: 16 DDQAASKQAQPDDYDPRWYAGSGSALFRGAAEGTIGLGQTLVETAKLSPTYSALRGDLPE 75 Query: 63 YYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLA 122 + +V L + S+ P +A ++L + T + P+A Sbjct: 76 LDEIVDQNFSAVQKS--LNDARNSVKPAPNSQGMAAEILEGLGT-FAPAIAATAVAGPVA 132 Query: 123 AGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVAS 182 GA+ S+++ +GV+++TA LA +A + + P + + +A + S Sbjct: 133 GGAVAFGSSYESTRQDFL-AKGVNEDTAGTLALEQAGANALGMALPAGVGGR-LATRLLS 190 Query: 183 GAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQN 242 G +N FG V R + LE++GY ++A+ YR++D ++L+ DG++GA FGG+H Sbjct: 191 GVGINTGFGAVNRFALGETLEENGYDELAKQYRVWDKQALLVDGVLGAAFGGVHHLTSPR 250 Query: 243 MSLRLVN------------DLKEGI-TERLPYKHGVKSSSP---GLH-TSFDAYEAHTDT 285 L + D + + P + V SP G ++D+ A Sbjct: 251 ADTPLADPAPVSAGESAVTDAPAALRADADPAQTVVAEDSPLPAGEPAVTYDSRIAEMQD 310 Query: 286 LAHGVDSLVRGEYPHFDQE 304 LA V + RG+ QE Sbjct: 311 LAGQV--ISRGDRKALAQE 327 >gi|85059663|ref|YP_455365.1| hypothetical protein SG1685 [Sodalis glossinidius str. 'morsitans'] gi|84780183|dbj|BAE74960.1| hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 490 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 61/344 (17%), Positives = 119/344 (34%), Gaps = 51/344 (14%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSP---DIKWHTGLGKEVINM-------PARS----L 46 M + + S + A+ P + D + G G + + L Sbjct: 1 MSYFSFSPTQQNKALAYAAEHPIGTGTLQDAAFFDGAGTALFKGLWSGVRQADQVGWAAL 60 Query: 47 DKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPT 106 D ++P + + S + A + L P + AG++L + Sbjct: 61 DTAISPVADAVSETFGVRDFSADFFKAQRKLAET--RVRELTPDLGTTGTAGQVLFSLGQ 118 Query: 107 PLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALL 166 + +L P +A A L + + +GVD TA A + Sbjct: 119 LGGQAIAGSLMGGPWSAAATVGTLQGFSYYE-KSRADGVDYGTAVDKALVTGGTAALGAV 177 Query: 167 AP-------GAIASQSIAKTVASG---------------------AVLNVPFGMVERGWS 198 P G ++ ++ ++ G A N+ GM +RG S Sbjct: 178 LPMSLGLRAGGAVAEGVSAALSVGRGASGALAGAVARAAPDLFYSAGTNIAMGMAQRGLS 237 Query: 199 SKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS---KQVQNMSLRLV--NDLKE 253 ++ L GY DMA+ Y + ++L TD ++G GG+ + +++ +R V ++ Sbjct: 238 AETLRRGGYEDMARQYDVMASQALATDAVLGLAPGGLGRFINSRGEDVPVRRVSPEEIDA 297 Query: 254 GITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGE 297 +T + +PG+ S + AH + + ++ GE Sbjct: 298 ALTSSSHVNF-EVTVAPGVPVSVLSCNAHAQAMNKAMAGVLAGE 340 >gi|298485994|ref|ZP_07004068.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159471|gb|EFI00518.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 448 Score = 206 bits (523), Expect = 8e-51, Method: Composition-based stats. Identities = 60/281 (21%), Positives = 110/281 (39%), Gaps = 19/281 (6%) Query: 31 HTGLGKEVINMPARSLDKLVAPFREET------HDQPNYYRGSRTDPHSVGTGAHLVEGL 84 + LGK ++ + + + +Y + + S + L Sbjct: 36 YDSLGKGLVRGAIEGGAAAESTYWNAILSGGPEQNIFDYTQSTTLSRESQQKIGDDLNTL 95 Query: 85 TS--------LAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAES 136 L P A +AG+++ L R A+ + P A + + Sbjct: 96 REETASAVMDLRPDPAEVGIAGQIIGEAAAILPRAVIGAVAAGPAGAAIAAGAPAGYSRR 155 Query: 137 SIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERG 196 ++ EG+D+ TA L E +V + + P A + + A NV GM RG Sbjct: 156 AVSM-AEGIDENTATLLGLSEGVVTGAGAILPAAQFVKPVLGDAAIAIGANVGLGMAHRG 214 Query: 197 WSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGIT 256 ++ +L+ +GY A YR D ++ TD ++GA F G+ ++ + + + +T Sbjct: 215 TAAALLDSNGYAAQAAQYRAMDGTAIATDAILGAAFFGIGRSSMRRPT---TDQVDAALT 271 Query: 257 ERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGE 297 ER +H ++PGL + AH D L ++ + RGE Sbjct: 272 ER-NAQHADIDTAPGLPVDPRSAIAHQDALRAAIEQINRGE 311 >gi|332160978|ref|YP_004297555.1| hypothetical protein YE105_C1356 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665208|gb|ADZ41852.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862134|emb|CBX72298.1| hypothetical protein YEW_AK02350 [Yersinia enterocolitica W22703] Length = 430 Score = 177 bits (449), Expect = 3e-42, Method: Composition-based stats. Identities = 77/340 (22%), Positives = 139/340 (40%), Gaps = 32/340 (9%) Query: 33 GLGKEVINMPARSLDKLVAPFREETHDQPNYYRGS---RTDPHSVGTGAHLVEGLTSLAP 89 GL K ++ + L++P + + + A + AP Sbjct: 50 GLNKVAFA-ASQGVSTLLSPVAQAIDRATGTNANAFFDGSWTEGFRKTAEIQ------AP 102 Query: 90 YIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKET 149 AG++L+ + ++R G + + PL L + + +G+D T Sbjct: 103 EATVTTTAGQILNGLGDVMSRAVGGTVAAGPLGGAVLAGGTEAIFANDEGLR-KGLDPLT 161 Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 A + + + L P A ++++ VA+GA N+ G V+RG +++ LE GY D Sbjct: 162 AAGKGVLDGVSLGAGTLVPAAPFAKTLLSRVAAGAASNIAIGAVQRGTTAEWLEQRGYKD 221 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGM-HSKQVQNMSLRLVNDLKEGITERLPYKHGVKSS 268 MAQ Y+++D +++ DG++GA FGG+ H + + +T R +H + + Sbjct: 222 MAQQYKVWDATAMLADGVLGAAFGGLAHIGAAATP-----DSVDAALTAR-NAQHFREDT 275 Query: 269 SPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH-------FDQEKLQTIADNTLEDPHFKP 321 +PG+ T + AH L D + RGE FD + N E P Sbjct: 276 APGIPTDIPSNIAHQRALETATDQINRGEPVDVANIDGVFDAHFIARDGSNFAEQP---- 331 Query: 322 HLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELS 361 E P P + + Q P + AE P+ + R+++ Sbjct: 332 --AEIAPRPVAESEATFQ-PEKTTAETATPEADPILRDIN 368 >gi|301028421|ref|ZP_07191667.1| conserved domain protein [Escherichia coli MS 196-1] gi|299878532|gb|EFI86743.1| conserved domain protein [Escherichia coli MS 196-1] Length = 686 Score = 160 bits (405), Expect = 4e-37, Method: Composition-based stats. Identities = 49/210 (23%), Positives = 95/210 (45%), Gaps = 6/210 (2%) Query: 33 GLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIA 92 G K +I+ PA + P+ + ++G L + + + P Sbjct: 58 GFSKRLISDPA-FTADVAPTVNIFREMFPDADKTLNDTYDTIGK--QLQDARSYVKPDAG 114 Query: 93 GAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADA 152 +A ++L+ + G + PL A +++ + +GVD+ TA Sbjct: 115 SQGMAAEVLNELG-KFVPAIGTTMFGGPLIGAATAFSSTYEQSYQDF-KGKGVDEATARN 172 Query: 153 LAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQ 212 LA ++++ + + P A+ + ++A +ASG +N FG + R LE+ GY +MA+ Sbjct: 173 LATQQSLFNAVGMALPAAVGT-TLATRIASGVAINTGFGGLNRYSVGATLEEKGYTEMAK 231 Query: 213 HYRIFDMESLITDGLIGAFFGGMHSKQVQN 242 YR+FD ++++ D ++G FGG+H N Sbjct: 232 QYRVFDGQAMLVDAVLGGVFGGVHHLTTHN 261 Score = 38.6 bits (88), Expect = 2.2, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 63/166 (37%), Gaps = 23/166 (13%) Query: 208 PDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKS 267 P+ + + ++ + + FG ++++ + + L EG+ H Sbjct: 459 PEQLRLLVSMRLRNMKLEAAVEKVFGIRARERIKPSDIDAAHILNEGL-------HYDIE 511 Query: 268 SSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH-------FDQEKLQTIADNTLEDPHFK 320 SSP LHTS ++ +H D + L G+ + D I+D E H Sbjct: 512 SSPVLHTSNESINSHVDAMDEAYRQLNDGQPVNVGGMARGLDGPLRSDISDTYQEQYH-- 569 Query: 321 PHLPEPEPLPQYKEHSDR-QKPSEPLAEHPHPKRKEVERELSEIEG 365 E ++E+ R + SEP++E P P+ + E G Sbjct: 570 ------EIQKVFEENGVRYETSSEPISESPVPRAESAFSSAGEHRG 609 >gi|30387395|ref|NP_848224.1| hypothetical protein epsilon15p16 [Enterobacteria phage epsilon15] gi|30266050|gb|AAO06079.1| 16 [Salmonella phage epsilon15] Length = 634 Score = 154 bits (390), Expect = 2e-35, Method: Composition-based stats. Identities = 52/247 (21%), Positives = 101/247 (40%), Gaps = 6/247 (2%) Query: 33 GLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIA 92 G K +I+ PA + P+ + ++G L + + P Sbjct: 58 GFSKRLISDPA-FTADVAPTVNIFRVMFPDADKALNETYDTIGK--QLQDARGYVKPDAG 114 Query: 93 GAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADA 152 A ++L + G + P A +++ + +GVD+ TA Sbjct: 115 SQGTAAEVLYGLG-QFVPAIGATIFGGPTVGAATAFSSTYEQSYQDF-KGKGVDETTARN 172 Query: 153 LAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQ 212 LA ++++ + + + P A+ + ++ +ASG +N FG + R + LE+ GY +MA+ Sbjct: 173 LATQQSLFNAAGMALPAAVGT-TLTTRIASGVAINTGFGGLNRYSVGETLEEKGYTEMAK 231 Query: 213 HYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGL 272 YR+FD ++++ D ++GA FGG H +N + D + I S P Sbjct: 232 QYRVFDGQAMLVDAVLGAAFGGAHHLAARNADVPPPPDSEAPIPAAEVQSVPDNSPQPQA 291 Query: 273 HTSFDAY 279 ++ Sbjct: 292 ESAPQPA 298 >gi|330007167|ref|ZP_08305909.1| hypothetical protein HMPREF9538_03598 [Klebsiella sp. MS 92-3] gi|328535514|gb|EGF61974.1| hypothetical protein HMPREF9538_03598 [Klebsiella sp. MS 92-3] Length = 632 Score = 149 bits (377), Expect = 8e-34, Method: Composition-based stats. Identities = 55/248 (22%), Positives = 102/248 (41%), Gaps = 10/248 (4%) Query: 33 GLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIA 92 G K +I+ PA D + P+ + +G L + P Sbjct: 58 GFSKRLISDPA-FTDNVAPTINMFRVMFPDADKALNESYDDLGK--QLSSAREYIKPEAG 114 Query: 93 GAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADA 152 +A +++ + G ++ P+ A A +++ +GVD++TA Sbjct: 115 SQGVAAQVIHGLG-QFAPAIGASVIGGPVVGAAAAAGSTYEQAYQDAL-AKGVDEQTART 172 Query: 153 LAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQ 212 +A ++ + + P A+ + +A + SG +N FG + R + LED+GY DMA+ Sbjct: 173 VAAEQSGFNAVGMGLPAAVGGR-LATRLLSGVGINAAFGGLNRFAVGETLEDNGYADMAK 231 Query: 213 HYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVND----LKEGITERLPYKHGVKSS 268 YR+FD ++++ D ++GA FGG H + S+ D + +G T + P Sbjct: 232 QYRVFDGQAILIDSVLGAAFGGAHHFAARGNSVDARADSTPAVDDGTTAQEPAATAEIQP 291 Query: 269 SPGLHTSF 276 S Sbjct: 292 QEQPPVSP 299 >gi|319793416|ref|YP_004155056.1| phage-like protein [Variovorax paradoxus EPS] gi|315595879|gb|ADU36945.1| phage-like protein [Variovorax paradoxus EPS] Length = 937 Score = 123 bits (309), Expect = 5e-26, Method: Composition-based stats. Identities = 69/321 (21%), Positives = 115/321 (35%), Gaps = 26/321 (8%) Query: 33 GLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIA 92 GL + + PA L P + + G+ D L L A Sbjct: 43 GLARGTVAKPALLLGDAATPLLRTSAQAVDKTLGTSLDAWLTDQQKRNTTALEQLRSDPA 102 Query: 93 GAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADA 152 AG+++ L L A+ P A L Y +GV TA A Sbjct: 103 TTGFAGQVVGG----LFDLGSSAILYTPEGAAVLEGY-----GRRQELIGQGVAPGTATA 153 Query: 153 LAWREAIVHTSALLAPG-------AIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDH 205 + + AP +++A+ +A GA +V G+ ERG+S +L+ Sbjct: 154 VGAVSGAATYVGVKAPITLGQQAIGQGGRAMAQNLAYGATASVAGGVAERGFSRDLLKAA 213 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGV 265 GY + A +D +L + +GA F G + ++R +T H Sbjct: 214 GYGEQAAPLEPYDKTALAAEATLGALFSGGAAALHARSTVRGQAATDAALTV-TTVDHAQ 272 Query: 266 KSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 + ++PG T A AH L+ ++ ++R E + ++ +AD P +P Sbjct: 273 RGTAPGTPTDARAASAHASALSTAIEQVLRNEPANVGEQ----MADTAFVRP-----VPS 323 Query: 326 PEPLPQYKEHSDRQKPSEPLA 346 PE + + H P P A Sbjct: 324 PEIRAELQAHVADLLPVGPAA 344 >gi|317120710|gb|ADV02532.1| hypothetical protein SC2_gp040 [Liberibacter phage SC2] gi|317120771|gb|ADV02592.1| hypothetical protein SC2_gp040 [Candidatus Liberibacter asiaticus] Length = 408 Score = 116 bits (291), Expect = 7e-24, Method: Composition-based stats. Identities = 80/381 (20%), Positives = 142/381 (37%), Gaps = 44/381 (11%) Query: 9 EDIRDNIKEWAQR------PRVSPDIKWHTGLG-------KEVINMPARSLDKLVAPFRE 55 E + IK P PD + T + E I A ++ Sbjct: 10 EKLLQQIKHAMDAGFYRYDPPKKPDYGFWTNITNDVASIPSEFIKGTAEGQVDVITSIST 69 Query: 56 ETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLS--FIPTPLTRLAG 113 + + + ++V ++ G+ A G LS L + Sbjct: 70 SLGYYTPHNKITSKPWYNVAEDVGVMGGV---------AHGIGHFLSAFGTGFSLFAINP 120 Query: 114 LALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIAS 173 + L ++P A + S + EGV ETA A + + Sbjct: 121 VTLPASPFIGLATASSASGTRRYKE-LRDEGVAHETAKIGALITTGTTFAGGSV-SGVIG 178 Query: 174 QSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFG 233 +S+ +G NV FG+ ER L+ G+ D+AQHYR D T+ +IGA G Sbjct: 179 KSLVSKAVTGGATNVAFGLGERQSIGAYLDYKGHKDLAQHYREVDGIHTTTEFIIGAGLG 238 Query: 234 GMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKS----SSPGLHTSFDAYEAHTDTLAHG 289 +H K ++ ++ + + +R S+P + T+ + E H TL Sbjct: 239 ALHGKGGKHPDIKPSDVDIAQVVKR------DIDDIYHSAPAIATTSRSAELHAQTLEQA 292 Query: 290 VDSLVRGEYPHFDQEKLQTIADNTLEDP--HFKPHLPEPEPLPQYKEHSDRQKPSEPLA- 346 ++ + RGE + D + + + + + P F P L + L Q ++ +Q+ S+P A Sbjct: 293 IEKMRRGEEINVDPKSIDLMTKDMITKPEVEFSPEL--KKQLKQGEDFLAQQEVSKPKAL 350 Query: 347 --EHPHPKR-KEVERELSEIE 364 + P + E ER L+++E Sbjct: 351 KEQDPLSSQVPEYERRLTDLE 371 >gi|315122889|ref|YP_004063378.1| hypothetical protein CKC_05725 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496291|gb|ADR52890.1| hypothetical protein CKC_05725 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 363 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 44/245 (17%), Positives = 97/245 (39%), Gaps = 14/245 (5%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGAL--------YAYLSHKAES 136 +L G++ + ++ A+ + A + Sbjct: 68 NALTVDPEETGAIGQIGHSLLHSVSAFGIGAMAGGSIGGPLGALAGGFLSVALAEGRRAF 127 Query: 137 SIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERG 196 + EG D TA + ++ + L P S+ K+ + A +N+ ++R Sbjct: 128 EN-ARDEGQDSSTATKGGMKTGVISGAGALIPAG-FGVSVVKSAIASAGVNLGLSKLDRM 185 Query: 197 WSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQN----MSLRLVNDLK 252 +L+ +GY ++A+H D S+ TD ++G FGG+H+K + + ++ Sbjct: 186 GDYAILKANGYDELAEHASEMDSISIATDIVLGMAFGGLHAKNARRNKKLVGMKPTPSEG 245 Query: 253 EGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADN 312 + T ++ + + T+ +++E H +A +LV GE D +KL+ + Sbjct: 246 DIATGAKNELMTSRTLNDAIPTTNESFETHMSAIAEAEHALVNGEKFGLDSQKLEALERG 305 Query: 313 TLEDP 317 +++ P Sbjct: 306 SIKKP 310 >gi|315121927|ref|YP_004062416.1| hypothetical protein CKC_00885 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495329|gb|ADR51928.1| hypothetical protein CKC_00885 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 326 Score = 108 bits (269), Expect = 3e-21, Method: Composition-based stats. Identities = 44/245 (17%), Positives = 95/245 (38%), Gaps = 14/245 (5%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGAL--------YAYLSHKAES 136 +L G++ + ++ A+ + A + Sbjct: 31 NALTVDPEETGAIGQIGHSLLHSVSAFGIGAMTGGSIGGPLGALAGGFLSVALAEGRRAF 90 Query: 137 SIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERG 196 + EG D TA + ++ + L P + +AS A +N+ ++R Sbjct: 91 EN-ARDEGQDSSTATKGGMKTGVISGAGALIPAGFGVSVVKSAIAS-AGVNLGLSKLDRM 148 Query: 197 WSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQN----MSLRLVNDLK 252 +L+ +GY ++A+H D S+ TD ++G FGG+H+K + ++ Sbjct: 149 GDYAILKANGYDELAEHASEMDSISIATDIVLGMAFGGLHAKNARRNKKLAGMKPTPSEG 208 Query: 253 EGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADN 312 + T ++ + + T+ +++E H +A +LV GE D +KL+ + Sbjct: 209 DIATGAKNELMTSRTLNDAVPTTNESFETHMSAIAEAEHALVNGEKFGLDSQKLEALERG 268 Query: 313 TLEDP 317 +++ P Sbjct: 269 SIKKP 273 >gi|332875213|ref|ZP_08443046.1| cation diffusion facilitator family transporter [Acinetobacter baumannii 6014059] gi|332736657|gb|EGJ67651.1| cation diffusion facilitator family transporter [Acinetobacter baumannii 6014059] Length = 957 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 53/325 (16%), Positives = 92/325 (28%), Gaps = 36/325 (11%) Query: 7 SDEDIRDNIKEWAQRPRVSP-DIKWHTGLGKEVINMPA----RSLDKLVAPFREETHD-Q 60 + +D + Q P P D G A + D + AP Sbjct: 12 NQQDFEKLNSQGLQHPDTRPNDPGVFDGAISSPFRGMAIGLNKVGDAISAPIDAVVDRVS 71 Query: 61 PNYYRGSR-------TDPHSVGTGAHLVEGLTSLA--PYIAGAALAGKLLSFIPTPLTRL 111 + S + + A ++A + G + + L R Sbjct: 72 YSLKDVSTNEFIEPYEEFKAKREKARDNLVYGTIADLEDKDNTGIVGNIGVGVGDYLWRG 131 Query: 112 AGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAI 171 A L A L + + +GVD+ TA +A A+ P Sbjct: 132 ALGVATGGTLGAATLTGGSTGNYVYTD-LTRKGVDENTALKVAGVNAVGDAIGTALPIGY 190 Query: 172 ASQS---IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLI 228 + + A + S ++L+ +GY A+ Y + ES+ TD LI Sbjct: 191 GFKGTGGLVADAALSVGGATGLNTGMQYASEQLLKSNGYDKQAKQYEV-TGESVATDLLI 249 Query: 229 ------GAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVK----------SSSPGL 272 GA + G Q+ +N L E ++ P Sbjct: 250 NSLMFGGARYLGSKQNQLDQDVDAEINQLNSDDFETRNDALNDALVKNSFEFEDTTLPVQ 309 Query: 273 HTSFDAYEAHTDTLAHGVDSLVRGE 297 T H L + +++G+ Sbjct: 310 TTDPVQQNKHYQNLDVATEQILKGQ 334 >gi|254251752|ref|ZP_04945070.1| Soluble lytic murein transglycosylase [Burkholderia dolosa AUO158] gi|124894361|gb|EAY68241.1| Soluble lytic murein transglycosylase [Burkholderia dolosa AUO158] Length = 764 Score = 89.0 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 46/231 (19%), Positives = 79/231 (34%), Gaps = 28/231 (12%) Query: 87 LAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVD 146 L P +++ + L ++ A+ P+A A+ S + EGVD Sbjct: 116 LRPDPQNTTTTDQIVQGAVSGLVQIVPAAVLGGPVAGAAVGGASIGLGRSEE-LKREGVD 174 Query: 147 KETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHG 206 T A+ E + + + P +IA+T+ AV + + +L++ G Sbjct: 175 VGTRTAVGAVEGALGAAGAVLPAG--GSTIARTLGLVAVGGPGMAIGQSTAEKAILKNAG 232 Query: 207 YPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVN--------DLKEGITER 258 Y +A D +L L+ FFGG+H+ + + + N L + Sbjct: 233 YDHLADQIDPLDPTNLAASTLMAGFFGGLHAGGLASAARTARNADPSTPLPSLDVAARKA 292 Query: 259 LPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSL------VRGEYPHFDQ 303 LPY + A GV RGE + DQ Sbjct: 293 LPYNSPILD-----------AYATQAAQREGVPPALLLFIKNRGEMSNSDQ 332 >gi|169795395|ref|YP_001713188.1| phage-like protein [Acinetobacter baumannii AYE] gi|169148322|emb|CAM86187.1| hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 954 Score = 89.0 bits (219), Expect = 2e-15, Method: Composition-based stats. Identities = 53/310 (17%), Positives = 91/310 (29%), Gaps = 35/310 (11%) Query: 21 RPRVSPDIKWHTGLGKEVINMPA----RSLDKLVAPFREETHD-QPNYYRGSR------- 68 +P V ++ G A + D + AP + S Sbjct: 26 KPTVQKEVGIFDGAISSPFRGMAIGLNKVGDAISAPIDAVVDRVSYSLKDVSTNEFIEPY 85 Query: 69 TDPHSVGTGAHLVEGLTSLA--PYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGAL 126 + + A ++A + G + + L R A S L A L Sbjct: 86 EEFKAKREKARDNLVYGTIADLEDKDNTGIVGNIGVGVGDYLWRGALGVATSGTLGAATL 145 Query: 127 YAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP---GAIASQSIAKTVASG 183 + + +GVD+ TA +A A+ P G S + A Sbjct: 146 TGGSTGNYVYTD-LTRKGVDENTALKVAGVNAVGDAIGTALPISYGFKGSGGLVADAALS 204 Query: 184 AVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLI------GAFFGGMHS 237 + S ++L+ +GY A+ Y + ES+ TD LI GA + G Sbjct: 205 VGGATGLNTGMQYTSEQLLKSNGYDKQAKQYEV-TGESVATDLLINSLMFGGARYLGTRQ 263 Query: 238 KQVQNMSLRLVNDLKEGITERLPYKHGVK----------SSSPGLHTSFDAYEAHTDTLA 287 Q+ +N L E ++ P T H L Sbjct: 264 NQLDQDVDAEINQLNSDDFETRNDALNDALVRNSFEFEDTTFPVRTTDPVQQNKHYQNLD 323 Query: 288 HGVDSLVRGE 297 + +++G+ Sbjct: 324 AATEQILKGQ 333 >gi|293609610|ref|ZP_06691912.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828062|gb|EFF86425.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 954 Score = 85.6 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 52/325 (16%), Positives = 94/325 (28%), Gaps = 36/325 (11%) Query: 7 SDEDIRDNIKEWAQRPRVSP-DIKWHTGLGKEVINMPA----RSLDKLVAPFREETHD-Q 60 + +D + + Q P + P + +G A + D + AP Sbjct: 12 NQQDFEELNSKGLQHPDIRPNEPSAFSGAISSPFRGAAIGLNKVGDAISAPIDAVVDRVS 71 Query: 61 PNYYRGSRTDPHS--VGTGAHLVEGLTSLA-------PYIAGAALAGKLLSFIPTPLTRL 111 S + A + +L + G+ L R Sbjct: 72 YTLKDVSTNEFIEPYEEYKAKREKARDNLVYGAIDKLEDKENTGIVGRFGVGAGDYLWRG 131 Query: 112 AGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP--- 168 A A L A L + + +GVD+ TA +A A+ P Sbjct: 132 ALGAATGGTLGAATLTGGSTGNYIYTD-LTRKGVDENTALQVAGINAVGDAIGTALPMSY 190 Query: 169 GAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLI 228 G + + A + S+++L+ G A+ + + ES+ TD + Sbjct: 191 GFRGTGGLVGDAALSVGGATALNTGVQYTSNQILKAAGNEKEAKQFEV-TGESVATDLAL 249 Query: 229 ------GAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVK----------SSSPGL 272 GA + G KQ+ +N L E + ++ P Sbjct: 250 NALLFGGARYLGSRQKQLDQDVDAEINQLNADDIETRNDQINDTLVRNSFEFEDTTLPVR 309 Query: 273 HTSFDAYEAHTDTLAHGVDSLVRGE 297 T H L D +++G+ Sbjct: 310 TTDPVQQNKHYQNLDAATDQILKGQ 334 >gi|294648410|ref|ZP_06725909.1| hypothetical protein HMP0015_0118 [Acinetobacter haemolyticus ATCC 19194] gi|292825715|gb|EFF84419.1| hypothetical protein HMP0015_0118 [Acinetobacter haemolyticus ATCC 19194] Length = 837 Score = 78.3 bits (191), Expect = 3e-12, Method: Composition-based stats. Identities = 46/267 (17%), Positives = 85/267 (31%), Gaps = 22/267 (8%) Query: 61 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP 120 + + ++ + A AG + S I T + + P Sbjct: 47 TSVSNAASRFVEGDEVADKRMQQVNE-AFTPLNQGTAGHIASGI-TEVVSAGAVGAPLGP 104 Query: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP-GAIASQSIAKT 179 A + E + Q GVD++TAD + + + P + +S+ Sbjct: 105 YGMAATVGLGTRAIEHTKLTQQLGVDQDTADTASNIYGATNAALAFLPVSNVFKKSLIAD 164 Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIF--DMESLITDGLIGAFFGGMHS 237 A+ V G L+ GY Y+ D ++ + IG+ F Sbjct: 165 YAALVVAPTAVGQGMTYAEGAYLDSKGYKKQGAMYKDMATDPNAIFMNMAIGSTFFAAG- 223 Query: 238 KQVQNMSLRLVNDLKEGITERLPYK----------HGVKSSSPGLHTSFDAYEAHTDTLA 287 + M+ + DL E + SS P + + D H L Sbjct: 224 ---RYMNAKGNADLPEAEVHKAEADFNATVEQAQTDADVSSMPNIADTVDDLAQHEANLN 280 Query: 288 HGVDSLVRGEYPHFDQE---KLQTIAD 311 +D +++GE + + KL+T+ D Sbjct: 281 QAIDQVMKGEKVNISEATGGKLKTLDD 307 >gi|48697206|ref|YP_024936.1| SLT domain-containing tail structural protein [Burkholderia phage BcepC6B] gi|47779012|gb|AAT38375.1| gp16 [Burkholderia phage BcepC6B] Length = 763 Score = 73.6 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 69/199 (34%), Gaps = 9/199 (4%) Query: 77 GAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAES 136 GA + + P A + + + + L ++ A+ PLA A+ + + Sbjct: 105 GARAYDLSDTFKPDPTRATAIDQTVQGVVSGLAQIVPAAVLGGPLAGAAVGGASIGMSRA 164 Query: 137 SIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERG 196 + +GVD T A+ E + + + P +A ++ +T+ A + + Sbjct: 165 ED-LKRQGVDVGTRTAVGAVEGALTAAGAVLP--VAGSTLPRTIGLVAAGGPGAAIAQAT 221 Query: 197 WSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGIT 256 +L + GY +A D +L L+ F G+H+ + + Sbjct: 222 IEKAILRNAGYDHLADQINPLDPINLAAATLMAGTFAGVHTAATARTARQ------NAPA 275 Query: 257 ERLPYKHGVKSSSPGLHTS 275 +P + + L Sbjct: 276 ATVPLQSLAIDARRALPYD 294 >gi|221213943|ref|ZP_03586916.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD1] gi|221166120|gb|EED98593.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD1] Length = 749 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 66/188 (35%), Gaps = 4/188 (2%) Query: 86 SLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGV 145 + P + + + + LT++ A+ PL A+ + + + +GV Sbjct: 114 TFKPDPTRTTAIDQTVQGVVSGLTQIVPAAVLGGPLTGAAVGGTSIGMSRAED-LKRQGV 172 Query: 146 DKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDH 205 D T A+ E + + + P +A ++ +TV A + + +L + Sbjct: 173 DVGTRTAVGAVEGALTAAGAVLP--VAGSTLPRTVGLVAAGGPGAAIAQASIEKAILRNA 230 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITE-RLPYKHG 264 Y +A D ++ L+ F G H+ + + + L + Sbjct: 231 DYDHLADQIDPLDPVNIAASTLMAGVFAGAHTVATARTARQTATAPTASLQSLSLDARRA 290 Query: 265 VKSSSPGL 272 + ++P L Sbjct: 291 LPYNAPEL 298 >gi|221201509|ref|ZP_03574548.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2M] gi|221207935|ref|ZP_03580941.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2] gi|221172120|gb|EEE04561.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2] gi|221178777|gb|EEE11185.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2M] Length = 749 Score = 67.9 bits (164), Expect = 4e-09, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 55/147 (37%), Gaps = 3/147 (2%) Query: 86 SLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGV 145 + P + + + + + LT++ A+ PLA A+ + + + +GV Sbjct: 114 TFKPDPTRTTVIDQTVQGVMSGLTQIVPAAVLGGPLAGAAVGGTSIGMSRAED-LKRQGV 172 Query: 146 DKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDH 205 D T A+ E + + + P +A ++ +TV A + + +L + Sbjct: 173 DVGTRTAVGAVEGALTAAGAVLP--VAGSTLPRTVGLVAAGGPGAAIAQASIEKAILRNA 230 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFF 232 Y +A D ++ L+ F Sbjct: 231 DYDHLADQIDPLDPVNIAASTLMAGVF 257 >gi|226953661|ref|ZP_03824125.1| possible phage-like protein [Acinetobacter sp. ATCC 27244] gi|226835533|gb|EEH67916.1| possible phage-like protein [Acinetobacter sp. ATCC 27244] Length = 876 Score = 57.5 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 72/243 (29%), Gaps = 22/243 (9%) Query: 72 HSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS---APLAAGALYA 128 A + L P G+ + TR+ A+ + + AL + Sbjct: 55 GDKKAAALRAQNLEIFKPD--DLGGVGEFTYGLTKDFTRIGWNAVTTLGTGGVPGLALNS 112 Query: 129 YLSHKAESSIH---HQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAV 185 L +G D +TA + + P ++S+ + Sbjct: 113 GLFGYQTFEAEKSDLLNKGADIKTARTGGAIKGVTDALGFAIPTHGVAKSVVADAVATTA 172 Query: 186 LNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM-------HSK 238 L G+ LE++ +AQ+ + L A GGM +K Sbjct: 173 LATGAGVAGDYLEGSFLENNENKKVAQYGEALKENATSPSTL--AANGGMALLLNLWANK 230 Query: 239 QVQNMSL----RLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLV 294 V+ + + + +H ++P T+ +H D L ++S + Sbjct: 231 GRLRPEQIKDHSNVDTMNDAAHIQANIEHAE-GTNPFSPTNAKEANSHFDALDSAMESAL 289 Query: 295 RGE 297 E Sbjct: 290 NDE 292 >gi|262371857|ref|ZP_06065136.1| predicted protein [Acinetobacter junii SH205] gi|262311882|gb|EEY92967.1| predicted protein [Acinetobacter junii SH205] Length = 876 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 74/243 (30%), Gaps = 22/243 (9%) Query: 72 HSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS---APLAAGALYA 128 A + L P G+ + TR+ A+ + + AL + Sbjct: 55 GDKKAAALRAQNLEIFKPD--DLGGVGEFTYGLTKDFTRIGWNAVTTLGTGGVPGLALNS 112 Query: 129 YLSHKAESSIH---HQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAV 185 L +G D +TA + + ++ P ++S+ + Sbjct: 113 GLFGYQTFEAEKSDLLNKGADVKTARTGGAIKGLADAASFAIPTHGVAKSVVADAVATTA 172 Query: 186 LNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM-------HSK 238 L G+ L+ + +AQ+ +L L A GGM +K Sbjct: 173 LATGAGVAGDYLEGSFLKTNENKKVAQYGEALKENALSPSTL--AANGGMALLLNLWANK 230 Query: 239 QVQNMSL----RLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLV 294 V+ + + + +H ++P T+ +H D L ++S + Sbjct: 231 GRLRPEQIKDHSNVDTMNDAAHIQANIEHAE-GTNPFSPTNAKEANSHFDALDSAMESAL 289 Query: 295 RGE 297 E Sbjct: 290 NDE 292 >gi|158425958|ref|YP_001527250.1| hypothetical protein AZC_4334 [Azorhizobium caulinodans ORS 571] gi|158332847|dbj|BAF90332.1| conserved hypothetical exported protein [Azorhizobium caulinodans ORS 571] Length = 386 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 63/206 (30%), Gaps = 36/206 (17%) Query: 31 HTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGL-TSLAP 89 + +G + N +LD+ V P T + + + +V A ++ + Sbjct: 159 YVMVGAILTNKAPVTLDQ-VTPIARLTEETEAIAVPAASPIKTVQELAEAIKANPAKVTW 217 Query: 90 YIAGAALAGKLL------------SFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESS 137 A + S I G AL A + G + A +S E Sbjct: 218 AGGSAGGVDHIAAALFAQAAGADPSKINYIPFSGGGEAL--AAVLGGKVTAGISGYGEFE 275 Query: 138 IHHQI--------------EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASG 183 + EGVD T + I + ++AP + + + A Sbjct: 276 SQVKAGKLRILAVTAGERVEGVDAPTLTEAGLKLKITNWRGVVAPPGLNPEQVKTLTA-- 333 Query: 184 AVLNVPFGMVERGWSSKVLEDHGYPD 209 M + ++VL+ G+ D Sbjct: 334 ----TVEKMAKSPAWAEVLKQKGWDD 355 >gi|315122596|ref|YP_004063085.1| hypothetical protein CKC_04240 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495998|gb|ADR52597.1| hypothetical protein CKC_04240 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 283 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 54/146 (36%), Gaps = 5/146 (3%) Query: 96 LAGKLLSF-IPTPLTRLAGLALQSAPLAA---GALYAYLSHKAESSIHHQIEGVDKETAD 151 A ++ + + + G + + P A G L ++ ++S + + G+D+ T+ Sbjct: 103 TAHSIVEGAVIYGIGNIIGSSFSANPFVASLVGLLTISATYGHQTSENMKHLGIDESTSQ 162 Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA 211 L + + P K + +GA + E+ ++ L GY + Sbjct: 163 TLGLLSGGFYMLSFAIPYIHRGDVSLKKIINGAGQQIATRTTEQLTTNGTLYFQGY-EKE 221 Query: 212 QHYRIFDMESLITDGLIGAFFGGMHS 237 + + ++I D ++ G + Sbjct: 222 EPTEGWSNYTVIVDVILTVGLGLISR 247 >gi|183986749|ref|NP_001116963.1| BCL2-associated transcription factor 1 [Xenopus (Silurana) tropicalis] gi|171846367|gb|AAI61609.1| bclaf1 protein [Xenopus (Silurana) tropicalis] Length = 894 Score = 43.2 bits (100), Expect = 0.091, Method: Composition-based stats. Identities = 31/143 (21%), Positives = 53/143 (37%), Gaps = 20/143 (13%) Query: 232 FGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVD 291 F G + + L + L R P H V P + + D D Sbjct: 598 FKGCGKTLNERFTDCLKDTLDHVSHLRRPEIHRVIDIPPNIP---KKHIRIQDE-----D 649 Query: 292 SLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHP 351 ++ E +++K +++D + H K H E L +E S+ QK +P Sbjct: 650 KAIKKETAKVEKKKKSSLSDQRCDVQHKKEHSKERVDLTCSRESSNSQKKEKP------- 702 Query: 352 KRKEVERELSEIEGAKKESSARK 374 ++EL E + K+ES +K Sbjct: 703 -----QKELKEFKIFKEESKRKK 720 >gi|320168701|gb|EFW45600.1| G protein-coupled receptor [Capsaspora owczarzaki ATCC 30864] Length = 4644 Score = 43.2 bits (100), Expect = 0.11, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 73/267 (27%), Gaps = 30/267 (11%) Query: 47 DKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPT 106 L +P D + + T + G V + I A +L + Sbjct: 2503 ASLASPLSVTFGDGSS---QASTFITILNNGLPRVTSTAVITLSIGSAGAYARLGTNTTF 2559 Query: 107 PL-------------TRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKET-ADA 152 L AL APLA A + ++ G T Sbjct: 2560 TLTIPAHNNPHGAVSFAAGSQALTVAPLAGAA--------SSIQLNLTRTGGSIGTLVVT 2611 Query: 153 LAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQ 212 V G + +A TV A F MV + D G+ + Sbjct: 2612 YQTSAGGVAGIEAATAGEDFTPIVAATVTIPAGSASAFVMVTIPSNVAPELDRGFQVLLT 2671 Query: 213 HYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGL 272 + + D+ + +G+ + + N++L ND + +S L Sbjct: 2672 NVAVSDLTNTGATPSLGS-----GAASMSNVTLSAQNDPNGVFEFAVTSVVADSTSGSYL 2726 Query: 273 HTSFDAYEAHTDTLAHGVDSLVRGEYP 299 + + G S+ +YP Sbjct: 2727 LVVHRSAGTVGAAVLTGTASVSGQQYP 2753 >gi|117619037|ref|YP_856875.1| hypothetical protein AHA_2352 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117560444|gb|ABK37392.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 229 Score = 42.4 bits (98), Expect = 0.17, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 39/138 (28%), Gaps = 2/138 (1%) Query: 12 RDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDP 71 D +KE A P W GL M A L T Y + + Sbjct: 48 NDRLKEEAGELTRVPKADWLVGLAGGSHVMAA--FTHLNPEGARFTTGDFGGYYCAPSLD 105 Query: 72 HSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLS 131 ++ + E + A + + L + G A + PL Y Y Sbjct: 106 TAIKETVYHQERVFGYTREPAQKVQMRVIHAEFSASLVDITGEAFLATPLYHATDYGYSQ 165 Query: 132 HKAESSIHHQIEGVDKET 149 A ++G+ + Sbjct: 166 AFAREQKALDVDGICYRS 183 >gi|189240286|ref|XP_973010.2| PREDICTED: similar to K11G12.5 [Tribolium castaneum] Length = 287 Score = 42.1 bits (97), Expect = 0.20, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 32/191 (16%) Query: 179 TVASGAVLNVPFGMVERGWSSKVLEDHGYP-------DMAQHYRIFDMESLITDGLIGAF 231 + V + G S+ +L Y + + + D +S + + A Sbjct: 47 RLTVNIVKKQGVTALYNGLSASLLRQLTYSTTRFGIYESVKQ--LMDKDSSFSARVALAA 104 Query: 232 F----GGMHSKQVQNMSLRLVNDLKEGITERLPYKHGV-----KSSSPGLHTSFDAYEAH 282 F GG+ +++R+ ND+K + +RL YKH + G+ F A Sbjct: 105 FAGSAGGLVGTPADKINVRMQNDIKLPLDKRLNYKHALDGLLRVYKEEGIPRLFSGATAA 164 Query: 283 TDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLED---PHFKPHLPE-------PEPLPQY 332 T + ++ G+ +DQ K + + ED HF L +PL Sbjct: 165 --TFRAALMTI--GQLSFYDQIKKTLLTTDYFEDNLTTHFVSSLTAGAIATTLTQPLDVL 220 Query: 333 KEHSDRQKPSE 343 K + KP E Sbjct: 221 KTRTMNAKPGE 231 >gi|270011578|gb|EFA08026.1| hypothetical protein TcasGA2_TC005615 [Tribolium castaneum] Length = 286 Score = 42.1 bits (97), Expect = 0.20, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 32/191 (16%) Query: 179 TVASGAVLNVPFGMVERGWSSKVLEDHGYP-------DMAQHYRIFDMESLITDGLIGAF 231 + V + G S+ +L Y + + + D +S + + A Sbjct: 46 RLTVNIVKKQGVTALYNGLSASLLRQLTYSTTRFGIYESVKQ--LMDKDSSFSARVALAA 103 Query: 232 F----GGMHSKQVQNMSLRLVNDLKEGITERLPYKHGV-----KSSSPGLHTSFDAYEAH 282 F GG+ +++R+ ND+K + +RL YKH + G+ F A Sbjct: 104 FAGSAGGLVGTPADKINVRMQNDIKLPLDKRLNYKHALDGLLRVYKEEGIPRLFSGATAA 163 Query: 283 TDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLED---PHFKPHLPE-------PEPLPQY 332 T + ++ G+ +DQ K + + ED HF L +PL Sbjct: 164 --TFRAALMTI--GQLSFYDQIKKTLLTTDYFEDNLTTHFVSSLTAGAIATTLTQPLDVL 219 Query: 333 KEHSDRQKPSE 343 K + KP E Sbjct: 220 KTRTMNAKPGE 230 >gi|157849706|gb|ABV89636.1| catalytic/coenzyme binding protein [Brassica rapa] Length = 624 Score = 42.1 bits (97), Expect = 0.21, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 40/107 (37%), Gaps = 8/107 (7%) Query: 260 PYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHF 319 PY P T + +DTLA GE T+A E+ Sbjct: 473 PYASYENLKPPSSPTPKASGIQKSDTLAPVPTDSDTGES--------STVATTVTEEAEA 524 Query: 320 KPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGA 366 P +P+ PL Y ++D + P+ P PK+ E+SE+ G Sbjct: 525 PPAIPKMRPLSPYAAYADLKPPTSPTPASTGPKKTAPAEEISELPGG 571 >gi|311899845|dbj|BAJ32253.1| hypothetical protein KSE_64930 [Kitasatospora setae KM-6054] Length = 385 Score = 42.1 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 75/245 (30%), Gaps = 17/245 (6%) Query: 65 RGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAG 124 + P G + T L P A LA + G L A Sbjct: 94 NTTSFHPVGHRVGPDDILKTTVLTPPPAPTGLAPDHGPSAGGGRVTITGRHLTGATAVDF 153 Query: 125 ALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPG-------AIASQSIA 177 A + +S V A A + +PG + A Sbjct: 154 GGVAATAFTVDSDTRITAT-VPAGKATGKAEVT-VTTAGGTGSPGQYTYDVPTPGGYTFA 211 Query: 178 KTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGA-FFGGMH 236 K+ A + V G +R + + HG + + D+ ++ D ++GA G Sbjct: 212 KSAAPASGSTVRVG--DRVTYTVTVRQHGDGAVTGARVVDDLSGVLDDAVLGADVAAGSG 269 Query: 237 SKQVQNMSLRLVNDLKEG----ITERLPYKHGVKSSSPGLHTSFDAYEAH-TDTLAHGVD 291 + V+N L DL G IT + K+G G ++ D D + + Sbjct: 270 TVAVRNGKLTWNGDLPVGGSTTITYSVTVKNGGDRRLSGAVSAPDDARGTCDDGKSCATE 329 Query: 292 SLVRG 296 VRG Sbjct: 330 HTVRG 334 >gi|258591977|emb|CBE68282.1| Membrane protein involved in aromatic hydrocarbon degradation [NC10 bacterium 'Dutch sediment'] Length = 447 Score = 42.1 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 7/101 (6%) Query: 92 AGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 A + F P LTRL G L + +H + S H + ++ Sbjct: 41 AALGEDASTVFFNPAGLTRLKGSQLSMV----ASAVGPSAHFSNSRSHPSTSAI---SSI 93 Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGM 192 L + S + P + + + G N PFG+ Sbjct: 94 PLTGGDGGDAGSWAMVPAGYYATDVTSRLKFGVGFNAPFGL 134 >gi|83644487|ref|YP_432922.1| choline dehydrogenase-like flavoprotein [Hahella chejuensis KCTC 2396] gi|83632530|gb|ABC28497.1| Choline dehydrogenase and related flavoprotein [Hahella chejuensis KCTC 2396] Length = 1963 Score = 42.1 bits (97), Expect = 0.24, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 51/179 (28%), Gaps = 26/179 (14%) Query: 89 PYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKE 148 P A ++G+L + L G A+ P + KA++ + GVD Sbjct: 534 PDDASTVMSGQLPGGRVITVHPLGGCAMGDGPDTGVVNHYGQVFKADN----RAHGVD-- 587 Query: 149 TADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYP 208 A A E + + P A+ A + W + ++ P Sbjct: 588 ---APALHEGLYVLDGSILPAALGVNPFLTISALSLRAAEAI-QKQHDWLAPT-QERVDP 642 Query: 209 DMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKS 267 ++AQ + T S + + E + RL + Sbjct: 643 ELAQALSPMRQPAATTTA---------------RPSPTVTLSISEQMFGRLQAQEVHTD 686 >gi|254523015|ref|ZP_05135070.1| outer membrane autotransporter barrel domain protein [Stenotrophomonas sp. SKA14] gi|219720606|gb|EED39131.1| outer membrane autotransporter barrel domain protein [Stenotrophomonas sp. SKA14] Length = 3615 Score = 41.7 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 55/189 (29%), Gaps = 34/189 (17%) Query: 75 GTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS-APLAAGALYAYLSHK 133 G G+ + G T+L A G ++ L LA L +S Sbjct: 1485 GNGSLVKNGATTLTLSAANTYTGGTTINDGTLALGLGGSLAAAGDVTLGNAGAAFDISGA 1544 Query: 134 AESSIHHQIEGV------------DKETADALAWREAIVHTSALLAPGAIASQSIAKTVA 181 + S + GV TA A+ ++ S L Q+++ Sbjct: 1545 SGSQTIGALNGVGGTTLALGGNSLTFGTASNAAFG-GVISGSGGLVKVGAGVQTLSGANT 1603 Query: 182 SGAVLN-------------VPFGMVERGWSSKVLEDHGYPDMA------QHYRIFDMESL 222 G + V G + G +S L+ G +A + +L Sbjct: 1604 FGGGVTLNAGGLVLGNDAAVGTGALTVGGAS-TLDTTGLATLANNIALNAGLTVLGTNAL 1662 Query: 223 ITDGLIGAF 231 +G++ Sbjct: 1663 TLNGVLSGA 1671 Score = 37.0 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 46/145 (31%), Gaps = 14/145 (9%) Query: 70 DPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQ--------SAPL 121 + ++G+GA V G +L ALA + + L AL + + Sbjct: 621 NAAALGSGALSVGGNVTLDGTTGALALANTVNLGAGSILNLPGNQALTFNGVIGGTGSLV 680 Query: 122 AAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVA 181 GA L++ S + TA L + L G + ++ Sbjct: 681 KNGATTLTLNNANTFSGGLSL------TAGGLVLGNGGALGTGALNVGGAVTLDAGSALS 734 Query: 182 SGAVLNVPFGMVERGWSSKVLEDHG 206 G +N+ G + S L G Sbjct: 735 VGNGINLGVGGLLNVLGSNALTLGG 759 >gi|209515507|ref|ZP_03264372.1| short-chain dehydrogenase/reductase SDR [Burkholderia sp. H160] gi|209503974|gb|EEA03965.1| short-chain dehydrogenase/reductase SDR [Burkholderia sp. H160] Length = 268 Score = 41.7 bits (96), Expect = 0.27, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 61/200 (30%), Gaps = 43/200 (21%) Query: 133 KAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGM 192 +E+ I ++ + A + + A NV Sbjct: 2 FSETQIGARLAQI-FNLTGKTAVVTGSAQGLGRET----------ARLLAEAGANVVIAD 50 Query: 193 VERGWSSKV---LEDHGYPDMAQHYRIFDMESL-ITDGLIGAFFGGM--------HSK-- 238 + +S +E G M + D S+ ++ A FGG+ H Sbjct: 51 LNPNAASATAADIEASGGIAMPCQVDVADEASVKALFAVVDAKFGGVNILINNAAHRSKA 110 Query: 239 -----------QVQNMSLRLVNDL-KEGITERLPYKHG-----VKSSSPGLHTSFDAYEA 281 Q+QN++LR +E IT R+ K SS L + A Sbjct: 111 EFFEMSVEQWDQMQNVTLRGTFLCCREAIT-RMKAKSSGGSIVNISSVGALRPTLWGVNA 169 Query: 282 HTDTLAHGVDSLVRGEYPHF 301 H D GVDS+ R F Sbjct: 170 HYDAAKAGVDSITRSLASEF 189 >gi|322504305|emb|CAM41701.2| hypothetical protein, unknown function [Leishmania braziliensis MHOM/BR/75/M2904] Length = 2392 Score = 41.7 bits (96), Expect = 0.30, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 56/200 (28%), Gaps = 27/200 (13%) Query: 31 HTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPH---SVGTGAHLVE-GLTS 86 + G ++ LV P + + SV + E + S Sbjct: 734 FSAPGASLLRG---HTALLVPPNADGRSPTTCPTSATAEHIQLPISVPPSSTAGEIAVAS 790 Query: 87 LAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAG---ALYAYLSHK---------- 133 P + A + L + +AP L L Sbjct: 791 FTPASSTVGTAYLVCVGQAGAFVPTGALTVATAPTVTADPSPLAFGLPAYVFFSSAIMSL 850 Query: 134 AESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMV 193 +++ I+ D T+D A V + + P A+Q S AV +V + Sbjct: 851 SQADTFTVIKVTDSCTSD---LSVATVLATGSIIPSTGAAQPFLVPSLSSAVTSVRLCVA 907 Query: 194 ERG-WSSKVLEDHGYPDMAQ 212 +R K L GY D Sbjct: 908 QRSQLVDKTL---GYADAGA 924 >gi|154332420|ref|XP_001562584.1| hypothetical protein [Leishmania braziliensis MHOM/BR/75/M2904] Length = 2392 Score = 41.7 bits (96), Expect = 0.30, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 56/200 (28%), Gaps = 27/200 (13%) Query: 31 HTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPH---SVGTGAHLVE-GLTS 86 + G ++ LV P + + SV + E + S Sbjct: 734 FSAPGASLLRG---HTALLVPPNADGRSPTTCPTSATAEHIQLPISVPPSSTAGEIAVAS 790 Query: 87 LAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAG---ALYAYLSHK---------- 133 P + A + L + +AP L L Sbjct: 791 FTPASSTVGTAYLVCVGQAGAFVPTGALTVATAPTVTADPSPLAFGLPAYVFFSSAIMSL 850 Query: 134 AESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMV 193 +++ I+ D T+D A V + + P A+Q S AV +V + Sbjct: 851 SQADTFTVIKVTDSCTSD---LSVATVLATGSIIPSTGAAQPFLVPSLSSAVTSVRLCVA 907 Query: 194 ERG-WSSKVLEDHGYPDMAQ 212 +R K L GY D Sbjct: 908 QRSQLVDKTL---GYADAGA 924 >gi|115398972|ref|XP_001215075.1| conserved hypothetical protein [Aspergillus terreus NIH2624] gi|114191958|gb|EAU33658.1| conserved hypothetical protein [Aspergillus terreus NIH2624] Length = 1004 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 20/98 (20%) Query: 309 IADNTLEDPHFKPHLPEPEPLPQYKEH---------------SDRQKPSEPLAEHPHPKR 353 +AD T PH + +P PEP +E+ +D+QKP+ P AEHP PKR Sbjct: 1 MADTT---PHSEEPIPRPEPTEPSQENNDTTASPAPAQNGSPADKQKPTPP-AEHPLPKR 56 Query: 354 KEV-ERELSEIEGAKKESSARKFFDEGSPDHSPFKGER 390 + + ER + SA D P +P + ++ Sbjct: 57 RRMEERHQKPRRRGRTPPSAYSRRDGDEPSATPTRNDQ 94 >gi|284034782|ref|YP_003384713.1| major facilitator superfamily protein [Kribbella flavida DSM 17836] gi|283814075|gb|ADB35914.1| major facilitator superfamily MFS_1 [Kribbella flavida DSM 17836] Length = 417 Score = 41.3 bits (95), Expect = 0.35, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 48/165 (29%), Gaps = 9/165 (5%) Query: 82 EGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQ 141 + +AP AL G+ L + + LT A +A AL E Sbjct: 186 DETEEVAPGTGRTAL-GRYLLMLRSGLTEAATSRTVRKAVALVALLGGFLSFDEY-FPLL 243 Query: 142 IEGVDKETADALAWREAIVHTSALLAPGAIASQSI--AKTVASGAVLNVPFGMVERGWSS 199 V T V A+ G L V G++ G S Sbjct: 244 AREVGASTGLVPLLIAGTVAAQAI---GGALGGPAYRLPATVFAVGLAVTAGLIAWGSLS 300 Query: 200 KVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMS 244 G+ +A Y + + ++ D + G V ++S Sbjct: 301 GT--AGGFLPIAVGYGVMQLVIIVADARLQDAIEGPARATVTSVS 343 >gi|327184111|gb|AEA32558.1| membrane protein [Lactobacillus amylovorus GRL 1118] Length = 1241 Score = 41.3 bits (95), Expect = 0.35, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 70/221 (31%), Gaps = 19/221 (8%) Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 D + + + A L + P A A L + + L ++ A Q Sbjct: 760 DTSSLTSLASQVSTLKQSIAQLAQASNQALPGAATA------LKQLSSGLGQVQAAASQG 813 Query: 119 APLA-----AGALYAYLSHKAESSIHHQIEGVDKETADALAWREA---IVHTSALLAPGA 170 A A + + S + G + +A A + LA GA Sbjct: 814 VAGAQRLNSGAAALNSGAGRLNSGLGTLSAGAGRLSAGAGQLDSGAGQLQSGLGTLANGA 873 Query: 171 IASQSIAKTVASGAVL-NVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIG 229 S T+A+GA N G + G + ++ ++G +A + G Sbjct: 874 GQLNSGLGTLANGAGTLNTGLGTLANG-AGQL--NNGVGQLASQAPQLISGIGQLNSGAG 930 Query: 230 AFFGGMHSKQVQNMSLRL-VNDLKEGITERLPYKHGVKSSS 269 G + L ++ + G+ + Y G+ SS+ Sbjct: 931 QLASGAGKLASRVPQLTTGIDTVNSGLGQGETYLKGLGSSA 971 >gi|237838371|ref|XP_002368483.1| hypothetical protein TGME49_090990 [Toxoplasma gondii ME49] gi|211966147|gb|EEB01343.1| hypothetical protein TGME49_090990 [Toxoplasma gondii ME49] Length = 2520 Score = 41.3 bits (95), Expect = 0.36, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 54/153 (35%), Gaps = 10/153 (6%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTR-----LAGLALQSAPLAAGALYAYLSHKAESSIH 139 + L P + LAG +++F LA S P++ G H A + Sbjct: 197 SELNPSPSS--LAGGIVTFASPDFFPATVSGLAASTSMSWPVSVGISAGGRGHAATTPFA 254 Query: 140 HQIEGVDKETADALAWREAIVH--TSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 + A +V+ T+ ++ PG + + A GA +P G Sbjct: 255 APPGAAAYPLSHARIPGGDLVYYLTAGVVLPGGAGA-GVVPAGALGAGTILPPGATFLSH 313 Query: 198 SSKVLEDHGYPDMAQHYRIFDMESLITDGLIGA 230 +S + G AQ+ R+ D +L GA Sbjct: 314 ASAGENNGGAMVSAQNARVGDHVALAGKAKQGA 346 >gi|256391150|ref|YP_003112714.1| MMPL domain-containing protein [Catenulispora acidiphila DSM 44928] gi|256357376|gb|ACU70873.1| MMPL domain protein [Catenulispora acidiphila DSM 44928] Length = 760 Score = 40.9 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 47/293 (16%), Positives = 87/293 (29%), Gaps = 29/293 (9%) Query: 9 EDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPN--YYRG 66 + + + + A P V+ + G + ++ F +E HD P+ Sbjct: 86 QRMTGALNQIATAPGVAGVTGPYDGPRGALQVSKDQTTAYATINFAQEAHDLPDAEVQHI 145 Query: 67 SRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLL-----SFIPTPLTRLAGLALQSAPL 121 + T + G +++ A L+ + + R AG A+ Sbjct: 146 IDVAQGARETNLQVELGGQAISQAERKIGGAADLIGVLAALLVLGLVFRAAGAAVMPILT 205 Query: 122 AAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSAL-------LAPGAIASQ 174 + + + S I A + I + + L G Sbjct: 206 GVAGVATGILGTGQLSHLFAISSTAPTLATLVGLGVGIDYALFIVNRHRKGLMSGLSVED 265 Query: 175 SIAKTV---------ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITD 225 SIAK + A G V+ GM G S +G A M L Sbjct: 266 SIAKALNTSGRAVIFAGGTVVIALLGMFALGLSFL----NGMAIGAAV--TVSMTVLAAI 319 Query: 226 GLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDA 278 L+ A G + + + R + + G+ +PY H + G+ + Sbjct: 320 TLLPAMLGFLKLRVLSKKQRRELAARQAGVGVLVPYAHASRRRPSGVPGHPET 372 >gi|253570523|ref|ZP_04847931.1| predicted protein [Bacteroides sp. 1_1_6] gi|251839472|gb|EES67555.1| predicted protein [Bacteroides sp. 1_1_6] Length = 642 Score = 40.9 bits (94), Expect = 0.47, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 90/253 (35%), Gaps = 33/253 (13%) Query: 9 EDIRDNIKEWAQRPRVSPDIKWHTGLG--KEVINMPARSLDKLVAPFREETHDQPNYYRG 66 ++IRD+ + + P +++ T + + NM + LD ++A T Sbjct: 71 DNIRDSFNDAIE-----PGVRFETAVAEMSGITNMEGKELD-VLATKARNTAKAFGVDAS 124 Query: 67 SRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGAL 126 + + + L+ + P + A A +++S L++ + A +A Sbjct: 125 NAMVVYK--------DLLSKITPELKKAPDALEIMSNNVMTLSKTMQNDVPGA--SAAMS 174 Query: 127 YAYLSHKAESSIHHQIEGV--DKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGA 184 A +K + D A E + ++++ +T + Sbjct: 175 TAMNQYKVSLDDPMKAAQTMTDYMNIMAAGTVEGSAEIREV-------AEALKQTGSVAK 227 Query: 185 VLNVPFGMVERGWSSKVLEDHGYPD----MAQHYRIFDMESLITDGLIGAFFGGMHSKQV 240 V F E ++L+ G +A I +++ TD + G++ K + Sbjct: 228 TFGVEF--AETNSLIQLLDKSGKKGSEGGIALRNTIVKLQAPTTDAIKQLKAAGVNIKTM 285 Query: 241 QNMSLRLVNDLKE 253 QN SL L + L+ Sbjct: 286 QNQSLSLTDRLRA 298 >gi|311245483|ref|XP_001925661.2| PREDICTED: ninein isoform 1 [Sus scrofa] Length = 2136 Score = 40.9 bits (94), Expect = 0.50, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 77/187 (41%), Gaps = 23/187 (12%) Query: 245 LRLVNDLKEGITERLPYKH------GVKSSSPGLHTSFDAYEAHTDTLA-HGVDSLVRGE 297 LR+ KE + + + H G K+ +P + T + L+ +D L+ E Sbjct: 1792 LRMTQQEKEALKQEVMSLHKQLQNAGDKNWAPEVATHPSGFPNQQQRLSWDKLDQLMNEE 1851 Query: 298 YPHF--DQEKLQTIADNT-LEDPHFKPHLPEPEP---LPQYKEH---SDRQKPSEPL--- 345 + E+LQT+ NT E H + + + E LP++++H S KP E Sbjct: 1852 QQLLWQENERLQTVVQNTKAELIHSREKVRQLESNLLLPKHQKHLSSSGTMKPPEQEKLS 1911 Query: 346 ----AEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGAD 401 E +R R++S++ ++E +EG E+ ++ +R Sbjct: 1912 LKRECEQVQKERSPTNRKVSQMNSLERELETIHLENEGLKKKQVKLDEQLMEMQHLRSTM 1971 Query: 402 FTDAPHA 408 F+ +P+A Sbjct: 1972 FSPSPNA 1978 >gi|253748668|gb|EET02688.1| Hypothetical protein GL50581_25 [Giardia intestinalis ATCC 50581] Length = 3182 Score = 40.9 bits (94), Expect = 0.52, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 57/156 (36%), Gaps = 12/156 (7%) Query: 123 AGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP----GAIASQSIAK 178 A ++ + + + K+ AD++ E + T+ P G S+ + + Sbjct: 597 ATSVAGANQGTSNLTPIRTRQRSHKDWADSVIANEGLYETADTAIPDYRDGFCGSKYVGR 656 Query: 179 TVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSK 238 T++ A G ++G + L+ A H ++ + L ++GA G Sbjct: 657 TISGTARAISGAGSSQQGRALSALQRPSSNR-ALHSSMYVAQHLSNSAVLGASSGAQRRD 715 Query: 239 QVQNMSLRL--VNDLKEGITERLPYKHGVKSSSPGL 272 S + ++D+ E P+ SS G+ Sbjct: 716 SSARPSQKWARLDDIDEN-----PHSPATTSSKEGV 746 >gi|311245485|ref|XP_003121856.1| PREDICTED: ninein isoform 2 [Sus scrofa] Length = 2049 Score = 40.5 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 77/187 (41%), Gaps = 23/187 (12%) Query: 245 LRLVNDLKEGITERLPYKH------GVKSSSPGLHTSFDAYEAHTDTLA-HGVDSLVRGE 297 LR+ KE + + + H G K+ +P + T + L+ +D L+ E Sbjct: 1792 LRMTQQEKEALKQEVMSLHKQLQNAGDKNWAPEVATHPSGFPNQQQRLSWDKLDQLMNEE 1851 Query: 298 YPHF--DQEKLQTIADNT-LEDPHFKPHLPEPEP---LPQYKEH---SDRQKPSEPL--- 345 + E+LQT+ NT E H + + + E LP++++H S KP E Sbjct: 1852 QQLLWQENERLQTVVQNTKAELIHSREKVRQLESNLLLPKHQKHLSSSGTMKPPEQEKLS 1911 Query: 346 ----AEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGAD 401 E +R R++S++ ++E +EG E+ ++ +R Sbjct: 1912 LKRECEQVQKERSPTNRKVSQMNSLERELETIHLENEGLKKKQVKLDEQLMEMQHLRSTM 1971 Query: 402 FTDAPHA 408 F+ +P+A Sbjct: 1972 FSPSPNA 1978 >gi|159118813|ref|XP_001709625.1| Hypothetical protein GL50803_113986 [Giardia lamblia ATCC 50803] gi|157437742|gb|EDO81951.1| hypothetical protein GL50803_113986 [Giardia lamblia ATCC 50803] Length = 272 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 48/134 (35%), Gaps = 6/134 (4%) Query: 103 FIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHT 162 F P L L AL+ P+ S+ +D A+ + + +V Sbjct: 41 FDPIGLGDLG--ALEVTPVTFDPKIPGFILDTIRSLCPCALSIDDAAEGAMTFEQQVV-- 96 Query: 163 SALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA-QHYRIFDMES 221 + IA + A A RG + + GY +A H+ + D+E+ Sbjct: 97 -GMDVSAKIARLDASGQEARKASACSVCSRCRRGTLASFVSAGGYDALALGHHLLDDLET 155 Query: 222 LITDGLIGAFFGGM 235 L G+ GA F G+ Sbjct: 156 LAITGVHGASFFGL 169 >gi|225375687|ref|ZP_03752908.1| hypothetical protein ROSEINA2194_01312 [Roseburia inulinivorans DSM 16841] gi|257437541|ref|ZP_05613296.1| conserved hypothetical protein [Faecalibacterium prausnitzii A2-165] gi|225212457|gb|EEG94811.1| hypothetical protein ROSEINA2194_01312 [Roseburia inulinivorans DSM 16841] gi|257199848|gb|EEU98132.1| conserved hypothetical protein [Faecalibacterium prausnitzii A2-165] Length = 701 Score = 40.5 bits (93), Expect = 0.59, Method: Composition-based stats. Identities = 47/237 (19%), Positives = 88/237 (37%), Gaps = 26/237 (10%) Query: 244 SLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHG-------------V 290 + R++ + T + K K+ + + + TD Sbjct: 81 TERVMKHIDAAHTRKASKKAVRKAQAEATAGTKSSRLQFTDEERAAPELEKYIKKSDKAA 140 Query: 291 DSLVRGEYPHFDQEKL--QTIADNTLEDPHFKPHLPEPEPLPQYKE-HSDRQKPSEPLAE 347 D L + + ++KL + D T + H E + P +KE H+ +P++ Sbjct: 141 DRLDKAKAAIPKEKKLVKERTFDETTGKGKTRLHFEEKDKPPGFKEKHNPLSRPTQEAGI 200 Query: 348 HPHPKRKEVERELSEIEGA-KKESSARKFFDEGSPDHSPFKGERNQKLDP----MRGADF 402 H K VE++ S +EGA K E +A + G+ +G RN KL P + Sbjct: 201 LVHNKIHSVEKDNSGVEGAHKSEEAAERGLKYGARKIK--QGYRNHKLKPYREAAKAEKA 258 Query: 403 TDAPHAKFDATTFTESLPHVDEQTMHRF---SELKERHPVEAREVLEGLQEKLQGTK 456 + F P + + RF ++K ++ EAR +G++ + T+ Sbjct: 259 AFRANMDFQYHKTLHENPQLTSNPISRFWQKQKIKRQYAKEARNTAKGIKGAAERTR 315 >gi|119476211|ref|ZP_01616562.1| ammonium transporter [marine gamma proteobacterium HTCC2143] gi|119450075|gb|EAW31310.1| ammonium transporter [marine gamma proteobacterium HTCC2143] Length = 431 Score = 40.5 bits (93), Expect = 0.61, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 35/126 (27%), Gaps = 1/126 (0%) Query: 63 YYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLA 122 G +V G G T++ P+ + G + ++ + Sbjct: 191 ITAGVAALVSAVVLGNRKGFGETAMPPHNMTMTIMGAGMLWVGWFGFNAGSALAANGDAG 250 Query: 123 AGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVAS 182 L +LS A + IE + AL +V + P + + Sbjct: 251 MAMLVTHLSAAAGTFTWLTIEWIKYGKPSALGAVTGMVAGLGTITPASGYVGP-GGALVI 309 Query: 183 GAVLNV 188 G + Sbjct: 310 GLSAGI 315 >gi|116748759|ref|YP_845446.1| hypothetical protein Sfum_1320 [Syntrophobacter fumaroxidans MPOB] gi|116697823|gb|ABK17011.1| hypothetical protein Sfum_1320 [Syntrophobacter fumaroxidans MPOB] Length = 702 Score = 40.5 bits (93), Expect = 0.67, Method: Composition-based stats. Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 5/61 (8%) Query: 270 PGLHTSFDAYEAHTDTLA-----HGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLP 324 P +HT++ AY +H + + H SL + +L+ N L P+F+PHLP Sbjct: 464 PDVHTAYSAYVSHEEDMRSRLAEHAGRSLKEWRRMFYLDSRLEPTRRNVLSSPYFRPHLP 523 Query: 325 E 325 + Sbjct: 524 D 524 >gi|159113851|ref|XP_001707151.1| Hypothetical protein GL50803_114336 [Giardia lamblia ATCC 50803] gi|157435254|gb|EDO79477.1| hypothetical protein GL50803_114336 [Giardia lamblia ATCC 50803] Length = 272 Score = 40.1 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 47/134 (35%), Gaps = 6/134 (4%) Query: 103 FIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHT 162 F P L L AL+ P+ S+ +D A+ + + +V Sbjct: 41 FDPIGLGDLG--ALEVTPVTFDPKIPGFILDTIRSLCPCALSIDDAAEGAMTFEQQVV-- 96 Query: 163 SALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA-QHYRIFDMES 221 + IA A A RG + + GY +A H+ + D+E+ Sbjct: 97 -GMDVSAKIARLDALGQEARKASACSVCSRCRRGTLASFVSAGGYDALALGHHLLDDLET 155 Query: 222 LITDGLIGAFFGGM 235 L G+ GA F G+ Sbjct: 156 LAITGVHGASFFGL 169 >gi|254513039|ref|ZP_05125105.1| betaine aldehyde dehydrogenase [Rhodobacteraceae bacterium KLH11] gi|221533038|gb|EEE36033.1| betaine aldehyde dehydrogenase [Rhodobacteraceae bacterium KLH11] Length = 486 Score = 40.1 bits (92), Expect = 0.80, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 83/304 (27%), Gaps = 23/304 (7%) Query: 7 SDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRG 66 +D IRD + + Q P D + + ++L+ E + Sbjct: 42 ADAAIRDADRAFRQGPWADLDPSGRADMLDALATQLETRWEELIE--AEIRDNGKRITEV 99 Query: 67 SRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLA---- 122 G H L P A+AG P + + ++PL Sbjct: 100 RGQFSALHGWYRHFAAQARKLTPVPQDNAIAGVTSVGHWMPYGVVVAITPWNSPLMILAW 159 Query: 123 --AGALYAYLSHKAESSIHHQIEGVDK-ETADALAW-------REAIVHTSALLAPGAIA 172 A AL A + + S ++ + A H Sbjct: 160 KLAPALAAGNTVVVKPSEMASASTLEFAQLAHEAGLPPGVLNVVTGFGHEVGEALVRHPL 219 Query: 173 SQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFF 232 ++ + T + V +R LE G D E+ +G++ F Sbjct: 220 TRKVTFTGSDAGGRKVAMAASDR-VIPTTLELGGKSPQIVFADC-DPET-TVNGVLSGIF 276 Query: 233 GGMHSKQVQNMSLRLVNDLKEG----ITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAH 288 V L + + +K+ +TER P H A E H + Sbjct: 277 LSNGQTCVAGSRLIVEHSIKDAFVARLTERARSLKVGDPMDPATHIGPLANEPHLRKVIA 336 Query: 289 GVDS 292 ++ Sbjct: 337 MIEQ 340 >gi|167757768|ref|ZP_02429895.1| hypothetical protein CLOSCI_00099 [Clostridium scindens ATCC 35704] gi|167664650|gb|EDS08780.1| hypothetical protein CLOSCI_00099 [Clostridium scindens ATCC 35704] Length = 702 Score = 40.1 bits (92), Expect = 0.81, Method: Composition-based stats. Identities = 47/237 (19%), Positives = 89/237 (37%), Gaps = 26/237 (10%) Query: 244 SLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHG-------------V 290 + R++ + T + K K+ + + + TD Sbjct: 82 TERVMEHIDAAHTRKASKKAVRKAQAEATAQTKSSRLQFTDEERAAPELEKYIKKSDKAA 141 Query: 291 DSLVRGEYPHFDQEKL--QTIADNTLEDPHFKPHLPEPEPLPQYKE-HSDRQKPSEPLAE 347 D L + + ++KL + D + H E + P +KE H+ +P++ Sbjct: 142 DRLDKAKAAIPKEKKLTKERTFDEATGKGKTRLHFEEKDKPPGFKEKHNPLSRPTQEAGI 201 Query: 348 HPHPKRKEVERELSEIEGA-KKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDA- 405 H K VE++ S +EGA K E +A + G+ +G R+ KL P R A + Sbjct: 202 FVHNKIHSVEKDNSGVEGAHKTEEAAERGVKYGARKIK--QGYRSHKLKPYREAAKAEKA 259 Query: 406 ---PHAKFDATTFTESLPHVDEQTMHRF---SELKERHPVEAREVLEGLQEKLQGTK 456 + F P + + RF ++K ++ EAR +G++ + T+ Sbjct: 260 AFQANVDFQYHKTLHDNPQLTSTPLSRFWQKQKIKRQYAKEARTTAKGIKGAAERTR 316 >gi|312109421|ref|YP_003987737.1| S-layer domain-containing protein [Geobacillus sp. Y4.1MC1] gi|311214522|gb|ADP73126.1| S-layer domain-containing protein [Geobacillus sp. Y4.1MC1] Length = 1047 Score = 40.1 bits (92), Expect = 0.84, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 52/165 (31%), Gaps = 15/165 (9%) Query: 34 LGKEVINMPA--RSLDKLVAPFREETHDQP-NYYRGSRTDPHSVGTGAHLVEGLTSLAPY 90 +G V+N + L AP + + G+ D S L +L Sbjct: 679 VGSPVVNGKDKTELIIPLTAPVAQTAKFYTVSVSAGTVKDLSSQQNS----NALATLTAD 734 Query: 91 IAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETA 150 ++ A+ GK +T ++G+A A + + +A + GVD T Sbjct: 735 VSAGAVTGK--DTAAPSITSISGVAAVKATSTGNQITFTIQDQANAGE--TASGVDFTTV 790 Query: 151 DALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVER 195 + P + + + +PFG + + Sbjct: 791 TDV----NNYRLDGAPLPSGSYVKVTGSDPSYTVTIQLPFGAISK 831 >gi|92109730|ref|YP_572016.1| protein of unknown function DUF395, YeeE/YedE [Nitrobacter hamburgensis X14] gi|91802812|gb|ABE65184.1| protein of unknown function DUF395, YeeE/YedE [Nitrobacter hamburgensis X14] Length = 151 Score = 40.1 bits (92), Expect = 0.84, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 29/92 (31%), Gaps = 6/92 (6%) Query: 151 DALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKV------LED 204 ++ ++ L+ I + + + + V G ++ Sbjct: 3 VLISLAAGLIFGLGLIISQMINPEKVLAFLDVAGDWDPSLAFVLAGAAAVSGLGYFFSRR 62 Query: 205 HGYPDMAQHYRIFDMESLITDGLIGAFFGGMH 236 P +A + I D L +IGA F G+ Sbjct: 63 RSAPLLAAQFDIPDRRDLDARLIIGAAFFGVG 94 >gi|115504195|ref|XP_001218890.1| hypothetical protein [Trypanosoma brucei TREU927] gi|83642372|emb|CAJ16234.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain 927/4 GUTat10.1] Length = 1443 Score = 40.1 bits (92), Expect = 0.88, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 72/255 (28%), Gaps = 30/255 (11%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEG 144 T+ + K++ L + A L S A + Sbjct: 556 TTTTSDPSSDTTINKMVDPGTFSLAAVLPTAAIVPDFFGQCLVVRNSSSALREYYANAIR 615 Query: 145 VDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKV--- 201 DK L E + ++ I S S A SG + G V RG Sbjct: 616 ADKGKILELMLLEYGAEWGSSISGDGITSSSAASFPISGTSSSTGLGAVSRGLRGTTRIA 675 Query: 202 ------LEDHGYPDMAQHYRIFDMESLITDG-----------LIGAFFGGMHSKQVQNMS 244 + + +R L+ + ++GA G+ + V + + Sbjct: 676 PVAQTPADRSQSSQINATFRHSHTRPLLPNSGSSVDRSRSPFVLGAETPGVATAGVASAA 735 Query: 245 LRLVNDLKEGITERLPYKHGVKSSSP--GLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFD 302 R L + +S+P L T+ + D L D + E PH Sbjct: 736 GRRHESLFPFLISL--ALQAQTTSTPRDNLPTTSAPAASSGDVLELAPDREKQQEEPH-- 791 Query: 303 QEKLQTIADNTLEDP 317 +T A + + P Sbjct: 792 ----RTTASHVITAP 802 >gi|159116221|ref|XP_001708332.1| Hypothetical protein GL50803_115232 [Giardia lamblia ATCC 50803] gi|157436443|gb|EDO80658.1| hypothetical protein GL50803_115232 [Giardia lamblia ATCC 50803] Length = 552 Score = 40.1 bits (92), Expect = 0.89, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 47/132 (35%), Gaps = 6/132 (4%) Query: 105 PTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSA 164 P L L AL+ P+ S+ +D A+ + + +V Sbjct: 323 PIGLGDLG--ALEVTPVTFDPKIPGFILDTIRSLCPCALSIDDAAEGAMTFEQQVV---G 377 Query: 165 LLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA-QHYRIFDMESLI 223 + IA + A A RG + + GY +A H+ + D+E+L Sbjct: 378 MDVSAKIARLDASGQEARKASACSVCSRCRRGTLASFVSAGGYDALALGHHLLDDLETLA 437 Query: 224 TDGLIGAFFGGM 235 G+ GA F G+ Sbjct: 438 ITGVHGASFFGL 449 >gi|240146330|ref|ZP_04744931.1| conserved hypothetical protein [Roseburia intestinalis L1-82] gi|257201544|gb|EEU99828.1| conserved hypothetical protein [Roseburia intestinalis L1-82] Length = 375 Score = 40.1 bits (92), Expect = 0.90, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 66/205 (32%), Gaps = 23/205 (11%) Query: 95 ALAGKLLSFIPTPLTRLAGLALQSAPLAA--GALYAYLSHKAESSIHHQIEGVDKETADA 152 A +S + + +A AL A + A ++ + + ++ E + ++ A A Sbjct: 21 GAACIAMSLLLPGIGTIAAGALMGAGIGAVSASVAGAVGDYSSGNVRSAEEAI-RDVAIA 79 Query: 153 LAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQ 212 A AI + PG ++ VER + + D + Sbjct: 80 -AISGAITGAIGVKFPG--------MNRLVEGGVDTTVATVERAAYAALDGDMTLEEKLA 130 Query: 213 HYRIFDMESLITDGLIGAFFG----GMHSK---QVQNMSLRLVNDLKEGITERLPYKHGV 265 + IFD + D + G F G G+ K +N +N+L K Sbjct: 131 Y--IFDPGQMAVDFVTGVFIGEAVDGIAKKLPGGWKNRGGSELNNLDAQGISSKSAKGSD 188 Query: 266 KSSSPG-LHTSFDAYEAHTDTLAHG 289 G L + +D L H Sbjct: 189 TFKQNGNLPNGVRTEISGSD-LRHS 212 >gi|225378270|ref|ZP_03755491.1| hypothetical protein ROSEINA2194_03931 [Roseburia inulinivorans DSM 16841] gi|225209933|gb|EEG92287.1| hypothetical protein ROSEINA2194_03931 [Roseburia inulinivorans DSM 16841] gi|291535807|emb|CBL08919.1| Phage late control gene D protein (GPD) [Roseburia intestinalis M50/1] Length = 852 Score = 40.1 bits (92), Expect = 0.90, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 66/205 (32%), Gaps = 23/205 (11%) Query: 95 ALAGKLLSFIPTPLTRLAGLALQSAPLAA--GALYAYLSHKAESSIHHQIEGVDKETADA 152 A +S + + +A AL A + A ++ + + ++ E + ++ A A Sbjct: 498 GAACIAMSLLLPGIGTIAAGALMGAGIGAVSASVAGAVGDYSSGNVRSAEEAI-RDVAIA 556 Query: 153 LAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQ 212 A AI + PG ++ VER + + D + Sbjct: 557 -AISGAITGAIGVKFPG--------MNRLVEGGVDTTVATVERAAYAALDGDMTLEEKLA 607 Query: 213 HYRIFDMESLITDGLIGAFFG----GMHSK---QVQNMSLRLVNDLKEGITERLPYKHGV 265 + IFD + D + G F G G+ K +N +N+L K Sbjct: 608 Y--IFDPGQMAVDFVTGVFIGEAVDGIAKKLPGGWKNRGGSELNNLDAQGISSKSAKGSD 665 Query: 266 KSSSPG-LHTSFDAYEAHTDTLAHG 289 G L + +D L H Sbjct: 666 TFKQNGNLPNGVRTEISGSD-LRHS 689 >gi|254489013|ref|ZP_05102218.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214045882|gb|EEB86520.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 936 Score = 39.7 bits (91), Expect = 1.0, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 50/165 (30%), Gaps = 9/165 (5%) Query: 68 RTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALY 127 R T L A + G S + + + R A LA GA++ Sbjct: 68 REYEKLERTLVDLRRAQERWNRAAAASRRVGSTFSNMASGIGRNVRQIAIGASLAGGAIF 127 Query: 128 AYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN 187 + A+ + +TAD L ++ A + + + N Sbjct: 128 GIANSTADLGDNVA------KTADKLGIGLGVLQELRYAAERSGVATATFDGALEKMTKN 181 Query: 188 VPFGMVERGWSSKVLEDHGY--PDMAQHYRIFDMESLITDGLIGA 230 + M G L+ G D+A D ++I D L G Sbjct: 182 IGLAMEGTGAQKDALDALGLSAADLASKLPE-DALAMIADRLQGV 225 >gi|271501409|ref|YP_003334434.1| outer membrane autotransporter barrel domain-containing protein [Dickeya dadantii Ech586] gi|270344964|gb|ACZ77729.1| outer membrane autotransporter barrel domain protein [Dickeya dadantii Ech586] Length = 1075 Score = 39.7 bits (91), Expect = 1.0, Method: Composition-based stats. Identities = 35/266 (13%), Positives = 73/266 (27%), Gaps = 18/266 (6%) Query: 16 KEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVG 75 ++W Q + G + ++ A++ +V E P+ + Sbjct: 188 EQWVQSGGSTTGTVISAGGYQ-LVKNGAQASGTVVNTGAE---GGPDAENSDGMFVSGIA 243 Query: 76 TGAHLVEGLTSLAPYIAG-AALAGK------LLSFIPTPLTRLAGLALQSAPLAAGALYA 128 T + G + + + + + + LA G Sbjct: 244 TDTLIHAGGRQIVAAGGSSTGTTIQAGGDQSVHGQAQSTTLDGGNQYVHAGALATGTTVN 303 Query: 129 YLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVA---SGAV 185 + L+ ++ L G S A TV+ S Sbjct: 304 A-GGWQVVQQSGTADATTVNRDGKLSVSAGGTASNVTLNAGGALVTSTAATVSGINSLGG 362 Query: 186 LNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSL 245 NV ++ +LE+ G D+ D ++ G++ GG+ V N Sbjct: 363 FNV--DAATASATNVLLENGGRLDVLSGGSA-DTTTVSNGGVLAVATGGVAQHIVMNEGG 419 Query: 246 RLVNDLKEGITERLPYKHGVKSSSPG 271 L+ D ++ ++ G Sbjct: 420 VLIADSGSTVSGTNTAGTFGIDAATG 445 >gi|291222913|ref|XP_002731460.1| PREDICTED: protein kinase D1-like [Saccoglossus kowalevskii] Length = 822 Score = 39.7 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 48/101 (47%), Gaps = 10/101 (9%) Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA 211 A W +AI + P + AS + ++ +++ E + D++ Sbjct: 462 AKGWEKAIRAALMPVTP--------QPSEASATTPALHVDKAQQ-AAAEKCEFIKHEDIS 512 Query: 212 QHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLK 252 QHY+IF E ++ G G +GG+H K + +++++++ L+ Sbjct: 513 QHYQIFPDE-ILGSGQFGIVYGGVHRKSGRQVAIKVIDKLR 552 >gi|296165610|ref|ZP_06848133.1| possible (R)-6-hydroxynicotine oxidase [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295899026|gb|EFG78509.1| possible (R)-6-hydroxynicotine oxidase [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 472 Score = 39.7 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 42/139 (30%), Gaps = 14/139 (10%) Query: 65 RGSRTDPHSVGTGAHLVEGLTSLA-PYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAA 123 R S P ++ + + ++ G LA R G + + + Sbjct: 34 RMSEQQPSAIARALDADDVIAAVRFAAEHGRGLA-----------IRAGGHGVDGSAMPD 82 Query: 124 GALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASG 183 AL LS E S+ V L + + L+ P S + + G Sbjct: 83 DALVVDLSEFKEISVEPGSRRVRLGAGVLLGEMDGALAEYGLVVPAGTVSTTGVAGLTIG 142 Query: 184 AVLNVPFGMVERGWSSKVL 202 V + M RG + L Sbjct: 143 GG--VGYNMRARGATVDSL 159 >gi|291337005|gb|ADD96528.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C29] Length = 3493 Score = 39.7 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 60/165 (36%), Gaps = 5/165 (3%) Query: 118 SAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSAL---LAPGAIAS- 173 LA ++ ++ + A + + LAPG ++ Sbjct: 1 GGALADAERAYKAQGFSDEEAFNKAQAPALAQGLGTALITRGFGKTGVESILAPGMKSTF 60 Query: 174 QSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDM-ESLITDGLIGAFF 232 +++K V GA + ++ W S V + P++ +M E+ + G++G Sbjct: 61 VNVSKAVVKGAGMEATEEWYDQLWQSVVRKMSYQPELTFEQAFGEMAEAGVIGGILGGAV 120 Query: 233 GGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFD 277 G+ + + + S L + + ITER +S PG + Sbjct: 121 SGVKAVEGEVKSKMLDRQMDKEITERGRAIALQESLPPGFAGEPE 165 >gi|261326093|emb|CBH08919.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] Length = 1443 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 40/253 (15%), Positives = 68/253 (26%), Gaps = 26/253 (10%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEG 144 T+ + K++ L + A L S A + Sbjct: 556 TTTTSDPSSDTTINKMVDPGTFSLAAVLPTAAIVPDFFGQCLVVRNSSSALREYYANAIR 615 Query: 145 VDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKV--- 201 DK L E + ++ I S S A SG + G V RG Sbjct: 616 ADKGKILELMLLEYGAEWGSSISGDGITSSSAASFPISGTSSSTGLGAVSRGLRGTTRIA 675 Query: 202 ------LEDHGYPDMAQHYRIFDMESLITDG-----------LIGAFFGGMHSKQVQNMS 244 + + R L+ + ++GA G+ + V + + Sbjct: 676 PVAQTPADRSQSSQINATLRHSHTRPLLPNSGSSVDRSRSPFVLGAETPGVATAGVASAA 735 Query: 245 LRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQE 304 R L + + L T+ + D L D + E PH Sbjct: 736 GRRHESLFPFLISLASQAQTTSTPRDNLPTTSAPAASSGDVLELAPDREKQQEEPH---- 791 Query: 305 KLQTIADNTLEDP 317 +T A + + P Sbjct: 792 --RTTASHVITAP 802 >gi|148244422|ref|YP_001219116.1| ammonium transporter [Candidatus Vesicomyosocius okutanii HA] gi|146326249|dbj|BAF61392.1| ammonium transporter [Candidatus Vesicomyosocius okutanii HA] Length = 431 Score = 39.4 bits (90), Expect = 1.3, Method: Composition-based stats. Identities = 11/96 (11%), Positives = 25/96 (26%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEG 144 + P+ + G + ++ A L ++S + E Sbjct: 211 RPIPPHNMTMTITGAAMLWVGWFGFNGGSALAVGGNAAMAILVTHISAATGAITWMFYEW 270 Query: 145 VDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 + AL +V + P + + V Sbjct: 271 IKFGRPTALGTVTGMVAGLGTITPASGFVGPVGALV 306 >gi|172063894|ref|YP_001811545.1| outer membrane autotransporter [Burkholderia ambifaria MC40-6] gi|171996411|gb|ACB67329.1| outer membrane autotransporter barrel domain protein [Burkholderia ambifaria MC40-6] Length = 2366 Score = 39.4 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 1/142 (0%) Query: 97 AGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWR 156 AG L+ T AG + L+ S+++ + + + Sbjct: 1261 AGGSLASTGTVNLAGAGATFDLGGASGAQTIGALTGATGSTVNLGANALTLSGSGNNTFG 1320 Query: 157 EAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRI 216 + + L +Q++ + G S L G ++A Sbjct: 1321 -GAIGGTGSLTLAGAGTQTLTGANTYTGGTTINGGSTLALVSGGSLASTGTVNLAGTGAT 1379 Query: 217 FDMESLITDGLIGAFFGGMHSK 238 FD+ IGA G + Sbjct: 1380 FDVSGAAGAETIGALSGAAGTT 1401 >gi|221196139|ref|ZP_03569186.1| putative esterase [Burkholderia multivorans CGD2M] gi|221202812|ref|ZP_03575831.1| putative esterase [Burkholderia multivorans CGD2] gi|221176746|gb|EEE09174.1| putative esterase [Burkholderia multivorans CGD2] gi|221182693|gb|EEE15093.1| putative esterase [Burkholderia multivorans CGD2M] Length = 307 Score = 39.4 bits (90), Expect = 1.5, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 30/80 (37%), Gaps = 14/80 (17%) Query: 152 ALAWREAIVHTS-----ALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDH- 205 ALA + + A AP A+ SIA ASGA LN +V R + S L+ Sbjct: 9 ALALIAGSIAAARATPVASTAPSGAAASSIATPAASGATLNPGSSIVLRTFRSASLQRDW 68 Query: 206 --------GYPDMAQHYRIF 217 GY Y + Sbjct: 69 SYTVYLPPGYNAEGARYPVM 88 >gi|221505772|gb|EEE31417.1| conserved hypothetical protein [Toxoplasma gondii VEG] Length = 2520 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 52/141 (36%), Gaps = 8/141 (5%) Query: 97 AGKLLSFIPTPLTR-----LAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 AG +++F P LA S P++ G H A ++ + Sbjct: 207 AGGIVTFAPPDFFPATVSGLAASTSMSWPVSVGISAGGRGHAATTAFAAPPGAAAYPLSH 266 Query: 152 ALAWREAIVH--TSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 A +V+ T+ ++ PG + + A GA ++P G +S + G Sbjct: 267 ARIPGGDLVYYLTAGVVLPGGAGA-GVVPAGALGAGTSLPPGATFLSHASAGENNGGAMV 325 Query: 210 MAQHYRIFDMESLITDGLIGA 230 AQ+ R+ D +L GA Sbjct: 326 SAQNARVGDHVALAGKAKQGA 346 >gi|221484245|gb|EEE22541.1| conserved hypothetical protein [Toxoplasma gondii GT1] Length = 2520 Score = 39.0 bits (89), Expect = 1.7, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 52/141 (36%), Gaps = 8/141 (5%) Query: 97 AGKLLSFIPTPLTR-----LAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 AG +++F P LA S P++ G H A ++ + Sbjct: 207 AGGIVTFAPPDFFPATVSGLAASTSMSWPVSVGISAGGRGHAATTAFAAPPGAAAYPLSH 266 Query: 152 ALAWREAIVH--TSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 A +V+ T+ ++ PG + + A GA ++P G +S + G Sbjct: 267 ARIPGGDLVYYLTAGVVLPGGAGA-GVVPAGALGAGTSLPPGATFLSHASAGENNGGAMV 325 Query: 210 MAQHYRIFDMESLITDGLIGA 230 AQ+ R+ D +L GA Sbjct: 326 SAQNARVGDHVALAGKAKQGA 346 >gi|161525705|ref|YP_001580717.1| hypothetical protein Bmul_2536 [Burkholderia multivorans ATCC 17616] gi|189349573|ref|YP_001945201.1| collagen alpha chain precursor [Burkholderia multivorans ATCC 17616] gi|160343134|gb|ABX16220.1| hypothetical protein Bmul_2536 [Burkholderia multivorans ATCC 17616] gi|189333595|dbj|BAG42665.1| collagen alpha chain precursor [Burkholderia multivorans ATCC 17616] Length = 1860 Score = 39.0 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 41/246 (16%), Positives = 73/246 (29%), Gaps = 22/246 (8%) Query: 32 TGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYI 91 TG V+N +++ +VA DQ S +VGT + G Sbjct: 373 TGALNGVVNTATNTVNTIVAAHGAL--DQAVLDLASNGLNGAVGTVTGALGGNNPTGALN 430 Query: 92 AGAALAGKLLSFIPTPLTRLAGL-ALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETA 150 L P L G+ + + L L+ ++ + G + T Sbjct: 431 GVIGTVTGALGGANNPTGALNGVVSTVTGALGGNDPAGALNGVVG-TVTGALGGANNPTG 489 Query: 151 DALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDM 210 + P + + + N P G + G S V G D Sbjct: 490 ALNGVVGTVTGALGGNDPAGALNGVVGTVTGALGGANNPTGALN-GVVSTVTGALGGNDP 548 Query: 211 AQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRL---VNDLKEGITERLP--YKHGV 265 A +G++G G + N + L V+ + + P +GV Sbjct: 549 AG----------ALNGVVGTVTGALGGAN--NPTGALNGVVSTVTGALGGNGPTGALNGV 596 Query: 266 KSSSPG 271 +++ G Sbjct: 597 VTTAQG 602 >gi|39974363|ref|XP_368572.1| hypothetical protein MGG_00672 [Magnaporthe oryzae 70-15] gi|145018413|gb|EDK02692.1| hypothetical protein MGG_00672 [Magnaporthe oryzae 70-15] Length = 740 Score = 39.0 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 31/83 (37%), Gaps = 5/83 (6%) Query: 156 REAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLED-HGYPDMAQHY 214 + + P S + ASG N G + S+ +L GY A Y Sbjct: 655 SHGLTASLIEPTPAPANSGGLTPLGASGLSGNSNNGGTQN--SAALLTTLGGYDANATAY 712 Query: 215 RIFDMESLITDGLI--GAFFGGM 235 FD + + D L+ G FGG+ Sbjct: 713 DFFDPQHWMLDNLVDFGYSFGGV 735 >gi|317507692|ref|ZP_07965399.1| mechanosensitive ion channel [Segniliparus rugosus ATCC BAA-974] gi|316254019|gb|EFV13382.1| mechanosensitive ion channel [Segniliparus rugosus ATCC BAA-974] Length = 327 Score = 39.0 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 63/207 (30%), Gaps = 20/207 (9%) Query: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWR---EAIVHTSALLAPGAIASQSIA 177 + A A E + E + A+ I A+L I+ Sbjct: 52 VKASANVIARGSFKEQEPQIRGEAARQRATLLSAFIWVFTVIQIFVAVLMIATALELPIS 111 Query: 178 KTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA-QHYRIFDMESLITDGLIGAFFGGMH 236 AV G + VL G+ +A + YRI D+ L G G + Sbjct: 112 GFAPLAAVAGAGLGFGAQRIVQDVL--SGFFIIAEKQYRIGDLVQLAVLGTTNDPIGTVE 169 Query: 237 SKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 ++ LR + E +G S L + A +++R Sbjct: 170 QVTLRVTKLRSTDG------ELYTVPNGQIIKSTNLSKDWAQAVVDIPVPASSDIAVLR- 222 Query: 297 EYPHFDQEKLQTIADNTLEDPHFKPHL 323 EKL + D +DP+ KP L Sbjct: 223 -------EKLTEVCDTAKDDPNLKPLL 242 >gi|85094267|ref|XP_959849.1| hypothetical protein NCU05858 [Neurospora crassa OR74A] gi|28921305|gb|EAA30613.1| hypothetical protein NCU05858 [Neurospora crassa OR74A] Length = 1134 Score = 39.0 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 44/132 (33%), Gaps = 13/132 (9%) Query: 12 RDNIKEWAQRPRVSPDI---KWHTGLGKEVINMPARSLDKLVAPFRE---------ETHD 59 RD++ +WA RP P K H G + N + ++ + +++ Sbjct: 695 RDHLFDWA-RPTRQPKPHQVKTHAGAQHVLDNDKEGTSGAYMSSWHAGLESLLGKPLSNN 753 Query: 60 QPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSA 119 P+ + R D H A + + + A LAG+ + L L Sbjct: 754 NPDAHDSQRRDIHEQLYSAEWADQVKAFFAQTTDALLAGESYNVGGHLLVDLVRDVGNIV 813 Query: 120 PLAAGALYAYLS 131 P A +S Sbjct: 814 PTLFAAKVFGIS 825 >gi|257414269|ref|ZP_05591966.1| conserved hypothetical protein [Roseburia intestinalis L1-82] gi|257200595|gb|EEU98879.1| conserved hypothetical protein [Roseburia intestinalis L1-82] Length = 401 Score = 38.6 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 36/144 (25%), Positives = 61/144 (42%), Gaps = 11/144 (7%) Query: 322 HLPEPEPLPQYKE-HSDRQKPSEPLAEHPHPKRKEVERELSEIEGA-KKESSARKFFDEG 379 H E + P +KE HS +P++ H K VE++ S +EGA K E +A + G Sbjct: 175 HFEEQDKPPGFKEKHSPLSRPAQEAGILVHNKIHSVEKDNSGVEGAHKSEETAERGLKYG 234 Query: 380 SPDHSPFKGERNQKLDPMR----GADFTDAPHAKFDATTFTESLPHVDEQTMHRF---SE 432 + +G R+ KL P R + F P + + RF + Sbjct: 235 ARKIK--QGYRSHKLKPYREAAKAEKAAFKANVDFQYHKTLHDNPQLTSNPISRFWQKQK 292 Query: 433 LKERHPVEAREVLEGLQEKLQGTK 456 +K ++ EAR +G++ + T+ Sbjct: 293 IKRQYAKEARNTAKGIKGAAERTR 316 >gi|261337975|ref|ZP_05965859.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093] gi|270277472|gb|EFA23326.1| conserved hypothetical protein [Bifidobacterium gallicum DSM 20093] Length = 668 Score = 38.6 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 37/119 (31%), Gaps = 10/119 (8%) Query: 123 AGALYAYLSHKAESS--IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 ++ S + EGV++ T + +V + P +S A + Sbjct: 57 GFTAVTKTQEFSKESQGDFLRYEGVEEGTRVDTSTPITVVESLGPGVPKGTVGKSEADAI 116 Query: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDG-----LIGAFFGG 234 + + VP E SSK G M D +L D ++G G Sbjct: 117 TAVKDMGVPLSRAEVVVSSKTDVKPGDVAMTAP---ADGTALAKDDADRGIVLGVAAKG 172 >gi|237807961|ref|YP_002892401.1| sensor protein KdpD [Tolumonas auensis DSM 9187] gi|237500222|gb|ACQ92815.1| Osmosensitive K channel His kinase sensor [Tolumonas auensis DSM 9187] Length = 897 Score = 38.6 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 56/171 (32%), Gaps = 21/171 (12%) Query: 13 DNIKEWAQRPRVSPDIKWHT--------GLGKE---VINMPARSLDKLVAPFREETHDQP 61 D + + Q R S + WHT G G ++ + AR +L + + P Sbjct: 229 DRVDDQMQAYRHSGEPVWHTRDAILVCIGPGSGNEKLVRVAARLASRLGCVWHAVYVETP 288 Query: 62 NYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFI-PTPLTRLAGLALQSAP 120 +R + S+ + H + L + + A +L + L ++ Q P Sbjct: 289 RLHRLPEQERRSILSTLHFAQELGAETSTLPAQDEADAILYYAREHNLGKILIGRHQKKP 348 Query: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAI 171 Y ++ + +G D + H ++ L P I Sbjct: 349 W-------YRWGQSRFAHRLGTKGPDLDLLIVSLTDTE--HAASTLLPADI 390 >gi|302541829|ref|ZP_07294171.1| PE-PGRS family protein [Streptomyces hygroscopicus ATCC 53653] gi|302459447|gb|EFL22540.1| PE-PGRS family protein [Streptomyces himastatinicus ATCC 53653] Length = 455 Score = 38.6 bits (88), Expect = 2.4, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 63/186 (33%), Gaps = 9/186 (4%) Query: 9 EDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSR 68 E IR + + +R ++ ++ LG + +PA L + D G Sbjct: 239 ERIRLRLPDGTRR-ELTIGARFDRSLGFADLVVPAAVLRPHTP---DPLLDAVYLVTGPD 294 Query: 69 TDPHSVGTGAHLVEGLTSLAPY----IAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAG 124 P A L + S + A AG + P L A + LA Sbjct: 295 HRPSLDRDLARLTKAWPSARAADRDQVQEAGAAGAVDETWPVYLFSALIAAFTALALANT 354 Query: 125 ALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSI-AKTVASG 183 + A L E ++ I + +AW +V +L A+A + A ++A Sbjct: 355 VVMATLVRTGEFAMLRLIGATRRNVLALVAWESLVVAGCGVLLGAAVAGIVLSATSLALT 414 Query: 184 AVLNVP 189 +++ Sbjct: 415 GGIHIS 420 >gi|227505988|ref|ZP_03936037.1| conserved hypothetical protein [Corynebacterium striatum ATCC 6940] gi|227197422|gb|EEI77470.1| conserved hypothetical protein [Corynebacterium striatum ATCC 6940] Length = 475 Score = 38.6 bits (88), Expect = 2.5, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 52/182 (28%), Gaps = 24/182 (13%) Query: 30 WHTGLGKEVINMPARSLD---KLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTS 86 +H G+ K + + D +L A E G++ V+ + Sbjct: 225 YHDGVYKA-VEGAKQLNDGTKQLDAKVDEALSGVKQLDDGAKKVDGMAKQNQSKVQEVQR 283 Query: 87 LAPYIAGAALAGKLLSFIPTPLTRLA---GLALQSA-----------PLAAGALYAYLSH 132 P G +LLS + L + G A A PL G + + Sbjct: 284 ALPAPTGGVQDVQLLSPVVALLIAVVLTLGGACLGAFVVCSGRSPWLPLFGGVVVLAV-- 341 Query: 133 KAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGM 192 AE G A + LA + + + GA L V G+ Sbjct: 342 LAEIMFFLLATG----PTGEAALWVGLAAAVTSLASAGLTTALLRYFGKVGAGLAVVLGL 397 Query: 193 VE 194 + Sbjct: 398 AQ 399 >gi|34498638|ref|NP_902853.1| long-chain fatty acid transport protein [Chromobacterium violaceum ATCC 12472] gi|34104491|gb|AAQ60849.1| long-chain fatty acid transport protein precursor [Chromobacterium violaceum ATCC 12472] Length = 445 Score = 38.6 bits (88), Expect = 2.6, Method: Composition-based stats. Identities = 25/129 (19%), Positives = 42/129 (32%), Gaps = 10/129 (7%) Query: 82 EGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYL----SHKAESS 137 + +T + AGA +AG LS + L + G YA + S + S Sbjct: 35 QSVTGMGRAYAGAGMAGDDLSAVFYN--PAGMTLLSGTRVQGGLTYAEIDAPFSGRNTSV 92 Query: 138 IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 H G T A + + P + + + G + PFG+ Sbjct: 93 SHLP--GTPPATVTTSANDNG--RGAGEVIPNGYLTHQVNDQLFLGLGVTTPFGLGASYS 148 Query: 198 SSKVLEDHG 206 + D+G Sbjct: 149 DNWGGRDNG 157 >gi|145608240|ref|XP_360722.2| hypothetical protein MGG_03265 [Magnaporthe oryzae 70-15] gi|145015734|gb|EDK00224.1| hypothetical protein MGG_03265 [Magnaporthe oryzae 70-15] Length = 976 Score = 38.2 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 55/182 (30%), Gaps = 13/182 (7%) Query: 19 AQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGA 78 RP SP + + + +PA + L P + + +DQ + R + G Sbjct: 798 MIRPLNSPKVGFLQAVNSPKRPLPADDFEDLNPPKKIQRNDQREFQRAESPLKGAAGRRL 857 Query: 79 HLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRL-AGLALQSAPLAAGALYAYLSHKAESS 137 + P A A + I L++L S L+A + L Sbjct: 858 ENQRRIHGQGPASYNTAPAPAIPREINFLLSQLPGAEVYNSTRLSASRVVDTLR------ 911 Query: 138 IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 H + ++ R+ S P S + G PFG R Sbjct: 912 DTHVPDYQSWKSRQDKGLRQ-----SGAQMPSD-FSNPGYGRDSPGLRTGSPFGGERRIA 965 Query: 198 SS 199 S+ Sbjct: 966 SA 967 >gi|29840115|ref|NP_829221.1| hypothetical protein CCA00351 [Chlamydophila caviae GPIC] gi|29834463|gb|AAP05099.1| conserved hypothetical protein [Chlamydophila caviae GPIC] Length = 583 Score = 38.2 bits (87), Expect = 3.0, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 46/161 (28%), Gaps = 13/161 (8%) Query: 108 LTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALL- 166 L RL + GA + + + T L + ++ Sbjct: 203 LIRLGNNIVSKLSKGGGAFSLKMQRLSSTMSKVH-------TGITLGLVVGGIAAVGVIA 255 Query: 167 --APGAIASQSIAKTVASGAVLNV-PFGMVERGWS--SKVLEDHGYPDMAQHYRIFDMES 221 PG I + + A G L V SK + D+ I D++ Sbjct: 256 AVIPGGIFALPMIIAAAIGIGLAVLGLSYAIEAILERSKTNKKQLLKDLKSTIDIQDLKD 315 Query: 222 LITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYK 262 + D + + + Q M+L + +E R + Sbjct: 316 MTLDQTVLMNMLKVSLQADQQMTLDHKDFYEEYNRIRDNLQ 356 >gi|332561007|ref|ZP_08415325.1| bacteriophge tail fiber protein [Rhodobacter sphaeroides WS8N] gi|332274805|gb|EGJ20121.1| bacteriophge tail fiber protein [Rhodobacter sphaeroides WS8N] Length = 532 Score = 38.2 bits (87), Expect = 3.1, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 41/113 (36%), Gaps = 3/113 (2%) Query: 80 LVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAY---LSHKAES 136 L LTS + A A GK+L PL + +AP AA + ++ Sbjct: 81 LNNTLTSTSQAQALTAAQGKVLQDTKAPLASPGLTGVPTAPTAAAGTDTGQLATTAFVQN 140 Query: 137 SIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVP 189 I + + TA + A + A+ + +A +A A L+ P Sbjct: 141 QIAASVPDATEATAGKVRLASAAQIAAGTAGALAVTAARLAPLLAEKAGLDSP 193 >gi|289767365|ref|ZP_06526743.1| FecCD-family membrane transporter [Streptomyces lividans TK24] gi|289697564|gb|EFD64993.1| FecCD-family membrane transporter [Streptomyces lividans TK24] Length = 368 Score = 38.2 bits (87), Expect = 3.2, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 45/124 (36%), Gaps = 9/124 (7%) Query: 83 GLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQI 142 +T+ + A A + S + L L G S P+AAGA+ L H S+ Sbjct: 193 AVTTFMVFAAEHGEAAR--SAMMWLLGSLGGANWSSVPIAAGAVLGGLLHLGWSARRLNA 250 Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASG-AVLNVP------FGMVER 195 + ETA AL + L A+ +A + A G L VP G R Sbjct: 251 LAMGDETAAALGVDPGRLRKELFLTASAVTGAVVAVSGAIGFVGLMVPHAARMLVGADHR 310 Query: 196 GWSS 199 + Sbjct: 311 RLLA 314 >gi|256783487|ref|ZP_05521918.1| FecCD-family membrane transport protein [Streptomyces lividans TK24] Length = 336 Score = 38.2 bits (87), Expect = 3.2, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 45/124 (36%), Gaps = 9/124 (7%) Query: 83 GLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQI 142 +T+ + A A + S + L L G S P+AAGA+ L H S+ Sbjct: 161 AVTTFMVFAAEHGEAAR--SAMMWLLGSLGGANWSSVPIAAGAVLGGLLHLGWSARRLNA 218 Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASG-AVLNVP------FGMVER 195 + ETA AL + L A+ +A + A G L VP G R Sbjct: 219 LAMGDETAAALGVDPGRLRKELFLTASAVTGAVVAVSGAIGFVGLMVPHAARMLVGADHR 278 Query: 196 GWSS 199 + Sbjct: 279 RLLA 282 >gi|254466410|ref|ZP_05079821.1| oxoglutarate dehydrogenase (succinyl-transferring), E1 component [Rhodobacterales bacterium Y4I] gi|206687318|gb|EDZ47800.1| oxoglutarate dehydrogenase (succinyl-transferring), E1 component [Rhodobacterales bacterium Y4I] Length = 911 Score = 37.8 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 60/199 (30%), Gaps = 30/199 (15%) Query: 42 PARSLDKLVAPFREETHDQPNYYRGSRTDPHS--VGTGAHLVEGLTSLAPYIAGAALAGK 99 P ++ + A F+ + +++ + + + G +HL + A+A + Sbjct: 534 PEGEIEDMKAAFQAQLNEEFEAGKDYKPNKADWLDGRWSHLNK--KDADYQRGSTAIAPE 591 Query: 100 LLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAI 159 L+ I T L+R+ PL + G ET + W Sbjct: 592 TLAEIGTALSRVPD----GFPL-----------HRTVARFLDARGKMFETGEGFDWATGE 636 Query: 160 VHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYP-----DMAQHY 214 L + + G F G + E+ YP Y Sbjct: 637 AMAFGSLLLEGYPVRLAGQDATRGT-----FSQRHSGIVDQETEERYYPLNNIRAGQSQY 691 Query: 215 RIFDMESLITDGLIGAFFG 233 + D +L ++G +G Sbjct: 692 EVID-SALSEYAVLGFEYG 709 >gi|212715502|ref|ZP_03323630.1| hypothetical protein BIFCAT_00400 [Bifidobacterium catenulatum DSM 16992] gi|212661584|gb|EEB22159.1| hypothetical protein BIFCAT_00400 [Bifidobacterium catenulatum DSM 16992] Length = 521 Score = 37.8 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 37/152 (24%), Positives = 51/152 (33%), Gaps = 12/152 (7%) Query: 265 VKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLP 324 S+P D H +HG++ Q A N +ED F P +P Sbjct: 377 DAPSAPAAPVIPDTPVVHQPEESHGINIAPDSSLAALAQMAQNIDAPNPVEDT-FTPRMP 435 Query: 325 EPEP--LPQYKEHSDR--QKPSEPLAEHPHPKRK-EVERELSEIEGAKKESSARKFFDEG 379 LPQ S P+ P + P P + + +E AK +K +E Sbjct: 436 SLSTPNLPQVNTESINLGTLPTVPPSFTPEPATSADHSTTATPVESAKPTVEEKK--NET 493 Query: 380 SPDHSPFKGERNQKLDPMRGADFTDAPHAKFD 411 P +P G N LD D D FD Sbjct: 494 KPATNPMFGPTNSNLD----VDIPDLSFPSFD 521 >gi|85707582|ref|ZP_01038648.1| phosphoenolpyruvate-protein phosphotransferase [Erythrobacter sp. NAP1] gi|85689116|gb|EAQ29119.1| phosphoenolpyruvate-protein phosphotransferase [Erythrobacter sp. NAP1] Length = 756 Score = 37.8 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 47/125 (37%), Gaps = 12/125 (9%) Query: 136 SSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVER 195 S + E VD+E A +L+ +++ T + L T+ G V R Sbjct: 155 SELITNAELVDEEEALSLSPQQSGTQTLSGL------------TLVRGLGAGVAAYHQPR 202 Query: 196 GWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGI 255 ++V+ D + + YR FD DGL G+ + + + + EG Sbjct: 203 VQITQVMADDIEAERQRVYRAFDKMREQIDGLTNQADFGVGGEHEEVLETYKMFAYDEGW 262 Query: 256 TERLP 260 + R+ Sbjct: 263 SRRIN 267 >gi|5880612|gb|AAD54768.1|AF120157_1 endo-1,4-beta-xylanase [Xylanimicrobium pachnodae] Length = 1183 Score = 37.8 bits (86), Expect = 4.2, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 45/138 (32%), Gaps = 4/138 (2%) Query: 83 GLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQI 142 SL+P A + + R+ +A +A GAL + ++ + Sbjct: 3 SRQSLSPGRVPGGPAPEHVGRTSRGWRRVIASGATAALIAGGALVGGALTSSAAAEPTVV 62 Query: 143 EGVDKETADALAWREAIVHTSALL-APGAIASQSIA--KTVASGAVLNVPFGMVERGWSS 199 VD E W ++ T A++ +P + A + P G+ G + Sbjct: 63 SAVDFEDGTTGTWTQSGSPTLAVVESPDGADDGQVLSITRAADYEGIQSPTGIFTPGQTY 122 Query: 200 K-VLEDHGYPDMAQHYRI 216 + D+A + Sbjct: 123 DFTMRARLAADVAGTADV 140 >gi|115359112|ref|YP_776250.1| outer membrane autotransporter [Burkholderia ambifaria AMMD] gi|115284400|gb|ABI89916.1| outer membrane autotransporter barrel domain protein [Burkholderia ambifaria AMMD] Length = 2371 Score = 37.8 bits (86), Expect = 4.2, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 39/142 (27%), Gaps = 1/142 (0%) Query: 97 AGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWR 156 AG L+ T AG + L S+++ + + + Sbjct: 1266 AGGSLASTGTVNLAGAGATFDLGGASGAETIGALIGATGSTVNLGANALTLSGSGNNTFG 1325 Query: 157 EAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRI 216 + + L +Q++ + G S L G ++A Sbjct: 1326 -GAIGGTGSLTLAGAGTQTLTGANTYTGGTTINGGSTLALVSGGSLASTGTVNLAGTGAT 1384 Query: 217 FDMESLITDGLIGAFFGGMHSK 238 FD+ IGA G + Sbjct: 1385 FDVSGAAGAETIGALSGAAGTN 1406 >gi|332967975|gb|EGK07062.1| xylulokinase [Desmospora sp. 8437] Length = 511 Score = 37.8 bits (86), Expect = 4.3, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 37/125 (29%), Gaps = 3/125 (2%) Query: 99 KLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADAL---AW 155 K+L+ + +L G + L S + G+D E L + Sbjct: 155 KVLNAKDYIVFKLTGAFVTDYSDGNSMGCFDLEDLKWSERILEASGIDPEKLPNLQPSTY 214 Query: 156 REAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYR 215 V A A G + G N+ G VE G + L + Sbjct: 215 VAGGVTEEAAKATGMALGTKVVIGAGDGVTANIGAGSVEEGKTYCSLGTSAWVTTTAKKP 274 Query: 216 IFDME 220 IFD E Sbjct: 275 IFDPE 279 >gi|187918967|ref|YP_001887998.1| methylmalonate-semialdehyde dehydrogenase [Burkholderia phytofirmans PsJN] gi|187717405|gb|ACD18628.1| methylmalonate-semialdehyde dehydrogenase [Burkholderia phytofirmans PsJN] Length = 501 Score = 37.8 bits (86), Expect = 4.3, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 28/78 (35%), Gaps = 8/78 (10%) Query: 221 SLITDGLIGAFFGGMHSKQVQNMSLRLVNDLK----EGITERLPYKHGVKSSSPGLHTSF 276 ++ TD LIGA FG + + V D+ + ER ++PG Sbjct: 267 AMATDALIGAAFGSAGERCMAISVAVAVGDVGDRLVAALAERTRALKIDDGTAPGAEMGP 326 Query: 277 DAYEAHTDTLAHGVDSLV 294 T ++SL+ Sbjct: 327 ----VITAAARERIESLI 340 >gi|226303686|ref|YP_002763644.1| hypothetical protein RER_01970 [Rhodococcus erythropolis PR4] gi|226182801|dbj|BAH30905.1| hypothetical protein RER_01970 [Rhodococcus erythropolis PR4] Length = 1112 Score = 37.8 bits (86), Expect = 4.5, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 76/243 (31%), Gaps = 19/243 (7%) Query: 44 RSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSF 103 + L P T + + ++VG+ G +P A +A Sbjct: 319 ETTTSLSVPATAITGTAVDLTATVAPN-NAVGSVQFKSNGTAIGSPVAVSAGVA------ 371 Query: 104 IPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTS 163 + AG +A AGA + S A++ VD ET +L+ + S Sbjct: 372 TLSHSFDAAGAQSVTADFTAGAGFVSSSASAQTVTVSDPAPVDVETTTSLSVPATAITGS 431 Query: 164 ALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLI 223 A+ ++A A G V G S V +G ++ + +S+ Sbjct: 432 AVDLTA-----TVAPNNAVGTVQFKSNGAA---IGSPVTVSNGTATLSHAFDAAGAQSIT 483 Query: 224 TDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHT 283 D GA F S Q +++ + T L + G A A Sbjct: 484 ADFTAGAGFVS-SSASAQTVTVSDPAPVDVETTTSLSVPATAIT---GTAVDLTATVAPN 539 Query: 284 DTL 286 + + Sbjct: 540 NAV 542 >gi|296128141|ref|YP_003635391.1| membrane protein-like protein [Cellulomonas flavigena DSM 20109] gi|296019956|gb|ADG73192.1| membrane protein-like protein [Cellulomonas flavigena DSM 20109] Length = 982 Score = 37.4 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 64/218 (29%), Gaps = 22/218 (10%) Query: 90 YIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKET 149 G A A L + +T L + +A+ + + + G+ T Sbjct: 653 DQTGEAKAAGLREGL--AMTDGGLDLLTAGVVASVRAAVGVEGQTAEDETLRG-GIASLT 709 Query: 150 ADALAWREAI---VHTSALLAPGAIASQSIAKTVASGAVLNVPFGM------------VE 194 A V L+ GA + + ++ GA + Sbjct: 710 AGVGELSTGGQALVDGLGELSAGAAELRDGSARLSVGAGTLADGAADLAAGTGRLAPGAQ 769 Query: 195 RGWSSKVLEDHGYPDMAQHYR-IFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKE 253 R + G +A R + + ++DG+ A G + + L + Sbjct: 770 RLSAGLRDAADGSQTLADRLRPAAEGSAALSDGVRAAADGALTLADRLRPAADGSRALAD 829 Query: 254 GITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVD 291 G R K SS G+ T+ D +D L D Sbjct: 830 G--ARTAADGATKLSS-GIRTAADGSRELSDGLRDAAD 864 >gi|271967415|ref|YP_003341611.1| signal transduction histidine kinase-like protein [Streptosporangium roseum DSM 43021] gi|270510590|gb|ACZ88868.1| Signal transduction histidine kinase-like protein [Streptosporangium roseum DSM 43021] Length = 403 Score = 37.4 bits (85), Expect = 4.9, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 48/165 (29%), Gaps = 28/165 (16%) Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASG---------------AVLNVPFGMVERG 196 AL W + + PG + + + A + R Sbjct: 125 ALWWVDLGAIGVGGVLPGVLLTAPLQPDTPLSLSLPLALAGAVIMPTAAYPITAWAGARA 184 Query: 197 WSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGIT 256 ++ L P++A + + + D + ++++ +T Sbjct: 185 TMARALLGSPDPELA---EVVRSRARLVDA------FEIERRRIERDLHDGAQQRLVALT 235 Query: 257 ERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVD--SLVRGEYP 299 +L PG + EAH + + + L+RG +P Sbjct: 236 LKLGM--AQLDLEPGSPAAERVAEAHEEAMRALAELRELIRGVHP 278 >gi|110679794|ref|YP_682801.1| flagellar motor switch protein FliG, putative [Roseobacter denitrificans OCh 114] gi|109455910|gb|ABG32115.1| flagellar motor switch protein FliG, putative [Roseobacter denitrificans OCh 114] Length = 352 Score = 37.4 bits (85), Expect = 5.1, Method: Composition-based stats. Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 7/101 (6%) Query: 192 MVER--GWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQ---VQNMSLR 246 MV R + + + D+ + R D E+L+T L GA GM + ++NMS R Sbjct: 244 MVRRAIFTFANIPQRIAARDIPRVVRALDQEALVT-ALAGAEAAGMQASAEFILENMSGR 302 Query: 247 LVNDLKEGITERLPYKHGVKSSSPGLHT-SFDAYEAHTDTL 286 + + L+E + ER K + L + EA D L Sbjct: 303 MADQLREEVQERETVKSADMEEASALIVQAIRELEASGDLL 343 >gi|322499046|emb|CBZ34118.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 884 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 44/131 (33%), Gaps = 10/131 (7%) Query: 90 YIAGAALAGKL-LSFIPTPLTRLAGLALQSAPLAAGALYAYLS----HKAESSIHHQIEG 144 A A+ + LS + G A+ APL A A + A+ S + Sbjct: 714 EAATQAIPEMVRLSVLHYTSRAAEGAAVDGAPLPATAEVTASQDDGRNVADRSPFSPADV 773 Query: 145 VDKETADALAWREAIVHT----SALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSK 200 + AD + E + ++L+ P A+ + V ER Sbjct: 774 TTADAADGKSVGEGRAASRHLKASLVRPTALTWNQV-DRVLVMLGTVTQLSEAERSLFRA 832 Query: 201 VLEDHGYPDMA 211 +L+D G ++ Sbjct: 833 LLDDDGSDSLS 843 >gi|146086661|ref|XP_001465607.1| hypothetical protein [Leishmania infantum JPCM5] gi|134069706|emb|CAM68030.1| conserved hypothetical protein [Leishmania infantum JPCM5] Length = 884 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 44/131 (33%), Gaps = 10/131 (7%) Query: 90 YIAGAALAGKL-LSFIPTPLTRLAGLALQSAPLAAGALYAYLS----HKAESSIHHQIEG 144 A A+ + LS + G A+ APL A A + A+ S + Sbjct: 714 EAATQAIPEMVRLSVLHYTSRAAEGAAVDGAPLPATAEVTASQDDGRNVADRSPFSPADV 773 Query: 145 VDKETADALAWREAIVHT----SALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSK 200 + AD + E + ++L+ P A+ + V ER Sbjct: 774 TTADAADGKSVGEGRAASRHLKASLVRPTALTWNQV-DRVLVMLGTVTQLSEAERSLFRA 832 Query: 201 VLEDHGYPDMA 211 +L+D G ++ Sbjct: 833 LLDDDGSDSLS 843 >gi|21225493|ref|NP_631272.1| FecCD-family membrane transport protein [Streptomyces coelicolor A3(2)] gi|8546927|emb|CAB94639.1| putative FecCD-family membrane transport protein [Streptomyces coelicolor A3(2)] Length = 368 Score = 37.4 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 30/124 (24%), Positives = 45/124 (36%), Gaps = 9/124 (7%) Query: 83 GLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQI 142 +T+ + A A + S + L L G S P+AAGA+ + H S+ Sbjct: 193 AVTTFMVFAAEHGEAAR--SAMMWLLGSLGGANWSSVPIAAGAVLGGILHLGWSARRLNA 250 Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASG-AVLNVP------FGMVER 195 + ETA AL + L A+ +A + A G L VP G R Sbjct: 251 LAMGDETAAALGVDPGRLRKELFLTASAVTGAVVAVSGAIGFVGLMVPHAARMLVGADHR 310 Query: 196 GWSS 199 + Sbjct: 311 RLLA 314 >gi|228909190|ref|ZP_04073018.1| Transketolase [Bacillus thuringiensis IBL 200] gi|228850511|gb|EEM95337.1| Transketolase [Bacillus thuringiensis IBL 200] Length = 673 Score = 37.4 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 12/119 (10%) Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN----VPFGMVERGWSSKVLEDH 205 A + I + + A + K S N V G + G + + + Sbjct: 122 ATTGPLGQGIANAVGMAMAEAHLAAKFNKDGHSIIDHNTYALVGDGDLMEGVAYEAMSMA 181 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFG--------GMHSKQVQNMSLRLVNDLKEGIT 256 G+ + + ++D + DG +G F +H + V+ V+ + + IT Sbjct: 182 GHMKLGKLIVLYDSNEISLDGELGIAFSEDIQKRAESVHWQYVRVEDGNDVDAITKAIT 240 >gi|218234257|ref|YP_002368092.1| transketolase [Bacillus cereus B4264] gi|218162214|gb|ACK62206.1| transketolase [Bacillus cereus B4264] Length = 664 Score = 37.4 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 12/119 (10%) Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN----VPFGMVERGWSSKVLEDH 205 A + I + + A + K S N V G + G + + + Sbjct: 113 ATTGPLGQGIANAVGMAMAEAHLAAKFNKDGHSIIDHNTYALVGDGDLMEGVAYEAMSMA 172 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFG--------GMHSKQVQNMSLRLVNDLKEGIT 256 G+ + + ++D + DG +G F +H + V+ V+ + + IT Sbjct: 173 GHMKLGKLIVLYDSNEISLDGELGIAFSEDIEKRAESVHWQYVRVEDGNDVDAITKAIT 231 >gi|229151569|ref|ZP_04279771.1| Transketolase [Bacillus cereus m1550] gi|228631813|gb|EEK88440.1| Transketolase [Bacillus cereus m1550] Length = 664 Score = 37.4 bits (85), Expect = 5.5, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 12/119 (10%) Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN----VPFGMVERGWSSKVLEDH 205 A + I + + A + K S N V G + G + + + Sbjct: 113 ATTGPLGQGIANAVGMAMAEAHLAAKFNKDGHSIIDHNTYALVGDGDLMEGVAYEAMSMA 172 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFG--------GMHSKQVQNMSLRLVNDLKEGIT 256 G+ + + ++D + DG +G F +H + V+ V+ + + IT Sbjct: 173 GHMKLGKLIVLYDSNEISLDGELGIAFSEDIQKRAESVHWQYVRVEDGNDVDAITKAIT 231 >gi|72388488|ref|XP_844668.1| nucleoporin (NUP54/57) [Trypanosoma brucei TREU927] gi|62360145|gb|AAX80565.1| nucleoporin (NUP54/57), putative [Trypanosoma brucei] gi|70801201|gb|AAZ11109.1| nucleoporin (NUP54/57), putative [Trypanosoma brucei brucei strain 927/4 GUTat10.1] Length = 641 Score = 37.4 bits (85), Expect = 5.5, Method: Composition-based stats. Identities = 20/125 (16%), Positives = 30/125 (24%), Gaps = 4/125 (3%) Query: 88 APYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDK 147 AP + A G + T + AGA A + G Sbjct: 18 APACSTAGGFGSGFNTATTGGFGAGANTATTGGFGAGANTATTGGFGAGANTVTTGG--F 75 Query: 148 ETADALAWREAIVHTSALLAPGAIAS-QSIAKTVASGAVLNVPFGMVERGWSSKVLEDHG 206 A + + G + + A T GA N G + G Sbjct: 76 GAGANTATTGGFGAGANTVTTGGFGAGANTATTGGFGAGANTAT-TGGFGAGANTATTGG 134 Query: 207 YPDMA 211 + A Sbjct: 135 FGAGA 139 >gi|228901875|ref|ZP_04066044.1| Transketolase [Bacillus thuringiensis IBL 4222] gi|228857765|gb|EEN02256.1| Transketolase [Bacillus thuringiensis IBL 4222] Length = 673 Score = 37.4 bits (85), Expect = 5.8, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 12/119 (10%) Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN----VPFGMVERGWSSKVLEDH 205 A + I + + A + K S N V G + G + + + Sbjct: 122 ATTGPLGQGIANAVGMAMAEAHLAAKFNKDGHSIIDHNTYALVGDGDLMEGVAYEAMSMA 181 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFG--------GMHSKQVQNMSLRLVNDLKEGIT 256 G+ + + ++D + DG +G F +H + V+ V+ + + IT Sbjct: 182 GHMKLGKLIVLYDSNEISLDGELGIAFSEDIQKRAESVHWQYVRVEDGTDVDAITKAIT 240 >gi|228966278|ref|ZP_04127336.1| Transketolase [Bacillus thuringiensis serovar sotto str. T04001] gi|228793411|gb|EEM40956.1| Transketolase [Bacillus thuringiensis serovar sotto str. T04001] Length = 673 Score = 37.4 bits (85), Expect = 5.8, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 12/119 (10%) Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN----VPFGMVERGWSSKVLEDH 205 A + I + + A + K S N V G + G + + + Sbjct: 122 ATTGPLGQGIANAVGMAMAEAHLAAKFNKDGHSIIDHNTYALVGDGDLMEGVAYEAMSMA 181 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFG--------GMHSKQVQNMSLRLVNDLKEGIT 256 G+ + + ++D + DG +G F +H + V+ V+ + + IT Sbjct: 182 GHMKLGKLIVLYDSNEISLDGELGIAFSEDIQKRAESVHWQYVRVEDGTDVDAITKAIT 240 >gi|115377331|ref|ZP_01464538.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] gi|115365651|gb|EAU64679.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] Length = 274 Score = 37.0 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 44/127 (34%), Gaps = 15/127 (11%) Query: 92 AGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 G+ L+ L + + A + + GA L+ + + + V Sbjct: 160 NGSGLSASLSTLFSNSPGLIGSAARWAGVVGNGASAV-LNGISAYQEAMRGDYV------ 212 Query: 152 ALAWREAIVHTSALLAPG-AIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDM 210 + L+A G + + S+ A+ V V ++ GW + + Y + Sbjct: 213 -------GAAGTGLMAVGSGVLAGSVFTGTAAPGVAVVGAALIGAGWVTNQFSEADYETI 265 Query: 211 AQHYRIF 217 A+ + ++ Sbjct: 266 ARQHGLY 272 >gi|237750952|ref|ZP_04581432.1| chaperonin GroEL [Helicobacter bilis ATCC 43879] gi|229373397|gb|EEO23788.1| chaperonin GroEL [Helicobacter bilis ATCC 43879] Length = 547 Score = 37.0 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 36/94 (38%), Gaps = 5/94 (5%) Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKE-TADALAWREAIVHTSALLAPGAIASQSIA 177 A L+ G + +E + + + VD +A A E IV A S+ A Sbjct: 369 AKLSGGVAVIKVGAPSEVEMKEKKDRVDDALSATKAAVEEGIVIGGGAALIHAA-SKVNA 427 Query: 178 KTVASGAVLNVPFGMVERGW---SSKVLEDHGYP 208 K + N+ F ++ R +++ + GY Sbjct: 428 KNASLKGDENIGFDIIHRAVKAPLAQIATNAGYD 461 >gi|260431300|ref|ZP_05785271.1| histidinol dehydrogenase [Silicibacter lacuscaerulensis ITI-1157] gi|260415128|gb|EEX08387.1| histidinol dehydrogenase [Silicibacter lacuscaerulensis ITI-1157] Length = 433 Score = 37.0 bits (84), Expect = 6.2, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 37/123 (30%), Gaps = 3/123 (2%) Query: 53 FREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLA 112 + D P+ V A + L L + L+F +T+ Sbjct: 20 LSAKREDSPDVDAVVAQIIADVR--ARGDAAVIELTAKFDRLQLTPETLAFSADEVTQAI 77 Query: 113 GLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIA 172 A A + E + E D+ A L WR + V + L PG +A Sbjct: 78 ATVSADDRAALELAAARIRAYHERQMPQDQEWTDESGA-TLGWRWSAVSAAGLYVPGGLA 136 Query: 173 SQS 175 S Sbjct: 137 SYP 139 >gi|23573417|gb|AAN38708.1| hemolysin/hemagglutinin-like protein HecA [Erwinia chrysanthemi] Length = 3848 Score = 37.0 bits (84), Expect = 6.2, Method: Composition-based stats. Identities = 51/286 (17%), Positives = 78/286 (27%), Gaps = 52/286 (18%) Query: 27 DIKWHTGLGKEVINMPARSLDK--------------LVAPFREETHDQPNYYRGSRTDPH 72 D + G G I DK + P E + Q Y + Sbjct: 2067 DQSAYVGGGSSPITKQLDLADKFEIQNKHYSINYKPVGEPTSELINGQT--YAATIQAGG 2124 Query: 73 SVGTGAHLVEGLTSLAPYIAGA--ALAGKLLSFIPTPLTRLAGLA-----------LQSA 119 ++ TSL P G ALA L+ + + T + A + + Sbjct: 2125 AITASFTQNISNTSLQPGSGGVMPALATPTLAGV-SAFTPVGAQAGRELSGGTAAAVSGS 2183 Query: 120 PLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKT 179 PL+ L+ +AE TA R L P I Sbjct: 2184 PLSGTGNGVALAGQAERP----------GTAAGAVTRAGTDAGGGTLTPAGI-------D 2226 Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQ 239 G V G + G L G +A +GL A G Sbjct: 2227 SGLGTAAPVAPGALSPGDLQAALRQ-GLAQVAGPSLTDYPLPTSQNGLFVADTAGDSRYL 2285 Query: 240 VQ-NMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTD 284 ++ N +L + + + L G+ +PG + TD Sbjct: 2286 IRSNPTLSQLGQVDNSLFGDL---RGLLGQTPGTSVPVETTPTLTD 2328 >gi|218898457|ref|YP_002446868.1| transketolase [Bacillus cereus G9842] gi|218542934|gb|ACK95328.1| transketolase [Bacillus cereus G9842] Length = 664 Score = 37.0 bits (84), Expect = 6.3, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 12/119 (10%) Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN----VPFGMVERGWSSKVLEDH 205 A + I + + A + K S N V G + G + + + Sbjct: 113 ATTGPLGQGIANAVGMAMAEAHLAAKFNKDGHSIIDHNTYALVGDGDLMEGVAYEAMSMA 172 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFG--------GMHSKQVQNMSLRLVNDLKEGIT 256 G+ + + ++D + DG +G F +H + V+ V+ + + IT Sbjct: 173 GHMKLGKLIVLYDSNEISLDGELGIAFSEDIQKRAESVHWQYVRVEDGTDVDAITKAIT 231 >gi|288919619|ref|ZP_06413948.1| D-alanine/D-alanine ligase [Frankia sp. EUN1f] gi|288349017|gb|EFC83265.1| D-alanine/D-alanine ligase [Frankia sp. EUN1f] Length = 369 Score = 37.0 bits (84), Expect = 6.8, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 40/111 (36%), Gaps = 8/111 (7%) Query: 77 GAHLVEGLTS-LAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAE 135 A VE L S +A G+ L LAG+ +P+ AGAL + Sbjct: 76 LAGAVEVLRSCVAAVPMLHGPGGE--DGTLAALCELAGVPYVGSPVRAGALA-----MDK 128 Query: 136 SSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVL 186 + E V TA + A A++AP + ++ + G L Sbjct: 129 WATKLVAEAVGVRTAPGILVNRARTAAGAVMAPLPAVVKPVSAGSSYGVSL 179 >gi|301105238|ref|XP_002901703.1| inositol transporter, putative [Phytophthora infestans T30-4] gi|262100707|gb|EEY58759.1| inositol transporter, putative [Phytophthora infestans T30-4] Length = 488 Score = 37.0 bits (84), Expect = 7.0, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 30/103 (29%), Gaps = 8/103 (7%) Query: 87 LAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVD 146 + P + L S L+ L A+ ++ ++ A LS + Sbjct: 1 MTPSGVISGALVLLQSPQGFALSDLQSEAVVASAVSGAIAGAALSGIGNDKFGRR----- 55 Query: 147 KETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVP 189 LA + L+A + IA + G + Sbjct: 56 ---QVILASSALFTVGAGLMAVAGSFLELIAGRLIVGVGIGCA 95 >gi|297202669|ref|ZP_06920066.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|197713244|gb|EDY57278.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length = 378 Score = 37.0 bits (84), Expect = 7.0, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 60/187 (32%), Gaps = 18/187 (9%) Query: 30 WHTGLGKEVINMPA--RSLDKLVAPFREETHDQPNYYRGSRTDPH----SVGTGAHLVEG 83 + TG +++ A + K+V R T +V T ++ Sbjct: 129 FFTGFISDLVANTAVAERVAKIVDLVRLFTSAAERVAGLLERFSGLSAETVATLERMLTA 188 Query: 84 LTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIE 143 + ++ A L +F+ + + A+ P+ GA + Sbjct: 189 VARVSASFARTGLESFATNFVADSGSLMVTQAVNGQPVTVGADL--------RNGALLAG 240 Query: 144 GVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLE 203 G TA A A + + L G + + T A+GA+ NV G+ + + Sbjct: 241 GTAGFTAGAGAIGARVTGVAGDLLRG----EGLLGTAANGALGNVTGGVTADYANGQDAS 296 Query: 204 DHGYPDM 210 G + Sbjct: 297 TMGQDAL 303 >gi|83747470|ref|ZP_00944509.1| Hypothetical Protein RRSL_02855 [Ralstonia solanacearum UW551] gi|83725927|gb|EAP73066.1| Hypothetical Protein RRSL_02855 [Ralstonia solanacearum UW551] Length = 433 Score = 37.0 bits (84), Expect = 7.3, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 69/209 (33%), Gaps = 44/209 (21%) Query: 10 DIRDNIKEWAQRPRVSPDIKWHTGLGKE--------VINMPARSLDKLVA------PFRE 55 D++ + +A+ P + G G ++ + L A P Sbjct: 146 DVKQQLTAFAKTPAQVGAV---VGAGSGVADAFGGSLVTKATSNTQWLAASAAHLEPVMA 202 Query: 56 ETHD--QPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIA---GAALAGKLLSFIPTPLTR 110 + H QP+ R + + + T +AP GAA A + S+I Sbjct: 203 QAHQAVQPSLRRLAVEVSGAFQAYSLRNVVRTGVAPLATHVLGAATAANVDSWI------ 256 Query: 111 LAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGV----DKETADALAWREA-------I 159 A P+A A Y + H E+ E + D ET L +++ Sbjct: 257 ----AAVGGPVAGAAAYMAMQHMNETQHRTGAEYLLGRTDWETQFTL-LKQSTWTDPLKG 311 Query: 160 VHTSALLAPGAIASQSIAKTVASGAVLNV 188 A P + ++++A T + N+ Sbjct: 312 AAQRAAKLPVDLLTETLAATRSLFTATNI 340 >gi|220911249|ref|YP_002486558.1| D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding [Arthrobacter chlorophenolicus A6] gi|219858127|gb|ACL38469.1| D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding [Arthrobacter chlorophenolicus A6] Length = 328 Score = 37.0 bits (84), Expect = 7.6, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 82 EGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP-LAAGALYAYLSHKAESS-IH 139 + L L+P L G + P + P + AGA+ L + + Sbjct: 13 QLLADLSPLP--EGLRGVVWDMQGEPDAAHGSIDGVILPYINAGAVLGNLDKVQDLKFVQ 70 Query: 140 HQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSS 199 Q G D + + PGA + + A+ A L V + + Sbjct: 71 TQSTGFD-----------GVREAAG---PGAAVANASGVHAAATAELAVGLILAKLRGID 116 Query: 200 KVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERL 259 + + D + A R +SL ++ GG+ + + + V + G TER Sbjct: 117 QAVRDQATENWAPQRR----QSLADRRVLLLGIGGIGQELARRLEPFEVTVTRVGSTERT 172 Query: 260 PYKHGVKSSSPGLHT 274 +HG SS L T Sbjct: 173 D-EHGQVHSSAQLET 186 >gi|219682767|ref|YP_002469150.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis AD011] gi|241190343|ref|YP_002967737.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195749|ref|YP_002969304.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis DSM 10140] gi|219620417|gb|ACL28574.1| putative nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis AD011] gi|240248735|gb|ACS45675.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250303|gb|ACS47242.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis DSM 10140] gi|289178066|gb|ADC85312.1| Nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis BB-12] gi|295793330|gb|ADG32865.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis V9] Length = 486 Score = 36.7 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 49/231 (21%), Positives = 82/231 (35%), Gaps = 23/231 (9%) Query: 107 PLTRLAGLALQSAP-LAAGALYAYLSHKAESSIH-HQIEGVDKETA---DALAWREAIVH 161 L L P + A L H +E QI + T D EA+ Sbjct: 226 GTANLLAAKLYDLPAIGTAAHCFTLVHDSERQAFESQIAALGTNTTLLVDTYNIEEAVKT 285 Query: 162 TSALLAPGAIASQSIAKTVASGA--VLNV--PFGMVE-RGWSSKVLEDHGYPDMAQHYRI 216 + P + + +A+ A V N G + + L++ Y + Sbjct: 286 AVEVAGPNLGGVRIDSGDLAALAQRVRNQLDALGATNTKITVTNDLDE--YAIASLQTAP 343 Query: 217 FDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSF 276 D + T + G+ G V ++ R N G E + K K+++PG +F Sbjct: 344 VDSYGVGTQLVTGS--GAPTCAMVYKLTERANN---AGHMEPVAKKSVDKATAPGAKLAF 398 Query: 277 DAYE---AHTDTLAHGVDSLVRGEYPHFDQEKLQT--IADNTLEDPHFKPH 322 +YE A + + G +S + P E L T + + T+ DP F+ H Sbjct: 399 RSYEYSLADCEHVISGSESALENFVPGEGWEDLLTDFVVNGTV-DPQFQGH 448 >gi|183601852|ref|ZP_02963221.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis HN019] gi|183218737|gb|EDT89379.1| nicotinate phosphoribosyltransferase [Bifidobacterium animalis subsp. lactis HN019] Length = 440 Score = 36.7 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 49/231 (21%), Positives = 82/231 (35%), Gaps = 23/231 (9%) Query: 107 PLTRLAGLALQSAP-LAAGALYAYLSHKAESSIH-HQIEGVDKETA---DALAWREAIVH 161 L L P + A L H +E QI + T D EA+ Sbjct: 180 GTANLLAAKLYDLPAIGTAAHCFTLVHDSERQAFESQIAALGTNTTLLVDTYNIEEAVKT 239 Query: 162 TSALLAPGAIASQSIAKTVASGA--VLNV--PFGMVE-RGWSSKVLEDHGYPDMAQHYRI 216 + P + + +A+ A V N G + + L++ Y + Sbjct: 240 AVEVAGPNLGGVRIDSGDLAALAQRVRNQLDALGATNTKITVTNDLDE--YAIASLQTAP 297 Query: 217 FDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSF 276 D + T + G+ G V ++ R N G E + K K+++PG +F Sbjct: 298 VDSYGVGTQLVTGS--GAPTCAMVYKLTERANN---AGHMEPVAKKSVDKATAPGAKLAF 352 Query: 277 DAYE---AHTDTLAHGVDSLVRGEYPHFDQEKLQT--IADNTLEDPHFKPH 322 +YE A + + G +S + P E L T + + T+ DP F+ H Sbjct: 353 RSYEYSLADCEHVISGSESALENFVPGEGWEDLLTDFVVNGTV-DPQFQGH 402 >gi|56708986|ref|YP_165031.1| histidinol dehydrogenase [Ruegeria pomeroyi DSS-3] gi|81819866|sp|Q5LL27|HISX3_SILPO RecName: Full=Histidinol dehydrogenase 3; Short=HDH 3 gi|56680671|gb|AAV97336.1| histidinol dehydrogenase [Ruegeria pomeroyi DSS-3] Length = 433 Score = 36.7 bits (83), Expect = 8.0, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 38/136 (27%), Gaps = 4/136 (2%) Query: 40 NMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGK 99 P A + D P+ V A + L AL + Sbjct: 8 RQPDFETA-FTALLGAKREDSPDVDAVVAGIIADVR--ARGDAAVIELTERFDRVALTPQ 64 Query: 100 LLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAI 159 L F + + A A + E + + D +T L WR + Sbjct: 65 SLRFSTEEIAQAVDEVPAPERAALELAAARIRAYHERQMPQDADWTD-DTGARLGWRWSA 123 Query: 160 VHTSALLAPGAIASQS 175 V + L PG +AS Sbjct: 124 VSAAGLYVPGGLASYP 139 >gi|154687620|ref|YP_001422781.1| hypothetical protein RBAM_032200 [Bacillus amyloliquefaciens FZB42] gi|154353471|gb|ABS75550.1| NagA [Bacillus amyloliquefaciens FZB42] Length = 396 Score = 36.7 bits (83), Expect = 8.5, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 68/200 (34%), Gaps = 14/200 (7%) Query: 109 TRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHT--SALL 166 T LQ A A +L A SS HH+ G A+ +A L Sbjct: 203 TDAGAELLQKAADAGAVHMTHL-FNAMSSFHHRKPG---------GIGTALACGRITAEL 252 Query: 167 APGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDG 226 I S +A +A A + M+ +K L+D Y Q + +L++DG Sbjct: 253 ITDGIHSHPLAVKLAYLAKGSKNLIMITDSMRAKGLKDGEYEFGGQKVTVRGDTALLSDG 312 Query: 227 LIGAFFGGM--HSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTD 284 + M + ++ + D+ + + G+ + DA TD Sbjct: 313 TLAGSILKMNEGAALMRRFTNCSWLDIANMTSANAARRLGIFDRKGSIAEGKDADVVLTD 372 Query: 285 TLAHGVDSLVRGEYPHFDQE 304 + ++ RG + +E Sbjct: 373 GQCGVLATICRGNTAYISRE 392 >gi|167583581|ref|YP_001671771.1| hypothetical protein phi32_26 [Enterobacteria phage phiEco32] gi|164375419|gb|ABY52827.1| hypothetical protein phi32_26 [Enterobacteria phage phiEco32] Length = 1473 Score = 36.7 bits (83), Expect = 8.6, Method: Composition-based stats. Identities = 24/117 (20%), Positives = 40/117 (34%), Gaps = 7/117 (5%) Query: 99 KLLSFIPTPLTRLAGLA-----LQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADAL 153 K+ I + + AG+A + PLAA A+ A + EG DK + Sbjct: 116 KIEDGIGKTVGQYAGVAGDIGMTVANPLAAAAIIAGRETGRAYADQTPEEGEDK-SILDA 174 Query: 154 AWREAIVHTSALLAPGAIA-SQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 A + + + PGA+ ++S + N G + Y D Sbjct: 175 ALVGGANYAAQRILPGAVGTAESTLGRIGQNVASNAVAGAKGGALVGAAEVQNKYGD 231 >gi|115380545|ref|ZP_01467507.1| salicylate biosynthesis isochorismate synthase [Stigmatella aurantiaca DW4/3-1] gi|310821605|ref|YP_003953963.1| isochorismate synthase [Stigmatella aurantiaca DW4/3-1] gi|115362446|gb|EAU61719.1| salicylate biosynthesis isochorismate synthase [Stigmatella aurantiaca DW4/3-1] gi|309394677|gb|ADO72136.1| Isochorismate synthase [Stigmatella aurantiaca DW4/3-1] Length = 453 Score = 36.7 bits (83), Expect = 9.1, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 52/186 (27%), Gaps = 21/186 (11%) Query: 21 RPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHS---VGTG 77 P ++ +W G+ P +D L P D P + V Sbjct: 23 APALAGQERWVGGMLYLAAVDPLAGVDVLGEP--SLYWDSPQMREVVAGWGEAGAMVAGS 80 Query: 78 AHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESS 137 A + L A AG++ + +P P A + G E Sbjct: 81 AQEAREVLRLLSSAATVRWAGEVPASLPGPWFGGMRFAAEGKDEGWGPFGFGRWTLPER- 139 Query: 138 IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 + WRE +A P ++ + + G N P G + Sbjct: 140 ---------------MVWREGDRLAAAAFVPEGPGAEEQVRALLVGLGANFPAGPLPSRR 184 Query: 198 SSKVLE 203 +++ L Sbjct: 185 TAQALR 190 >gi|71013544|ref|XP_758617.1| hypothetical protein UM02470.1 [Ustilago maydis 521] gi|46098275|gb|EAK83508.1| hypothetical protein UM02470.1 [Ustilago maydis 521] Length = 405 Score = 36.7 bits (83), Expect = 9.3, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 32/72 (44%), Gaps = 5/72 (6%) Query: 338 RQKPSEPLAEHPHPKRKEVERELSEIEGAKK-----ESSARKFFDEGSPDHSPFKGERNQ 392 R P + A P P R RE + + + + R F G+PD++P +G R+ Sbjct: 265 RGDPYDRYARGPPPPRDYAARERDYLGPPPRGGPGMDYAPRDFAPRGAPDYAPPRGYRDM 324 Query: 393 KLDPMRGADFTD 404 P RGA + D Sbjct: 325 SPPPPRGARYDD 336 >gi|325092850|gb|EGC46160.1| cell cycle inhibitor Nif1 [Ajellomyces capsulatus H88] Length = 767 Score = 36.7 bits (83), Expect = 9.7, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 37/95 (38%), Gaps = 5/95 (5%) Query: 262 KHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY--PHFDQEKLQTIADNTLEDPHF 319 K G + S + S D+ D S + +Y P ++ L+ PHF Sbjct: 334 KRGNRPSPITVPESPDSSAKVDDAQTSAPTSYIYAKYALPRGRSVSRDSLVFTGLQTPHF 393 Query: 320 KPHLP--EPEPLPQYKEHSD-RQKPSEPLAEHPHP 351 + + P E P P E Q+PS P A H HP Sbjct: 394 EWNEPLFESSPSPSAPEKETLEQEPSSPAATHAHP 428 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.127 0.329 Lambda K H 0.267 0.0394 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,556,503,376 Number of Sequences: 14124377 Number of extensions: 126862999 Number of successful extensions: 462223 Number of sequences better than 10.0: 705 Number of HSP's better than 10.0 without gapping: 68 Number of HSP's successfully gapped in prelim test: 637 Number of HSP's that attempted gapping in prelim test: 460697 Number of HSP's gapped (non-prelim): 1832 length of query: 478 length of database: 4,842,793,630 effective HSP length: 143 effective length of query: 335 effective length of database: 2,823,007,719 effective search space: 945707585865 effective search space used: 945707585865 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 83 (36.6 bits)