BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] (864 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|254040879|gb|ACT57675.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|317120668|gb|ADV02491.1| hypothetical protein SC1_gp030 [Liberibacter phage SC1] gi|317120812|gb|ADV02633.1| hypothetical protein SC1_gp030 [Candidatus Liberibacter asiaticus] Length = 864 Score = 1782 bits (4616), Expect = 0.0, Method: Compositional matrix adjust. Identities = 864/864 (100%), Positives = 864/864 (100%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK 60 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK Sbjct: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK 60 Query: 61 ELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE 120 ELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE Sbjct: 61 ELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE 120 Query: 121 TKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHS 180 TKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHS Sbjct: 121 TKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHS 180 Query: 181 QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIA 240 QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIA Sbjct: 181 QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIA 240 Query: 241 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI Sbjct: 241 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 Query: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ Sbjct: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 Query: 361 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVG 420 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVG Sbjct: 361 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVG 420 Query: 421 IDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSG 480 IDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSG Sbjct: 421 IDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSG 480 Query: 481 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR 540 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR Sbjct: 481 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR 540 Query: 541 AKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ 600 AKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ Sbjct: 541 AKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ 600 Query: 601 QLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGE 660 QLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGE Sbjct: 601 QLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGE 660 Query: 661 ALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKAL 720 ALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKAL Sbjct: 661 ALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKAL 720 Query: 721 LRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAV 780 LRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAV Sbjct: 721 LRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAV 780 Query: 781 ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK 840 ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK Sbjct: 781 ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK 840 Query: 841 KGIELFQNMDEGLPHRLPFPFGED 864 KGIELFQNMDEGLPHRLPFPFGED Sbjct: 841 KGIELFQNMDEGLPHRLPFPFGED 864 >gi|309702799|emb|CBJ02130.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 825 Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 209/874 (23%), Positives = 357/874 (40%), Gaps = 98/874 (11%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R SL + L+ AER R AG A Sbjct: 2 MRQECIQAVQQAAKRTLTAREIQDIEDRIYRNMRSLARDDPASWRQLTDAERLRRAGQLA 61 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 ++ Q+E R +L + ++ Q G GK AL + F A S + + Sbjct: 62 SDELQREAALKKRRVALTISARQRLDNFINNYQ-GADGKLGALNRTIAFSADGKSNFLSV 120 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 121 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVFEMRGQNTGNAKARKGAKAW 180 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 181 GEVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVTKDKWVSDVIGKLDRKYYTRSD 240 Query: 231 GTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G +S SE+ +F+GE + D + S R R HFKD+ +++ Y Sbjct: 241 GQLMSDSELTAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 300 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA--SAGNK 345 + +G ++ I+ L +SKDI + GPN D + ++ Q A A S K Sbjct: 301 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 359 Query: 346 VLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGAL 405 V +L E + + + V N A W +R+ AS LG + + Sbjct: 360 V-------ERLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSF 410 Query: 406 LEDG--FISRQMLSRVGIDKEAIQRINKMPLKERME--LLSDVGLYAEGVVAHGRNMMEG 461 + G ++S + ++ + +++ ++ M R E L GL E ++ Sbjct: 411 SDLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELALARRAGLAMESLLGSVNRWAMD 469 Query: 462 SDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRLD 520 + + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 470 NMGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRIL 529 Query: 521 PSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYH 580 S K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 530 KS-----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG--------- 575 Query: 581 RKKLKNSKTLSPEQ-RQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHT 639 PE+ + E ++L +E+++ + V +Q Sbjct: 576 ----------EPERVKFEAMRKLLGAVTEEVDMAVITPGAREQMFVGSGLQ--------- 616 Query: 640 SLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHV 699 RGT GE R F + P + + S + MP A Sbjct: 617 --------------RGTWKGELTRSVFLFKSFPISVVMR--HWSRAMGMPSAGGRAAYIA 660 Query: 700 WIQYSATMALAGIGVAS--IKALLRGEDPS------LPEVIYDGTLANGALLPYMDRLTK 751 S TM +G S I L+ G +P + + + L G Y D L Sbjct: 661 TFLASTTM----LGALSMQITDLINGRNPKEMTGDHMVKFWINAFLKGGGAGLYGDFLFS 716 Query: 752 LVSKGDRAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNM 807 ++ A+ +LGPV +V ++ A + NE + + K + +P N+ Sbjct: 717 DHTRYGSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANL 776 Query: 808 WYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 WYLK + DH+I NQ+ E +PGYL + + + KK+ Sbjct: 777 WYLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 810 >gi|332344341|gb|AEE57675.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 824 Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 206/873 (23%), Positives = 366/873 (41%), Gaps = 96/873 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E+++F+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA--SAGN 344 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A S Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 KV +L + E + + + V N A W +R+ AS LG + + Sbjct: 358 KV-------ERLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSS 408 Query: 405 LLEDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMME 460 + G ++S + ++ + +++ ++ M R EL GL E ++ Sbjct: 409 FSDLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAM 467 Query: 461 GSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRL 519 + + + + + SG ++ + + +G + L+ L D R+ Sbjct: 468 DNMGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDYDFRI 527 Query: 520 DPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAY 579 S K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 528 LKS-----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG-------- 574 Query: 580 HRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHT 639 PE+ +K + K+ V + V +V Sbjct: 575 -----------EPER------------------VKFEAMRKLLGAVTEEVDMAV-----I 600 Query: 640 SLFDRQRLGLLT-YKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNH 698 + R+R+ + + +RGT GE R F + P + + + MP A Sbjct: 601 TPGARERMFVGSGLQRGTWKGELTRSVFLFKSFPISVVMR--HWHRAMGMPSAGGRAAYI 658 Query: 699 VWIQYSATMALAGIGVASIKALLRGEDP------SLPEVIYDGTLANGALLPYMDRLTKL 752 + A+ + G I L+ G +P ++ + + L G Y D L Sbjct: 659 A--TFLASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSD 716 Query: 753 VSKGDRAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMW 808 ++ A+ +LGPV +V ++ A + NE + + K + +P N+W Sbjct: 717 HTRYGSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLW 776 Query: 809 YLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 YLK + DH+I NQ+ E +PGYL + + + KK+ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|298381705|ref|ZP_06991304.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279147|gb|EFI20661.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 824 Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 204/873 (23%), Positives = 363/873 (41%), Gaps = 96/873 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDQMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINNYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTDLLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E+++F+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKV 346 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 347 LKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALL 406 + +L + E + + + V N A W +R+ AS LG + + Sbjct: 358 SVE-----RLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFS 410 Query: 407 EDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGS 462 + G ++S + ++ + +++ ++ M R EL GL E ++ + Sbjct: 411 DLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDN 469 Query: 463 DAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRLDP 521 + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 470 MGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILK 529 Query: 522 SIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHR 581 S K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 530 S-----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG---------- 574 Query: 582 KKLKNSKTLSPEQ-RQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTS 640 PE+ + E ++L +E+++ + V +Q Sbjct: 575 ---------EPERVKFEAMRKLLGAVTEEVDMAVITPGAREQMFVGSGLQ---------- 615 Query: 641 LFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVW 700 RGT GE R F + P + + + MP A Sbjct: 616 -------------RGTWKGELTRSVFLFKSFPISVVMR--HWHRAMGMPSAGGRAAYIAT 660 Query: 701 IQYSATMALAGIGVAS--IKALLRGEDP------SLPEVIYDGTLANGALLPYMDRLTKL 752 S TM +G S I L+ G +P ++ + + L G Y D L Sbjct: 661 FLASTTM----LGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSD 716 Query: 753 VSKGDRAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMW 808 ++ A+ +LGPV +V ++ A + NE + + K + +P N+W Sbjct: 717 HTRYGSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLW 776 Query: 809 YLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 YLK + DH+I NQ+ E +PGYL + + + KK+ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|300898440|ref|ZP_07116781.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357907|gb|EFJ73777.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 824 Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 204/873 (23%), Positives = 362/873 (41%), Gaps = 96/873 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E++ F+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSEFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKV 346 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 347 LKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALL 406 + +L + E + + + V N A W +R+ AS LG + + Sbjct: 358 SVE-----RLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFS 410 Query: 407 EDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGS 462 + G ++S + ++ + +++ ++ M R EL GL E ++ + Sbjct: 411 DLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDN 469 Query: 463 DAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRLDP 521 + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 470 MGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILK 529 Query: 522 SIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHR 581 S K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 530 S-----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG---------- 574 Query: 582 KKLKNSKTLSPEQ-RQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTS 640 PE+ + E ++L +E+++ + V +Q Sbjct: 575 ---------EPERVKFEAMRKLLGAVTEEVDMAVITPGAREQMFVGSGLQ---------- 615 Query: 641 LFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVW 700 RGT GE R F + P + + + MP A Sbjct: 616 -------------RGTWKGELTRSVFLFKSFPISVVMR--HWHRAMGMPSAGGRAAYIAT 660 Query: 701 IQYSATMALAGIGVAS--IKALLRGEDP------SLPEVIYDGTLANGALLPYMDRLTKL 752 S TM +G S I L+ G +P ++ + + L G Y D L Sbjct: 661 FLASTTM----LGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSD 716 Query: 753 VSKGDRAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMW 808 ++ A+ +LGPV +V ++ A + NE + + K + +P N+W Sbjct: 717 HTRYGSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLW 776 Query: 809 YLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 YLK + DH+I NQ+ E +PGYL + + + KK+ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|331648163|ref|ZP_08349253.1| hypothetical protein ECIG_04089 [Escherichia coli M605] gi|331043023|gb|EGI15163.1| hypothetical protein ECIG_04089 [Escherichia coli M605] Length = 824 Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 209/877 (23%), Positives = 362/877 (41%), Gaps = 104/877 (11%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE Q+E + R +L ++ Q G GK AL + F A S + Sbjct: 61 SEE-LQREAALNKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKF-NEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATRDYALSQLQGAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E+++F+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA--SAGN 344 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A S Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 KV +L E + + + V N A W +R+ AS LG + + Sbjct: 358 KV-------ERLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSS 408 Query: 405 LLEDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMME 460 + G ++S + ++ + +++ ++ M R EL+ GL E ++ Sbjct: 409 FSDLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELVRARRAGLAMESLLGSVNRWAM 467 Query: 461 GSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRL 519 + + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 468 DNMGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRI 527 Query: 520 DPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAY 579 S K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 528 LKS-----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG-------- 574 Query: 580 HRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHT 639 PE+ +K + K+ V + V +V Sbjct: 575 -----------EPER------------------VKFEAMRKLLGAVTEEVDMAV------ 599 Query: 640 SLFDRQRLGLLT---YKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMAL 696 + R L+T +RGT GE R F + P + + + MP A Sbjct: 600 -ITPGAREQLITGSGIQRGTWKGELTRSVFLFKSFPISVVMR--HWHRAMGMPSAGGRAA 656 Query: 697 NHVWIQYSATMALAGIGVAS--IKALLRGEDP------SLPEVIYDGTLANGALLPYMDR 748 S TM +G S I L+ G +P ++ + + L G Y D Sbjct: 657 YIATFLASTTM----LGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDF 712 Query: 749 LTKLVSKGDRAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPF 804 L ++ A+ +LGPV +V ++ A + NE + + K + +P Sbjct: 713 LFSDHTRYGSGALASMLGPVVGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPG 772 Query: 805 MNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 N+WYLK + DH+I NQ+ E +PGYL + + + KK+ Sbjct: 773 ANLWYLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|304398390|ref|ZP_07380264.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] gi|304354256|gb|EFM18629.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] Length = 921 Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 219/945 (23%), Positives = 386/945 (40%), Gaps = 119/945 (12%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYVSLDGK----GLSKAERYR----L 49 MK C+ + + GR+ EL+ +ED I VR ++ + G A+ Y+ L Sbjct: 1 MKQACVDAITQTLGRQPLASELKNIEDLISDSVRQVSRMNARAGKSGFPDADTYKQAADL 60 Query: 50 AGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--- 106 A + D K+ R +AI L ++ + SQ +F+ G Sbjct: 61 AARRVVHDVFKKRQRLAQNAIAINNVTETLNRNVPAPEQTPKNLSQFIFSGRRVADGKEI 120 Query: 107 ---SAEV-----------PLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFG----L 148 SAE L ++ AA V F + +G + D++ G L Sbjct: 121 DVVSAEELATGAFQDWSRQLSAEMTAAGGDVQKFFEQAQALGEQRFRNIFDQRVGKSSQL 180 Query: 149 DVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRA 207 + E+ G+ T N A ++ + + + +++G D ++ +P D +RA Sbjct: 181 QLLKEIYGEDTGNPAAKKIASIWSDVTSRARQEMNDSGFDIGQRDDWHLPYVDEADLVRA 240 Query: 208 TKKDDFVRSML----------------DWL------------DLSRYKDIDGTPLSRSEI 239 +++++ ++ DW D S++ + DGTP++ + Sbjct: 241 AGREEWLATLPLAERTQARLAGRMPPGDWARRAWVDDIYNTQDRSQFVNPDGTPMNDVQY 300 Query: 240 ASFVGEVFAERVRSTSFK-DPSIPSSEVGVKREFE--RVFHFKDSQAHMDYMEHFGVSTN 296 + +F + + K DP + G+K RV FKD+++H YME + Sbjct: 301 REALEYIFETKATDGAQKLDPGAFAGSGGLKNRGSQSRVLAFKDAESHFGYMEKY-TQQP 359 Query: 297 VNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKL 356 V ++ S L + S+D+ + + GP+A + K +I I N V D G Sbjct: 360 VVGVMMSHLQTASRDLGVVKAFGPDAGTNFK-LIADRIYQ-------NAVKVDGAGHPIA 411 Query: 357 EVRQEAML--QMWEVMRYGETVENTG-WANWMAGLRSAAGASMLGQHPIGALLEDGFISR 413 E+ +E L +M++ M V +T +++ + GLR+ ++MLG I A D + R Sbjct: 412 EMNKERELVQRMFDSMAGLNGVNSTSVFSSAVGGLRNLMTSAMLGSSVITAT-SDQAVMR 470 Query: 414 QMLSRVGIDKEAIQ----RINKMPLKERMELLSDVGLYAEG---VVAHGRNMMEGSDAFQ 466 +G D+ ++ I + + +++GL + V+A M G D + Sbjct: 471 AAAQALGFDRNGMRLSATTIRNLFSGDAKRANAELGLLVDAHSAVIAK----MGGFDLTR 526 Query: 467 -IGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKA 525 I K KWSG +D+ ++ L++Y IG +T YA+L LK + S K Sbjct: 527 GITGWFAEKTLKWSGLIAMDRANKAAFGLLMYKNIGELTRRYATLDALKGSDKALLSSKG 586 Query: 526 FFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRD-LARMSDKI-AYHRKK 583 + + D+ ++ A+ TP I + D +R LA D++ A + Sbjct: 587 WSAE----DWAIMNAAELKPLTTSGHMGITPDAIYAVPDEKVRQILAGQIDRVRAGADEA 642 Query: 584 LKNSKTLSPEQRQELQQQL-ADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLF 642 L N ++ + L+Q A++E+ ++++ + L L + A+ T+ Sbjct: 643 LANLGAMTDSRATNLRQAYDAEVEQTISRMVRNARAEAAQKL-LGVTHGEMSQAITTAT- 700 Query: 643 DRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLS-NSAKMPKGASMALNHVWI 701 G+ TY R + GE + F F TTP F ++ + N ++P AL + Sbjct: 701 -----GIDTYAR-DQGGELYKSFMLFKTTPFAGFRQMVTRAQNLDRVP-----ALKFL-A 748 Query: 702 QYSATMALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKGDR 758 Y L G+ + ALL G DP + P TL G Y D L + ++ Sbjct: 749 AYIGGTTLTGMFANQLNALLSGNDPIDMTKPGAWVGATLKGGGFGIYGDFLFQDHTQYGS 808 Query: 759 AAIGGLLGPVPSMVTNLTSSAV---ELATKDNENS-KVNATKAIRKTLPFMNMWYLKNSF 814 + L GP + +L + + A + E S +A K R PF N+WY K Sbjct: 809 SIAATLGGPSLGLAESLMKLLITNPQKAMQGEETSFGADAIKTARMITPFANLWYTKAVT 868 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 +HLIL Q+ E NPGY DR + + + + + + N + P R P Sbjct: 869 NHLILQQLQEMANPGYNDRVRDRAQNQFDVTSWWNPGDTEPRRTP 913 >gi|85059173|ref|YP_454875.1| hypothetical protein SG1195 [Sodalis glossinidius str. 'morsitans'] gi|84779693|dbj|BAE74470.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 824 Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 204/894 (22%), Positives = 359/894 (40%), Gaps = 138/894 (15%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ ECIQ + A+ R L+ E++ +ED IV+ L + LS++ER + AG A Sbjct: 1 MRQECIQAITAASKRTLTSAEIQGIEDRIVKNMRHLARNDPTSWRSLSESERMQRAGHMA 60 Query: 55 EEDFQKELI---RSVNDAIDEAYKRHQLRSDLDRVQAGVYGKS---QALFNKLFFKA-GS 107 E ++E R V I R LD AG GK +AL + F A G Sbjct: 61 AEALEREATLKKRRVALTI-------AARQRLDNFIAGYKGKGGKLEALNRTIAFHADGK 113 Query: 108 AE-VPLEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQAS 165 A + +E + KA LS+ +E ++ + + DKQ D+ EM+G+ T N +A Sbjct: 114 APFLSVESRTKATRDYALSQLDELFSAIDPRFFQLFEDKQGISDLVYEMRGQDTGNVRAK 173 Query: 166 RLVKQYFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLS 224 + + + L + ++AG D E+ +PQ S++K+ + D+V ++ LD + Sbjct: 174 KGAEAWKNVSELLRRRFNDAGGDIGHLEDWGMPQHHSMEKVGKATQSDWVGFVMGKLDRN 233 Query: 225 RYKDIDGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDS 281 +Y +G +S ++A F+G + D S R ER HFKD+ Sbjct: 234 KYVKENGELMSDKDVADFLGHAYKTIATGGMNKLGDSGRRLSGARANRGNAERQIHFKDA 293 Query: 282 QAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAN--DQE 339 + ++ Y + FG ++ IL + L +SKDI + GPN D + ++ + A D+ Sbjct: 294 EGYLAYQQRFG-EKSMWDILVNHLDGMSKDIALVETYGPNPDQVFRSLLDELAAKTADET 352 Query: 340 ASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQ 399 S K+ KL+ + E + + + + N A W +R+ AS LG Sbjct: 353 PSRTGKI-------KKLKNKTEDLYNF--IAGKTQPIANPHIARWADHVRNWLVASRLGS 403 Query: 400 HPIGALLEDGFISRQMLSRVGIDKEAIQRINKMPL---------------KERMELLSDV 444 I +L ++G + ++N +P+ K+ + L Sbjct: 404 ALISSLSDNGTMY------------LTAKVNNLPMAQLLRNQLAAMNPANKDEIRLARGA 451 Query: 445 GLYAEGVVAH----GRNMMEGSDAFQIGHKL--HSKMHKWSGAEYLDKKRISSHALIVYN 498 GL E ++ + M S + + + + S + WS A KR ++ + + Sbjct: 452 GLAMETLLGSVNRWATDNMGPSPSRWVANAVMRASGLSAWSDAH----KR--AYGVTMMG 505 Query: 499 QIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPST 558 IG + +A + ++ D D ++K +K +SS D ++ Sbjct: 506 GIGNLVRKHADI-----------------AKIADEDARILK-SKGISSQDWKIW------ 541 Query: 559 IKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQEL-QQQLADLERKEINILKDKV 617 K A+ D N+ L+PE + ++LA L E +V Sbjct: 542 ----KLAEQEDWGN------------GNTTMLTPESIMRIPNEKLAALGNAE------RV 579 Query: 618 SNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFL 677 + +L V V A+ T + + +RG GE +R F + P + + Sbjct: 580 KFEAMRKLLGAVSEEVDMAVVTPGARERMVTGAAMQRGDWRGELVRSVFLFKSFPIAVMM 639 Query: 678 NILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDP------SLPEV 731 S + MP A + A+ + G I ++ G +P + Sbjct: 640 R--HWSRALNMPSAGGRAAYLA--AFLASTTVLGAMSQQISEVIAGRNPRDITGDKALQF 695 Query: 732 IYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTN----LTSSAVELATKDN 787 + L G Y D L ++ A+ +LGPV +V + L + Sbjct: 696 WVNAFLKGGGAGLYGDFLLSDHTRYGSGALASMLGPVAGVVDDAIKLLQGIPLNAVEGKP 755 Query: 788 ENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 E + + K + +P N+WY K FDH++ NQ+ E +PGYL R + + +K+ Sbjct: 756 EQTGGDLVKFAKGMIPGQNLWYTKAVFDHMVFNQLQEIFSPGYLRRMEKRSRKE 809 >gi|330007168|ref|ZP_08305910.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] gi|328535515|gb|EGF61975.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] Length = 924 Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 221/951 (23%), Positives = 384/951 (40%), Gaps = 128/951 (13%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYV---SLDGK-GLSKAERYRLAGLK 53 MK CI + GR+ E++ +ED I VR + +GK G+ AE YR A Sbjct: 1 MKQACIDAVANTLGRQPKADEIKNIEDRIKDAVRVIARRNAREGKTGIPDAETYRQAAEL 60 Query: 54 AEED-----FQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSA 108 A F+K R +AI A R L + + Q +F+ + G Sbjct: 61 AAAQAVHAVFKKRQ-RVAQNAIAIAKVRDTLNKAIPENEQTPIALQQFIFSG---RRGRD 116 Query: 109 EVP---------------------LEMKIKAAETKVLSKFNEYAEVGSKNLG--FTLDKQ 145 + P L ++ AA V F + +G + L D++ Sbjct: 117 KQPDINVVSAEEMATGAYQDWTRQLSAELTAAGDDVQKFFYQSQALGEQRLRNLLPFDRE 176 Query: 146 FG----LDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFEN-RIPQPM 200 L + E+ G+ T N A ++ K + + + +++G D ++ +P Sbjct: 177 ASRSGQLQILKEIYGEDTGNPAAKKIAKVWGDVTSRARQEMNDSGFDIGLRDDWHLPYVD 236 Query: 201 SVDKLRATKKDDFVRSM---------------------LDWLD-------LSRYKDIDGT 232 + +RA +D+++ S+ W+D S+Y ++DG+ Sbjct: 237 DAELIRAAGRDEWLSSLPLNERAAAIAAGRQPPQDFARQAWVDDVWNTQDRSQYVNLDGS 296 Query: 233 PLSRSEIASFVGEVFAERVRSTSFK-DPSIPSSEVGVKREFE--RVFHFKDSQAHMDYME 289 P++ E + ++ +V + K DP G+K RV FKD+++H YME Sbjct: 297 PMNDIEYRQALEAIYETKVTEGANKIDPGAFMGSGGIKNRGSQSRVMAFKDAKSHFSYME 356 Query: 290 HFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKD 349 + V ++ S L S S+D+ + + GP+A S K ++ Q + G + Sbjct: 357 RY-TQQPVVGVMMSHLQSSSRDLGVVKAFGPDAASNFKLLMDQIYQRATSTTGGGHDIGT 415 Query: 350 WLGRNKLEVRQ-EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLED 408 + +L R +M + V G GLR+ ++MLG A D Sbjct: 416 MNDQRQLVERMFNSMAGLNGVASSSVFSSAVG------GLRNLMTSAMLGTSVFTAA-SD 468 Query: 409 GFISRQMLSRVGIDKEAIQRINKMPLK-----ERMELLSDVGLYAEGVVAHGRNMMEGSD 463 I R +G D+ + R++ L+ + +++GL + A M Sbjct: 469 QAIMRANAQALGFDRNGM-RLSANTLRNLFNGDAKRANAELGLLVDAHAAVVSKMGGFDL 527 Query: 464 AFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSI 523 + I K KWSG +D+ ++ L+++ IG ++ Y SL L R + Sbjct: 528 SRGITGWFAEKTLKWSGLIAMDRANKAAFGLLMFKNIGELSRKYKSLDALTGSDRTVLAN 587 Query: 524 KAFFKQLDDTDFTVIKRAKAMS-SPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHR- 581 K + + D+ ++ A+ +PDG+ TP I ++ D +R++ ++D+I R Sbjct: 588 KGWTPE----DWAIMSAAELRPLTPDGH-KGMTPDAIYDVPDETVRNI--LADRIEKVRV 640 Query: 582 ---KKLKNSKTLSPEQRQELQQQL-ADLERKEINILKDKVSNKMHALVLDNVQTSVRGAM 637 + L ++ +R+ L+Q A++E+ ++++ + L L + A+ Sbjct: 641 GSDQALAALGDMTDAKRKTLKQAFDAEVEQTISRMVRNARAEAAQHL-LGITHGEMTSAV 699 Query: 638 HTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTP-TGMFLNILDLSNSAKMPKGASMAL 696 T+ GL + R T +G+ L+ F F TTP GM + L + MP A Sbjct: 700 TTAT------GLDAFARDT-SGDLLKSFMLFKTTPMAGMRQFVTRLQDLETMPAVKFFA- 751 Query: 697 NHVWIQYSATMALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLV 753 Y A LAG+ + ALL G DP + P+ L G+ Y D L + Sbjct: 752 -----AYVAGTTLAGMFANQMNALLSGNDPLDMTKPQTWLQALLKGGSFGIYGDFLFQDH 806 Query: 754 SKGDRAAIGGLLGPVPSMV-----TNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMW 808 ++ + G L GPV T LT+S +A ++ + +A K R PF N+W Sbjct: 807 TQYGSSIAGILGGPVLGFAEQLSKTVLTNSQKAMAGEETTFT-ADALKTARMITPFANLW 865 Query: 809 YLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 Y K +HLIL Q+ E NPGY R + + ++ + E P R P Sbjct: 866 YTKAITNHLILQQLQEMANPGYNARVRDRAMREFNTTSWWEPGEETPRRAP 916 >gi|30387396|ref|NP_848225.1| hypothetical protein epsilon15p17 [Enterobacteria phage epsilon15] gi|30266051|gb|AAO06080.1| 17 [Salmonella phage epsilon15] Length = 918 Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 207/926 (22%), Positives = 376/926 (40%), Gaps = 119/926 (12%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGK-GLSKAERYRLAG-- 51 MK C++ + + GR+ EL+ +ED I A + +GK G+ A+ Y A Sbjct: 1 MKQACVEAIAQTLGRQPKADELKGIEDRIKEAVRQVHKKNAKEGKTGIPDAQTYMEAADL 60 Query: 52 --LKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQ-----------AG---VYGKSQ 95 + D K+ R +AI + L +++ Q AG GK Sbjct: 61 VRQRVVHDVYKKRQRVAQNAIAISRVTDTLDANIPPEQQTPANLQQFIFAGRRTTDGKDI 120 Query: 96 ALFNKLFFKAGSAE---VPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFG----L 148 A+ + G+ + L ++ A V F + +G + D+Q Sbjct: 121 AVTSAEELSTGAYQDWSRQLSAELLKAGDDVRKFFEQSKALGEQRFRSLFDQQAAKSAQF 180 Query: 149 DVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRA 207 + E+ G+ T N QA ++ + + + + ++ G D ++ +P D +R Sbjct: 181 QILKELYGEDTGNPQAKKIAQVWNDVTSRARQEMNDNGFDIGLRDDWHLPYVDDADFIRN 240 Query: 208 TKKDDFVRSM---------------------LDWL-------DLSRYKDIDGTPLSRSEI 239 +D+++ S+ W+ D S Y + DG+P++ E Sbjct: 241 AGRDEWLASLPAAERAKAQLSGRQPPIEFARQAWVDDVYNTQDRSNYVNPDGSPMNDIEY 300 Query: 240 ASFVGEVFAERVRSTSFK-DPSIPSSEVGVKREFE--RVFHFKDSQAHMDYMEHFGVSTN 296 + +F + + K DP G+K RV FKD+Q+H YME + Sbjct: 301 RQALEAIFETKATDGANKIDPGAFMGTGGIKNRGSQNRVMAFKDAQSHFAYMERY-TQQP 359 Query: 297 VNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKL 356 V ++ S L S S+D+ + + GP+A ++ + Q A G K + + Sbjct: 360 VAGVMMSHLQSSSRDLGVVKAFGPDAARNFSLVLDRVY---QRAVTGGKAV------GHM 410 Query: 357 EVRQEAMLQMWEVMR-YGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQM 415 ++ + +M+ M ++ + + + GLR+ ++MLG + A D I R Sbjct: 411 NEERKMVERMFNSMAGLNGAATSSVFTSAVGGLRNLMTSAMLGTSVLTA-TSDQAIMRAN 469 Query: 416 LSRVGIDKEAIQ----RINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKL 471 +G ++ ++ I + + +++GL + A M + I Sbjct: 470 AQALGFTRDGMRLSANTIKNLFSGDAKRANAELGLLVDSHAAVVSKMGGFDLSRGITGWF 529 Query: 472 HSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLD 531 K KWSG +D+ ++ L++Y IG +T + +L D+K D +I A K Sbjct: 530 AEKTLKWSGLIAMDRANKAAFGLLMYKNIGELTRKFKTLDDVKGS---DKTILA-NKGWS 585 Query: 532 DTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHR----KKLKNS 587 + D+ ++ A+ TP I + D + + M+D+IA R + L Sbjct: 586 NEDWAIMAAAELQPMTTAGHMGMTPDAIYAVPDNVITGI--MADRIAQVRAGSEEVLAAL 643 Query: 588 KTLSPEQRQELQQQL-ADLERKEINILKD---KVSNKMHALVLDNVQTSVRGAMHTSLFD 643 L PE+ + ++Q A+ E+ ++++ + + K+ + + ++V A Sbjct: 644 GDLPPERLKRMRQAFDAEAEQTITRMVRNARVEAAQKLLGITHGEMTSAVTTAT------ 697 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA-KMPKGASMALNHVWIQ 702 GL TY R AG+ ++ F F TTP F +++ +N +P +A Sbjct: 698 ----GLDTYARDD-AGQLIKSFMLFKTTPFAGFRQLVNRANDLDTVPAIKFLA------S 746 Query: 703 YSATMALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKGDRA 759 Y A LAG+ + +LL G DP + P L G+ Y D L + ++ + Sbjct: 747 YIAGTTLAGMFANQMNSLLTGNDPLDMTKPTTWVQALLKGGSFGIYGDFLFQDHTQYGSS 806 Query: 760 AIGGLLGPVPSMVTNLTSSAV---ELATKDNENS-KVNATKAIRKTLPFMNMWYLKNSFD 815 + GPV S LT + + A + E S +A K R PF N+WY K + Sbjct: 807 IAATIGGPVLSFAEQLTKLLITNPQKALQGEETSFGADALKTARMITPFANLWYAKAITN 866 Query: 816 HLILNQILEELNPGYLDRQQSKKKKK 841 HLIL Q+ E NPGY DR + + +++ Sbjct: 867 HLILQQLQEMANPGYNDRVRDRAQRE 892 >gi|215487808|ref|YP_002330239.1| hypothetical protein E2348C_2741 [Escherichia coli O127:H6 str. E2348/69] gi|215265880|emb|CAS10289.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 824 Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 103/346 (29%), Positives = 167/346 (48%), Gaps = 15/346 (4%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + L+ AER R AG A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLNDAERLRRAGQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E+ R +L + ++ Q G GK AL + F A S + + Sbjct: 61 AEELQREVALKKRRVALTIAARQRLDNFINSYQ-GADGKLGALNRTIAFSADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNEYAE-VGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E E V + G D+ D+ EM+G+KT N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEVFEAVDPRFFGLFEDEAGVRDLVFEMRGQKTGNAKAMKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 GEVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDTELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQT 333 + +G ++ I+ L +SKDI + GPN D + ++ QT Sbjct: 300 QQMYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQT 344 Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 10/90 (11%) Query: 759 AAIGGLLGPVPSMVTNLTS-------SAVELATKDNENSKVNATKAIRKTLPFMNMWYLK 811 A+ +LGPV +V ++ +AVE ++ V K + P N+WYLK Sbjct: 723 GALASMLGPVAGLVDDVIKIGQGIPLNAVEGKSEQTGGDLVKLGKGL---TPGANIWYLK 779 Query: 812 NSFDHLILNQILEELNPGYLDRQQSKKKKK 841 + DH+I NQ+ E +PGYL + + + KK+ Sbjct: 780 AALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|317120709|gb|ADV02531.1| hypothetical protein SC2_gp030 [Liberibacter phage SC2] gi|317120770|gb|ADV02591.1| hypothetical protein SC2_gp030 [Candidatus Liberibacter asiaticus] Length = 809 Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 214/899 (23%), Positives = 376/899 (41%), Gaps = 151/899 (16%) Query: 1 MKPECIQVLNKAAGR-ELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQ 59 MK ECI + AAG +LS ++ +E I ++ + +G+ +A A L ++ + Sbjct: 1 MKEECINAVRVAAGELKLSDVDIEHIEHHI---RIAWEQEGVKQAG---FADLPLDQQIK 54 Query: 60 KELIRSVNDAIDEA--YKRHQLRSDLDRVQAGVYGKSQA--LFNKLFFKAGSAEVPLEMK 115 + ++ + ++ YK ++L S G++Q L ++L A S +EM Sbjct: 55 RVSKKAKSSFFSDSDRYKPYELLSTFK-------GENQVTELGHRLAHHATSGG-SIEMS 106 Query: 116 IKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQ 175 IK +KV +F +Y G+K GF D ++ ++G K N +A +L + ET Sbjct: 107 IKGLRSKVFDRFKDYHTYGTKAFGFKNDVNAHTELLRALRGDKGVNPEALKLASIFHETM 166 Query: 176 RELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYK----DIDG 231 L +A G+ + +N PQPM K+ KD+FV L LD + Y+ D +G Sbjct: 167 DFLVKEAKAVGIKFNPRDNYTPQPMDFRKISLVTKDEFVDRTLPRLDWAEYQKRGLDNEG 226 Query: 232 TPLSRSEIASFVGEVFA-------ERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAH 284 + + FV +V+ +V ++ KD S S +G + R H+ Q Sbjct: 227 S------LRQFVEDVYETLASEGRNKVIASGGKDHSGIS--LGGRLRQVRQLHYT-PQGL 277 Query: 285 MDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEAS--- 341 ++ M+ FG V +++ +L +DI IARE G NA+ ++ D+E Sbjct: 278 VEAMKEFGSDLTVEGMMSRSFDNLIRDIAIAREFGANANENFNFVLASMFERDREDINSR 337 Query: 342 -AGNKVLKDWLGRNKLEVRQEAMLQM-WEVMRYGETVENTGWANWMAGLRSAAGA----S 395 G+K K NKL+ ++E +QM W+ + G +T M + +A A + Sbjct: 338 LEGDKKTK---ALNKLK-KEEMQVQMDWDGLTMGRKQPST-----MDKIVDSATAWTVIT 388 Query: 396 MLGQHPI---GALLEDGFISRQMLSRVGID-KEAIQRI-NKMPL--KERMELLSDVGLYA 448 LG + ++E F+ Q R+G K I I N P+ KER E + + + Sbjct: 389 KLGSQSLYIPKEIIESAFMGSQ---RMGYTWKTNIANIWNASPVAGKERKEFIKSITVGL 445 Query: 449 EGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYA 508 E + +E + +G + K W G LD + + + + +G T + Sbjct: 446 EHMATGFTRDLETNSQSVLG-VMAKKTMDWQGLTTLDNMMVRGLSATLQDYVGGFTRNFK 504 Query: 509 SLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLR 568 + LK K++ + F I + D +K L AD Sbjct: 505 DMDSLK-------------KKIGEQSFKSIIDEHRFNERD----------LKLLSLADTE 541 Query: 569 DL----ARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHAL 624 ++DK Y ++ ++K L+P ++ D+ R LK ++NK Sbjct: 542 SFKGKGTYLTDKNIY---RIDDTK-LTP-----FLKKGEDIYR-----LKSDLANKYRTF 587 Query: 625 VLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMF-LNILDLS 683 + VQ RG++ +++ D++ +T K G+ R+ QF P ++++++ Sbjct: 588 IWSTVQEHARGSVGSTIQDKR---WITGKDGS-VNNLARLMGQFLVMPISWSRMHLIEIP 643 Query: 684 NSAKMPKGASMALNHVWIQYSATMALAGI-GVASIK----ALLRGEDPSL---PEVIYDG 735 +S G S + Y A + GI G I+ L+ G++P L Y Sbjct: 644 SSL---VGVSSQV------YRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIK 694 Query: 736 TLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNAT 795 L NG + + +R + S G +LGP S L + E + + Sbjct: 695 ALING--ITHYERFSPFNSSG-----WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKA 747 Query: 796 KA------IRKTLPFMNMWYLKNSFDHLILNQILEELNPG-------YLDRQQSKKKKK 841 +A + +PF N+WY + +F+H + N I + LNPG Y RQ+ KK++K Sbjct: 748 QAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRK 806 >gi|301028422|ref|ZP_07191668.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299878533|gb|EFI86744.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 918 Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 207/939 (22%), Positives = 375/939 (39%), Gaps = 145/939 (15%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGK-GLSKAERYRLAGLK 53 MK C++ + + GR+ EL+ +ED I A + +GK G+ A+ Y A Sbjct: 1 MKQACVEAIAQTLGRQPKADELKNIEDRIKEAVQHVHRKNAKEGKSGIPDAQTYMDAA-- 58 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQ--------LRSDLDRVQAGVYGKSQALFN--KLFF 103 EL+R + + YK+ Q + D + A + Q N + F Sbjct: 59 -------ELVR--QRVVHDVYKKRQRVAQNAIAISKITDTLDANIPPDQQTPVNLQQFIF 109 Query: 104 KAGSAEVPLEMKIKAAETKVLSKFNEYAE----------------------VGSKNLGFT 141 + ++ + +AE + + +++ +G + Sbjct: 110 AGRRSRDKADISVTSAEELAIGAYQDWSRQLSAELLKAGDDVRKFFEQSRALGEQRFRSV 169 Query: 142 LDKQFG----LDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFEN-RI 196 D+Q L + E+ G+ T N A ++ + + + + + ++ G D E+ Sbjct: 170 FDRQAAKSAQLQILKEIYGEDTGNPLAKKIAQIWKDVTGRVRHEMNDNGFDIGLREDWHT 229 Query: 197 PQPMSVDKLRATKKDDFVRSM---------------------LDWL-------DLSRYKD 228 P D +R +++++ S+ W+ D S Y + Sbjct: 230 PYVDDADLIRNAGREEWLASLPVAEQATARLSGRQPPIEFARQKWVDDAYNTQDRSNYVN 289 Query: 229 IDGTPLSRSEIASFVGEVFAERVRSTSFK-DPSIPSSEVGVKREF--ERVFHFKDSQAHM 285 DG+ ++ E + +F + + K +P G+K RV FKD+Q+H Sbjct: 290 PDGSIMNDVEYRQALEAIFETKATDGANKIEPGTFMGAGGIKSRGSQHRVMAFKDAQSHF 349 Query: 286 DYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNAD---SFVKQMIVQTIANDQEASA 342 YME + V ++ S L S S+D+ + + GP+A+ S V I + A Sbjct: 350 AYMERYTQQPLVG-VMMSHLQSSSRDLGVVKAFGPDAERNFSLVLDRIY------KRAVT 402 Query: 343 GNKVLKDWLGRNKLEVRQEAML--QMWEVMR-YGETVENTGWANWMAGLRSAAGASMLGQ 399 G G+ K E+ EA L +M+ M ++ +++ + GLR+ ++MLG Sbjct: 403 G--------GKRKKEMEDEAKLVARMFNSMAGLNGVASSSVFSSAVGGLRNLMTSAMLGT 454 Query: 400 HPIGALLEDGFISRQMLSRVGIDKE----AIQRINKMPLKERMELLSDVGLYAEGVVAHG 455 + A D I R +G + ++ I + + + +++GL + A Sbjct: 455 SVLTA-TSDQAIMRANAQALGFTRGGMRLSVNTIKNLFSGDAKKANAELGLLVDSHAAVV 513 Query: 456 RNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKA 515 M + I K KWSG +D+ +S L++Y IG +T + +L D+K Sbjct: 514 SKMGGFDLSRGITGWFAEKTLKWSGLIAMDRANKASFGLLMYKNIGELTRKFKTLDDMKG 573 Query: 516 DPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSD 575 D +I A K + D+ ++ A+ TP I + D + D+ M+D Sbjct: 574 ---TDKTILA-NKGWSNEDWAIMAAAELRPMTTAGHMGMTPDAIYAVPDNVIADI--MAD 627 Query: 576 KIAYHR----KKLKNSKTLSPEQRQELQQQL-ADLERKEINILKDKVSNKMHALVLDNVQ 630 +I R K L L PE+ + +++ A+ E+ ++++ + L L Sbjct: 628 RITRIRAGSEKALAALGDLPPERLKRMKEAFDAEAEQTITRMIRNARAEAAQKL-LGITH 686 Query: 631 TSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA-KMP 689 + A+ T+ G+ TY R AGE ++ F F TTP F +++ + +P Sbjct: 687 GEMTNAVTTA------TGIDTYARDD-AGELMKSFMLFKTTPFAGFRQLVNRTRDLDTVP 739 Query: 690 KGASMALNHVWIQYSATMALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYM 746 +A Y LAG+ + +LL G DP + P L G+ Y Sbjct: 740 AIKFLA------SYIGGTTLAGMFAIQMNSLLNGNDPLDMTKPTTWVQALLKGGSFGIYG 793 Query: 747 DRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAV---ELATKDNENS-KVNATKAIRKTL 802 D + + ++ + + GPV S LT + + A + E S +A K R Sbjct: 794 DFIFQDHTQYGSSIGATMGGPVLSFAEQLTKLLITNPQKALQGEETSFGADALKTARMIT 853 Query: 803 PFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 PF N+WY K +HLIL Q+ E NPGY DR + + +++ Sbjct: 854 PFANLWYAKAITNHLILQQLQEMANPGYNDRVRDRAQRE 892 >gi|332160979|ref|YP_004297556.1| hypothetical protein YE105_C1357 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665209|gb|ADZ41853.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862135|emb|CBX72299.1| hypothetical protein YEW_AK02360 [Yersinia enterocolitica W22703] Length = 841 Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 213/921 (23%), Positives = 373/921 (40%), Gaps = 151/921 (16%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL--DGKG---LSKAERYRLAGLKAE 55 M+ ECIQ + A GR +++ E++ +E+ I + + L D G +SKA+R R Sbjct: 1 MRAECIQAVVNAIGRSITQAEVKGIENRINQHHKRLAQDTPGWMAMSKADRLR------- 53 Query: 56 EDFQKELIRSVNDAIDEAYKRHQLRSDL-----DRVQAGVYGKSQALFNKL----FFKAG 106 E +S D I K + R+ L DRV+ V + N L F + Sbjct: 54 -----EAAKSAADEITREAKLKKWRTALTILAHDRVKNYVESSTDTPVNALGRLIAFDSD 108 Query: 107 --SAEVPLEMKIKAAETKVLSKFNEYAEVG-SKNLGFTLDKQFGLDVFDEMKGKKTQNEQ 163 S + +E + KA S+ + K L D + V E+ G+ + N Sbjct: 109 QKSGVLSVESQAKAIRDIAYSQMLTLIDTTKGKFLSLLSDPESSKAVIKELHGEHSGNAA 168 Query: 164 ASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATK-KDDFVRSMLDWLD 222 A + K++ + L + + +G E+ P S +L+ K ++ +V + W D Sbjct: 169 AKQSAKEFKDVAEFLRQRFNNSGGAIGRLES-WAMPRSHSQLKVAKNREAWVDDHVKWAD 227 Query: 223 LSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREF-----ERVFH 277 Y + DG+ +S +++ F A R +T + P +G R H Sbjct: 228 RRSYVNEDGSRMSDAQLREFF--THAARTIATGGINKVEPGRFIGGSLRANHGSESRSIH 285 Query: 278 FKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNAD-SFVKQMIV--QTI 334 +KD+ + + + +G ++ +LT + L++DI + LGPN+D F QM + Q++ Sbjct: 286 YKDADSFILAQQKYG-DKDLLALLTGHIDRLARDIALTETLGPNSDLQFRTQMDMAQQSM 344 Query: 335 ANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGW-ANWMAGLRSAAG 393 N + A K+E + ++++ + + T W RS Sbjct: 345 INAEPAKF-----------KKIESEMLRVERLYKDVAGQNDIPETPWLKEAFDTYRSINV 393 Query: 394 ASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRINKMPL----KERMELLS------- 442 AS LG I A+ + G + M++ ++N +P+ + ++LL+ Sbjct: 394 ASKLGSAAITAITDQGNL---MVT---------AKVNNLPVMQVFAQELKLLNPADSASR 441 Query: 443 --------DVGLYAEGVVAHGRNMMEGSDAFQIG------HKLHSKMHKWSGAEYLDKKR 488 + Y G+ G + GS G K+ + + SG + Sbjct: 442 EAARRAGLGINYYLNGLQRFGAETL-GSAGDTSGALSSSAQKIAGFVLRASGLNAMTAAG 500 Query: 489 ISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPD 548 + +++ + IG MT +A+L L A R + + + D+ V ++A Sbjct: 501 NQAFGMVMLDTIGGMTRKHANLAHLNAKDR----TRLQGMGVTEADWAVWRKA------- 549 Query: 549 GYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERK 608 D+ DL+ M D + H + L LS L +Q A K Sbjct: 550 -----------------DVSDLSGMGDTVLTHNEIL----ALSDSALTPLAKQFATTPAK 588 Query: 609 EINILKDKVSNKMHALVLDNVQTSV--RGAMHTSLFDRQRLGL-LTYKRGTRAGEALRMF 665 L++ + K+ +V D Q +V GA R+R+ L RGT +GE R Sbjct: 589 ----LRNTAATKLLGVVQDEAQMAVVEPGA-------RERVTLHRGTTRGTWSGEIWRSA 637 Query: 666 QQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGED 725 QF + P M ++ ++ A GA I ++T+ L G+ + + + G D Sbjct: 638 TQFKSFPIAM---VMRHAHRALAQDGAGKGTYAAAIIAASTL-LGGMAI-QLNEIASGRD 692 Query: 726 P---SLPEVIYDGTLANGALLPYMDRLTKLVSKGDR---AAIGG-LLGPVPSMVTNLTSS 778 P + PE L GAL Y D L ++G A+IGG L G + S+V + Sbjct: 693 PRDMTKPEFWGGAFLKGGALGLYGDFLLTNQTQGGNSFIASIGGPLAGDIESVVKMTQGA 752 Query: 779 AVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDR-QQSK 837 A + + ++ N + I+ P N+WY K + DH+I + I E+ +PGYL R +Q Sbjct: 753 AFKAIDGKDPHTAANVVRFIKGHTPGANLWYAKAALDHMIFHDIQEQFSPGYLSRMRQRA 812 Query: 838 KKKKGIELFQNMDEGLPHRLP 858 +K+ + + E P R P Sbjct: 813 QKEYDQQFWWAPGETAPDRAP 833 >gi|323156120|gb|EFZ42279.1| hypothetical protein ECEPECA14_1895 [Escherichia coli EPECa14] Length = 824 Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 145/590 (24%), Positives = 262/590 (44%), Gaps = 35/590 (5%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTDLLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E+++F+GE + D + S V R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGVRANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKV 346 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 347 LKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALL 406 + +L + E + + + V N A W +R+ AS LG + + Sbjct: 358 SVE-----RLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFS 410 Query: 407 EDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGS 462 + G ++S + ++ + +++ ++ M R EL GL E ++ + Sbjct: 411 DLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDN 469 Query: 463 DAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRLDP 521 + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 470 MGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILK 529 Query: 522 SIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLA 571 S K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 530 S-----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG 574 Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 4/87 (4%) Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A+ +LGPV +V ++ A + +E + + K + +P N+WYLK + Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGLMPGANLWYLKAAL 782 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK 841 DH+I NQ+ E +PGYL + + + KK+ Sbjct: 783 DHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|117624699|ref|YP_853612.1| hypothetical protein APECO1_4054 [Escherichia coli APEC O1] gi|115513823|gb|ABJ01898.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|323948672|gb|EGB44577.1| hypothetical protein ERKG_04895 [Escherichia coli H252] Length = 824 Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 149/592 (25%), Positives = 260/592 (43%), Gaps = 39/592 (6%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRMLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARNGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E++SF+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSSFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA--SAGN 344 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A S Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 KV +L E + + + V N A W +R+ AS LG + + Sbjct: 358 KV-------ERLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSS 408 Query: 405 LLEDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMME 460 + G ++S + ++ + +++ ++ M R EL GL E ++ Sbjct: 409 FSDLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAM 467 Query: 461 GSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRL 519 + + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 468 DNMGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRI 527 Query: 520 DPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLA 571 S K + DTD++V K AK +G TP +I + D+ ++ L Sbjct: 528 LKS-----KGITDTDWSVWKLAKQEDWGNGNNTMLTPESIMRIPDSAVKHLG 574 Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 4/87 (4%) Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A+ +LGPV +V ++ A + +E + + K + +P N+WYLK + Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGLMPGANLWYLKAAL 782 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK 841 DH+I NQ+ E +PGYL + + + KK+ Sbjct: 783 DHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|324008547|gb|EGB77766.1| hypothetical protein HMPREF9532_01734 [Escherichia coli MS 57-2] Length = 824 Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 149/592 (25%), Positives = 260/592 (43%), Gaps = 39/592 (6%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRMLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQTTGNAKARNGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGRLDRKYYIRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E++SF+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSSFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA--SAGN 344 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A S Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 KV +L E + + + V N A W +R+ AS LG + + Sbjct: 358 KV-------ERLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSS 408 Query: 405 LLEDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMME 460 + G ++S + ++ + +++ ++ M R EL GL E ++ Sbjct: 409 FSDLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAM 467 Query: 461 GSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRL 519 + + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 468 DNMGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRI 527 Query: 520 DPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLA 571 S K + DTD++V K AK +G TP +I + D+ ++ L Sbjct: 528 LKS-----KGITDTDWSVWKLAKQEDWGNGNNTMLTPESIMRIPDSAVKHLG 574 Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 4/87 (4%) Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A+ +LGPV +V ++ A + +E + + K + +P N+WYLK + Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGLMPGANLWYLKAAL 782 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK 841 DH+I NQ+ E +PGYL + + + KK+ Sbjct: 783 DHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|327252171|gb|EGE63843.1| hypothetical protein ECSTEC7V_3018 [Escherichia coli STEC_7v] Length = 824 Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 147/592 (24%), Positives = 261/592 (44%), Gaps = 39/592 (6%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E+++F+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA--SAGN 344 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A S Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTG 357 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 KV +L E + + + V N A W +R+ AS LG + + Sbjct: 358 KV-------ERLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSS 408 Query: 405 LLEDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMME 460 + G ++S + ++ + +++ ++ M R EL GL E ++ Sbjct: 409 FSDLGTMYLSAK-VTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAM 467 Query: 461 GSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRL 519 + + + + + SG ++ + + +G + L+ L +D R+ Sbjct: 468 DNMGPSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRI 527 Query: 520 DPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLA 571 S K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 528 LKS-----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG 574 Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 4/87 (4%) Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAAL 782 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK 841 DH+I NQ+ E +PGYL + + + KK+ Sbjct: 783 DHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|319793417|ref|YP_004155057.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] gi|315595880|gb|ADU36946.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] Length = 838 Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 203/907 (22%), Positives = 366/907 (40%), Gaps = 126/907 (13%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL----DGKGLSKAERYRLAGLKAEE 56 MKP CI + +A GR +S EL+ +ED I R L DG L+ +R+ A +A E Sbjct: 1 MKPACIDAVIEAVGRPMSDAELKGIEDRIGRELRRLGNGPDGLRLTGEQRFFEAARRARE 60 Query: 57 DFQKEL-IRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAE---VPL 112 F E +++ DA+ A +H + +++ AG G A +L G A+ + + Sbjct: 61 SFLGEQELKARRDAL--AVLKH---AQVEQALAGFPGDKIAGLRRLLAFHGDAKGSTLSV 115 Query: 113 EMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVF-DEMKGKKTQNEQASRLVKQY 171 E K +A E + E + + G+ EM G+ + +A ++ Sbjct: 116 ESKAEAIEADAFRQMLGTLEATNPKFFGLFESPEGVRALVREMFGEDSGVREAKEGAAEF 175 Query: 172 FETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 + EL + ++AG + E+ +P S +K+ A + +V L+ RY++ D Sbjct: 176 KKVADELLGRFNDAGGKIRPREDWGLPHHHSQNKIAAAGEAVWVEKTFPLLNRDRYRNED 235 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVK-----REFERVFHFKDSQAHM 285 G+ ++ S++ +F+ E + + +T + P + G R H++ + ++ Sbjct: 236 GSRMNDSQVLAFLRESY--QTLATGGVNTLEPGAGGGETMRANLHAAAREIHYRSADDYL 293 Query: 286 DYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQM--IVQTIANDQEASAG 343 Y + FG + +LT + L+ I + GPN D K + Q + + Sbjct: 294 AYQKDFG-ERGLYDVLTGHVRGLADSIAMVETFGPNPDHAFKYFRDLAQREMTVADPTKH 352 Query: 344 NKVLKDWLGRNKL---------EVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGA 394 K+ K +G + L V E + Q ++ +R W+ A Sbjct: 353 GKIAKQLVGLDNLYNYVSGKTLPVASEWLAQGFDSLR-----------KWLV-------A 394 Query: 395 SMLGQHPIGALLEDGFISRQMLSRVG-IDKEAIQR--INKMPLKERME--LLSDVGLYAE 449 S LG I +L ++ + Q+ +RV ID + R + + +ME + GL + Sbjct: 395 SRLGSAFISSLPDEATM--QLTARVNNIDGMQVFRNELAALNPANQMEKRMAQRAGLALQ 452 Query: 450 GVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYAS 509 ++ + + + K+ + + SG + + R + + + + +G +T Sbjct: 453 TMIGSLNRFGDENMRNTLATKMATFTMRASGLNAITEARRRAFGVTMMSSLGHLT----- 507 Query: 510 LKDLKADPRLDPSIKAFF--KQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADL 567 +D +A +LDP K + D D+ V KRA+ G TP I + D L Sbjct: 508 -RDAEAPSKLDPMDHRILLSKGITDADWQVWKRAELEDWGGGNGTMLTPEAIYRIPDEAL 566 Query: 568 RDLARMSDKIAYHRKKLKNSKTLSPEQ-RQELQQQLADLERKEINILKDKVSNKMHALVL 626 + + +P+Q R++ +L + +E N+ + ++ A + Sbjct: 567 VGIGNLD---------------ANPQQLRRDAATRLLGVVLEEQNMAVVEPGSRERAALY 611 Query: 627 DNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA 686 N+Q RGT GE R F T P M + + S Sbjct: 612 SNLQ-----------------------RGTWKGELTRSVFLFKTMPIAMLMRHWERGMSG 648 Query: 687 KMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVIYDGT---------- 736 P S A + S T + G+ I LL+G DP + ++G Sbjct: 649 --PDARSKAGYIGALMVSTT--VMGMLALQIDELLKGRDP-VNMNPFEGKAGARNWVRAF 703 Query: 737 LANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVT---NLT-SSAVELATKDNENSKV 792 L G+L Y D L ++ I LGPV V LT + V+L + ++ Sbjct: 704 LKGGSLGIYGDFLFSEQNQHGGGPIASALGPVVGAVEEAFGLTQGNLVQLGQGKDTHAGA 763 Query: 793 NATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDE 851 K + P N+WYLK + +HLI NQ+ E ++PGYL R +S+ +++ G + + + Sbjct: 764 ELLKFAKGMTPGANLWYLKAATNHLIFNQLQEMVSPGYLARVKSRAQREFGTTEWWDSRQ 823 Query: 852 GLPHRLP 858 +P R P Sbjct: 824 AVPDRAP 830 >gi|89152441|ref|YP_512274.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] gi|74055464|gb|AAZ95913.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] Length = 824 Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 103/355 (29%), Positives = 172/355 (48%), Gaps = 17/355 (4%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y Sbjct: 179 WREVTDLLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRA 238 Query: 230 DGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 DG ++ +E+++F+GE + D + S R R HFKD+ +++ Sbjct: 239 DGQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQ 298 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEAS 341 Y + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 299 YQQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATAN 352 Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 4/87 (4%) Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAAL 782 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK 841 DH+I NQ+ E +PGYL + + + KK+ Sbjct: 783 DHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|85059662|ref|YP_455364.1| hypothetical protein SG1684 [Sodalis glossinidius str. 'morsitans'] gi|84780182|dbj|BAE74959.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 507 Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 117/430 (27%), Positives = 196/430 (45%), Gaps = 38/430 (8%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ ECIQ + A+ R L+ E++ +ED IV+ L + LS++ER + AG A Sbjct: 6 MRQECIQAITAASKRTLTSAEIQGIEDRIVKNMRHLARNDPTSWRSLSESERMQRAGHMA 65 Query: 55 EEDFQKELI---RSVNDAIDEAYKRHQLRSDLDRVQAGVYGKS---QALFNKLFFKA-GS 107 E ++E R V I R LD AG GK +AL K+ F A G Sbjct: 66 AEALEREATLKKRRVALTI-------AARQRLDNFIAGYKGKGGKLEALNRKIAFHADGK 118 Query: 108 AE-VPLEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQAS 165 A + +E + KA LS+ +E ++ + + DKQ+ D+ EM+G+ T N +A Sbjct: 119 APFLSVESRTKATRDYALSQLDELFSAIDPRFFQLFEDKQWIRDLVYEMRGQDTGNVRAK 178 Query: 166 RLVKQYFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLS 224 + + + L + ++AG D E+ +PQ S++K+ + D+V ++ LD + Sbjct: 179 KGAEAWKNVSELLRRRFNDAGGDIGHLEDWGMPQYHSMEKVGKATQSDWVGFVIGKLDRN 238 Query: 225 RYKDIDGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDS 281 +Y +G +S ++A F+G + D S R ER HFKD+ Sbjct: 239 KYVKENGELMSDKDVADFLGHAYKTIATGGMNKLGDSGRRLSGARANRGNAERQIHFKDA 298 Query: 282 QAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAN--DQE 339 + ++ Y + FG ++ IL + L +SKDI + GPN D + ++ + A D+ Sbjct: 299 EGYIAYQQRFG-EKSMWDILVNHLDGISKDIALVETYGPNPDHVFRSLLDELAAKTADET 357 Query: 340 ASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQ 399 S K+ KL+ + E + + + V N A W +R+ AS LG Sbjct: 358 PSRTGKI-------KKLKNKTEDLYNF--IAGKTQPVANPHIARWADHVRNWLVASRLGS 408 Query: 400 HPIGALLEDG 409 I +L ++G Sbjct: 409 ALISSLSDNG 418 >gi|315121926|ref|YP_004062415.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122888|ref|YP_004063377.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495328|gb|ADR51927.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496290|gb|ADR52889.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 810 Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 201/902 (22%), Positives = 352/902 (39%), Gaps = 176/902 (19%) Query: 1 MKPECIQVLNKAAGR-ELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQ 59 M PECI+ + K AG +L ++L ++E + + L+GL+ E F+ Sbjct: 1 MHPECIERVKKLAGEWKLEPEDLDQIE----------------RVSKQALSGLELNESFK 44 Query: 60 KELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQAL-----------FNKL-FFKAGS 107 +++ + + K H L ++ G + S+ L N L F Sbjct: 45 N--LKTADKVKALSEKAHLLL-----LENGAFAMSETLGGVGRAKHGEQLNTLKNFLRYE 97 Query: 108 AEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRL 167 +E +IK + F+++ ++GSKNLGF+ D + ++G +T + Q ++ Sbjct: 98 TTASIESRIKGEQANARKAFHDFEDLGSKNLGFSADPITNEKITKALRGVETDDPQVNKF 157 Query: 168 VKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYK 227 + Y + + + +QA + GL + PQP K+RA K ++ +++ W+D+ Y Sbjct: 158 GRAYRKIRDRVTAQAEDMGLLHPLDNWGSPQPDDALKIRAKGKKAWIETIMPWVDVEAY- 216 Query: 228 DIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPS--------SEVGVKREFERVFHFK 279 D L + F+G V+ +S+ ++ + S + VG R+ R Sbjct: 217 --DKKGLYGKGLTEFLGHVW--DTKSSEGRNKILASGGAEQAGKASVGGSRKQPRHLFLL 272 Query: 280 DSQAHMDYMEHFG-VSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAND- 337 D + + DY FG N ++ + L +DI IAR G NAD+ + +I Q ND Sbjct: 273 D-EHYSDYNAAFGKTGLNAEDLVRMTIDPLIRDIEIARTFGSNADNNFRWVITQAYENDL 331 Query: 338 QEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASML 397 + A + V K +G + +EA + +W+ + + + +N LR Sbjct: 332 KSAKTASDVTK--MG----GLYKEANI-LWDRLTISSEMLDHELSNAQINLRE------- 377 Query: 398 GQHPIGALLEDGFISRQMLSRVGID-----KEAIQRI------NKMP-----LKERMELL 441 L+ GF + Q++ G+ E I + MP L E L Sbjct: 378 --------LKSGFSTFQVVKSFGMQIFSALPETINCVVMGSHRQGMPFWSRALPEFKRHL 429 Query: 442 SDVGLYA---------EGVVAHGRNMMEGSDAFQIGHK-LHSKMHKWSGAEYLDKKRISS 491 ++ A E + N F G K L K KW G + LD+ + Sbjct: 430 TNANYKASIRAFAPAGEMAITGMMNEFHNQSKFVSGMKVLAEKTVKWQGLKALDRFQRDL 489 Query: 492 HALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYL 551 + +G +T + L+D K+ + + T T+IK Sbjct: 490 SFGFTSSWMGEVTRGFKGLEDFKS------------RYGEQTFKTLIKD----------- 526 Query: 552 YARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ----QLADLER 607 Y T S + L +L D R+ L+P+ +E + LA E Sbjct: 527 YGFTQSDMHALSKVEL-DAGRL----------------LTPDSIRECRHPDLVTLARSEN 569 Query: 608 KEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQ 667 K I + +S+KM + Q + RG++ +SL D + T RG G L + Q Sbjct: 570 KSIERMMGDLSSKMSGYIWSQTQDNARGSVGSSLRDTK----YTSSRGGIPG--LSLVTQ 623 Query: 668 FTTTPTGMFLNILDLSNSAKMPKGASMALN--HVWIQYSATMA----LAGIGVASIKALL 721 F TTP M L +PK N W + +A L GI + + L Sbjct: 624 FLTTPISMAEKHL-----WAVPKTLVGGANGMSAWSYRAKFLAFGIVLEGIVANTARKAL 678 Query: 722 RGE---DPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSS 778 G+ D + P+V+ L L + DR + + + PV S V L + Sbjct: 679 TGQELDDFTDPKVL---ALMTARTLTHYDRFFNEYHHDFKDLLHSV--PVASTVIGLGDA 733 Query: 779 AVELA-------TKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYL 831 +E++ + + K + +P N++Y+K +F ++++ + E N GY Sbjct: 734 GLEVSRNIFGEDEEKKAKANAKLAKEVANNMPLKNLFYVKAAFQKMVVDNLCEYFNEGYK 793 Query: 832 DR 833 DR Sbjct: 794 DR 795 >gi|169795397|ref|YP_001713190.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148324|emb|CAM86189.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 841 Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 195/890 (21%), Positives = 355/890 (39%), Gaps = 122/890 (13%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLK 53 MK +C Q + KA G++ LS +E +E I +L + + LS AE+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLSAQEATDIEARINETMRNLARKDINNWRNLSDAEKLTEAAKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGK--SQALFNKLFFKAG--SAE 109 D Q++L R A + K+ Q + LD +GK S + +++ G S Sbjct: 61 VAIDIQEQLKRKHKIAAQDILKQSQNIAALD------HGKLSSMEVIDRMVAAHGDMSGI 114 Query: 110 VPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRL-- 167 ++ K + + + ++ LG D++ + E G+ T + A ++ Sbjct: 115 QSIDSKARGIASIYRGELVDFYTNIKGGLGVFTDQELVQKIVRERFGENTGDALAKKISD 174 Query: 168 -VKQYFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSR 225 + FET R+ + + G D +N +PQ +++K+ K+ +V +D + Sbjct: 175 KMGDVFETMRD---RFNRNGGDIGKLDNWGLPQTHNLEKIAKAGKEAWVNKAESLIDTRQ 231 Query: 226 YKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIP-------SSEVGVKREFERVFHF 278 Y +G S+ EI S + E + + S + +S+V + RV HF Sbjct: 232 YVHENGDYYSQQEIRSLL-EYTYDTLSSDGANKIEVGRQATGGGTSKVTNRHGESRVLHF 290 Query: 279 KDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQ 338 KD+++ ++Y FG V+ ++ + + LSKDI + LG N + +K ++ D Sbjct: 291 KDAESWLEYQSEFGGMQFVD-LVEAHINGLSKDIAMVENLGSNPKTALKILMDAAAKKDW 349 Query: 339 EASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLG 398 E K K R ++E M++ + G + ++ AN RS A+MLG Sbjct: 350 EGQIPEKTTKRV--RKRIET-------MFDELSGGNSPQSEVLANLGVLYRSMNVAAMLG 400 Query: 399 QHPIGALLEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAH 454 I ++ + I++ LS E + ++N +R EL +GL E ++ Sbjct: 401 GTTISSITDQAMIAKTANVHGLSYRKTFGELVDQLNPANKADR-ELAHSLGLATEEMI-- 457 Query: 455 GRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISS-HALIVYNQIG---RMTDTYASL 510 G D + K+ + S R+S +AL +++G + + Y L Sbjct: 458 GSIARWSDDGLTSTYGKSEKLARISSGIASQVMRVSGLNALTAASKVGFTKLLMEKYGRL 517 Query: 511 KDLKADPRLDPSIKAFFKQ--LDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLR 568 KA LD + LD+ + V + A + D + Sbjct: 518 SRSKAWNDLDAQDRELLSNTGLDERAWQVFQLADPV--------------------VDRK 557 Query: 569 DLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDN 628 MS + Y K + P+Q +KD+VS+++ A +LD Sbjct: 558 GNQLMSARSIYEIPDEKLTAFGDPKQ------------------VKDQVSSQLQAHLLDE 599 Query: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKM 688 +V + L R++ + RGT GE +R QF + + + + + Sbjct: 600 QGLAV---VEAGL--REKTLINVGARGTITGEIVRGLAQFKSFSAAFLMRHGSRAFAQEG 654 Query: 689 PKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVIYDGT------------ 736 KG + +++ T+ L G V +K LL G D P+ IYD Sbjct: 655 IKGKAGYAVPLFV----TLTLLGGLVVQLKELLNGND---PQTIYDSNDPKKAGSFFIRS 707 Query: 737 -LANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVN-- 793 + G L D L R A + GP+ + T L V T+ NE N Sbjct: 708 AVQGGGLSFLGDILVAGTDTSGRDANSFVAGPLGNDFTALLGLTVGNLTQYNEGKDTNFG 767 Query: 794 --ATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 A K ++ +P N+WY K + + ++ +++ + + PGY ++ K +++ Sbjct: 768 NEAFKFVKGKIPAQNLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQ 817 >gi|260548934|ref|ZP_05823156.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] gi|260408102|gb|EEX01573.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] Length = 841 Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 198/891 (22%), Positives = 360/891 (40%), Gaps = 124/891 (13%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSLDGK------GLSKAERYRLAGLK 53 MK +C Q + KA G++ LS +E ++E I A ++ K LS +E+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLSAQEAIKIESRINEAMRNMARKDIDKWRNLSDSEKLIEASKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLE 113 D Q++L R A ++ + + + LD + + + +++ G Sbjct: 61 VAIDIQEQLKRKHKIAANDILTQSKNLAKLDHTRL----LASEVVDRMVAPHGDMSGIQS 116 Query: 114 MKIKA---AETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 + KA A+ + Y + LG DK+ + E + T + A ++ + Sbjct: 117 ISSKADGIADIYEGELVDFYTNI-KGGLGIFTDKELVHKIVRERFNENTGDPLAKKISNK 175 Query: 171 ---YFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRY 226 FET R+ + + +G D +N +PQ +++K+ K +V +D +Y Sbjct: 176 MGDVFETMRD---RFNRSGGDIGMLDNWGLPQTHNLEKIAKAGKKAWVNKAESLIDTRQY 232 Query: 227 KDIDGTPLSRSEIASFVGEVF-------AERVRSTSFKDPSIPSSEVGVKREFERVFHFK 279 +G S+ EI S + + A ++ + +S+V K RV HFK Sbjct: 233 VHENGDYYSQQEIRSLLEYTYDTLSSDGANKIE-VGRQATGAGTSKVTNKHSESRVLHFK 291 Query: 280 DSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQE 339 D+++ ++Y FG V+ ++ + + LSKDI + LG N + K I++ A+ ++ Sbjct: 292 DAESWLEYQSDFGGMQFVD-LVNAHIKGLSKDIALVENLGSNPKTAFK--ILKNAADKKD 348 Query: 340 ASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQ 399 AG KD N+ +V M++ G + ++ AN RS SMLG Sbjct: 349 REAGRITTKDNPALNRAQV-------MFDEFSGGNSPQSQVLANLGIAYRSMNIFSMLGG 401 Query: 400 HPIGALLEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHG 455 + + + I++ LS E I+++N +R EL +GL E ++ G Sbjct: 402 TTVVSTTDQATIAKTAHVHGLSYRKAFGELIRQLNPANKADR-ELAHSLGLATEEML--G 458 Query: 456 RNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRIS-SHALIVYNQIG---RMTDTYASLK 511 D H K+ + S R+S +AL +++G + + Y L Sbjct: 459 SIARWSDDGLTSTHGKSEKLARISSGVASLVMRVSLLNALTAASKVGFTKLLMEKYGRLS 518 Query: 512 DLKADPRLDPSIKAFFKQ--LDDTDFTVIKRAKAMSSPDG--YLYARTPSTIKNLKDADL 567 KA LD + LD+ + V + A+ + G + AR+ I + K A Sbjct: 519 RSKAWGDLDIQDRELLSNTGLDERAWQVFQLAEPVVDRKGNQLMSARSIYEIPDEKLAAF 578 Query: 568 RDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLD 627 D P+Q +KD+V++++ A +LD Sbjct: 579 GD----------------------PKQ------------------VKDQVASQLQAHLLD 598 Query: 628 NVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAK 687 +V + L R++ + RGT GE R QF + + + + + Sbjct: 599 EQGMAV---IEAGL--REKTLINVGARGTITGEIFRGIVQFKSFSAAFLMRHGSRTMAQE 653 Query: 688 MPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVIYDG------------ 735 KG + +++ T L G+ V +K LL G D P+ IYD Sbjct: 654 GLKGKAAYAIPLFVM---TTLLGGL-VVQLKELLNGND---PQTIYDSNDPKKASNFFVR 706 Query: 736 TLANGALLPYM-DRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVN- 793 + G L ++ D L R A + GP+ S +L S V T+ NE N Sbjct: 707 SAVQGGGLSFLGDILVAGTDTSGRDAHSFVAGPLGSDFESLLSLTVGNLTQYNEGKDTNF 766 Query: 794 ---ATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 A + +++ +P N+WY K + + ++ ++I + + PGY ++ K ++K Sbjct: 767 GNEAFQFVKRKIPAQNLWYTKAAINRMVFDEIQDFIAPGYREKALRKAEEK 817 >gi|268589387|ref|ZP_06123608.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] gi|291315414|gb|EFE55867.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] Length = 823 Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 84/344 (24%), Positives = 160/344 (46%), Gaps = 19/344 (5%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ CI+ + A+ R+L+ +E++ +ED I+ + +L + LS++ER + AG A Sbjct: 1 MRTACIEAIQNASKRQLTAREVQNIEDRIISSMRNLARNDPASWRLLSESERLQRAGQMA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 + Q+E R +L ++ Q K +AL + F A S + + Sbjct: 61 ATELQREADLKQRRVALTIAARQRLDEHINNFQG---SKLEALNRTIAFSADGKSNFMSV 117 Query: 113 EMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGL-DVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E E + Q G+ D+ EMKG+ T+N +A + + Sbjct: 118 ETRAKATINYALSQLQEAFEAVDPKFFQLFEDQNGVRDLIFEMKGQDTRNVRAKKGAAAW 177 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 L + + AG D E+ +PQ S+ ++ +D +V ++ LD ++Y D Sbjct: 178 HNVTGMLRNSFNRAGGDIGHLEDWGLPQSHSMQRVGKVTQDKWVSDVIGKLDRNKYIKED 237 Query: 231 GTPLSRSEIASFVGEVFAERVRS---TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMD 286 G+ ++ +E+ F+ + E + + D I S + R R HFKD++++++ Sbjct: 238 GSVMNDAELKQFLDSAY-ETIATGGLNKINDRPIGVSGMRANRGNASRQIHFKDAESYLE 296 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMI 330 Y + +G ++ I+ + +SKDI + GPN D + ++ Sbjct: 297 YQQLYG-EKSLWDIMVGHIEGISKDIGLIETYGPNPDHVFQSLL 339 Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 4/86 (4%) Query: 760 AIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFD 815 A LLGPV +V + A + E + + K ++ +P N+WY K D Sbjct: 722 AFASLLGPVAGVVDDAIKLAQGIPLNAVEGKPEQTGGDTVKFVKGLIPGQNLWYTKAVLD 781 Query: 816 HLILNQILEELNPGYLDRQQSKKKKK 841 H++ NQ+ E +PGYL R + + KK+ Sbjct: 782 HMVFNQLQEYFSPGYLRRMEKRSKKE 807 >gi|332875212|ref|ZP_08443045.1| hypothetical protein HMPREF0022_02678 [Acinetobacter baumannii 6014059] gi|332736656|gb|EGJ67650.1| hypothetical protein HMPREF0022_02678 [Acinetobacter baumannii 6014059] Length = 841 Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 193/894 (21%), Positives = 355/894 (39%), Gaps = 130/894 (14%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLK 53 MK +C Q + KA G++ L+ +E +E I +L + + LS AE+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLTAQEATDIEARINETMRNLARKDINNWRNLSDAEKLSEAAKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 D Q++L R A + K+ Q + LD + S + +++ G S Sbjct: 61 VAIDIQEQLKRKHKIAAQDILKQSQNIAALDHSKL----SSMEVIDRMVAAHGDMSGIQS 116 Query: 112 LEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRL---V 168 ++ K + + + ++ LG D++ + E G+ T + A ++ + Sbjct: 117 IDSKARGIASIYRGELVDFYTNIKGGLGIFTDQELVQKIVRERFGENTGDALAKKISDKM 176 Query: 169 KQYFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYK 227 FET R+ + + G D +N +PQ +++K+ K+ +V +D +Y Sbjct: 177 GDVFETMRD---RFNRNGGDIGKLDNWGLPQTHNLEKIAKAGKEAWVNKAESLIDTRQYV 233 Query: 228 DIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIP-------SSEVGVKREFERVFHFKD 280 +G S+ EI S + E + + S + +S+V + RV HFKD Sbjct: 234 HENGDYYSQQEIRSLL-EYTYDTLSSDGANKIEVGRQATGGGTSKVTNRHGESRVLHFKD 292 Query: 281 SQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA 340 +++ ++Y FG V+ ++ + + LSKDI + LG N + +K ++ +A Sbjct: 293 AESWLEYQSEFGGMQFVD-LVEAHINGLSKDIAMVENLGSNPKTALKILM--------DA 343 Query: 341 SAGNKVLKDW---LGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASML 397 +A KDW + NK + ++ M++ G T ++ AN RS ASML Sbjct: 344 AAK----KDWEKGIDENKTQSSRKRAQVMFDEFSGGNTPQSQVLANLGIAYRSMNVASML 399 Query: 398 GQHPIGALLEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVA 453 G I +L + I++ LS ++++N +R E +GL E ++ Sbjct: 400 GGTTIASLADQATIAKTAHVHNLSYRKAFGGIVEQLNPANKADR-EFAHGLGLATEEML- 457 Query: 454 HGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISS-HALIVYNQIG---RMTDTYAS 509 G D + K+ + S R+S +AL +++G + + Y Sbjct: 458 -GSIARWSDDGLTSTYGKSEKLARISSGVATQVMRVSFLNALTSASKVGFTKLLMEKYGR 516 Query: 510 LKDLKADPRLDPSIKAFFKQ--LDDTDFTVIKRAKAMSSPDG--YLYARTPSTIKNLKDA 565 L KA LD + LD+ + V + A+ + G + AR+ I + K Sbjct: 517 LSRSKAWNELDVQDRELLSNTGLDERAWQVFQLAEPVVDRKGNQLMSARSIYEIPDEKLT 576 Query: 566 DLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALV 625 D ++ D++A +LQ L D + + Sbjct: 577 AFGDPKQVKDQVA-----------------SQLQAHLLDEQGMAV--------------- 604 Query: 626 LDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNS 685 ++ +R +R + +GT GE + QF + + + + Sbjct: 605 ---IEAGLR----------ERTWMTVGAKGTITGEVFKGLMQFKSFSASFLMRQGSRAMA 651 Query: 686 AKMPKG-ASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVIYDG--------- 735 + KG A+ A I +M L G V ++ +L G D P+ IYD Sbjct: 652 QEGLKGKAAYA-----IPLMVSMTLLGGLVVQLREILNGND---PQTIYDSNDPKKATSF 703 Query: 736 ---TLANGALLPYM-DRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSK 791 +L G LP + D L R A + GP+ S T L V T+ NE Sbjct: 704 FMRSLVAGGGLPVLGDILVAGTDTSGRDANSFVSGPLGSDFTALLGLTVGNLTQYNEGKD 763 Query: 792 VN----ATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 N A K ++ +P N+WY K + + + +++ + + PGY ++ K +++ Sbjct: 764 TNFGNEAFKFVKGKIPAQNLWYTKAAINRMFFDEVQDTIAPGYREKALRKAERQ 817 >gi|291334971|gb|ADD94604.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233] Length = 530 Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 116/501 (23%), Positives = 207/501 (41%), Gaps = 100/501 (19%) Query: 380 GWANWMAGLRSAAGASMLGQHPIGALLEDGF----ISRQMLSRVGIDKEAIQRI-NKMPL 434 G A W A R+ + LG I A + G +S Q S +G E + + + Sbjct: 72 GVAKWSAITRAVGNTAKLGGAVISAAADLGIYGSEMSFQGRSFLGGMYEGFKGLARRKNT 131 Query: 435 KERMELLSDVGLYAEGVV-------AHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKK 487 +++ +L+ +G A+GVV G N+ +G Q ++ + W+ Sbjct: 132 QDKKDLVEGMGFLADGVVYDVSGRHTVGDNLTKGWTRIQRTFFKYNLLSWWTNT------ 185 Query: 488 RISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFK--QLDDTDFTVIKRAKAMS 545 + N + M + YA K+L D +L+ ++ FF +D + VI++ Sbjct: 186 -------LKENSMLGMANYYAKQKNLSFD-KLNKPLQEFFGLYNIDSVKWDVIRKNGMAK 237 Query: 546 SPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADL 605 + DG + + + + DAD++ + + + L Sbjct: 238 ADDGTEFINI-ANLDQISDADIKKITGIDN-----------------------------L 267 Query: 606 ERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYK--RGTRAGEALR 663 + E+ I KDK + ++LD S+ + D + G++T GT GEA+R Sbjct: 268 SKTELQIEKDKFKYSVSGILLDR---SIYAVIEP---DARVKGIMTQGLLAGTGMGEAIR 321 Query: 664 MFQQFTTTPTGMFLNILDL-----------------SNSAKMPKG----ASMALNHVWIQ 702 QF P + +L + A++ +G A++ + ++ Sbjct: 322 FVGQFKAFPMSIMNKVLGREMAYIRKGKKLGGLSTEAGRAEIGRGIRGMAALVITSGFMG 381 Query: 703 YSATMALAGIGVASIKALLRGEDPSLP---EVIYDGTLANGALLPYMDRLTKLVSKGDRA 759 Y A ++K LL+G++P P + I G L G L Y D L K + + Sbjct: 382 YMAM---------TMKDLLKGKEPRDPTKFKTIMAGFLQGGGLGIYGDVLFK-EQRDAGS 431 Query: 760 AIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLIL 819 I GL+GP P+ V +L + + S A +AI +PF+N++Y+K +FD+LI Sbjct: 432 VIAGLVGPAPTTVVDLGLALQYALLGEGGKSGKAAYRAISSNIPFLNLFYIKIAFDYLIG 491 Query: 820 NQILEELNPGYLDRQQSKKKK 840 QI+E +NPG L + + + KK Sbjct: 492 FQIMETVNPGVLKKVERRMKK 512 >gi|303328566|ref|ZP_07359001.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861332|gb|EFL84271.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 855 Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 170/779 (21%), Positives = 286/779 (36%), Gaps = 153/779 (19%) Query: 143 DKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQREL-HSQAHEAGLDYKFFENRIPQPMS 201 DK F VF EM+ + ++ +R + F E + + AG D + PQ Sbjct: 130 DKAFHDSVFREMREPDSTGDKNARAIADIFSRYTEQSRVRLNAAGADIGKLDGWTPQTHD 189 Query: 202 VDKLRA---TKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRS----- 253 KL A + +V ML LDL R DG VG V A R R Sbjct: 190 PYKLMAGGEAGRAKWVDFMLPRLDLER--TFDG-----------VGLVDANRARELLNGV 236 Query: 254 ----TSFKDPSIPSSEVGVKREF------------ERVFHFKDSQAHMDYMEHFGVSTNV 297 T ++P +P G RV HFKD+Q ++Y + +G N+ Sbjct: 237 YDTLTMGRNPHMPGDFTGGGASVPGPRNLASGMGKSRVLHFKDAQGALEYHDAYG-RGNI 295 Query: 298 NTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLE 357 + L ++ + + LGPN +++++ + LKD N + Sbjct: 296 FDAMLRHLEQDARALALMERLGPNPQYTLERLLAHE----------KRALKD----NAVL 341 Query: 358 VRQEAMLQMWE--------VMRYGET----VENTGWANWM---------AGLRSAAGASM 396 +E QM E ++R G E TG +W A LR++ S Sbjct: 342 TPEEKARQMRELDNAFSGGIIRQGRVSAWLAELTGETSWAVHPTLARVGAVLRASQNLSK 401 Query: 397 LGQHPIGALLEDGFISRQMLSRV-------GIDKEAIQRINKMPLKERMELLSDVGLYAE 449 LG + A+ + ++ RV I K Q I KE+ DV Sbjct: 402 LGGASLSAIAD--VFTKAASMRVNGETWPGAIGKSLAQYIQGFSGKEK-----DVARQCG 454 Query: 450 GVVAHGRNMM-----EGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMT 504 + H R + + S + L K+ +WSG ++ ++ + + L + +G ++ Sbjct: 455 AFLDHVRGDIVARWDDASGMPGVLADLQDKLFRWSGLNWITERGKAGYTLWLSEHLGEVS 514 Query: 505 DTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMS--SPDGYLYARTPSTIKNL 562 KA +LD +A Q D + + MS + DG Y TP L Sbjct: 515 G--------KAFDQLDGPRRAML-QYHGVDPERWEAMRKMSHQAEDGKAYF-TPEAAAYL 564 Query: 563 KDADLRDLARMSDKIAYHRKKLKNSKTLSPE-QRQELQQQLADLERKEINILKDKVSNKM 621 DADL L +++K P+ Q +EL + L + +L D+ Sbjct: 565 TDADLAPLLP------------EHAKNAPPDVQARELARIRDSLRFDSMAMLADET---- 608 Query: 622 HALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNIL- 680 A + + R M + GT AGE R QF + P +L Sbjct: 609 -AFAIIEPDDATRAIMRQGT-----------RPGTGAGEVWRAIMQFKSFPIAYMQRVLG 656 Query: 681 -------DLSNSAK-----MPKGASMALNH---VWIQYSATMALAGIGVASIKALLRGED 725 DL + +P AL + + + G ++K L +G + Sbjct: 657 GRRWVRGDLQRGMRYGPRNLPGAVEDALTRDMGGLMGFVLSSVAFGYASMTLKDLAKGRE 716 Query: 726 P-SLP--EVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVEL 782 P SL E + +G + D L V++ + +GP+ ++ + + +L Sbjct: 717 PRSLAHRETWLAAAMQSGGAGIFGDILFGKVNRFGNSFAETAVGPLGGLIGDAATLGGQL 776 Query: 783 ATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 D ++ + + PF+N+WY + + D ++L + E ++PG L R + K KK+ Sbjct: 777 VRGDMADAGEDTLRLAMGNAPFINLWYTRAALDWMLLYHVREMMSPGTLRRTERKMKKE 835 >gi|294648411|ref|ZP_06725910.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825716|gb|EFF84420.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 854 Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 80/349 (22%), Positives = 149/349 (42%), Gaps = 24/349 (6%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLD------GKGLSKAERYRLAGLKA 54 MK EC + GR+L+ KE LE ++A L K +S ER +A Sbjct: 1 MKNECRAAVEGVLGRKLTDKEADLLEQQFIKASRELPQEDIKAWKSMSDEERAEAIADRA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKL-FFKAGSAEVPLE 113 +++ + I+ V + I++ R L +L +AL KL F S +E Sbjct: 61 IKNYTDQHIKEVTNLINDLEIREALEHEL--TSHSKLNPLEALNRKLVMFTDQSGIQSVE 118 Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 I+A E + + + K LG+ +D + E+ GK + + + + L K + Sbjct: 119 HNIQAIEVRYMGALADVFSKTQKGLGYLIDADKVKLLVKEIFGKPSGDAEIAGLAKSVQD 178 Query: 174 TQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGT 232 +L + G D K N IPQ S K+ + +++++ +D S+Y+ +G Sbjct: 179 VLEQLRQHYNRYGGDIKKLANYGIPQSHSHYKVIQAGEGEWIKTTFPMVDKSKYRHENGK 238 Query: 233 PLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREF------------ERVFHFKD 280 ++ +E+ + V+ + + S S+ + V + + R HFKD Sbjct: 239 LMNDAEVKEVLKAVY-QTIASEGHNKASVQAHAVQSETDLPVGMNMQALHQHHREVHFKD 297 Query: 281 SQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQM 329 + + Y E FG N + +L++ + +S +I + + G N + VKQ+ Sbjct: 298 PDSWVAYQEQFG-EVNFHDLLSNHIRRMSTEIGMMQTFGSNPEKLVKQL 345 Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 52/249 (20%), Positives = 106/249 (42%), Gaps = 23/249 (9%) Query: 600 QQLADLERKEINILKDKVSNKMHALVLDNVQTSVR--GAMHTSLFDRQRLGLLTYKRGTR 657 Q+L+D+ + LK++++NK + +V GA ++ R +RGT Sbjct: 595 QELSDIAFR----LKEQLANKYMNYIYTETNAAVLEVGARESTFMGLGR------ERGTV 644 Query: 658 AGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASI 717 E R F QF P M + + M +G + + A + G V+ I Sbjct: 645 GNELSRFFWQFKQFPLAMIMRQW----TRGMAQGTPQEKFVYFAKLFAYTTVMGALVSQI 700 Query: 718 KALLRGEDPSLPEVI--YDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGP-----VPS 770 + L +G+D P + Y ++ G ++ S ++ + P + S Sbjct: 701 QNLTQGKDLDDPTTLDFYMKSIVKGGSASFLADAISATSDPTERSVKDFIIPAAFKDITS 760 Query: 771 MVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGY 830 + T ++ + T+ + + A ++ +PF N+WY + FD L++ ++ E + GY Sbjct: 761 IGTMVSGAGSAFITERDSSYGAEAVNVVKNNIPFQNLWYSRLVFDRLVIAEMQELFDEGY 820 Query: 831 LDRQQSKKK 839 +R+Q +++ Sbjct: 821 RERKQRRQE 829 >gi|320175032|gb|EFW50145.1| 17 [Shigella dysenteriae CDC 74-1112] Length = 236 Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 73/233 (31%), Positives = 118/233 (50%), Gaps = 13/233 (5%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAER-YRLAGLK 53 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER YR A L Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDTMSWRQLSESERLYRAAQLA 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 +EE ++ ++ A+ A R +L ++ Q G GK AL + F A S + Sbjct: 61 SEELQREAALKKRRVALTIA-ARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLS 118 Query: 112 LEMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 +E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K Sbjct: 119 VESRTKATREYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKA 178 Query: 171 YFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLD 222 + E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Sbjct: 179 WREVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLD 231 >gi|298485996|ref|ZP_07004070.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159473|gb|EFI00520.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 831 Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 77/342 (22%), Positives = 141/342 (41%), Gaps = 35/342 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK 60 M C + + +A GR L K E + D I S + L++ + + + ++ Sbjct: 3 MSANCKREVEQAIGRPLKKSEADAINDKI-----SFHIRDLARTDPTKFNAMTEQQRQLA 57 Query: 61 ELIRSVNDAIDEAYKRHQLRS----------DLDRVQAGVYGKSQALFNKLFFKAGSAEV 110 ++ D + + K+ Q + D +A V G Q + LF + + Sbjct: 58 GAQAAMADHMADVAKKAQRKGLNLLAQTRELDNQTARAAVLGGKQPFTSALFERLRQVDT 117 Query: 111 PLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 ++ + A T ++ + K +G +K D E+ G+ + N A K Sbjct: 118 RIKGERNRAFTSIM---DTIMAAEPKFMGLITNKAVERDFVHEVFGQDSGNAIAKNAAKV 174 Query: 171 YFETQRELHSQAHEAG-----LDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSR 225 + + + + + AG LDY + +PQP S+ K+R ++ +L LD R Sbjct: 175 WRDQMDSIRERQNAAGADIGRLDYGW----LPQPHSLVKVRRAAPQEWASFVLGRLDRRR 230 Query: 226 YKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKR-----EFERVFHFKD 280 Y + DGT ++ ++ F+ + A T + P + G R R HFKD Sbjct: 231 YLNEDGTQMNDGQVTDFL--LAAHETLRTDGLNKMTPGTGNGSSRAAKHDNAHRQIHFKD 288 Query: 281 SQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNA 322 ++++YM FG T+V + + + KD V+ +LGPNA Sbjct: 289 GDSYLEYMRDFG-PTSVFEAMNGSVHAQIKDTVLTEQLGPNA 329 Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 31/108 (28%), Positives = 55/108 (50%), Gaps = 3/108 (2%) Query: 754 SKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNAT--KAIRKTLPFMNMWYLK 811 ++G ++ + GLLGPV ++ + + + E + V A + + PF+ WY K Sbjct: 715 NRGGQSNLTGLLGPVYGTAADVGLTLGSVFKEKTEPADVGANLLRIGYQNTPFIRSWYTK 774 Query: 812 NSFDHLILNQILEELNPGYLDRQQSKKKKKGIELF-QNMDEGLPHRLP 858 +F+H +++ + E L+PGYL R + + KK + F E P R P Sbjct: 775 AAFEHAVMHDMQEMLSPGYLSRMKKRAKKDFNQRFWWEPGETAPSRAP 822 >gi|293609607|ref|ZP_06691909.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828059|gb|EFF86422.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 1175 Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 102/438 (23%), Positives = 190/438 (43%), Gaps = 48/438 (10%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLK 53 MK +C Q + KA G++ L+ +E +E I +L + + LS AE+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLTAQEATDIEARINETMRNLARKDINNWRNLSDAEKLTEAAKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGK--SQALFNKLFFKAG--SAE 109 D Q++L R A + K+ Q + LD +GK S + +++ G S Sbjct: 61 VAIDIQEQLKRKHKIAAQDILKQSQNIAALD------HGKLSSMEVIDRMVAAHGDMSGI 114 Query: 110 VPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRL-- 167 ++ K + + ++ LG D++ + E G+ T + A ++ Sbjct: 115 QSIDSKARGIAAIYRGELVDFYTNIKGGLGIFTDQELVQKIVRERFGESTGDALAKKISD 174 Query: 168 -VKQYFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSR 225 + FET R+ + + G D +N +PQ +++K+ K +V +D + Sbjct: 175 KMGDVFETMRD---RFNRNGGDIGKLDNWGLPQTHNLEKIAQAGKQAWVSKAESLIDTRQ 231 Query: 226 YKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIP-------SSEVGVKREFERVFHF 278 Y +G S+ EI S + E + + S + +S+V + RV HF Sbjct: 232 YVHENGDYYSQQEIRSLL-EYTYDTLSSDGANKIEVGRQATGGGTSKVTNRHGESRVLHF 290 Query: 279 KDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQ 338 KD+++ ++Y FG V+ ++ + + LSKDI + LG N + +K ++ Sbjct: 291 KDAESWLEYQSDFGGMQFVD-LVEAHINGLSKDIAMVENLGSNPKTALKILM-------- 341 Query: 339 EASAGNKVLKDW---LGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGAS 395 +A+A KDW + N+ + ++ M++ + G T ++ AN RS AS Sbjct: 342 DAAAK----KDWEKGIEENQTKSSRKRAQVMFDELSGGNTPQSQVLANLGIAYRSMNVAS 397 Query: 396 MLGQHPIGALLEDGFISR 413 MLG I +L + I++ Sbjct: 398 MLGGTTIASLADQATIAK 415 Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 60/251 (23%), Positives = 105/251 (41%), Gaps = 36/251 (14%) Query: 613 LKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTP 672 ++D+V++++ A +LD +V + L R+R + +GT GE + QF + Sbjct: 918 IRDEVASQLQAHLLDEQGMAV---IEAGL--RERTWMTVGAKGTITGEVFKGLMQFKSFS 972 Query: 673 TGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVI 732 + S M + I +M L G V ++ +L G DP + I Sbjct: 973 ASFLMR----QGSRAMAQEGLKGKAAYAIPLMVSMTLLGGLVVQLREILNGNDP---QTI 1025 Query: 733 YDG------------TLANGALLPYM-DRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSA 779 YD +L G LP + D L R A + GP+ S T+L Sbjct: 1026 YDSNDPKKATSFFMRSLVAGGGLPVLGDILVAGTDTSGRDANSFVSGPLGSDFTSLLGLT 1085 Query: 780 VELATKDNENSKVN----ATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGY----- 830 V T+ NE N A K ++ +P N+WY K + + ++ +++ + + PGY Sbjct: 1086 VGNLTQYNEGKDTNFGNEAFKFVKGKIPAQNLWYTKAAINRMVFDEMQDTIAPGYREKAL 1145 Query: 831 --LDRQQSKKK 839 +RQQ +++ Sbjct: 1146 RKAERQQDRER 1156 >gi|167041093|gb|ABZ05854.1| hypothetical protein ALOHA_HF400048F7ctg1g21 [uncultured marine microorganism HF4000_48F7] Length = 828 Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 126/608 (20%), Positives = 227/608 (37%), Gaps = 102/608 (16%) Query: 258 DPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARE 317 +P + + K R HF+DS A ++Y + +G S V I+ + LS + + + Sbjct: 277 EPGVGRKSLSTKISQSRQLHFRDSAAWIEYNKKYGHSNAVQAIVQG-VGHLSDSLELIKV 335 Query: 318 LGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAML--QMWEVMRYGET 375 G N D K++ L R + Q ML + +V Sbjct: 336 FGANPDGTFKRL---------------------LERQDFDPGQRTMLRSEYNQVSGAAFE 374 Query: 376 VENTGWANWMAGLRSAAGASMLGQ-------HPIGALLEDGFISRQMLSRV--GIDKEAI 426 V N W W G+++ S LG PI + + + S + Sbjct: 375 VANPAWHKWTQGIQAIQNLSKLGSAIFSSTTDPIYVAFTQHYHGKNIFSAYYNAFLNIGV 434 Query: 427 QRINKMPLKERMELLS-DVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLD 485 R+ + + +E+ + +GL +GV+ S +WSGA+ D Sbjct: 435 GRLLQRGKSKEIEMFARKLGLGFDGVIG-------------------SAASRWSGAK--D 473 Query: 486 KKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMS 545 A+ + + L L A+ D D T + K Sbjct: 474 TTEFMQGAV----------NNFFRLNGLSGWTNFYREGAAYLMASDMADATKLNWDKL-- 521 Query: 546 SPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADL 605 +P+ Y R + D+D +D+A + +K+ +SP R + +L ++ Sbjct: 522 APN---YRRLLERY-GITDSDWKDIAGLP------FEKINGLDVISP-TRVFDEIELGNI 570 Query: 606 ERKEINILKDKVSNKMHALVLDNVQTSVR-GAMHTSLFDRQRLGLLTYKRGTRAGEALRM 664 I ++ L+ +N ++ GA + R G K GT A ++ Sbjct: 571 TGDAIPRSRELAEKIQQVLITENEFAVLQPGANERAFMGRFFTGEEGIKSGTPMAMANKL 630 Query: 665 FQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGE 724 F QF + M + P+ M L + + M L G ++K +L+G Sbjct: 631 FWQFRSFGLTMLFR--------QWPRAYEMGLPSFY--HLVPMVLMGYVAMAMKDILKGR 680 Query: 725 DPSLPEVIYD-GTLANGALLP------YMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLT- 776 + L +V+ D G +A ++L D L + + + L GP S + +L Sbjct: 681 E--LKDVVEDPGKIAVASVLQSGFGGIAGDFLFNDYRQYSTSYVDLLAGPSGSSLNDLAE 738 Query: 777 --SSAVELATK-DNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDR 833 ++ ++AT D ++ +A++ +P+ N W + FD+LI Q+ E LNPG L R Sbjct: 739 FGATTFDVATGGDPVDAAAAGWRAVKGNIPYANWWASRTLFDYLINYQVQEILNPGSLRR 798 Query: 834 QQSKKKKK 841 + + K+K Sbjct: 799 MERRFKQK 806 >gi|226953662|ref|ZP_03824126.1| phage related protein [Acinetobacter sp. ATCC 27244] gi|226835534|gb|EEH67917.1| phage related protein [Acinetobacter sp. ATCC 27244] Length = 842 Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 86/425 (20%), Positives = 173/425 (40%), Gaps = 39/425 (9%) Query: 1 MKPECIQVLNKAAG-RELSKKELRRLEDGIVRAYVSL-----DGKGLSKAERYRLAGLKA 54 M+ EC + + KA G R+LS + R+ +RA +L D S AER K Sbjct: 1 MRAECREQVAKALGKRKLSAADSNRISSLYIRAQNTLARTDPDWMFKSPAERAEAIAQKT 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKL-FFKAGSAEVPLE 113 D ++ ++ + +A + QL++++ QAL K+ +F S +E Sbjct: 61 ATDLAVQIAKNNQNIARDAIIKAQLQNEI--YNHPKLNPVQALMRKIAYFSDQSGIQSIE 118 Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 + +A ++ +S + + G +++K D+ M G K+ N + + + K+ Sbjct: 119 KQSQALHSRWMSLVADVFTKTQERFGMSVNKAMTDDIIRVMFGGKSDNPEITAMAKEVSA 178 Query: 174 TQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGT 232 E+ + AG + K +N K+ T + ++V L +D ++Y G Sbjct: 179 ALEEMRLAFNRAGGNIKKLDNFGFMTSHDQKKVALTDQSEWVNDALAGVDRNQYVKETGE 238 Query: 233 PLSRSEIASFVGEVFAERVRSTSFKD-------------PSIPSSEVGVKREFERVFHFK 279 + E+ S + E++ + + KD P S++ + + R HFK Sbjct: 239 LMDELELKSMLEEIYKTISTNGANKDLLILNKQAKAGASPVGGRSKMANRHQESRALHFK 298 Query: 280 DSQAHMDYMEHFGV--STNVNTILTSELASLSKDIVIARELGPNA----DSFVKQMIVQT 333 D A + Y + +G + IL + +S ++ + + LG N +S + + ++ Sbjct: 299 DGDAWLAYQKKYGTYDEAGFHEILKNHTHRMSTEVAMMQNLGSNPRNTFESLLDEAKIKL 358 Query: 334 IANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAG 393 A+ Q + +++ + + M+ + ++ N M GLR+ Sbjct: 359 KADPQNG----------MKHGEIDKQAHRAMSMYNTLDANTRAIDSTLGNVMGGLRALMV 408 Query: 394 ASMLG 398 AS LG Sbjct: 409 ASKLG 413 Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust. Identities = 34/154 (22%), Positives = 64/154 (41%), Gaps = 7/154 (4%) Query: 705 ATMALAGIGVASIKALLRGEDPSLPEVI--YDGTLANGALLPYM-DRLTKLVSKGDRAAI 761 A LAG + + L G++P I + +L G L ++ D ++ L R+A Sbjct: 688 AYQTLAGALIVQTQNLANGKNPEPVFTIDFFGKSLLKGGGLSFLGDIMSALSDPTGRSAS 747 Query: 762 ----GGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHL 817 G LLG + LT + + ++ +P N+WY K D + Sbjct: 748 DFISGPLLGQSMKLGMLLTGMGNNIIEGKESTRMMEVANTLKSNIPLQNLWYSKLVVDRM 807 Query: 818 ILNQILEELNPGYLDRQQSKKKKKGIELFQNMDE 851 + +++ ++P YL R Q + + G + ++ E Sbjct: 808 LYSKMQNMIDPDYLPRTQQRLENLGNSYWWDLSE 841 >gi|262371858|ref|ZP_06065137.1| predicted protein [Acinetobacter junii SH205] gi|262311883|gb|EEY92968.1| predicted protein [Acinetobacter junii SH205] Length = 841 Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 90/424 (21%), Positives = 175/424 (41%), Gaps = 37/424 (8%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL-----DGKGLSKAERYRLAGLKA 54 M+ EC + + KA G++ L+ + R+ +RA +L D S AER K Sbjct: 1 MRAECREQVAKALGKKRLNAADSNRISSLYIRAQNTLARTDPDWMFKSPAERAEAIAQKT 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKL-FFKAGSAEVPLE 113 D ++ ++ + +A + QL++++ QAL K+ +F S +E Sbjct: 61 ASDLAVQIAKNNQNIARDAVIKAQLQTEI--YNHPKLNPVQALMRKIAYFSDQSGIQSIE 118 Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 + +A ++ +S + + G +++K D+ M G K+ N + + + K+ Sbjct: 119 KQSQALHSRWMSLVADVFTKTQERFGMSVNKAMTDDIIRVMFGGKSDNPEITAMAKEVSA 178 Query: 174 TQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGT 232 E+ + AG + K +N K+ T + ++V LD LD ++Y G Sbjct: 179 ALEEMRLAFNRAGGNIKKLDNFGFMTSHDQKKVALTNQAEWVNDALDGLDRNQYVKDTGE 238 Query: 233 PLSRSEIASFVGEVFAERVRSTSFKD-------------PSIPSSEVGVKREFERVFHFK 279 + E+ S + +++ + + KD P S++ + + R HFK Sbjct: 239 LMDELELKSMLEDIYKTISTNGANKDLLVLNKQAKAGVSPVGGRSKMANRHQEARALHFK 298 Query: 280 DSQAHMDYMEHFGV--STNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAND 337 D A + Y + +G + IL + +S ++ + + LG N + ++ + Sbjct: 299 DGDAWLAYQKKYGTYDEAGFHEILKNHTQRMSTEVAMMQNLGSNPRHTFESLLDE----- 353 Query: 338 QEASAGNKVLKDWL-GRNKLEVRQEA--MLQMWEVMRYGETVENTGWANWMAGLRSAAGA 394 A K+ D L G E+ ++A L M+ + ++ N M GLR+ A Sbjct: 354 ----AKIKLKADPLNGLKHGEIDKQAHRALSMYNTLDANTRAIDSTLGNVMGGLRALMVA 409 Query: 395 SMLG 398 S LG Sbjct: 410 SKLG 413 Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust. Identities = 34/154 (22%), Positives = 64/154 (41%), Gaps = 7/154 (4%) Query: 705 ATMALAGIGVASIKALLRGEDPSLPEVI--YDGTLANGALLPYM-DRLTKLVSKGDRAAI 761 A LAG + + L G++P I + +L G L ++ D ++ L R+A Sbjct: 688 AYQTLAGALIVQTQNLANGKNPEPVFTIDFFGKSLLKGGGLSFLGDIMSALSDPTGRSAS 747 Query: 762 ----GGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHL 817 G LLG + LT + + ++ +P N+WY K D + Sbjct: 748 DFISGPLLGQSMKLGMLLTGMGNNIIEGKESTRMMEVANTLKSNIPLQNLWYSKLVVDRM 807 Query: 818 ILNQILEELNPGYLDRQQSKKKKKGIELFQNMDE 851 + +++ ++P YL R Q + + G + ++ E Sbjct: 808 LYSKMQNMIDPDYLPRTQQRLENLGNSYWWDLSE 841 >gi|48697207|ref|YP_024937.1| hypothetical protein BcepC6B_gp17 [Burkholderia phage BcepC6B] gi|47779013|gb|AAT38376.1| gp17 [Burkholderia phage BcepC6B] Length = 864 Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 86/396 (21%), Positives = 156/396 (39%), Gaps = 74/396 (18%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYVSLDGKG---LSKAERYRLAGLKA 54 M +C+ + AAGR+L++ E+ +E+ + +R+ D G +S+A+R Sbjct: 1 MHQKCVNAVETAAGRKLTQAEIDGIENRVRAGMRSTARQDPAGWSAMSQADRV----AAG 56 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNK----------- 100 E +++L+ + +D A K+ Q+ + DR+Q +Y + K Sbjct: 57 AEWARQQLVHEAD--LDRARKQLQIAKQIETTDRIQEALYADPENAHRKRARETIVKHDI 114 Query: 101 --LFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLD---KQFGLDVFDEM- 154 + AG+ IK+ + + +VG L D D+ E+ Sbjct: 115 EQTYVTAGA--------IKSDYMRQTMGAIDAMKVGQNFLARAFDVDNPAMERDIIREVY 166 Query: 155 KGK--KTQNEQASRLVKQYFETQRELHSQAHEAG-----LDYKFFENRIPQPMSVDKLRA 207 +G T NE A +Q +T + + + AG LDY + R Q + Sbjct: 167 RGADGSTGNEVAKAAAEQIGKTTGAMRERFNRAGGNVGELDYGYVPIRHAQSKVLGNGSD 226 Query: 208 TKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIAS-FVGEVFAERVRSTSFKDPSIPSSEV 266 ++ + +++ LD S+Y D G PL+ +E+ VGE R+ + ++ + Sbjct: 227 AQRHAWADAVMPLLDRSQYLDDAGNPLNDAELRKVLVGEDREAWERANAAARGNVAPRKQ 286 Query: 267 GVKREF-------------------------ERVFHFKDSQAHMDYMEHFGVSTNVNTIL 301 GV RV HF+D+ AHM Y FG + +N L Sbjct: 287 GVWDTIAYGGVNKIVPGETSGGAARANAGSAHRVLHFRDADAHMQYNRQFGEGSLLNA-L 345 Query: 302 TSELASLSKDIVIARELGPNADSFVKQMIVQTIAND 337 + ++K+I + GPN +K + T +D Sbjct: 346 VDHVGGMAKNIALVERYGPNPTRNMKTQMQLTAVHD 381 Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust. Identities = 57/228 (25%), Positives = 99/228 (43%), Gaps = 26/228 (11%) Query: 655 GTRAGEALRMFQQFTTTPTGM----FLNILDLSNSAKMPKGASMALNHVWIQYSATMALA 710 GT GE + F QF + P M + I D+ S + AL + + Y+A + ++ Sbjct: 630 GTVTGELKKSFMQFKSFPMAMISRHWGRIGDMRRSGDFRVDGAPALANP-MAYAAALVVS 688 Query: 711 G--IGVASIKA--LLRGEDPS--LPEVIYDG-------TLANGALLPYMDRLTKLVSKGD 757 IG S +A LL G+DP +V + G ++ GA D L D Sbjct: 689 TTLIGAISTQAKNLLAGKDPEPMFDDVKHAGGFWTRAFSVGGGAGFA-GDMLVAAFQSAD 747 Query: 758 R-----AAIGG-LLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLK 811 +AIGG LL + + ++S+ + A + + + K + P +N+W+ K Sbjct: 748 YGSLLGSAIGGPLLSTLFQPLRAVSSNVQDAAQGKDTHIGADLLKIAQSNTPLVNLWFWK 807 Query: 812 NSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 ++ LI + + E L+PG R ++ + + + F + G P R P Sbjct: 808 TVWNRLIWDNLAENLSPGVTQRNMNRSRTQYHNDYFWSPGTGSPQRSP 855 >gi|48696687|ref|YP_024981.1| hypothetical protein VP5_gp18 [Vibrio phage VP5] gi|40806150|gb|AAR92068.1| hypothetical protein [Vibrio phage VP5] Length = 782 Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 52/194 (26%), Positives = 88/194 (45%), Gaps = 10/194 (5%) Query: 653 KRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGI 712 K G GE R F + P +N + K GA ++ I AT L G+ Sbjct: 566 KSGNFGGELHRSLFMFHSFPITTIMNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVL-GV 624 Query: 713 GVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKG---DRAAIGGLLG 766 G+ K +L G+ P S P++ +G +A G Y+ L + + G D + G G Sbjct: 625 GIIQAKDILNGKKPRSMSDPKLWIEG-MAQGGSFNYIGDLMRNAASGYSHDMTSYVG--G 681 Query: 767 PVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEEL 826 PV + + +A ++A D E++ + +PF N+WY K + D L++++I Sbjct: 682 PVLAYGDWVAMTAADMAKGDAESAMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLS 741 Query: 827 NPGYLDRQQSKKKK 840 +P Y +Q +K +K Sbjct: 742 DPEYDKKQLNKMRK 755 Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust. Identities = 67/292 (22%), Positives = 120/292 (41%), Gaps = 33/292 (11%) Query: 130 YAEVGSKNLGFTLDKQFGLDVF-DEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLD 188 +A + + T +Q LD F E+ G++T N A + K + + +L+++ +AG Sbjct: 116 FAGIATGERRLTKSQQRLLDDFVHELYGRQTGNADALKAAKGWKKATEDLNARFGQAGGH 175 Query: 189 YKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLD-------LSRYKDIDGTPLSRSEIA 240 ++ R+PQ + + D +V + D +D L + KD D R + Sbjct: 176 MAELDDWRLPQKHNRMAISKAGADVWVEKVWDLIDRDKMVKKLRKGKDEDNL---REALY 232 Query: 241 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 S + + + S+ ++ + R ER FKDS + + Y FG TNV Sbjct: 233 SVYNNIVTDGMSSSK----TLSKKFTDMMRS-ERFITFKDSDSWLKYQREFG-DTNVYAS 286 Query: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 + + ++S+ I + GP+ D + T+ + G + R ++ Sbjct: 287 MLGHIDNMSRAIGMMETFGPDPD-----IGFNTLERAVKTKKGLTSRQPTGARPTFDM-- 339 Query: 361 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFIS 412 +M Y E T W N +AGLR+ AS LG + AL + + S Sbjct: 340 --------LMGYNMVEEQTVWGNRVAGLRNLWTASKLGAAVVSALTDSVYAS 383 >gi|48696644|ref|YP_024423.1| hypothetical protein VP2p19 [Vibrio phage VP2] gi|40950042|gb|AAR97633.1| hypothetical protein [Vibrio phage VP2] Length = 782 Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 52/194 (26%), Positives = 88/194 (45%), Gaps = 10/194 (5%) Query: 653 KRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGI 712 K G GE R F + P +N + K GA ++ I AT L G+ Sbjct: 566 KSGNFGGELHRSLFMFHSFPITTIMNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVL-GV 624 Query: 713 GVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKG---DRAAIGGLLG 766 G+ K +L G+ P S P++ +G +A G Y+ L + + G D + G G Sbjct: 625 GIIQAKDILNGKKPRSMSDPKLWIEG-MAQGGSFNYIGDLMRNAASGYSHDMTSYVG--G 681 Query: 767 PVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEEL 826 PV + + +A ++A D E++ + +PF N+WY K + D L++++I Sbjct: 682 PVLAYGDWVAMTAADMAKGDAESAMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLS 741 Query: 827 NPGYLDRQQSKKKK 840 +P Y +Q +K +K Sbjct: 742 DPEYDKKQLNKMRK 755 Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust. Identities = 67/292 (22%), Positives = 120/292 (41%), Gaps = 33/292 (11%) Query: 130 YAEVGSKNLGFTLDKQFGLDVF-DEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLD 188 +A + + T +Q LD F E+ G++T N A + K + + +L+++ +AG Sbjct: 116 FAGIATGERRLTKSQQRLLDDFVHELYGRQTGNADALKAAKGWKKATEDLNARFGQAGGH 175 Query: 189 YKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLD-------LSRYKDIDGTPLSRSEIA 240 ++ R+PQ + + D +V + D +D L + KD D R + Sbjct: 176 MAELDDWRLPQKHNRMAISKAGADVWVEKVWDLIDRDKMVKKLRKGKDEDNL---REALY 232 Query: 241 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 S + + + S+ ++ + R ER FKDS + + Y FG TNV Sbjct: 233 SVYNNIVTDGMSSSK----TLSKKFTDMMRS-ERFITFKDSDSWLKYQREFG-DTNVYAS 286 Query: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 + + ++S+ I + GP+ D + T+ + G + R ++ Sbjct: 287 MLGHIDNMSRAIGMMETFGPDPD-----IGFNTLERAVKTKKGLTSRQPTGARPTFDM-- 339 Query: 361 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFIS 412 +M Y E T W N +AGLR+ AS LG + AL + + S Sbjct: 340 --------LMGYNMVEEQTVWGNRVAGLRNLWTASKLGAAVVSALTDSVYAS 383 >gi|167032768|ref|YP_001667999.1| hypothetical protein PputGB1_1760 [Pseudomonas putida GB-1] gi|166859256|gb|ABY97663.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 855 Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 158/743 (21%), Positives = 294/743 (39%), Gaps = 144/743 (19%) Query: 165 SRLVKQYFETQRELHSQAHEAGLDYKFFENRIP-QPMSVDKLRATKKDDFVRSMLDWLDL 223 ++++++Y E R A+ AG I Q +K+ A + + +L LD Sbjct: 180 AKIIQKYQEGAR---IDANRAGASIGKLPGYIARQSHDSEKMGAAGFERWAEEILPRLDT 236 Query: 224 SRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREF--ERVFHFKDS 281 + +++ G P+ + + G V + ++S + + P+ + ++ ERV HFKD Sbjct: 237 ATFRE-GGDPMVFLK-GVYDGLVSGDHLKSPAGQQPNGFRGPANLAKKLSQERVLHFKDG 294 Query: 282 QAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEAS 341 A +Y + FG N+ + L ++ + R LG N ++ + M + I D A Sbjct: 295 VAWHEYNQLFGTG-NLREAVLRGLDLSGQNTALMRRLGTNPEANLN-MAMDVIKEDVRAG 352 Query: 342 A----------------GNKVLKDWLGR-----NKLEVRQEAMLQMWEVMRYGETVENTG 380 GN+ LK+ G+ N + R A ++ W+ ++ G Sbjct: 353 GDPAALANFNTARRGVIGNR-LKEVSGQTRIPGNATQARVAANVRAWQ------SLSKLG 405 Query: 381 WANWMAGLRSAAGASML---GQHPIGALLEDGFISRQMLSRVGIDKEAIQRINKMPLKER 437 A + AS + GQ +G+L E G G+ K E+ Sbjct: 406 GALLSSFTDLPVAASEMRYQGQSFLGSLAEMG---------AGLMK-------GRGSAEQ 449 Query: 438 MELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKW---SG-AEYLDKKRISSHA 493 ++LS G+YA+ + G M S +G K+ M ++ +G + + D + S+ Sbjct: 450 RQILSAYGVYADSM--RGEIMRRFSADDSVGGKMSRGMSQFFRLNGLSWWTDANKASAGL 507 Query: 494 LIVYNQIGRMTDTYASLK-DLKADPRLDPSIKAF-FKQLDDTDFTVIKRAKAMSSPDGYL 551 ++ +N + SL D K +A LD + +++ + DG Sbjct: 508 MMAHNLAQNKGKAWGSLNGDFK---------RALGLYDLDAGKWELLREMDTRMA-DGRD 557 Query: 552 YARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEIN 611 Y TP I + D R+ +A + PE +++ DLER Sbjct: 558 YM-TPDGIAGISDE------RIGQYLAERNR---------PESAGAIRETRQDLERSLRA 601 Query: 612 ILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTT- 670 + D+V+ +A++ + +T R M+ + GT G+ LR QF + Sbjct: 602 YVNDRVT---YAVLEPDART--RSIMNQGT-----------QPGTVPGDLLRFVTQFKSF 645 Query: 671 ------------------TPTGM---FLNILDLSNSAKMPKGASMALNHVWIQYSATMAL 709 TPT + F DL + + G +AL + + T A Sbjct: 646 PAAYMQKTLGRELYGRGYTPTALGNSFRGGRDLVQALRNGNGERLALAQLMLW---TTAF 702 Query: 710 AGIGVASIKALLRGEDPSL---PEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLG 766 + +AS K + +G +P P+ + G L + D L ++ +A+ G Sbjct: 703 GYLSMAS-KDVTKGREPRPADDPKTWLAAMVQGGGLGIFGDYLFGEANRFGNSALESAAG 761 Query: 767 PV---PSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQIL 823 P + V NL + A K+ +++ +A + + PFMN++Y + + DHL L + Sbjct: 762 PTIGTAADVINLWARA-----KEGDDTASSALRLAQNNTPFMNLFYTRIALDHLFLYSVQ 816 Query: 824 EELNPGYLDRQQSKKKKKGIELF 846 E +NPG L R + + +++ + F Sbjct: 817 EAMNPGSLRRTEERIRQQNGQEF 839 >gi|288959378|ref|YP_003449719.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] gi|288911686|dbj|BAI73175.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] Length = 995 Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 51/203 (25%), Positives = 85/203 (41%), Gaps = 16/203 (7%) Query: 653 KRGTRAG----EALRMFQQFTTTPTGMFLNIL--DLSNSAKMPKGASMALNHVWIQYSAT 706 +RGT+AG EALR QF P + + DL + G + + H + + Sbjct: 778 RRGTQAGTLEGEALRFVGQFKAFPVAVISKVWGRDLYGGER-GWGRAAGIVHTLVATTVM 836 Query: 707 MALAGIGVASIKALLRGE---DPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGG 763 +AG+ +K L +G DP+ P L G Y D L S+ + Sbjct: 837 GYVAGM----LKDLSKGRAPRDPTDPRAWGAAFLQGGGAGIYGDFLLGQYSRFGNRFLES 892 Query: 764 LLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQIL 823 GP S L + + ++ + K + PF+N++Y + + D+L L Q+ Sbjct: 893 AAGPTLSSAGELLN--IWAGAREGNDEKAATLRWTLSNTPFVNLFYTRMALDYLFLYQVQ 950 Query: 824 EELNPGYLDRQQSKKKKKGIELF 846 E +NPG+L R + + K + F Sbjct: 951 EAMNPGFLRRFEQRVAKDNNQRF 973 Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust. Identities = 36/134 (26%), Positives = 55/134 (41%), Gaps = 19/134 (14%) Query: 255 SFKDPSIPSSEVGVKREFE-RVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIV 313 FKDP+ S KR + RV H++D+ A MDY FG V +L L +++ Sbjct: 273 GFKDPAFKGSGNIAKRLSQGRVLHWRDADAWMDYQAAFGHGNLVEAVLRG-LDQAARNTA 331 Query: 314 IARELGPNA----DSFVKQMIVQTIANDQEASAGNKVLKDWLGR-------------NKL 356 + RE G N D+ ++ + D +A + WL N+L Sbjct: 332 LMREFGTNPRGEFDADMQALAESWRDRDPDAVVKLGEARKWLANRFDELDGTSSMPVNRL 391 Query: 357 EVRQEAMLQMWEVM 370 R A ++ WE M Sbjct: 392 GARIGASVRAWESM 405 >gi|221213942|ref|ZP_03586915.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166119|gb|EED98592.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 864 Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 59/231 (25%), Positives = 102/231 (44%), Gaps = 32/231 (13%) Query: 655 GTRAGEALRMFQQFTTTPTGM----FLNILDLSNSAKMPKGASMALNHVWIQYSATMALA 710 GT GE + F QF + P M + I D+ S + AL + + Y+A + ++ Sbjct: 630 GTAMGELKKTFMQFKSFPIAMISRHWGRIGDMRRSGDFRVDGAPALANP-MAYAAALVVS 688 Query: 711 G--IGVAS--IKALLRGEDPSLPEVIYDG------------TLANGALLPYMDRLTKLVS 754 IG S +K LL G+DP E ++D ++ GA D LT Sbjct: 689 TTLIGAISTQVKNLLAGKDP---EPMFDDVKHAAGFWTRAFSVGGGAGFA-GDMLTASFE 744 Query: 755 KGDRAAIGGLL--GPVPS----MVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMW 808 D ++ G + GP+PS +V +S+A + A + + + K + P +N+W Sbjct: 745 STDYGSLLGSVVGGPLPSTIYQVVRAFSSNAQDAAQGKDTHVSADLLKVAQSNTPLVNLW 804 Query: 809 YLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 + K ++ LI + + E L+PG R ++ + + + F + G P R P Sbjct: 805 FWKTVWNRLIWDNLAENLSPGVTQRNINRSRNQYHNDYFWSPGTGSPQRAP 855 Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 81/390 (20%), Positives = 151/390 (38%), Gaps = 62/390 (15%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYVSLDGKG---LSKAERYRLAGLKA 54 M +C+ + AAGR+L++ E+ +E+ + +RA D G +S+A+R A Sbjct: 1 MHQKCVNAVEAAAGRKLTQAEIDGIENRVRAGMRATARQDPVGWSAMSQADRVAAGAEWA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNK----------- 100 + + E +D A K+ Q+ + DR+Q +Y + K Sbjct: 61 RKQLEHEA------DLDRARKQLQIAKQIETTDRIQEALYADPENAHRKRARETIVKQDI 114 Query: 101 --LFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKK 158 + AG+ + + A + + N A + +++ +V+ G Sbjct: 115 EQTYVLAGAIKSDYMRQTMGAIDAMKAGQNFLARAFDVD-NPAMERDIIREVYHGADGS- 172 Query: 159 TQNEQASRLVKQYFETQRELHSQAHEAG-----LDYKFFENRIPQPMSVDKLRATKKDDF 213 T NE A +Q +T + + + AG LDY + R Q + + + Sbjct: 173 TGNEVAKAAAEQISKTTAAMRERFNRAGGNVGELDYGYVPIRHSQSKVLGNGSDAARHAW 232 Query: 214 VRSMLDWLDLSRYKDIDGTPLSRSEIAS-FVGEVFAERVRSTSFKDPSIPSSEVGVKREF 272 +++ LD S+Y D G PL+ +++ VGE R+ + +I + GV Sbjct: 233 ADAVMPLLDRSQYLDDAGNPLNDADLRKMLVGEDREPWERANAAARGNIAPRKQGVWDTI 292 Query: 273 -------------------------ERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELAS 307 RV HF+D+ AH+ Y +G + +N L + Sbjct: 293 AYGGVNKIVPGETTGSAARANAGSAHRVLHFRDADAHIQYNRQYGEGSLLNA-LVDHVGG 351 Query: 308 LSKDIVIARELGPNADSFVKQMIVQTIAND 337 ++K+I + GPN +K + T +D Sbjct: 352 MAKNIALVERYGPNPTRNMKTQMQLTAVHD 381 >gi|320175029|gb|EFW50142.1| 17 [Shigella dysenteriae CDC 74-1112] Length = 582 Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 4/87 (4%) Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A+ + GPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 481 GALASMFGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAAL 540 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK 841 DH+I NQ+ E +PGYL + + + KK+ Sbjct: 541 DHMIFNQMQEYFSPGYLRKMEQRSKKE 567 Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust. Identities = 27/111 (24%), Positives = 51/111 (45%), Gaps = 4/111 (3%) Query: 234 LSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEVGVKR-EFERVFHFKDSQAHMDYMEH 290 ++ +E+++F+GE + D + S R R HFKD+ +++ Y + Sbjct: 1 MNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQYQQL 60 Query: 291 FGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEAS 341 +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 61 YG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATAN 110 >gi|262043551|ref|ZP_06016664.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039085|gb|EEW40243.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 708 Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 174/746 (23%), Positives = 301/746 (40%), Gaps = 98/746 (13%) Query: 4 ECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRL--AGLKAEEDFQK- 60 +C +N AAGR+LS+ E+ L VR + L+ E L A L+A ++ Sbjct: 8 QCEIAVNTAAGRKLSEDEMESL----VRDMNDTTNRILAGNEALTLEEAALRAAQELGNR 63 Query: 61 ----ELIRSVNDAIDEAYKRHQL----RSDLDR----VQAGVYGKSQALFNKLFFKAGSA 108 ++I + N AI+ +L R+ DR ++A + G++ A ++ S+ Sbjct: 64 DQLAKVIEARNKAINTRIAAQRLGELRRTWKDRPDIGLEAMLVGRNDARTGSR--RSVSS 121 Query: 109 EVP-LEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKT-----QNE 162 EV L K A + F++ V G + D++ ++ +G+KT Q+ Sbjct: 122 EVAQLRGKYHAG---INYDFDQAGLVKFIASG-SNDREIADAMWRIGRGQKTDGMTPQSV 177 Query: 163 QASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLD 222 A++++ ++ ET R ++A K + Q + K+RA + + ++L LD Sbjct: 178 SAAKIIMKWQETARVDENRA--GAWIGKMPGYIVRQSHDILKIRAAGYESWRNAILPRLD 235 Query: 223 LSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDP---SIPSSEVGVKREF-ERVFHF 278 + + I R V + A V TS K S VKR ERV HF Sbjct: 236 DATFDGIS----DREGFLRGVYDGLASGVHLTSEKPDWMNGFKGSANAVKRASQERVLHF 291 Query: 279 KDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQ 338 KD +Y E FG + + L S ++ I R LG N + K + TIA D Sbjct: 292 KDGVNWHEYNEQFGTGSLREAVFGG-LNSAARTTGIMRVLGTNPQNMFK-YLTDTIAKDV 349 Query: 339 EASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLG 398 + L D++ +VR+ M +V + GWAN A +R S LG Sbjct: 350 SKQSNPAALADFM----TKVRRLNRTVMPQVDGSLNIPGSVGWANASANVRGWLRMSQLG 405 Query: 399 QHPIGALLEDGFISRQMLSRVGIDKEAIQ-----RINKMPLKERMELLSDVGLYAEGVVA 453 I + + + +M + +A+ R ++ E+ E+LS +G+Y++ + Sbjct: 406 GAVISSFNDVPISATEMRYQGQNFMQALTGAMKGRFSRYTSDEQKEILSSIGVYSDTMTQ 465 Query: 454 HGRNMMEGSDAF--QIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLK 511 M G+D+ ++G + K++ + + +S+A+++ N + + D Sbjct: 466 EIIRRMSGNDSMSGKMG-RAQQLFFKYNLMNFWTESGRNSNAMMITNWLAKNADQ--QFT 522 Query: 512 DLKADPR--LDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRD 569 L D R LD + D ++ I R M+ +G + T S I+ + D + D Sbjct: 523 ALPEDLRRVLD------LHGIGDAEWN-IYRNMDMADSEGRKFM-TTSGIRAVPDEVIGD 574 Query: 570 LARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALV-LDN 628 K LK ++ + R+ L+ QL +NI + ++ A + + Sbjct: 575 YV--------ASKGLKVTERSIADARETLESQLRGYILDRLNIAMSEPGDRTQAFMKMGT 626 Query: 629 VQTSVRGAM---------HTSLFDRQRLGLLTYKRG-TRAGEALRMFQQFTTTPTGMFLN 678 V +V G T+ F + LG + RG T AG + TG N Sbjct: 627 VPGTVAGEAVRFAGQYKSFTASFMQNVLGREVFGRGYTPAG--------LGESKTGSLTN 678 Query: 679 ILDLSNSAKMPKGASMAL---NHVWI 701 L L N G ++ NHVWI Sbjct: 679 AL-LRNGKGAFLGGCKSVRMGNHVWI 703 >gi|262043648|ref|ZP_06016757.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038986|gb|EEW40148.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 974 Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 126/627 (20%), Positives = 235/627 (37%), Gaps = 98/627 (15%) Query: 254 TSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIV 313 T FK S + V + ERV HFKD + Y + FGV N+ + S L ++ Sbjct: 391 TGFKGGS---TNVARRASQERVLHFKDGLSWYRYNDKFGVG-NLREAVGSGLIHSAETTG 446 Query: 314 IARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAML--QMWEVMR 371 + R +G N ++ ++ I +A+ + L NK ++ L Q+ E+ Sbjct: 447 LMRRMGTNPENMFNEL-ADRIEQRYKAAKDDNAL------NKFRQKRNTSLTSQLKEITG 499 Query: 372 YGETVENTGWANWMAGLRSAAGASMLGQHPIGAL-------LEDGFISRQMLSRVGIDKE 424 N A A R+ LG I + +E + R ML V Sbjct: 500 QTNIPGNAALARVAATTRAIETMMKLGGSMISSFNDIATQAMEMRYQGRNMLGSVWEATA 559 Query: 425 AIQRINKMPLKERMELLSDVGLYAEGVVAH------GRNMMEGSDAFQIGHKLHSKMHKW 478 ++ + ER ++L +GL+A+ + N M G + + + W Sbjct: 560 NKVQLTRWKNAERQQVLKSIGLHADAMKDELIYRFSADNSMPGRVNRAMRNYFRLNLQSW 619 Query: 479 SGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVI 538 + + R S+ ++V +G T S D+ + R S+ +++ ++ + Sbjct: 620 ----WTNSSRYST-GMMVSEWLG--THAGKSFGDVPEELRRVLSMHG----IEENEWAAL 668 Query: 539 KRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQEL 598 + K + + DG Y TP + ++ D+ + L Sbjct: 669 SKMK-LHAADGNAYM-TPDGVADIPRTDIENY---------------------------L 699 Query: 599 QQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLT--YKRGT 656 + + + + ++ +S+K+ +LD V ++ D + + ++ +RGT Sbjct: 700 TNRGIKINDRSVEYARELLSDKVRGYILDRVGVALNEP------DARTMSIMKQGMQRGT 753 Query: 657 RAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAG----- 711 GE LR QF + N + + S++ N+ + + A+ Sbjct: 754 AYGEMLRFAWQFKSFTASFMQNAIGRELYGRGYDFGSLSQNNTFRNNALIRAMRNGNGEL 813 Query: 712 IGVASI--------------KALLRGEDPSLPEVIYDGTLA---NGALLPYMDRLTKLVS 754 +G+A + K +LRG+ P + + T A G L D L + Sbjct: 814 MGIAQLFLWATAFGYLSMQTKLMLRGQTPRPADNVSTWTAAMAQGGGLGILGDFLFGEYN 873 Query: 755 KGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 + L GP S L + L + + + AI T P+MN+ ++ Sbjct: 874 RFGNTPATSLAGPFASDAAQLVN-LFGLTKQGDAKAADYFNFAINHT-PYMNLHVVRPVM 931 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKK 841 D LILNQ+ E ++PG L R Q + K++ Sbjct: 932 DFLILNQMREWMSPGSLQRYQQRVKEE 958 >gi|291336673|gb|ADD96216.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 101 Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust. Identities = 32/97 (32%), Positives = 55/97 (56%), Gaps = 5/97 (5%) Query: 737 LANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVN--A 794 L G L Y D L + + +A+ +GP+P+ + S A+ A K E K A Sbjct: 2 LQGGGLGIYTDFLFGNI-QNSTSALATAVGPIPTEAARVLS-ALNYAIK-GEGGKAGKQA 58 Query: 795 TKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYL 831 +I++ +PF+N++Y+K +FD++I Q++E L+PG L Sbjct: 59 YYSIKENIPFLNLFYIKTAFDYMIGYQMMETLSPGSL 95 >gi|157372110|ref|YP_001480099.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] gi|157323874|gb|ABV42971.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] Length = 850 Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust. Identities = 130/622 (20%), Positives = 233/622 (37%), Gaps = 130/622 (20%) Query: 273 ERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQ 332 ERV HFKD A +Y + +GV N+ + S L S ++ + R LG N ++ + Sbjct: 290 ERVLHFKDGVAWHEYNKAYGVG-NLRESVMSGLTSSARTTGVMRVLGTNPENMFGHLFET 348 Query: 333 TIA-----NDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAG 387 A N+ A A D+ GR R+ ++ E++ Y N+ A A Sbjct: 349 QQARLKKLNNPAAEA------DFAGR-----RRALENELSEILGYNSIPANSAIARAGAT 397 Query: 388 LRSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLY 447 +R+ G + LG GA++ +DVG Sbjct: 398 IRAVEGMTKLG----GAVISS--------------------------------FNDVGNA 421 Query: 448 AEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQI-GRMTDT 506 A + G N+M+ +G + K+ +S A D+K I + I + + M Sbjct: 422 AMELRYQGMNLMDA-----MGKSIAGKLKGYSAA---DQKEILGYMGIFTDSVRDEMIAK 473 Query: 507 YASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDAD 566 ++ D R+ + FFK L+ ++ K+M AR + + + D Sbjct: 474 FSG--DTSVPGRISRLQRTFFK-LNLLNWWTENSRKSMGLVMSNWMARNSKSAWSSMNED 530 Query: 567 LRDLARMS---------------DKIAYHRKKLKNSKTLSPEQR--QELQQQLADLERKE 609 LR + S D + ++ N P++R + + + + Sbjct: 531 LRRVLNSSGITEREWNLYRGMEMDSVRGNQHMTPNGVKYIPDERIAEYVAADGLQVNKAS 590 Query: 610 INILKDKVSNKMHALVLDNVQTSVR--GAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQ 667 I ++ + K+ LD V ++ GA ++ + + GT GEA+R Q Sbjct: 591 IAAARESLEGKLRGYYLDRVLIAMSEPGARTRAMMKQ------GTQPGTPLGEAIRFGGQ 644 Query: 668 FTTTPTGMFLN---------------------ILDLSNSAKMPKGASMALNHVWIQYSAT 706 F + TG F+ L+N+ + G M L ++I +A Sbjct: 645 FKSF-TGSFMQNTIGREIYGRGYTPAELGQSRFTSLANAMRNGNGEKMGLAQLFIWMTA- 702 Query: 707 MALAGIGVASI--KALLRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGG- 763 +G S+ K LL+G+ P + T A + G+ GG Sbjct: 703 -----LGYVSMQTKLLLKGQTPRPADAK---TFLAAAAQGGGLGIMGDFLFGEYNRFGGG 754 Query: 764 ----LLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLIL 819 L GP + + + + L +D + + K PFMN+ ++ + ++LIL Sbjct: 755 LASSLAGPTVGDLDQIRN--LFLRARDGDAKAADLLKFGIDHTPFMNLHVVRPAMNYLIL 812 Query: 820 NQILEELNPGYLDRQQSKKKKK 841 N+ E L+PG L+R + + +K+ Sbjct: 813 NRAQEWLSPGSLERYRQRVEKE 834 >gi|221201510|ref|ZP_03574549.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207934|ref|ZP_03580940.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221172119|gb|EEE04560.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221178778|gb|EEE11186.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 869 Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust. Identities = 82/391 (20%), Positives = 153/391 (39%), Gaps = 64/391 (16%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYVSLD---GKGLSKAERYRLAGLKA 54 M +C+ + AAGR+L++ E+ +E+ + +RA D +S+A+R Sbjct: 1 MHQKCVNAVEAAAGRKLTQAEIDGIENRVRAGMRAKARQDPLAWSAMSQADRV----AAG 56 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNKLFFKAGSAEVP 111 E +++L+ +D K+ Q+ + DR+Q +Y + K +A V Sbjct: 57 AEWARQQLVHEAE--LDRMRKQLQIAKQIETTDRIQEALYADPENAHRK---RARETIVK 111 Query: 112 LEMK--------IKAAETKVLSKFNEYAEVGSKNLGFTLD---KQFGLDVFDEM-KGK-- 157 +++ IK+ + E + G L D D+ E+ +G Sbjct: 112 HDIEQTYVLAGAIKSDYMRQTMGAIEAMKAGQNFLARAFDVDNPAMERDIIREVYRGADG 171 Query: 158 KTQNEQASRLVKQYFETQRELHSQAHEAG-----LDYKFFENRIPQPMSVDKLRATKKDD 212 T NE A +Q +T + + + AG LDY + R Q + + Sbjct: 172 STGNEVAKAAAEQISKTTAAMRERFNRAGGNVGELDYGYVPIRHSQSKVLGNGSDAARHA 231 Query: 213 FVRSMLDWLDLSRYKDIDGTPLSRSEIAS-FVGEVFAERVRSTSFKDPSIPSSEVGVKRE 271 + +++ LD S+Y D G PL+ ++ VGE R+ + +I + GV Sbjct: 232 WADAVMPLLDRSQYLDDAGNPLNDVDLRKMLVGEDREPWERANAAARGNIAPRKQGVWDT 291 Query: 272 F-------------------------ERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELA 306 RV HF+D+ AH+ Y +G + +N ++ + Sbjct: 292 IAYGGINKIVPGETTGSAARANAGSAHRVLHFRDADAHIQYNRQYGEGSLLNALI-DHVG 350 Query: 307 SLSKDIVIARELGPNADSFVKQMIVQTIAND 337 ++K+I + GPN +K + T +D Sbjct: 351 GMAKNIALVERYGPNPTRNMKTQMQLTAVHD 381 Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust. Identities = 56/235 (23%), Positives = 98/235 (41%), Gaps = 35/235 (14%) Query: 655 GTRAGEALRMFQQFTTTPTGM----FLNILDLSNSAK-----MPKGASMALNHVWIQYSA 705 GT GE + F QF + P M + I ++ S P+ + L + + Y+A Sbjct: 630 GTVTGELKKSFMQFKSFPMAMISRHWGRIGNMRRSGDYLVEGAPRAFGIPLANP-MAYAA 688 Query: 706 TMALAG--IGVASIKA--LLRGEDPSLPEVIYDGTLANGALLPYMDRLTK--------LV 753 + ++ IG S +A LL G+DP E ++D G + LV Sbjct: 689 ALVVSTTLIGAISTQAKNLLAGKDP---EPMFDDVKHAGGFWTRAFSVGGGAGFAGDMLV 745 Query: 754 SKGDRAAIGGLLGPV---PSMVT------NLTSSAVELATKDNENSKVNATKAIRKTLPF 804 + + A G LLG P + T ++S+ + A + + + K + P Sbjct: 746 AAFESADYGSLLGSAVGGPLLSTLFQPLRAISSNVQDAAQGKDTHVGADLLKIAQSNTPL 805 Query: 805 MNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 +N+W+ K ++ LI + + E L+PG R ++ + + E F + G P R P Sbjct: 806 VNLWFWKTVWNRLIWDNLAENLSPGVTQRNMNRSRTQYHNEYFWSPGTGAPQRAP 860 >gi|254251753|ref|ZP_04945071.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] gi|124894362|gb|EAY68242.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] Length = 865 Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust. Identities = 57/232 (24%), Positives = 93/232 (40%), Gaps = 34/232 (14%) Query: 655 GTRAGEALRMFQQFTTTPTGM----FLNILDLSNSAKMPKGASMALNHVWIQYSATM--- 707 GT GE + F QF + P M + I ++ S + L + Y A + Sbjct: 631 GTLQGELQKTFLQFKSFPIAMISRHWGRIGEMRRSGDFRVEGAPTLASP-MAYGAALVVS 689 Query: 708 -ALAGIGVASIKALLRGEDPSLPEVIYDGTLANGALLPYMDRLTK--------------L 752 L G ++ LL G+DP E + D GA + TK L Sbjct: 690 TTLLGALAVQLQNLLLGKDP---EPMGDDVKHGGAF--WFRAFTKGGGAGFAGDMLSAML 744 Query: 753 VSKGDRAAIGGLLG-PVPSM----VTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNM 807 K A+G + G P+ S VT +++A+ A + + + K + +P +N+ Sbjct: 745 TGKNPAEAVGSVFGGPLVSTAIQAVTPFSNNAMAAAEGKDTHLSADLLKFAQSNMPIVNL 804 Query: 808 WYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 WY K ++ LI + I E L+PG R +K +++ + F P R P Sbjct: 805 WYWKTVWNRLIWDNIAENLSPGVTSRNVAKSRQQYHNDYFWEPGTSAPQRAP 856 Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust. Identities = 80/360 (22%), Positives = 137/360 (38%), Gaps = 59/360 (16%) Query: 14 GRELSKKELRRLEDGI---VRAYVSLD---GKGLSKAERYRLAGLKAEEDFQKELIRSVN 67 GR+L K EL +E+ + +RA D + +++AER + A + + E Sbjct: 14 GRDLKKAELDGIENRVRAGMRAVARQDPAAWRSMTEAERVQAGAEWARQQLEAEA----- 68 Query: 68 DAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNKLFF-KAGSAEVP----LEMKIKAA 119 +D+A K+ Q+ + DR+Q ++ + + K KA A++ L IKA Sbjct: 69 -NLDKARKQLQIAKQIETTDRIQEALFADPERAYAKRAREKAVKADIERTYELAGGIKAD 127 Query: 120 ETKVLSKFNEYAEVGSKNLGFTLD---KQFGLDVFDEM-KGK--KTQNEQASRLVKQYFE 173 + E + G L D D+ E+ +G T NE A +Q Sbjct: 128 YMRQTMDAIEAMKHGQNFLARAFDIDNPAMERDIIREIYRGADGSTGNEVAKAAAQQIGA 187 Query: 174 TQRELHSQAHEAG-----LDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKD 228 T + + + AG LDY + R Q + + + +L LD S+Y D Sbjct: 188 TSNAMRERFNRAGGNVGQLDYGYVPIRHSQAKILGNGSDAARHAWADFVLPRLDRSQYLD 247 Query: 229 IDGTPLSRSEI---------ASFVGEVFAERVRSTSFKDPSI-------------PSSEV 266 G PL + + S+ A R + + P Sbjct: 248 DAGNPLDDAALRRVLTGEDRESWEARNIAARGMGVEPRQQGVWDTIAYGGVNKIVPGETT 307 Query: 267 GVKREF-----ERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPN 321 G RV HFKD+ AH++Y +G + +N ++ + ++K+I + GPN Sbjct: 308 GAAARANAGSQHRVLHFKDADAHIEYNRAYGEGSLLNALI-DHVGGMAKNIALVERYGPN 366 >gi|146276496|ref|YP_001166655.1| hypothetical protein Rsph17025_0444 [Rhodobacter sphaeroides ATCC 17025] gi|145554737|gb|ABP69350.1| hypothetical protein Rsph17025_0444 [Rhodobacter sphaeroides ATCC 17025] Length = 830 Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust. Identities = 57/283 (20%), Positives = 114/283 (40%), Gaps = 29/283 (10%) Query: 133 VGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDY-KF 191 VG +G + + D+ E+ + + N QA + Q+ + + G D + Sbjct: 130 VGLNVIGSSRNPVLLRDLIRELHAEASGNAQAKAMADAVRTVQQRMRRAFNSYGGDIGEI 189 Query: 192 FENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID-GTPLS-------RSEIASFV 243 + +P +R + + + L R D + G P + R+ F+ Sbjct: 190 ADYGVPHSHDAGAMRQAGFEAWAAEIEQRLAWDRIVDFNTGQPFAAPGQVPPRAVSGRFL 249 Query: 244 GEVFAERV-RSTSFKDPS--IPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 +V+ V R +DPS + + +R R+ HF+ ++Y + FG S + + Sbjct: 250 KDVYEGIVTRGWDDRDPSLAVGGKALANQRAERRLLHFRSGSDWIEYNKAFGASDPFSAM 309 Query: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 + L L++D+ + R LGP+ + ++ +A + A+ GN+ KLE R Sbjct: 310 MNG-LHGLARDVALMRVLGPSPKAGLE--YAAQVAKKRAATIGNQ---------KLEARV 357 Query: 361 EAMLQMWEVMRY-----GETVENTGWANWMAGLRSAAGASMLG 398 + ++ + M + GWA + +G R+ + LG Sbjct: 358 DTQSKVAKAMLMHLDGSANVPDRAGWAAFFSGTRAVLTSIQLG 400 >gi|262043550|ref|ZP_06016663.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039084|gb|EEW40242.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 143 Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust. Identities = 23/81 (28%), Positives = 47/81 (58%), Gaps = 5/81 (6%) Query: 766 GPVPSMVTNLTSSAVELATKDNENSKVNAT-----KAIRKTLPFMNMWYLKNSFDHLILN 820 GPV S++ S+A + T + ++ +A + PF+N+++L+ + + LILN Sbjct: 47 GPVTSLMGPAASNADSIITLLQQTTRGDADLGDWYRTALDNTPFLNVFWLRTAMNGLILN 106 Query: 821 QILEELNPGYLDRQQSKKKKK 841 +I + L+PG L+R Q + +++ Sbjct: 107 RIQDALDPGSLERYQRRVERE 127 >gi|291336683|gb|ADD96225.1| hypothetical protein Rsph17025_0444 [uncultured organism MedDCM-OCT-S08-C1350] Length = 850 Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust. Identities = 50/203 (24%), Positives = 88/203 (43%), Gaps = 13/203 (6%) Query: 153 EMKGKKTQNEQASRLVKQYFETQRELHSQAHEAG---LDYKFFENRIPQPMSVDKLRATK 209 E+ G+ T N A +L + ET L + ++ G L K + +PQ +R + Sbjct: 161 ELMGETTGNVNAKQLADAWRETAEHLRKRFNKFGGKVLSRKDWG--LPQIHDSLLVRQSS 218 Query: 210 KDDFVRSMLDWLDLSR-YKDIDGTPLSRSEIASFVGEVFAERVRS--TSFKDPSIPSSEV 266 K D++ +L LDL + + G P + I + EV+ +FK + Sbjct: 219 KADWIDYILPKLDLDKMVNERSGLPFNDKTIREALSEVYDNIATEGMATFKPGTAGYGRA 278 Query: 267 GVKREFE-RVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNAD-- 323 R + R FK++ M+Y FG T++ + ++++DI + + LGPN D Sbjct: 279 LHNRRIDHRFLAFKNADDWMEYQTRFGSPDPFKTMM-EHINAMARDISMLKILGPNPDAT 337 Query: 324 -SFVKQMIVQTIANDQEASAGNK 345 ++ MI + + D A A K Sbjct: 338 HTWALGMIKKQMKIDAAAEAQGK 360 >gi|251799040|ref|YP_003013771.1| NADH:flavin oxidoreductase/NADH oxidase [Paenibacillus sp. JDR-2] gi|247546666|gb|ACT03685.1| NADH:flavin oxidoreductase/NADH oxidase [Paenibacillus sp. JDR-2] Length = 343 Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust. Identities = 39/126 (30%), Positives = 57/126 (45%), Gaps = 24/126 (19%) Query: 56 EDFQKELIRSVNDAID-------EAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSA 108 E F+ + R+V +D Y HQ S L +A YG+ + LF K KA + Sbjct: 143 EKFRLAVRRAVQAGVDTIEIHGAHGYLIHQFVSPLTNKRADKYGQDRTLFGKEVIKAAKS 202 Query: 109 E----VPLEMKIKAAETKVLSKFNEYAEVG---SKNLGFTLD-KQFGLDVFDEMKGKKTQ 160 E +PL M+I A EYAE G ++++ F + K+ G+DVF G + Q Sbjct: 203 EMPAHMPLFMRISA---------REYAEGGYGINESIAFAKEFKEAGVDVFHISAGGEGQ 253 Query: 161 NEQASR 166 A R Sbjct: 254 IAAAGR 259 >gi|253578526|ref|ZP_04855798.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA] gi|251850844|gb|EES78802.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA] Length = 393 Score = 40.8 bits (94), Expect = 0.94, Method: Compositional matrix adjust. Identities = 45/162 (27%), Positives = 71/162 (43%), Gaps = 26/162 (16%) Query: 480 GAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIK 539 G + LDK + +IV G + D +A +++ A + S +TDFT+ Sbjct: 169 GIQMLDKTDVD--VIIVGRGGGSIEDLWAFNEEIVARAIFECSTPIISAVGHETDFTIAD 226 Query: 540 RAKAMSSPDGYLYARTPSTIKNLKDADLRDLAR------------MSDKIAYHRKKLKNS 587 A + +P TPS L D R + MS K +R +L++ Sbjct: 227 FAADLRAP-------TPSAAAELAVDDYRSVIEAVSIYRQRLYRAMSGKTDLYRSRLEHF 279 Query: 588 KT----LSPEQR-QELQQQLADLERKEINILKDKVSNKMHAL 624 +T LSPE R +E +Q+LADLE N + K+ ++ H L Sbjct: 280 QTKFAYLSPENRLREQRQRLADLENAVQNGMNRKLQDERHRL 321 >gi|171315464|ref|ZP_02904701.1| ABC transporter related [Burkholderia ambifaria MEX-5] gi|171099464|gb|EDT44199.1| ABC transporter related [Burkholderia ambifaria MEX-5] Length = 510 Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust. Identities = 46/158 (29%), Positives = 77/158 (48%), Gaps = 22/158 (13%) Query: 218 LDWLDLSRYKDID--GTPLSRSEIASFVGEVFAERV---RSTSFKDPSIPSSEVGVKREF 272 L+ LSR + I G L R EI F G + A R R+ DP + + E+ V + Sbjct: 267 LEVSGLSRGRAIRDVGFTLRRGEILGFAGLMGAGRTEVARAVFGADP-VDAGEIRVHGKI 325 Query: 273 ERVFHFKDSQAH-MDYM----EHFGVSTNVNTILTSELASLSKDI-----VIARELGPNA 322 + D+ AH + Y+ +HFG++ ++ L+S+ + + V ARE+ A Sbjct: 326 VTIRTPADAVAHGIGYLSEDRKHFGLAVGMDVQNNIALSSMRRFVRRGMFVDAREMRDIA 385 Query: 323 DSFVKQMIVQTIANDQEA---SAGNK---VLKDWLGRN 354 S+V+Q+ ++T + Q A S GN+ V+ WL R+ Sbjct: 386 QSYVRQLAIRTPSVAQPARLLSGGNQQKIVIAKWLLRD 423 >gi|315122308|ref|YP_004062797.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495710|gb|ADR52309.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 56 Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust. Identities = 14/40 (35%), Positives = 27/40 (67%) Query: 801 TLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK 840 T+PF N+WY K+ FD+ + ++ + +NPG R ++ ++K Sbjct: 8 TVPFQNLWYTKSVFDYFVRGKLDDAINPGNRARAEAYRRK 47 >gi|159896788|ref|YP_001543035.1| hypothetical protein Haur_0255 [Herpetosiphon aurantiacus ATCC 23779] gi|159889827|gb|ABX02907.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC 23779] Length = 563 Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust. Identities = 26/80 (32%), Positives = 45/80 (56%), Gaps = 7/80 (8%) Query: 549 GYLYARTPSTIKNL---KDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADL 605 G A P+ N+ ++A DL R++D+I R+++ +TLSPE+R EL +QLA+L Sbjct: 152 GITSAVLPNAQSNVLAEREAVKTDLERIADRIDQTREEIAKDETLSPEERVELDRQLAEL 211 Query: 606 ERKEINILKDKVSNKMHALV 625 + L++ ++ AL Sbjct: 212 SKD----LRENTGSREDALA 227 >gi|226312537|ref|YP_002772431.1| linear pentadecapeptide gramicidin synthetase LgrD [Brevibacillus brevis NBRC 100599] gi|226095485|dbj|BAH43927.1| linear pentadecapeptide gramicidin synthetase LgrD [Brevibacillus brevis NBRC 100599] Length = 5085 Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust. Identities = 32/112 (28%), Positives = 57/112 (50%), Gaps = 8/112 (7%) Query: 511 KDLKADPRLDPSIKAFFKQL-DDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRD 569 KDLK + LDP+I+A + D + F +A ++ G+L A + + DAD+ Sbjct: 4690 KDLKDEVILDPAIQAEHPYVGDPSQF----QAALLTGATGFLGAFLLRDLLQMTDADIYC 4745 Query: 570 LARMSDK---IAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVS 618 L R SD+ +A R+ L+ + + EQ + + DL + +N+ +D+ S Sbjct: 4746 LVRASDEEEGMARLRQTLELYELWNEEQAHRIIPVIGDLAKPRLNLSEDQFS 4797 >gi|254515568|ref|ZP_05127628.1| methylcrotonoyl-CoA carboxylase beta chain [gamma proteobacterium NOR5-3] gi|219675290|gb|EED31656.1| methylcrotonoyl-CoA carboxylase beta chain [gamma proteobacterium NOR5-3] Length = 544 Score = 37.7 bits (86), Expect = 9.6, Method: Compositional matrix adjust. Identities = 59/233 (25%), Positives = 96/233 (41%), Gaps = 48/233 (20%) Query: 557 STIKNLKDADLRDLARMSDKIAYHRKK---LKNSKTLSPEQRQELQQQLADLERKEINIL 613 S+I N +A R+ SD +A K L+ S+TLS R +++ L R+ + L Sbjct: 10 SSINNASEAFARN---RSDHLALIEKMNGILERSRTLSDAARPRFEKRGQLLPRERLARL 66 Query: 614 KDKVS-----NKMHALVLD--NVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQ 666 D S M +LD N +TSV GA ++ + Y +GTR M Q Sbjct: 67 LDPGSPFLEIGNMAGYLLDDTNPETSVPGA--------TQIAGIGYVQGTRC-----MIQ 113 Query: 667 ---------QFTTTPTGMFLNILDLSNSAKMP-------KGASMALNHVWIQYSATMALA 710 T T T I+D++ K+P GA++ ++Y M +A Sbjct: 114 VDDSGINAGAMTRTSTRKGCRIMDIALQQKLPFLHLVESAGANL------LEYEVEMWMA 167 Query: 711 GIGVASIKALLRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGG 763 G G+ + A L + V++ + A GA +P + V + +A + G Sbjct: 168 GGGIFARLARLSAAGLPVITVLHGASAAGGAYMPGLSDYVVGVKENGKAYLAG 220 Searching..................................................done Results from round 2 >gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|254040879|gb|ACT57675.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|317120668|gb|ADV02491.1| hypothetical protein SC1_gp030 [Liberibacter phage SC1] gi|317120812|gb|ADV02633.1| hypothetical protein SC1_gp030 [Candidatus Liberibacter asiaticus] Length = 864 Score = 918 bits (2373), Expect = 0.0, Method: Composition-based stats. Identities = 864/864 (100%), Positives = 864/864 (100%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK 60 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK Sbjct: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK 60 Query: 61 ELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE 120 ELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE Sbjct: 61 ELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE 120 Query: 121 TKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHS 180 TKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHS Sbjct: 121 TKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHS 180 Query: 181 QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIA 240 QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIA Sbjct: 181 QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIA 240 Query: 241 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI Sbjct: 241 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 Query: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ Sbjct: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 Query: 361 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVG 420 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVG Sbjct: 361 EAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVG 420 Query: 421 IDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSG 480 IDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSG Sbjct: 421 IDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSG 480 Query: 481 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR 540 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR Sbjct: 481 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR 540 Query: 541 AKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ 600 AKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ Sbjct: 541 AKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ 600 Query: 601 QLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGE 660 QLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGE Sbjct: 601 QLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGE 660 Query: 661 ALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKAL 720 ALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKAL Sbjct: 661 ALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKAL 720 Query: 721 LRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAV 780 LRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAV Sbjct: 721 LRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAV 780 Query: 781 ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK 840 ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK Sbjct: 781 ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK 840 Query: 841 KGIELFQNMDEGLPHRLPFPFGED 864 KGIELFQNMDEGLPHRLPFPFGED Sbjct: 841 KGIELFQNMDEGLPHRLPFPFGED 864 >gi|332344341|gb|AEE57675.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 824 Score = 776 bits (2002), Expect = 0.0, Method: Composition-based stats. Identities = 199/883 (22%), Positives = 353/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L + E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDYDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 + +RGT GE R F + P + + + MP A + Sbjct: 606 ERMFVGSGLQRGTWKGELTRSVFLFKSFPISVVMRHWHR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDP------SLPEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G I L+ G +P ++ + + L G Y D L ++ Sbjct: 662 LASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|300898440|ref|ZP_07116781.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357907|gb|EFJ73777.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 824 Score = 775 bits (2001), Expect = 0.0, Method: Composition-based stats. Identities = 200/883 (22%), Positives = 353/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E++ F+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSEFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGS 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L + E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q +RGT GE R F + P + + + MP A + Sbjct: 606 EQMFVGSGLQRGTWKGELTRSVFLFKSFPISVVMRHWHR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDP------SLPEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G I L+ G +P ++ + + L G Y D L ++ Sbjct: 662 LASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|331648163|ref|ZP_08349253.1| hypothetical protein ECIG_04089 [Escherichia coli M605] gi|331043023|gb|EGI15163.1| hypothetical protein ECIG_04089 [Escherichia coli M605] Length = 824 Score = 775 bits (2000), Expect = 0.0, Method: Composition-based stats. Identities = 199/883 (22%), Positives = 354/883 (40%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E + R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALNKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATRDYALSQLQGAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMEL--LSDVGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELVRARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q + +RGT GE R F + P + + + MP A + Sbjct: 606 EQLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWHR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDP------SLPEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G I L+ G +P ++ + + L G Y D L ++ Sbjct: 662 LASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVVGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|298381705|ref|ZP_06991304.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279147|gb|EFI20661.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 824 Score = 773 bits (1996), Expect = 0.0, Method: Composition-based stats. Identities = 200/883 (22%), Positives = 354/883 (40%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDQMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINNYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTDLLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGS 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L + E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q +RGT GE R F + P + + + MP A + Sbjct: 606 EQMFVGSGLQRGTWKGELTRSVFLFKSFPISVVMRHWHR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDP------SLPEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G I L+ G +P ++ + + L G Y D L ++ Sbjct: 662 LASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|309702799|emb|CBJ02130.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 825 Score = 769 bits (1985), Expect = 0.0, Method: Composition-based stats. Identities = 204/883 (23%), Positives = 353/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R SL + L+ AER R AG A Sbjct: 2 MRQECIQAVQQAAKRTLTAREIQDIEDRIYRNMRSLARDDPASWRQLTDAERLRRAGQLA 61 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 ++ Q+E R +L + ++ Q G GK AL + F A S + + Sbjct: 62 SDELQREAALKKRRVALTISARQRLDNFINNYQ-GADGKLGALNRTIAFSADGKSNFLSV 120 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 121 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVFEMRGQNTGNAKARKGAKAW 180 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 181 GEVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVTKDKWVSDVIGKLDRKYYTRSD 240 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G +S SE+ +F+GE + K D + S R R HFKD+ +++ Y Sbjct: 241 GQLMSDSELTAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 300 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 301 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 359 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L E + + + V N A W +R+ AS LG + + + Sbjct: 360 VE-----RLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 412 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELL--SDVGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 413 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELALARRAGLAMESLLGSVNRWAMDNMG 472 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 473 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 531 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 532 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 575 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 576 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 606 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q +RGT GE R F + P + + + MP A + Sbjct: 607 EQMFVGSGLQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGMPSAGGRAAYIAT--F 662 Query: 704 SATMALAGIGVASIKALLRGEDPS------LPEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G I L+ G +P + + + L G Y D L ++ Sbjct: 663 LASTTMLGALSMQITDLINGRNPKEMTGDHMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 722 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 723 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAA 782 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 783 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 825 >gi|323156120|gb|EFZ42279.1| hypothetical protein ECEPECA14_1895 [Escherichia coli EPECa14] Length = 824 Score = 737 bits (1903), Expect = 0.0, Method: Composition-based stats. Identities = 200/883 (22%), Positives = 353/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTDLLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + K D + S V R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGVRANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGS 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L + E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANKTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q + +RGT GE R F + P + + + MP A + Sbjct: 606 EQLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G + L G +P + L G L Y D L ++ Sbjct: 662 IASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + +E + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|89152441|ref|YP_512274.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] gi|74055464|gb|AAZ95913.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] Length = 824 Score = 737 bits (1901), Expect = 0.0, Method: Composition-based stats. Identities = 198/883 (22%), Positives = 351/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTDLLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L E + + + V N + W +R+ AS LG + + + Sbjct: 359 VE-----RLANNTENLYNF--ISGKTQPVANPHISRWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q + +RGT GE R F + P + + + +P A + Sbjct: 606 EQLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGIPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G + L G +P + L G L Y D L ++ Sbjct: 662 IASTTILGALSQQLNDLASGRNPREMTGGDAAKFWLGALLKGGGLGLYGDFLLSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|117624699|ref|YP_853612.1| hypothetical protein APECO1_4054 [Escherichia coli APEC O1] gi|115513823|gb|ABJ01898.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|323948672|gb|EGB44577.1| hypothetical protein ERKG_04895 [Escherichia coli H252] Length = 824 Score = 734 bits (1894), Expect = 0.0, Method: Composition-based stats. Identities = 201/883 (22%), Positives = 350/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRMLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARNGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E++SF+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSSFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K AK +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAKQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q + +RGT GE R F + P + + + MP A + Sbjct: 606 EQLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G + L G +P + L G L Y D L ++ Sbjct: 662 IASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + +E + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|327252171|gb|EGE63843.1| hypothetical protein ECSTEC7V_3018 [Escherichia coli STEC_7v] Length = 824 Score = 733 bits (1892), Expect = 0.0, Method: Composition-based stats. Identities = 199/883 (22%), Positives = 350/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYIRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K A+ +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q + +RGT GE R F + P + + + MP A + Sbjct: 606 EQLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G + L G + + L G L Y D L ++ Sbjct: 662 IASTTILGALSQQLNDLASGRNHREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + NE + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|324008547|gb|EGB77766.1| hypothetical protein HMPREF9532_01734 [Escherichia coli MS 57-2] Length = 824 Score = 733 bits (1892), Expect = 0.0, Method: Composition-based stats. Identities = 201/883 (22%), Positives = 350/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRMLTAREIQNIEDRIYRNMRSIARDDPMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A K + Sbjct: 120 ESRTKATRDYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQTTGNAKARNGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 REVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGRLDRKYYIRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E++SF+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDAELSSFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ Q A A+ Sbjct: 300 QQLYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGK 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 + +L E + + + V N A W +R+ AS LG + + + Sbjct: 359 VE-----RLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSI 523 + + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + DTD++V K AK +G TP +I + D+ ++ L Sbjct: 531 ----KGITDTDWSVWKLAKQEDWGNGNNTMLTPESIMRIPDSAVKHLG------------ 574 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 E +K + K+ V + V +V T Sbjct: 575 -------------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 Q + +RGT GE R F + P + + + MP A + Sbjct: 606 EQLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGMPSAGGRAAYIAT--F 661 Query: 704 SATMALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G + L G +P + L G L Y D L ++ Sbjct: 662 IASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V ++ A + +E + + K + +P N+WYLK + Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGLMPGANLWYLKAA 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 782 LDHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|215487808|ref|YP_002330239.1| hypothetical protein E2348C_2741 [Escherichia coli O127:H6 str. E2348/69] gi|215265880|emb|CAS10289.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 824 Score = 726 bits (1873), Expect = 0.0, Method: Composition-based stats. Identities = 199/882 (22%), Positives = 347/882 (39%), Gaps = 85/882 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + L+ AER R AG A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDPMSWRQLNDAERLRRAGQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E+ R +L + ++ Q G GK AL + F A S + + Sbjct: 61 AEELQREVALKKRRVALTIAARQRLDNFINSYQ-GADGKLGALNRTIAFSADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+KT N +A + K + Sbjct: 120 ESRTKATRDYALSQLQEVFEAVDPRFFGLFEDEAGVRDLVFEMRGQKTGNAKAMKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Y D Sbjct: 180 GEVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRKYYTRAD 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G ++ +E+++F+GE + K D + S R R HFKD+ +++ Y Sbjct: 240 GQLMNDTELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ L +SKDI + GPN D + ++ QT + A+ Sbjct: 300 QQMYG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQTKSETATANP----- 353 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 +D + E + + + V N A W +R+ AS LG + + + Sbjct: 354 QDTGSIERQANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWMVASRLGSALLSSFSD 411 Query: 408 DGFIS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDA 464 G + ++ + +++ ++ M R EL GL E ++ + Sbjct: 412 LGTMYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIK 524 + + + + SG ++ + + +G + LK L D Sbjct: 472 PSVSRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGDVVTRTPDLKSLSNDDFRILKS- 530 Query: 525 AFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKL 584 K + DTD++V K A+ G TP +I + DA + L Sbjct: 531 ---KGITDTDWSVWKLAQQEDWGKGNDTMLTPESIMRIPDAAVEHLG------------- 574 Query: 585 KNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDR 644 +K + K+ V + V +V T Sbjct: 575 ------------------------SPERVKFEAMRKLLGAVTEEVDMAV----ITPGARE 606 Query: 645 QRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYS 704 Q + +RGT GE R F + P + + + MP A + Sbjct: 607 QMVTGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGMPSAGGRAAYIAT--FI 662 Query: 705 ATMALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGDR 758 A+ + G + + G +P + L G L Y D L ++ Sbjct: 663 ASTTILGALSQQLNDMASGRNPRDMVGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722 Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A+ +LGPV +V ++ + +E + + K + P N+WYLK + Sbjct: 723 GALASMLGPVAGLVDDVIKIGQGIPLNAVEGKSEQTGGDLVKLGKGLTPGANIWYLKAAL 782 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH+I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 783 DHMIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 824 >gi|85059173|ref|YP_454875.1| hypothetical protein SG1195 [Sodalis glossinidius str. 'morsitans'] gi|84779693|dbj|BAE74470.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 824 Score = 714 bits (1841), Expect = 0.0, Method: Composition-based stats. Identities = 183/883 (20%), Positives = 345/883 (39%), Gaps = 87/883 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ ECIQ + A+ R L+ E++ +ED IV+ L + LS++ER + AG A Sbjct: 1 MRQECIQAITAASKRTLTSAEIQGIEDRIVKNMRHLARNDPTSWRSLSESERMQRAGHMA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E ++E R +L + + + G GK +AL + F A + + + Sbjct: 61 AEALEREATLKKRRVALTIAARQRLDNFIAGYK-GKGGKLEALNRTIAFHADGKAPFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ +E ++ + + DKQ D+ EM+G+ T N +A + + + Sbjct: 120 ESRTKATRDYALSQLDELFSAIDPRFFQLFEDKQGISDLVYEMRGQDTGNVRAKKGAEAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 L + ++AG D E+ +PQ S++K+ + D+V ++ LD ++Y + Sbjct: 180 KNVSELLRRRFNDAGGDIGHLEDWGMPQHHSMEKVGKATQSDWVGFVMGKLDRNKYVKEN 239 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G +S ++A F+G + K D S R ER HFKD++ ++ Y Sbjct: 240 GELMSDKDVADFLGHAYKTIATGGMNKLGDSGRRLSGARANRGNAERQIHFKDAEGYLAY 299 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + FG ++ IL + L +SKDI + GPN D + ++ + A + + Sbjct: 300 QQRFG-EKSMWDILVNHLDGMSKDIALVETYGPNPDQVFRSLLDELAAKTADETPSRTGK 358 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 KL+ + E + + + + N A W +R+ AS LG I +L + Sbjct: 359 -----IKKLKNKTEDLYNF--IAGKTQPIANPHIARWADHVRNWLVASRLGSALISSLSD 411 Query: 408 DGFISRQM-LSRVGIDKEAIQRINKMPLK--ERMELLSDVGLYAEGVVAHGRNMMEGSDA 464 +G + ++ + + + ++ M + + L GL E ++ + Sbjct: 412 NGTMYLTAKVNNLPMAQLLRNQLAAMNPANKDEIRLARGAGLAMETLLGSVNRWATDNMG 471 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL-KADPRLDPSI 523 + + + + SG ++ + + IG + +A + + D R+ S Sbjct: 472 PSPSRWVANAVMRASGLSAWSDAHKRAYGVTMMGGIGNLVRKHADIAKIADEDARILKS- 530 Query: 524 KAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 K + D+ + K A+ +G TP +I + + L L Sbjct: 531 ----KGISSQDWKIWKLAEQEDWGNGNTTMLTPESIMRIPNEKLAALGN----------- 575 Query: 584 LKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFD 643 +K + K+ V + V +V T Sbjct: 576 --------------------------AERVKFEAMRKLLGAVSEEVDMAV----VTPGAR 605 Query: 644 RQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQY 703 + + +RG GE +R F + P + + + MP A + Sbjct: 606 ERMVTGAAMQRGDWRGELVRSVFLFKSFPIAVMMRHWSR--ALNMPSAGGRAAY--LAAF 661 Query: 704 SATMALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGD 757 A+ + G I ++ G +P + + L G Y D L ++ Sbjct: 662 LASTTVLGAMSQQISEVIAGRNPRDITGDKALQFWVNAFLKGGGAGLYGDFLLSDHTRYG 721 Query: 758 RAAIGGLLGPVPSMVTN----LTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNS 813 A+ +LGPV +V + L + E + + K + +P N+WY K Sbjct: 722 SGALASMLGPVAGVVDDAIKLLQGIPLNAVEGKPEQTGGDLVKFAKGMIPGQNLWYTKAV 781 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 FDH++ NQ+ E +PGYL R + + +K+ + + + LP Sbjct: 782 FDHMVFNQLQEIFSPGYLRRMEKRSRKEFNQTYWWRPQDRLPQ 824 >gi|268589387|ref|ZP_06123608.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] gi|291315414|gb|EFE55867.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] Length = 823 Score = 667 bits (1720), Expect = 0.0, Method: Composition-based stats. Identities = 174/882 (19%), Positives = 339/882 (38%), Gaps = 87/882 (9%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ CI+ + A+ R+L+ +E++ +ED I+ + +L + LS++ER + AG A Sbjct: 1 MRTACIEAIQNASKRQLTAREVQNIEDRIISSMRNLARNDPASWRLLSESERLQRAGQMA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 + Q+E R +L ++ Q K +AL + F A S + + Sbjct: 61 ATELQREADLKQRRVALTIAARQRLDEHINNFQG---SKLEALNRTIAFSADGKSNFMSV 117 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V K D+ D+ EMKG+ T+N +A + + Sbjct: 118 ETRAKATINYALSQLQEAFEAVDPKFFQLFEDQNGVRDLIFEMKGQDTRNVRAKKGAAAW 177 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 L + + AG D E+ +PQ S+ ++ +D +V ++ LD ++Y D Sbjct: 178 HNVTGMLRNSFNRAGGDIGHLEDWGLPQSHSMQRVGKVTQDKWVSDVIGKLDRNKYIKED 237 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G+ ++ +E+ F+ + K D I S + R R HFKD++++++Y Sbjct: 238 GSVMNDAELKQFLDSAYETIATGGLNKINDRPIGVSGMRANRGNASRQIHFKDAESYLEY 297 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + +G ++ I+ + +SKDI + GPN D + ++ + + + + Sbjct: 298 QQLYG-EKSLWDIMVGHIEGISKDIGLIETYGPNPDHVFQSLLNEVTEIEVKGTPSKTGK 356 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 L R E + + V N A + LR+ AS LG + + + Sbjct: 357 -----IKNLRDRTENLYNF--ISGKTTPVANVHIAKFFDDLRNILIASRLGSALLSSFSD 409 Query: 408 DGFISRQM-LSRVGIDKEAIQRINKMPLK--ERMELLSDVGLYAEGVVAHGRNMMEGSDA 464 G + ++ + + ++ + + + L GL E ++ + Sbjct: 410 LGTMYLTAKVNNLPSAQLLKNQLAALNPANKDELRLARRAGLSMETLLGSINRWANDNMG 469 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIK 524 + + + SG + + + IG + + +A +K + Sbjct: 470 PSFARWSANAVMRASGLSAWSDAHKRAFGVTMMGSIGDVVNRHADIKSIGEHDLAIMKS- 528 Query: 525 AFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKL 584 K + +TD+T+ + A+ +G TP +I ++ + L + Sbjct: 529 ---KGITETDWTIWRLAEQEDWGNGNNTMLTPESIMHIPNERLTEFGN------------ 573 Query: 585 KNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDR 644 PE+ +K + + K+ V + V +V + Sbjct: 574 -------PER------------------VKFEAARKLLGAVTEEVDMAV----ISPGARE 604 Query: 645 QRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYS 704 + + +RG GE +R F F + P + + + + G L + Sbjct: 605 RMMIGAGLQRGDWKGEIVRSFFLFKSFPISVVVRHWKRALGIQSAGGRVAYLA----AFI 660 Query: 705 ATMALAGIGVASIKALLRGEDPSLP------EVIYDGTLANGALLPYMDRLTKLVSKGDR 758 A + G I + G +P + + L G L Y D L +K Sbjct: 661 AGTTVLGAISQQINDISSGRNPRDMADENWHKFWLNALLKGGGLGLYGDFLLSDHTKYGS 720 Query: 759 AAIGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSF 814 A LLGPV +V + A + E + + K ++ +P N+WY K Sbjct: 721 DAFASLLGPVAGVVDDAIKLAQGIPLNAVEGKPEQTGGDTVKFVKGLIPGQNLWYTKAVL 780 Query: 815 DHLILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 DH++ NQ+ E +PGYL R + + KK+ + + + P+ Sbjct: 781 DHMVFNQLQEYFSPGYLRRMEKRSKKEFNQTYWWRPQDITPN 822 >gi|30387396|ref|NP_848225.1| hypothetical protein epsilon15p17 [Enterobacteria phage epsilon15] gi|30266051|gb|AAO06080.1| 17 [Salmonella phage epsilon15] Length = 918 Score = 658 bits (1698), Expect = 0.0, Method: Composition-based stats. Identities = 195/939 (20%), Positives = 359/939 (38%), Gaps = 110/939 (11%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGK-------GLSKAERYRLAG-- 51 MK C++ + + GR+ EL+ +ED I A + K G+ A+ Y A Sbjct: 1 MKQACVEAIAQTLGRQPKADELKGIEDRIKEAVRQVHKKNAKEGKTGIPDAQTYMEAADL 60 Query: 52 --LKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAE 109 + D K+ R +AI + L +++ Q Q +F G Sbjct: 61 VRQRVVHDVYKKRQRVAQNAIAISRVTDTLDANIPPEQQTPANLQQFIFAGRRTTDGKDI 120 Query: 110 V-----------------PLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGL---- 148 L ++ A V F + +G + D+Q Sbjct: 121 AVTSAEELSTGAYQDWSRQLSAELLKAGDDVRKFFEQSKALGEQRFRSLFDQQAAKSAQF 180 Query: 149 DVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRA 207 + E+ G+ T N QA ++ + + + + ++ G D ++ +P D +R Sbjct: 181 QILKELYGEDTGNPQAKKIAQVWNDVTSRARQEMNDNGFDIGLRDDWHLPYVDDADFIRN 240 Query: 208 TKKDDF----------------------------VRSMLDWLDLSRYKDIDGTPLSRSEI 239 +D++ V + + D S Y + DG+P++ E Sbjct: 241 AGRDEWLASLPAAERAKAQLSGRQPPIEFARQAWVDDVYNTQDRSNYVNPDGSPMNDIEY 300 Query: 240 ASFVGEVFAERVRSTSFK-DPSIPSSEVGVKREFE--RVFHFKDSQAHMDYMEHFGVSTN 296 + +F + + K DP G+K RV FKD+Q+H YME + Sbjct: 301 RQALEAIFETKATDGANKIDPGAFMGTGGIKNRGSQNRVMAFKDAQSHFAYMERY-TQQP 359 Query: 297 VNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKL 356 V ++ S L S S+D+ + + GP+A ++ + Q A G K + + Sbjct: 360 VAGVMMSHLQSSSRDLGVVKAFGPDAARNFSLVLDRVY---QRAVTGGKAV------GHM 410 Query: 357 EVRQEAMLQMWEVMRYGETVE-NTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQM 415 ++ + +M+ M ++ + + + GLR+ ++MLG + A + I R Sbjct: 411 NEERKMVERMFNSMAGLNGAATSSVFTSAVGGLRNLMTSAMLGTSVLTATSDQA-IMRAN 469 Query: 416 LSRVGIDKEAI----QRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKL 471 +G ++ + I + + +++GL + A M + I Sbjct: 470 AQALGFTRDGMRLSANTIKNLFSGDAKRANAELGLLVDSHAAVVSKMGGFDLSRGITGWF 529 Query: 472 HSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLD 531 K KWSG +D+ ++ L++Y IG +T + +L D+K + + K Sbjct: 530 AEKTLKWSGLIAMDRANKAAFGLLMYKNIGELTRKFKTLDDVKGSDKTILAN----KGWS 585 Query: 532 DTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLA--RMSDKIAYHRKKLKNSKT 589 + D+ ++ A+ TP I + D + + R++ A + L Sbjct: 586 NEDWAIMAAAELQPMTTAGHMGMTPDAIYAVPDNVITGIMADRIAQVRAGSEEVLAALGD 645 Query: 590 LSPEQRQELQQQL-ADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLG 648 L PE+ + ++Q A+ E+ ++++ +L + A+ T+ G Sbjct: 646 LPPERLKRMRQAFDAEAEQTITRMVRNARVEAAQK-LLGITHGEMTSAVTTA------TG 698 Query: 649 LLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA-KMPKGASMALNHVWIQYSATM 707 L TY R AG+ ++ F F TTP F +++ +N +P +A Y A Sbjct: 699 LDTYARDD-AGQLIKSFMLFKTTPFAGFRQLVNRANDLDTVPAIKFLA------SYIAGT 751 Query: 708 ALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGL 764 LAG+ + +LL G DP + P L G+ Y D L + ++ + + Sbjct: 752 TLAGMFANQMNSLLTGNDPLDMTKPTTWVQALLKGGSFGIYGDFLFQDHTQYGSSIAATI 811 Query: 765 LGPVPSMVTNLTSSAV----ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILN 820 GPV S LT + + + + +A K R PF N+WY K +HLIL Sbjct: 812 GGPVLSFAEQLTKLLITNPQKALQGEETSFGADALKTARMITPFANLWYAKAITNHLILQ 871 Query: 821 QILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 Q+ E NPGY DR + + +++ + P R P Sbjct: 872 QLQEMANPGYNDRVRDRAQREFNTTSWWEPGSTTPRRAP 910 >gi|301028422|ref|ZP_07191668.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299878533|gb|EFI86744.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 918 Score = 646 bits (1665), Expect = 0.0, Method: Composition-based stats. Identities = 193/939 (20%), Positives = 363/939 (38%), Gaps = 110/939 (11%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGK-------GLSKAERYRLAG-- 51 MK C++ + + GR+ EL+ +ED I A + K G+ A+ Y A Sbjct: 1 MKQACVEAIAQTLGRQPKADELKNIEDRIKEAVQHVHRKNAKEGKSGIPDAQTYMDAAEL 60 Query: 52 --LKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAE 109 + D K+ R +AI + L +++ Q Q +F + + Sbjct: 61 VRQRVVHDVYKKRQRVAQNAIAISKITDTLDANIPPDQQTPVNLQQFIFAGRRSRDKADI 120 Query: 110 V-----------------PLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGL---- 148 L ++ A V F + +G + D+Q Sbjct: 121 SVTSAEELAIGAYQDWSRQLSAELLKAGDDVRKFFEQSRALGEQRFRSVFDRQAAKSAQL 180 Query: 149 DVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRI-PQPMSVDKLRA 207 + E+ G+ T N A ++ + + + + + ++ G D E+ P D +R Sbjct: 181 QILKEIYGEDTGNPLAKKIAQIWKDVTGRVRHEMNDNGFDIGLREDWHTPYVDDADLIRN 240 Query: 208 TKKDDFVRSM---------------------LDWL-------DLSRYKDIDGTPLSRSEI 239 +++++ S+ W+ D S Y + DG+ ++ E Sbjct: 241 AGREEWLASLPVAEQATARLSGRQPPIEFARQKWVDDAYNTQDRSNYVNPDGSIMNDVEY 300 Query: 240 ASFVGEVFAERVRSTSFK-DPSIPSSEVGVKRE--FERVFHFKDSQAHMDYMEHFGVSTN 296 + +F + + K +P G+K RV FKD+Q+H YME + Sbjct: 301 RQALEAIFETKATDGANKIEPGTFMGAGGIKSRGSQHRVMAFKDAQSHFAYMERY-TQQP 359 Query: 297 VNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKL 356 + ++ S L S S+D+ + + GP+A+ ++ + + A G K K+ KL Sbjct: 360 LVGVMMSHLQSSSRDLGVVKAFGPDAERNFSLVLDRIY---KRAVTGGKRKKEMEDEAKL 416 Query: 357 EVRQEAMLQMWEVMRY-GETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQM 415 + +M+ M ++ +++ + GLR+ ++MLG + A + I R Sbjct: 417 ------VARMFNSMAGLNGVASSSVFSSAVGGLRNLMTSAMLGTSVLTATSDQA-IMRAN 469 Query: 416 LSRVGIDKEAI----QRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKL 471 +G + + I + + + +++GL + A M + I Sbjct: 470 AQALGFTRGGMRLSVNTIKNLFSGDAKKANAELGLLVDSHAAVVSKMGGFDLSRGITGWF 529 Query: 472 HSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLD 531 K KWSG +D+ +S L++Y IG +T + +L D+K + + K Sbjct: 530 AEKTLKWSGLIAMDRANKASFGLLMYKNIGELTRKFKTLDDMKGTDKTILAN----KGWS 585 Query: 532 DTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLA--RMSDKIAYHRKKLKNSKT 589 + D+ ++ A+ TP I + D + D+ R++ A K L Sbjct: 586 NEDWAIMAAAELRPMTTAGHMGMTPDAIYAVPDNVIADIMADRITRIRAGSEKALAALGD 645 Query: 590 LSPEQRQELQQQL-ADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLG 648 L PE+ + +++ A+ E+ ++++ + +L + A+ T+ G Sbjct: 646 LPPERLKRMKEAFDAEAEQTITRMIRNARAEAAQK-LLGITHGEMTNAVTTA------TG 698 Query: 649 LLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA-KMPKGASMALNHVWIQYSATM 707 + TY R AGE ++ F F TTP F +++ + +P +A Y Sbjct: 699 IDTYARDD-AGELMKSFMLFKTTPFAGFRQLVNRTRDLDTVPAIKFLA------SYIGGT 751 Query: 708 ALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGL 764 LAG+ + +LL G DP + P L G+ Y D + + ++ + + Sbjct: 752 TLAGMFAIQMNSLLNGNDPLDMTKPTTWVQALLKGGSFGIYGDFIFQDHTQYGSSIGATM 811 Query: 765 LGPVPSMVTNLTSSAV----ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILN 820 GPV S LT + + + + +A K R PF N+WY K +HLIL Sbjct: 812 GGPVLSFAEQLTKLLITNPQKALQGEETSFGADALKTARMITPFANLWYAKAITNHLILQ 871 Query: 821 QILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 Q+ E NPGY DR + + +++ I + P R P Sbjct: 872 QLQEMANPGYNDRVRDRAQREFDITSWWEPGAIAPRRAP 910 >gi|304398390|ref|ZP_07380264.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] gi|304354256|gb|EFM18629.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] Length = 921 Score = 639 bits (1648), Expect = 0.0, Method: Composition-based stats. Identities = 195/939 (20%), Positives = 366/939 (38%), Gaps = 107/939 (11%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYVSLDGK----GLSKAERYR----L 49 MK C+ + + GR+ EL+ +ED I VR ++ + G A+ Y+ L Sbjct: 1 MKQACVDAITQTLGRQPLASELKNIEDLISDSVRQVSRMNARAGKSGFPDADTYKQAADL 60 Query: 50 AGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAE 109 A + D K+ R +AI L ++ + SQ +F+ G Sbjct: 61 AARRVVHDVFKKRQRLAQNAIAINNVTETLNRNVPAPEQTPKNLSQFIFSGRRVADGKEI 120 Query: 110 -----------------VPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGL---- 148 L ++ AA V F + +G + D++ G Sbjct: 121 DVVSAEELATGAFQDWSRQLSAEMTAAGGDVQKFFEQAQALGEQRFRNIFDQRVGKSSQL 180 Query: 149 DVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRA 207 + E+ G+ T N A ++ + + + +++G D ++ +P D +RA Sbjct: 181 QLLKEIYGEDTGNPAAKKIASIWSDVTSRARQEMNDSGFDIGQRDDWHLPYVDEADLVRA 240 Query: 208 TKKDD----------------------------FVRSMLDWLDLSRYKDIDGTPLSRSEI 239 +++ +V + + D S++ + DGTP++ + Sbjct: 241 AGREEWLATLPLAERTQARLAGRMPPGDWARRAWVDDIYNTQDRSQFVNPDGTPMNDVQY 300 Query: 240 ASFVGEVFAERVRSTSFK-DPSIPSSEVGVKRE--FERVFHFKDSQAHMDYMEHFGVSTN 296 + +F + + K DP + G+K RV FKD+++H YME + Sbjct: 301 REALEYIFETKATDGAQKLDPGAFAGSGGLKNRGSQSRVLAFKDAESHFGYMEKY-TQQP 359 Query: 297 VNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKL 356 V ++ S L + S+D+ + + GP+A + K + + N + + + + Sbjct: 360 VVGVMMSHLQTASRDLGVVKAFGPDAGTNFKLIADRIYQNAVKVDGAGHPIAE------M 413 Query: 357 EVRQEAMLQMWEVMRYGETVENT-GWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQM 415 +E + +M++ M V +T +++ + GLR+ ++MLG I A + + R Sbjct: 414 NKERELVQRMFDSMAGLNGVNSTSVFSSAVGGLRNLMTSAMLGSSVITATSDQAVM-RAA 472 Query: 416 LSRVGIDKEAIQ----RINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKL 471 +G D+ ++ I + + +++GL + A M I Sbjct: 473 AQALGFDRNGMRLSATTIRNLFSGDAKRANAELGLLVDAHSAVIAKMGGFDLTRGITGWF 532 Query: 472 HSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLD 531 K KWSG +D+ ++ L++Y IG +T YA+L LK + S K Sbjct: 533 AEKTLKWSGLIAMDRANKAAFGLLMYKNIGELTRRYATLDALKGSDKALLSS----KGWS 588 Query: 532 DTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDL--ARMSDKIAYHRKKLKNSKT 589 D+ ++ A+ TP I + D +R + ++ A + L N Sbjct: 589 AEDWAIMNAAELKPLTTSGHMGITPDAIYAVPDEKVRQILAGQIDRVRAGADEALANLGA 648 Query: 590 LSPEQRQELQQQL-ADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLG 648 ++ + L+Q A++E+ ++++ + +L + A+ T+ G Sbjct: 649 MTDSRATNLRQAYDAEVEQTISRMVRNARAEAAQK-LLGVTHGEMSQAITTA------TG 701 Query: 649 LLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLS-NSAKMPKGASMALNHVWIQYSATM 707 + TY R + GE + F F TTP F ++ + N ++P +A Y Sbjct: 702 IDTYAR-DQGGELYKSFMLFKTTPFAGFRQMVTRAQNLDRVPALKFLAA------YIGGT 754 Query: 708 ALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGL 764 L G+ + ALL G DP + P TL G Y D L + ++ + L Sbjct: 755 TLTGMFANQLNALLSGNDPIDMTKPGAWVGATLKGGGFGIYGDFLFQDHTQYGSSIAATL 814 Query: 765 LGPVPSMVTNLTSSAV----ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILN 820 GP + +L + + + + +A K R PF N+WY K +HLIL Sbjct: 815 GGPSLGLAESLMKLLITNPQKAMQGEETSFGADAIKTARMITPFANLWYTKAVTNHLILQ 874 Query: 821 QILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 Q+ E NPGY DR + + + + + + N + P R P Sbjct: 875 QLQEMANPGYNDRVRDRAQNQFDVTSWWNPGDTEPRRTP 913 >gi|319793417|ref|YP_004155057.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] gi|315595880|gb|ADU36946.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] Length = 838 Score = 637 bits (1643), Expect = e-180, Method: Composition-based stats. Identities = 186/893 (20%), Positives = 332/893 (37%), Gaps = 98/893 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL----DGKGLSKAERYRLAGLKAEE 56 MKP CI + +A GR +S EL+ +ED I R L DG L+ +R+ A +A E Sbjct: 1 MKPACIDAVIEAVGRPMSDAELKGIEDRIGRELRRLGNGPDGLRLTGEQRFFEAARRARE 60 Query: 57 DFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAE---VPLE 113 F E K Q+ L AG G A +L G A+ + +E Sbjct: 61 SFLGEQELKARRDALAVLKHAQVEQAL----AGFPGDKIAGLRRLLAFHGDAKGSTLSVE 116 Query: 114 MKIKAAETKVLSKFN-EYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYF 172 K +A E + K G + + EM G+ + +A ++ Sbjct: 117 SKAEAIEADAFRQMLGTLEATNPKFFGLFESPEGVRALVREMFGEDSGVREAKEGAAEFK 176 Query: 173 ETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDG 231 + EL + ++AG + E+ +P S +K+ A + +V L+ RY++ DG Sbjct: 177 KVADELLGRFNDAGGKIRPREDWGLPHHHSQNKIAAAGEAVWVEKTFPLLNRDRYRNEDG 236 Query: 232 TPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVK-----REFERVFHFKDSQAHMD 286 + ++ S++ +F+ E + +T + P + G R H++ + ++ Sbjct: 237 SRMNDSQVLAFLRESYQT--LATGGVNTLEPGAGGGETMRANLHAAAREIHYRSADDYLA 294 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQM--IVQTIANDQEASAGN 344 Y + FG + +LT + L+ I + GPN D K + Q + + Sbjct: 295 YQKDFG-ERGLYDVLTGHVRGLADSIAMVETFGPNPDHAFKYFRDLAQREMTVADPTKHG 353 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 K+ K +G + L V V + A LR AS LG I + Sbjct: 354 KIAKQLVGLDNLYNY---------VSGKTLPVASEWLAQGFDSLRKWLVASRLGSAFISS 404 Query: 405 LLEDGFISRQM-LSRVGIDKEAIQRINKMPLKER--MELLSDVGLYAEGVVAHGRNMMEG 461 L ++ + ++ + + + + + + GL + ++ + Sbjct: 405 LPDEATMQLTARVNNIDGMQVFRNELAALNPANQMEKRMAQRAGLALQTMIGSLNRFGDE 464 Query: 462 SDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDP 521 + + K+ + + SG + + R + + + + +G +T D +A +LDP Sbjct: 465 NMRNTLATKMATFTMRASGLNAITEARRRAFGVTMMSSLGHLTR------DAEAPSKLDP 518 Query: 522 SIKAFF--KQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAY 579 K + D D+ V KRA+ G TP I + D L + + Sbjct: 519 MDHRILLSKGITDADWQVWKRAELEDWGGGNGTMLTPEAIYRIPDEALVGIGNLDAN--- 575 Query: 580 HRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHT 639 + R++ +L + +E N+ + ++ A + N+Q Sbjct: 576 -----------PQQLRRDAATRLLGVVLEEQNMAVVEPGSRERAALYSNLQ--------- 615 Query: 640 SLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHV 699 RGT GE R F T P M + + P S A Sbjct: 616 --------------RGTWKGELTRSVFLFKTMPIAMLMRHWER--GMSGPDARSKAGYIG 659 Query: 700 WIQYSATMALAGIGVASIKALLRGEDPSL---------PEVIYDGTLANGALLPYMDRLT 750 + + + G+ I LL+G DP L G+L Y D L Sbjct: 660 ALM--VSTTVMGMLALQIDELLKGRDPVNMNPFEGKAGARNWVRAFLKGGSLGIYGDFLF 717 Query: 751 KLVSKGDRAAIGGLLGPVPSMVTN----LTSSAVELATKDNENSKVNATKAIRKTLPFMN 806 ++ I LGPV V + V+L + ++ K + P N Sbjct: 718 SEQNQHGGGPIASALGPVVGAVEEAFGLTQGNLVQLGQGKDTHAGAELLKFAKGMTPGAN 777 Query: 807 MWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 +WYLK + +HLI NQ+ E ++PGYL R +S+ +++ G + + + +P R P Sbjct: 778 LWYLKAATNHLIFNQLQEMVSPGYLARVKSRAQREFGTTEWWDSRQAVPDRAP 830 >gi|330007168|ref|ZP_08305910.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] gi|328535515|gb|EGF61975.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] Length = 924 Score = 636 bits (1639), Expect = e-180, Method: Composition-based stats. Identities = 194/941 (20%), Positives = 360/941 (38%), Gaps = 108/941 (11%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGK-------GLSKAERYRLAGLK 53 MK CI + GR+ E++ +ED I A + + G+ AE YR A Sbjct: 1 MKQACIDAVANTLGRQPKADEIKNIEDRIKDAVRVIARRNAREGKTGIPDAETYRQAAEL 60 Query: 54 A----EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKA---- 105 A K+ R +AI A R L + + Q +F+ + Sbjct: 61 AAAQAVHAVFKKRQRVAQNAIAIAKVRDTLNKAIPENEQTPIALQQFIFSGRRGRDKQPD 120 Query: 106 --------------GSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLG--FTLDKQFGL- 148 L ++ AA V F + +G + L D++ Sbjct: 121 INVVSAEEMATGAYQDWTRQLSAELTAAGDDVQKFFYQSQALGEQRLRNLLPFDREASRS 180 Query: 149 ---DVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDK 204 + E+ G+ T N A ++ K + + + +++G D ++ +P + Sbjct: 181 GQLQILKEIYGEDTGNPAAKKIAKVWGDVTSRARQEMNDSGFDIGLRDDWHLPYVDDAEL 240 Query: 205 LRATKKDDFVRSM---------------------LDWL-------DLSRYKDIDGTPLSR 236 +RA +D+++ S+ W+ D S+Y ++DG+P++ Sbjct: 241 IRAAGRDEWLSSLPLNERAAAIAAGRQPPQDFARQAWVDDVWNTQDRSQYVNLDGSPMND 300 Query: 237 SEIASFVGEVFAERVRSTSFK-DPSIPSSEVGVKRE--FERVFHFKDSQAHMDYMEHFGV 293 E + ++ +V + K DP G+K RV FKD+++H YME + Sbjct: 301 IEYRQALEAIYETKVTEGANKIDPGAFMGSGGIKNRGSQSRVMAFKDAKSHFSYMERY-T 359 Query: 294 STNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGR 353 V ++ S L S S+D+ + + GP+A S K ++ Q + G + + Sbjct: 360 QQPVVGVMMSHLQSSSRDLGVVKAFGPDAASNFKLLMDQIYQRATSTTGGGHDIGTMNDQ 419 Query: 354 NKLEVRQEAMLQMWEVMRY-GETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFIS 412 +L + +M+ M ++ +++ + GLR+ ++MLG A + I Sbjct: 420 RQL------VERMFNSMAGLNGVASSSVFSSAVGGLRNLMTSAMLGTSVFTAASDQA-IM 472 Query: 413 RQMLSRVGIDKEAI----QRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIG 468 R +G D+ + + + + +++GL + A M + I Sbjct: 473 RANAQALGFDRNGMRLSANTLRNLFNGDAKRANAELGLLVDAHAAVVSKMGGFDLSRGIT 532 Query: 469 HKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFK 528 K KWSG +D+ ++ L+++ IG ++ Y SL L R + K Sbjct: 533 GWFAEKTLKWSGLIAMDRANKAAFGLLMFKNIGELSRKYKSLDALTGSDRTVLAN----K 588 Query: 529 QLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLA--RMSDKIAYHRKKLKN 586 D+ ++ A+ TP I ++ D +R++ R+ + L Sbjct: 589 GWTPEDWAIMSAAELRPLTPDGHKGMTPDAIYDVPDETVRNILADRIEKVRVGSDQALAA 648 Query: 587 SKTLSPEQRQELQQQL-ADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQ 645 ++ +R+ L+Q A++E+ ++++ + +L + A+ T+ Sbjct: 649 LGDMTDAKRKTLKQAFDAEVEQTISRMVRNARAEAAQ-HLLGITHGEMTSAVTTA----- 702 Query: 646 RLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSA 705 GL + R T +G+ L+ F F TTP + + +M + Y A Sbjct: 703 -TGLDAFARDT-SGDLLKSFMLFKTTPMAGMRQFVTRLQDLE-----TMPAVKFFAAYVA 755 Query: 706 TMALAGIGVASIKALLRGEDP---SLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIG 762 LAG+ + ALL G DP + P+ L G+ Y D L + ++ + G Sbjct: 756 GTTLAGMFANQMNALLSGNDPLDMTKPQTWLQALLKGGSFGIYGDFLFQDHTQYGSSIAG 815 Query: 763 GLLGPVPSMVTNLTSSAV----ELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLI 818 L GPV L+ + + + + +A K R PF N+WY K +HLI Sbjct: 816 ILGGPVLGFAEQLSKTVLTNSQKAMAGEETTFTADALKTARMITPFANLWYTKAITNHLI 875 Query: 819 LNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 L Q+ E NPGY R + + ++ + E P R P Sbjct: 876 LQQLQEMANPGYNARVRDRAMREFNTTSWWEPGEETPRRAP 916 >gi|169795397|ref|YP_001713190.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148324|emb|CAM86189.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 841 Score = 624 bits (1608), Expect = e-176, Method: Composition-based stats. Identities = 187/898 (20%), Positives = 345/898 (38%), Gaps = 104/898 (11%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLK 53 MK +C Q + KA G++ LS +E +E I +L + + LS AE+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLSAQEATDIEARINETMRNLARKDINNWRNLSDAEKLTEAAKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG-SAEVPL 112 D Q++L R A + K+ Q + LD G + + + S + Sbjct: 61 VAIDIQEQLKRKHKIAAQDILKQSQNIAALD---HGKLSSMEVIDRMVAAHGDMSGIQSI 117 Query: 113 EMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYF 172 + K + + + ++ LG D++ + E G+ T + A ++ + Sbjct: 118 DSKARGIASIYRGELVDFYTNIKGGLGVFTDQELVQKIVRERFGENTGDALAKKISDKMG 177 Query: 173 ETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDG 231 + + + + G D +N +PQ +++K+ K+ +V +D +Y +G Sbjct: 178 DVFETMRDRFNRNGGDIGKLDNWGLPQTHNLEKIAKAGKEAWVNKAESLIDTRQYVHENG 237 Query: 232 TPLSRSEIASFVGEVFAERVRSTSFK------DPSIPSSEVGVKREFERVFHFKDSQAHM 285 S+ EI S + + + K +S+V + RV HFKD+++ + Sbjct: 238 DYYSQQEIRSLLEYTYDTLSSDGANKIEVGRQATGGGTSKVTNRHGESRVLHFKDAESWL 297 Query: 286 DYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNK 345 +Y FG V ++ + + LSKDI + LG N + +K ++ D E K Sbjct: 298 EYQSEFGGMQFV-DLVEAHINGLSKDIAMVENLGSNPKTALKILMDAAAKKDWEGQIPEK 356 Query: 346 VLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGAL 405 K ++ + M++ + G + ++ AN RS A+MLG I ++ Sbjct: 357 TTKRV---------RKRIETMFDELSGGNSPQSEVLANLGVLYRSMNVAAMLGGTTISSI 407 Query: 406 LEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEG 461 + I++ LS E + ++N +R EL +GL E ++ G Sbjct: 408 TDQAMIAKTANVHGLSYRKTFGELVDQLNPANKADR-ELAHSLGLATEEMI--GSIARWS 464 Query: 462 SDAFQIGHKLHSKMHKWSGAEYLDKKRISS-HALIVYNQIG---RMTDTYASLKDLKADP 517 D + K+ + S R+S +AL +++G + + Y L KA Sbjct: 465 DDGLTSTYGKSEKLARISSGIASQVMRVSGLNALTAASKVGFTKLLMEKYGRLSRSKAWN 524 Query: 518 RLDPSIKAFFK--QLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSD 575 LD + LD+ + V + A + G + +I + D L Sbjct: 525 DLDAQDRELLSNTGLDERAWQVFQLADPVVDRKGNQLM-SARSIYEIPDEKLTAFG---- 579 Query: 576 KIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRG 635 P+Q +KD+VS+++ A +LD +V Sbjct: 580 ---------------DPKQ------------------VKDQVSSQLQAHLLDEQGLAVVE 606 Query: 636 AMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMA 695 A R++ + RGT GE +R QF + + + + + KG + Sbjct: 607 A-----GLREKTLINVGARGTITGEIVRGLAQFKSFSAAFLMRHGSRAFAQEGIKGKAGY 661 Query: 696 LNHVWIQYSATMALAGIGVASIKALLRGEDPS------LPE----VIYDGTLANGALLPY 745 +++ T+ L G V +K LL G DP P+ + G L Sbjct: 662 AVPLFV----TLTLLGGLVVQLKELLNGNDPQTIYDSNDPKKAGSFFIRSAVQGGGLSFL 717 Query: 746 MDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVN----ATKAIRKT 801 D L R A + GP+ + T L V T+ NE N A K ++ Sbjct: 718 GDILVAGTDTSGRDANSFVAGPLGNDFTALLGLTVGNLTQYNEGKDTNFGNEAFKFVKGK 777 Query: 802 LPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK-KGIELFQNMDEGLPHRLP 858 +P N+WY K + + ++ +++ + + PGY ++ K ++ + E F D+ R P Sbjct: 778 IPAQNLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRERFWG-DDINDIRAP 834 >gi|332875212|ref|ZP_08443045.1| hypothetical protein HMPREF0022_02678 [Acinetobacter baumannii 6014059] gi|332736656|gb|EGJ67650.1| hypothetical protein HMPREF0022_02678 [Acinetobacter baumannii 6014059] Length = 841 Score = 621 bits (1600), Expect = e-175, Method: Composition-based stats. Identities = 187/899 (20%), Positives = 345/899 (38%), Gaps = 106/899 (11%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLK 53 MK +C Q + KA G++ L+ +E +E I +L + + LS AE+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLTAQEATDIEARINETMRNLARKDINNWRNLSDAEKLSEAAKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 D Q++L R A + K+ Q + LD + S + +++ G S Sbjct: 61 VAIDIQEQLKRKHKIAAQDILKQSQNIAALDHSKLS----SMEVIDRMVAAHGDMSGIQS 116 Query: 112 LEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 ++ K + + + ++ LG D++ + E G+ T + A ++ + Sbjct: 117 IDSKARGIASIYRGELVDFYTNIKGGLGIFTDQELVQKIVRERFGENTGDALAKKISDKM 176 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 + + + + G D +N +PQ +++K+ K+ +V +D +Y + Sbjct: 177 GDVFETMRDRFNRNGGDIGKLDNWGLPQTHNLEKIAKAGKEAWVNKAESLIDTRQYVHEN 236 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK------DPSIPSSEVGVKREFERVFHFKDSQAH 284 G S+ EI S + + + K +S+V + RV HFKD+++ Sbjct: 237 GDYYSQQEIRSLLEYTYDTLSSDGANKIEVGRQATGGGTSKVTNRHGESRVLHFKDAESW 296 Query: 285 MDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGN 344 ++Y FG V ++ + + LSKDI + LG N + +K ++ D E Sbjct: 297 LEYQSEFGGMQFV-DLVEAHINGLSKDIAMVENLGSNPKTALKILMDAAAKKDWEK---- 351 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 + NK + ++ M++ G T ++ AN RS ASMLG I + Sbjct: 352 -----GIDENKTQSSRKRAQVMFDEFSGGNTPQSQVLANLGIAYRSMNVASMLGGTTIAS 406 Query: 405 LLEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMME 460 L + I++ LS ++++N +R E +GL E ++ G Sbjct: 407 LADQATIAKTAHVHNLSYRKAFGGIVEQLNPANKADR-EFAHGLGLATEEML--GSIARW 463 Query: 461 GSDAFQIGHKLHSKMHKWSGAEYLDKKRIS-SHALIVYNQIG---RMTDTYASLKDLKAD 516 D + K+ + S R+S +AL +++G + + Y L KA Sbjct: 464 SDDGLTSTYGKSEKLARISSGVATQVMRVSFLNALTSASKVGFTKLLMEKYGRLSRSKAW 523 Query: 517 PRLDPSIKAFFK--QLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMS 574 LD + LD+ + V + A+ + G + +I + D L Sbjct: 524 NELDVQDRELLSNTGLDERAWQVFQLAEPVVDRKGNQLM-SARSIYEIPDEKLTAFG--- 579 Query: 575 DKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVR 634 P+Q +KD+V++++ A +LD +V Sbjct: 580 ----------------DPKQ------------------VKDQVASQLQAHLLDEQGMAVI 605 Query: 635 GAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASM 694 A R+R + +GT GE + QF + + + + + KG + Sbjct: 606 EA-----GLRERTWMTVGAKGTITGEVFKGLMQFKSFSASFLMRQGSRAMAQEGLKGKA- 659 Query: 695 ALNHVWIQYSATMALAGIGVASIKALLRGEDPS------LPE----VIYDGTLANGALLP 744 I +M L G V ++ +L G DP P+ +A G L Sbjct: 660 ---AYAIPLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPV 716 Query: 745 YMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVN----ATKAIRK 800 D L R A + GP+ S T L V T+ NE N A K ++ Sbjct: 717 LGDILVAGTDTSGRDANSFVSGPLGSDFTALLGLTVGNLTQYNEGKDTNFGNEAFKFVKG 776 Query: 801 TLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK-KGIELFQNMDEGLPHRLP 858 +P N+WY K + + + +++ + + PGY ++ K ++ + E F D+ R P Sbjct: 777 KIPAQNLWYTKAAINRMFFDEVQDTIAPGYREKALRKAERQQDRERFWG-DDINDIRAP 834 >gi|260548934|ref|ZP_05823156.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] gi|260408102|gb|EEX01573.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] Length = 841 Score = 616 bits (1588), Expect = e-174, Method: Composition-based stats. Identities = 190/899 (21%), Positives = 345/899 (38%), Gaps = 106/899 (11%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSLD------GKGLSKAERYRLAGLK 53 MK +C Q + KA G++ LS +E ++E I A ++ + LS +E+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLSAQEAIKIESRINEAMRNMARKDIDKWRNLSDSEKLIEASKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVP 111 D Q++L R A ++ + + + LD + + + +++ G S Sbjct: 61 VAIDIQEQLKRKHKIAANDILTQSKNLAKLDHTRL----LASEVVDRMVAPHGDMSGIQS 116 Query: 112 LEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 + K + ++ LG DK+ + E + T + A ++ + Sbjct: 117 ISSKADGIADIYEGELVDFYTNIKGGLGIFTDKELVHKIVRERFNENTGDPLAKKISNKM 176 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 + + + + +G D +N +PQ +++K+ K +V +D +Y + Sbjct: 177 GDVFETMRDRFNRSGGDIGMLDNWGLPQTHNLEKIAKAGKKAWVNKAESLIDTRQYVHEN 236 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK------DPSIPSSEVGVKREFERVFHFKDSQAH 284 G S+ EI S + + + K +S+V K RV HFKD+++ Sbjct: 237 GDYYSQQEIRSLLEYTYDTLSSDGANKIEVGRQATGAGTSKVTNKHSESRVLHFKDAESW 296 Query: 285 MDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGN 344 ++Y FG V ++ + + LSKDI + LG N + K + + A+ ++ AG Sbjct: 297 LEYQSDFGGMQFV-DLVNAHIKGLSKDIALVENLGSNPKTAFKIL--KNAADKKDREAGR 353 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 KD N+ +V M++ G + ++ AN RS SMLG + + Sbjct: 354 ITTKDNPALNRAQV-------MFDEFSGGNSPQSQVLANLGIAYRSMNIFSMLGGTTVVS 406 Query: 405 LLEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMME 460 + I++ LS E I+++N +R EL +GL E ++ G Sbjct: 407 TTDQATIAKTAHVHGLSYRKAFGELIRQLNPANKADR-ELAHSLGLATEEML--GSIARW 463 Query: 461 GSDAFQIGHKLHSKMHKWSGAEYLDKKRIS-SHALIVYNQIG---RMTDTYASLKDLKAD 516 D H K+ + S R+S +AL +++G + + Y L KA Sbjct: 464 SDDGLTSTHGKSEKLARISSGVASLVMRVSLLNALTAASKVGFTKLLMEKYGRLSRSKAW 523 Query: 517 PRLDPSIKAFFK--QLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMS 574 LD + LD+ + V + A+ + G + +I + D L Sbjct: 524 GDLDIQDRELLSNTGLDERAWQVFQLAEPVVDRKGNQLM-SARSIYEIPDEKLAAFG--- 579 Query: 575 DKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVR 634 P+Q +KD+V++++ A +LD +V Sbjct: 580 ----------------DPKQ------------------VKDQVASQLQAHLLDEQGMAVI 605 Query: 635 GAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASM 694 A R++ + RGT GE R QF + + + + + KG + Sbjct: 606 EA-----GLREKTLINVGARGTITGEIFRGIVQFKSFSAAFLMRHGSRTMAQEGLKGKA- 659 Query: 695 ALNHVWIQYSATMALAGIGVASIKALLRGEDPS------LPE----VIYDGTLANGALLP 744 I L G V +K LL G DP P+ + G L Sbjct: 660 ---AYAIPLFVMTTLLGGLVVQLKELLNGNDPQTIYDSNDPKKASNFFVRSAVQGGGLSF 716 Query: 745 YMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVN----ATKAIRK 800 D L R A + GP+ S +L S V T+ NE N A + +++ Sbjct: 717 LGDILVAGTDTSGRDAHSFVAGPLGSDFESLLSLTVGNLTQYNEGKDTNFGNEAFQFVKR 776 Query: 801 TLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKK-KKKGIELFQNMDEGLPHRLP 858 +P N+WY K + + ++ ++I + + PGY ++ K +K+ E F D+ R P Sbjct: 777 KIPAQNLWYTKAAINRMVFDEIQDFIAPGYREKALRKAEEKQDRERFWG-DDINDIRAP 834 >gi|332160979|ref|YP_004297556.1| hypothetical protein YE105_C1357 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665209|gb|ADZ41853.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862135|emb|CBX72299.1| hypothetical protein YEW_AK02360 [Yersinia enterocolitica W22703] Length = 841 Score = 601 bits (1550), Expect = e-169, Method: Composition-based stats. Identities = 180/893 (20%), Positives = 340/893 (38%), Gaps = 95/893 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLD-----GKGLSKAERYRLAGLKAE 55 M+ ECIQ + A GR +++ E++ +E+ I + + L +SKA+R R A A Sbjct: 1 MRAECIQAVVNAIGRSITQAEVKGIENRINQHHKRLAQDTPGWMAMSKADRLREAAKSAA 60 Query: 56 EDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPLE 113 ++ +E +++ + V++ AL + F + S + +E Sbjct: 61 DEITREAKLKKWRTALTILAHDRVK---NYVESSTDTPVNALGRLIAFDSDQKSGVLSVE 117 Query: 114 MKIKAAETKVLSKFNEYAEVGS-KNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYF 172 + KA S+ + K L D + V E+ G+ + N A + K++ Sbjct: 118 SQAKAIRDIAYSQMLTLIDTTKGKFLSLLSDPESSKAVIKELHGEHSGNAAAKQSAKEFK 177 Query: 173 ETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDG 231 + L + + +G E+ +P+ S K+ A ++ +V + W D Y + DG Sbjct: 178 DVAEFLRQRFNNSGGAIGRLESWAMPRSHSQLKV-AKNREAWVDDHVKWADRRSYVNEDG 236 Query: 232 TPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREF-----ERVFHFKDSQAHMD 286 + +S +++ F A R +T + P +G R H+KD+ + + Sbjct: 237 SRMSDAQLREFF--THAARTIATGGINKVEPGRFIGGSLRANHGSESRSIHYKDADSFIL 294 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKV 346 + +G ++ +LT + L++DI + LGPN+D + + + A Sbjct: 295 AQQKYG-DKDLLALLTGHIDRLARDIALTETLGPNSDLQFRTQMDMAQQSMINA------ 347 Query: 347 LKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGW-ANWMAGLRSAAGASMLGQHPIGAL 405 + K+E + ++++ + + T W RS AS LG I A+ Sbjct: 348 --EPAKFKKIESEMLRVERLYKDVAGQNDIPETPWLKEAFDTYRSINVASKLGSAAITAI 405 Query: 406 LEDGFISRQM-LSRVGIDKEAIQRINKMPLKE--RMELLSDVGLYAEGVVAHGRNMMEGS 462 + G + ++ + + + Q + + + E GL + + + Sbjct: 406 TDQGNLMVTAKVNNLPVMQVFAQELKLLNPADSASREAARRAGLGINYYLNGLQRFGAET 465 Query: 463 DA---------FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL 513 K+ + + SG + + +++ + IG MT +A+L L Sbjct: 466 LGSAGDTSGALSSSAQKIAGFVLRASGLNAMTAAGNQAFGMVMLDTIGGMTRKHANLAHL 525 Query: 514 KADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARM 573 A R + + + D+ V ++A D+ DL+ M Sbjct: 526 NAKDRT----RLQGMGVTEADWAVWRKA------------------------DVSDLSGM 557 Query: 574 SDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSV 633 D + H + L LS L +Q A K L++ + K+ +V D Q +V Sbjct: 558 GDTVLTHNEILA----LSDSALTPLAKQFATTPAK----LRNTAATKLLGVVQDEAQMAV 609 Query: 634 RGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGAS 693 + RGT +GE R QF + P M + + + GA Sbjct: 610 ----VEPGARERVTLHRGTTRGTWSGEIWRSATQFKSFPIAMVMRHAHRALAQ---DGAG 662 Query: 694 MALNHVWIQYSATMALAGIGVASIKALLRGEDPSL---PEVIYDGTLANGALLPYMDRLT 750 I A L G + + G DP PE L GAL Y D L Sbjct: 663 KGTYAAAI--IAASTLLGGMAIQLNEIASGRDPRDMTKPEFWGGAFLKGGALGLYGDFLL 720 Query: 751 KLVSKGDRAAIGGLLGPVPSMVTNLT----SSAVELATKDNENSKVNATKAIRKTLPFMN 806 ++G + I + GP+ + ++ +A + + ++ N + I+ P N Sbjct: 721 TNQTQGGNSFIASIGGPLAGDIESVVKMTQGAAFKAIDGKDPHTAANVVRFIKGHTPGAN 780 Query: 807 MWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 +WY K + DH+I + I E+ +PGYL R + + +K+ + + E P R P Sbjct: 781 LWYAKAALDHMIFHDIQEQFSPGYLSRMRQRAQKEYDQQFWWAPGETAPDRAP 833 >gi|294648411|ref|ZP_06725910.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825716|gb|EFF84420.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 854 Score = 546 bits (1407), Expect = e-153, Method: Composition-based stats. Identities = 164/894 (18%), Positives = 335/894 (37%), Gaps = 91/894 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLD------GKGLSKAERYRLAGLKA 54 MK EC + GR+L+ KE LE ++A L K +S ER +A Sbjct: 1 MKNECRAAVEGVLGRKLTDKEADLLEQQFIKASRELPQEDIKAWKSMSDEERAEAIADRA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKL-FFKAGSAEVPLE 113 +++ + I+ V + I++ R L +L +AL KL F S +E Sbjct: 61 IKNYTDQHIKEVTNLINDLEIREALEHEL--TSHSKLNPLEALNRKLVMFTDQSGIQSVE 118 Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 I+A E + + + K LG+ +D + E+ GK + + + + L K + Sbjct: 119 HNIQAIEVRYMGALADVFSKTQKGLGYLIDADKVKLLVKEIFGKPSGDAEIAGLAKSVQD 178 Query: 174 TQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGT 232 +L + G D K N IPQ S K+ + +++++ +D S+Y+ +G Sbjct: 179 VLEQLRQHYNRYGGDIKKLANYGIPQSHSHYKVIQAGEGEWIKTTFPMVDKSKYRHENGK 238 Query: 233 PLSRSEIASFVGEVFAERVRSTSFKDPSIP-----------SSEVGVKREFERVFHFKDS 281 ++ +E+ + V+ K + + R HFKD Sbjct: 239 LMNDAEVKEVLKAVYQTIASEGHNKASVQAHAVQSETDLPVGMNMQALHQHHREVHFKDP 298 Query: 282 QAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEAS 341 + + Y E FG N + +L++ + +S +I + + G N + VKQ+ + + Sbjct: 299 DSWVAYQEQFG-EVNFHDLLSNHIRRMSTEIGMMQTFGSNPEKLVKQLGHDLL------N 351 Query: 342 AGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHP 401 + K K++ + + + + ++ + ++ A LRS A+ +G Sbjct: 352 KMMQDPKYVKDHRKIQKQAKLINKHYDELAGQALPVDSSLAQVGGMLRSWTVATKMGSAF 411 Query: 402 IGALLEDGFISRQMLSR-VGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMME 460 I A + + + K + + + KE + +GL + + Sbjct: 412 ITAFSDQATMKLASEMHGIAYTKVFGKHLKQFKNKEDRDFAISIGLGVREMTNALVRFGD 471 Query: 461 GSDAFQIG---------HKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLK 511 A K+ + + + SG ++ + + + + ++L Sbjct: 472 DDLASASTKLASANTKTRKVANAVIRASGLNHITASAKRAFGASLMHHV-------SNLN 524 Query: 512 DLKADPRLDPSIKAFF--KQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRD 569 KA +L K + + D+T++K+ +P G T I N D D Sbjct: 525 SGKAWDQLGTQDKKMLEGGGIKEDDWTLLKQIDRTEAPSG-EKLVTNKDIFNASDDLFLD 583 Query: 570 LARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNV 629 ++ ++ Q+L+D+ LK++++NK + Sbjct: 584 TFQV-------------------DKTGYTAQELSDIAF----RLKEQLANKYMNYIYTET 620 Query: 630 QTSVR--GAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAK 687 +V GA ++ R +RGT E R F QF P M + + Sbjct: 621 NAAVLEVGARESTFMGLGR------ERGTVGNELSRFFWQFKQFPLAMIMRQWTRGMAQG 674 Query: 688 MPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVI--YDGTLANGALLPY 745 P+ + + A + G V+ I+ L +G+D P + Y ++ G + Sbjct: 675 TPQEKF----VYFAKLFAYTTVMGALVSQIQNLTQGKDLDDPTTLDFYMKSIVKGGSASF 730 Query: 746 MDRLTKLVSKGDRAAIGGLLGP-----VPSMVTNLTSSAVELATKDNENSKVNATKAIRK 800 + S ++ + P + S+ T ++ + T+ + + A ++ Sbjct: 731 LADAISATSDPTERSVKDFIIPAAFKDITSIGTMVSGAGSAFITERDSSYGAEAVNVVKN 790 Query: 801 TLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGI-ELFQNMDEGL 853 +PF N+WY + FD L++ ++ E + GY +R+Q +++ + ++D Sbjct: 791 NIPFQNLWYSRLVFDRLVIAEMQELFDEGYRERKQRRQENNHNMSYWWDLDNDS 844 >gi|262371858|ref|ZP_06065137.1| predicted protein [Acinetobacter junii SH205] gi|262311883|gb|EEY92968.1| predicted protein [Acinetobacter junii SH205] Length = 841 Score = 510 bits (1313), Expect = e-142, Method: Composition-based stats. Identities = 156/892 (17%), Positives = 314/892 (35%), Gaps = 92/892 (10%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL-----DGKGLSKAERYRLAGLKA 54 M+ EC + + KA G++ L+ + R+ +RA +L D S AER K Sbjct: 1 MRAECREQVAKALGKKRLNAADSNRISSLYIRAQNTLARTDPDWMFKSPAERAEAIAQKT 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKL-FFKAGSAEVPLE 113 D ++ ++ + +A + QL++++ QAL K+ +F S +E Sbjct: 61 ASDLAVQIAKNNQNIARDAVIKAQLQTEI--YNHPKLNPVQALMRKIAYFSDQSGIQSIE 118 Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 + +A ++ +S + + G +++K D+ M G K+ N + + + K+ Sbjct: 119 KQSQALHSRWMSLVADVFTKTQERFGMSVNKAMTDDIIRVMFGGKSDNPEITAMAKEVSA 178 Query: 174 TQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGT 232 E+ + AG + K +N K+ T + ++V LD LD ++Y G Sbjct: 179 ALEEMRLAFNRAGGNIKKLDNFGFMTSHDQKKVALTNQAEWVNDALDGLDRNQYVKDTGE 238 Query: 233 PLSRSEIASFVGEVFAERVRSTSFKD-------------PSIPSSEVGVKREFERVFHFK 279 + E+ S + +++ + + KD P S++ + + R HFK Sbjct: 239 LMDELELKSMLEDIYKTISTNGANKDLLVLNKQAKAGVSPVGGRSKMANRHQEARALHFK 298 Query: 280 DSQAHMDYMEHFGV--STNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAND 337 D A + Y + +G + IL + +S ++ + + LG N + ++ + Sbjct: 299 DGDAWLAYQKKYGTYDEAGFHEILKNHTQRMSTEVAMMQNLGSNPRHTFESLLDEAKIKL 358 Query: 338 QEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASML 397 + L +++ + L M+ + ++ N M GLR+ AS L Sbjct: 359 KADPLNG------LKHGEIDKQAHRALSMYNTLDANTRAIDSTLGNVMGGLRALMVASKL 412 Query: 398 GQHPIGALLEDGFISR--QMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHG 455 G + + + + ML + + ++ + GL + Sbjct: 413 GGTTLTTFGDHASMKKVANMLGLSYTKSILPEYMKQLKQGATRDEALRFGLGINEMAGSM 472 Query: 456 RNMMEGSDAFQIGHK---------LHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDT 506 + + K SG + + L+ N++ MT Sbjct: 473 TRFGDADIVSSATKSGRFNARMQAFAAMTMKLSGLNAVTAGAKRALNLVHMNKLAEMTRK 532 Query: 507 YASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDAD 566 KDL AD + + D+ + ++ + DG T + N+ D Sbjct: 533 T-DWKDLGADDLKILKGN----GITERDWQLWQQLEPSKREDGTA-VLTQNDFFNVPDDV 586 Query: 567 LRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVL 626 ++ PE +Q+ LAD + K + K + Sbjct: 587 IKKFL--------------------PEDKQDNANALADF--------RYKAAMKYQTHLF 618 Query: 627 DNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA 686 + ++ A R+R + + GT GE R QF P + + + Sbjct: 619 NEESVAIIEA-----GVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYMFRMGHRAFA- 672 Query: 687 KMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLP---EVIYDGTLANGALL 743 +G + A LAG + + L G++P + L G L Sbjct: 673 ---QGDIKSRVTFLASLLAYQTLAGALIVQTQNLANGKNPEPVFTIDFFGKSLLKGGGLS 729 Query: 744 PYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVEL----ATKDNENSKVNATKAIR 799 D ++ L R+A + GP+ L + + ++ Sbjct: 730 FLGDIMSALSDPTGRSASDFISGPLLGQSMKLGMLLTGMGNNIIEGKESTRMMEVANTLK 789 Query: 800 KTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELFQNMDE 851 +P N+WY K D ++ +++ ++P YL R Q + + G + ++ E Sbjct: 790 SNIPLQNLWYSKLVVDRMLYSKMQNMIDPDYLPRTQQRLENLGNSYWWDLSE 841 >gi|226953662|ref|ZP_03824126.1| phage related protein [Acinetobacter sp. ATCC 27244] gi|226835534|gb|EEH67917.1| phage related protein [Acinetobacter sp. ATCC 27244] Length = 842 Score = 508 bits (1307), Expect = e-141, Method: Composition-based stats. Identities = 148/893 (16%), Positives = 308/893 (34%), Gaps = 92/893 (10%) Query: 1 MKPECIQVLNKAAG-RELSKKELRRLEDGIVRAYVSL-----DGKGLSKAERYRLAGLKA 54 M+ EC + + KA G R+LS + R+ +RA +L D S AER K Sbjct: 1 MRAECREQVAKALGKRKLSAADSNRISSLYIRAQNTLARTDPDWMFKSPAERAEAIAQKT 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKL-FFKAGSAEVPLE 113 D ++ ++ + +A + QL++++ QAL K+ +F S +E Sbjct: 61 ATDLAVQIAKNNQNIARDAIIKAQLQNEI--YNHPKLNPVQALMRKIAYFSDQSGIQSIE 118 Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 + +A ++ +S + + G +++K D+ M G K+ N + + + K+ Sbjct: 119 KQSQALHSRWMSLVADVFTKTQERFGMSVNKAMTDDIIRVMFGGKSDNPEITAMAKEVSA 178 Query: 174 TQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGT 232 E+ + AG + K +N K+ T + ++V L +D ++Y G Sbjct: 179 ALEEMRLAFNRAGGNIKKLDNFGFMTSHDQKKVALTDQSEWVNDALAGVDRNQYVKETGE 238 Query: 233 PLSRSEIASFVGEVFAERVRSTSFKD-------------PSIPSSEVGVKREFERVFHFK 279 + E+ S + E++ + + KD P S++ + + R HFK Sbjct: 239 LMDELELKSMLEEIYKTISTNGANKDLLILNKQAKAGASPVGGRSKMANRHQESRALHFK 298 Query: 280 DSQAHMDYMEHFGV--STNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAND 337 D A + Y + +G + IL + +S ++ + + LG N + + ++ + Sbjct: 299 DGDAWLAYQKKYGTYDEAGFHEILKNHTHRMSTEVAMMQNLGSNPRNTFESLLDEAKIKL 358 Query: 338 QEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASML 397 + + +++ + + M+ + ++ N M GLR+ AS L Sbjct: 359 KADPQNG------MKHGEIDKQAHRAMSMYNTLDANTRAIDSTLGNVMGGLRALMVASKL 412 Query: 398 GQHPIGALLEDGFISR--QMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHG 455 G + + + + ML + + ++ + GL + Sbjct: 413 GGTTLTTFGDHASMKKVANMLGLSYTKSILPEYMKQLKQGATRDEALRFGLGINEMAGSM 472 Query: 456 RNMMEGSDAFQIGHK---------LHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDT 506 + + K SG + + L+ N++ MT Sbjct: 473 TRFGDADIVSSATKSGRFNARMQAFAATTMKLSGLNAVTAGAKRALNLVHMNKLAEMTRK 532 Query: 507 YASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDAD 566 +DL AD + + D+ + ++ + DG + + N D Sbjct: 533 T-DWQDLGADDLKILQGN----GITERDWQLWQQLEPSKREDGTA-VLSQNDFFNAPDDV 586 Query: 567 LRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVL 626 ++ + + + + K + K + Sbjct: 587 IKQFLPLDKQD----------------------------NANALADFRYKAAMKYQTHIF 618 Query: 627 DNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA 686 + ++ A R+R + + GT GE R QF P I + + Sbjct: 619 NEESVAIIEA-----GVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYMFRIGHRAFA- 672 Query: 687 KMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLP---EVIYDGTLANGALL 743 +G + A LAG + + L G++P + L G L Sbjct: 673 ---QGDIKSRVTFLASLLAYQTLAGALIVQTQNLANGKNPEPVFTIDFFGKSLLKGGGLS 729 Query: 744 PYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVEL----ATKDNENSKVNATKAIR 799 D ++ L R+A + GP+ L + + ++ Sbjct: 730 FLGDIMSALSDPTGRSASDFISGPLLGQSMKLGMLLTGMGNNIIEGKESTRMMEVANTLK 789 Query: 800 KTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELFQNMDEG 852 +P N+WY K D ++ +++ ++P YL R Q + + G + ++ E Sbjct: 790 SNIPLQNLWYSKLVVDRMLYSKMQNMIDPDYLPRTQQRLENLGNSYWWDLSEE 842 >gi|320175029|gb|EFW50142.1| 17 [Shigella dysenteriae CDC 74-1112] Length = 582 Score = 486 bits (1249), Expect = e-134, Method: Composition-based stats. Identities = 126/640 (19%), Positives = 235/640 (36%), Gaps = 76/640 (11%) Query: 234 LSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDYMEH 290 ++ +E+++F+GE + K D + S R R HFKD+ +++ Y + Sbjct: 1 MNDAELSAFLGEAYNTIATGGLNKLTDTGMRISGARANRGNASRQIHFKDADSYLQYQQL 60 Query: 291 FGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDW 350 +G ++ I+ L +SKDI + GPN D + ++ Q A A+ + Sbjct: 61 YG-DRSLWEIMVGHLEGISKDIALVETYGPNPDHVFRSLLDQVKAETATANPSKTGKVE- 118 Query: 351 LGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGF 410 +L E + + + V N A W +R+ AS LG + + + G Sbjct: 119 ----RLANNTENLYNF--ISGKTQPVANPHIARWSDNIRNWLVASRLGSALLSSFSDLGT 172 Query: 411 IS-RQMLSRVGIDKEAIQRINKMPLKERMELLSD--VGLYAEGVVAHGRNMMEGSDAFQI 467 + ++ + +++ ++ M R EL GL E ++ + + Sbjct: 173 MYLSAKVTNLPMNQLFRNQLEAMDPTNRTELARARRAGLAMESLLGSVNRWAMDNMGPSV 232 Query: 468 GHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK-ADPRLDPSIKAF 526 + + + SG ++ + + +G + L+ L +D R+ S Sbjct: 233 SRWAATAVMRASGLTAWSDAHKRAYGVTMMGSLGEVVSRTPDLRSLDDSDFRILKS---- 288 Query: 527 FKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKN 586 K + DTD++V K A+ +G TP +I + D ++ L Sbjct: 289 -KGITDTDWSVWKLAQQEDWGNGNNTMLTPESIMRIPDLAVKHLG--------------- 332 Query: 587 SKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQR 646 E +K + K+ V + V +V T Q Sbjct: 333 ----------------------EPERVKFEAMRKLLGAVTEEVDMAV----ITPGAREQL 366 Query: 647 LGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSAT 706 + +RGT GE R F + P + + + MP A + A+ Sbjct: 367 ITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWSR--AMGMPSAGGRAAYIAT--FIAS 422 Query: 707 MALAGIGVASIKALLRGEDPSL------PEVIYDGTLANGALLPYMDRLTKLVSKGDRAA 760 + G + L G +P + L G L Y D L ++ A Sbjct: 423 TTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGSGA 482 Query: 761 IGGLLGPVPSMVTNLTSSA----VELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDH 816 + + GPV +V ++ A + NE + + K + +P N+WYLK + DH Sbjct: 483 LASMFGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGLMPGANLWYLKAALDH 542 Query: 817 LILNQILEELNPGYLDRQQSKKKKKGIE-LFQNMDEGLPH 855 +I NQ+ E +PGYL + + + KK+ + + + P Sbjct: 543 MIFNQMQEYFSPGYLRKMEQRSKKEFNQTYWWRPQDVTPQ 582 >gi|298485996|ref|ZP_07004070.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159473|gb|EFI00520.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 831 Score = 471 bits (1211), Expect = e-130, Method: Composition-based stats. Identities = 148/886 (16%), Positives = 298/886 (33%), Gaps = 94/886 (10%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK 60 M C + + +A GR L K E + D I S + L++ + + + ++ Sbjct: 3 MSANCKREVEQAIGRPLKKSEADAINDKI-----SFHIRDLARTDPTKFNAMTEQQRQLA 57 Query: 61 ELIRSVNDAIDEAYKRHQLRS----------DLDRVQAGVYGKSQALFNKLFFKAGSAEV 110 ++ D + + K+ Q + D +A V G Q + LF + + Sbjct: 58 GAQAAMADHMADVAKKAQRKGLNLLAQTRELDNQTARAAVLGGKQPFTSALFERLRQVDT 117 Query: 111 PLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQ 170 ++ + A T ++ + K +G +K D E+ G+ + N A K Sbjct: 118 RIKGERNRAFTSIM---DTIMAAEPKFMGLITNKAVERDFVHEVFGQDSGNAIAKNAAKV 174 Query: 171 YFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + + + + + AG D + +PQP S+ K+R ++ +L LD RY + Sbjct: 175 WRDQMDSIRERQNAAGADIGRLDYGWLPQPHSLVKVRRAAPQEWASFVLGRLDRRRYLNE 234 Query: 230 DGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKR-----EFERVFHFKDSQAH 284 DGT ++ ++ F+ T + P + G R R HFKD ++ Sbjct: 235 DGTQMNDGQVTDFLLAAHET--LRTDGLNKMTPGTGNGSSRAAKHDNAHRQIHFKDGDSY 292 Query: 285 MDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGN 344 ++YM FG T+V + + + KD V+ +LGPNA + + D S Sbjct: 293 LEYMRDFG-PTSVFEAMNGSVHAQIKDTVLTEQLGPNAAQTYRLLHDTAKQKDAGGSGAF 351 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVM-RYGETVENTGWANWMAGLRSAAGASMLGQHPIG 403 + + + W V+ N +A + G+R+ A+ L I Sbjct: 352 AGTEFGATPDMV----------WNVLNGSLGVPVNARFAEFNQGIRNFMVAAKLQATLIA 401 Query: 404 ALLEDGFISRQMLSRVGIDKEAIQRINKMP--LKERMELLSDVGLYAEGVVAHGRNMMEG 461 +++ D S + S ++ + K+ + + + + + + Sbjct: 402 SVIGD-VQSLAITSAYHGLPIGKTLVSALKSVSKDYRTEAGRMSIGMDSITSDMVSFHTD 460 Query: 462 SDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDP 521 + + KL + K + E ++ + +++ T Sbjct: 461 NLSAGWTSKLANATMKVTLLEGWTNAMRRGFSVEIMSRMAGDTRKAWG-------DDPVL 513 Query: 522 SIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHR 581 + + D+ V + A TP ++ ++K Sbjct: 514 QSRLERHGITQDDWAVWQAATPEDWR--GHQMLTPESVASMK------------------ 553 Query: 582 KKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSL 641 S +Q+ + +L ++E A + Sbjct: 554 -------GFSAKQKNDAIGKLLGYIQEESEFTSILPGIMTRATLXXXXXXXXXXXXXXXX 606 Query: 642 FDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWI 701 F + MF + + G V+ Sbjct: 607 XXXXXXXXXXXX------XXXXXXXXFKSFGLAMFERHWKRVSQIESTGGKLAYSASVF- 659 Query: 702 QYSATMALAGIGVASIKALLRGEDPS---LPEVIYDGTLANGALLPYMDRL---TKLVSK 755 + +AG + ++ G DP + L G + + D L ++ Sbjct: 660 ---TGLLMAGAMTNQLMDIMNGRDPRDMKDGKFWLQAMLRGGGVGIFGDILNTGLGGDNR 716 Query: 756 GDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENS--KVNATKAIRKTLPFMNMWYLKNS 813 G ++ + GLLGPV ++ + + + E + N + + PF+ WY K + Sbjct: 717 GGQSNLTGLLGPVYGTAADVGLTLGSVFKEKTEPADVGANLLRIGYQNTPFIRSWYTKAA 776 Query: 814 FDHLILNQILEELNPGYLDRQQSKKKKKGIEL-FQNMDEGLPHRLP 858 F+H +++ + E L+PGYL R + + KK + + E P R P Sbjct: 777 FEHAVMHDMQEMLSPGYLSRMKKRAKKDFNQRFWWEPGETAPSRAP 822 >gi|221213942|ref|ZP_03586915.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166119|gb|EED98592.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 864 Score = 459 bits (1181), Expect = e-127, Method: Composition-based stats. Identities = 174/936 (18%), Positives = 315/936 (33%), Gaps = 159/936 (16%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYVSLD---GKGLSKAERYRLAGLKA 54 M +C+ + AAGR+L++ E+ +E+ + +RA D +S+A+R A Sbjct: 1 MHQKCVNAVEAAAGRKLTQAEIDGIENRVRAGMRATARQDPVGWSAMSQADRVAAGAEWA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNKLFFKAGSAEVP 111 + + E +D A K+ Q+ + DR+Q +Y + K + + Sbjct: 61 RKQLEHEA------DLDRARKQLQIAKQIETTDRIQEALYADPENAHRKRA-RETIVKQD 113 Query: 112 LE------MKIKAAETKVLSKFNEYAEVGSKNLGFTLD---KQFGLDVFDEMK-GKK--T 159 +E IK+ + + + G L D D+ E+ G T Sbjct: 114 IEQTYVLAGAIKSDYMRQTMGAIDAMKAGQNFLARAFDVDNPAMERDIIREVYHGADGST 173 Query: 160 QNEQASRLVKQYFETQRELHSQAHEAGL-----DYKFFENRIPQPMSVDKLRATKKDDFV 214 NE A +Q +T + + + AG DY + R Q + + + Sbjct: 174 GNEVAKAAAEQISKTTAAMRERFNRAGGNVGELDYGYVPIRHSQSKVLGNGSDAARHAWA 233 Query: 215 RSMLDWLDLSRYKDIDGTPLSRSEIASFV-GEVFAERVRSTSFKDPSIPSSEVGVKR--- 270 +++ LD S+Y D G PL+ +++ + GE R+ + +I + GV Sbjct: 234 DAVMPLLDRSQYLDDAGNPLNDADLRKMLVGEDREPWERANAAARGNIAPRKQGVWDTIA 293 Query: 271 ----------------------EFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASL 308 RV HF+D+ AH+ Y +G + +N ++ + + Sbjct: 294 YGGVNKIVPGETTGSAARANAGSAHRVLHFRDADAHIQYNRQYGEGSLLNALV-DHVGGM 352 Query: 309 SKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWE 368 +K+I + GPN +K + T +D LE ++ W Sbjct: 353 AKNIALVERYGPNPTRNMKTQMQLTAVHDGT------------EMRTLEGGMTSIGAYWN 400 Query: 369 -VMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFI-SRQMLSRVGIDKEA- 425 V T N A M LR+ A L + AL + G + ++V K Sbjct: 401 YVTGTTNTPVNPALARKMETLRTTVSAVKLQGTILAALGDVGTMFVTAGYNKVPFFKTLG 460 Query: 426 -IQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYL 484 R+ K+ LS GL AE + + A L + K+ G Sbjct: 461 TAARLMAPGSKDFRAWLSSQGLIAESLEHGLNRWGTDNLATTWARNLSAATMKFGGVTGW 520 Query: 485 DKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAM 544 ++ + + + T L R + + D+ V+ +A Sbjct: 521 TDALRTAFQSHMMRGLAGIGRT--DWNSLTEWDRRALTR----AGITADDWAVVNKATPG 574 Query: 545 SSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLAD 604 D TP + DA + A+ Sbjct: 575 RYGDAEY--LTPDALYATGDA-----------------------------------RAAN 597 Query: 605 LERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRM 664 + K + +++++ + D + + + GT GE + Sbjct: 598 VVPKLLGMIREEGEFAVLN------------------PDLRTKVIASATPGTAMGELKKT 639 Query: 665 FQQFTTTPTGMFLNILDL----SNSAKMPKGASMALNHVWIQYSA---TMALAGIGVASI 717 F QF + P M S + AL + +A + L G + Sbjct: 640 FMQFKSFPIAMISRHWGRIGDMRRSGDFRVDGAPALANPMAYAAALVVSTTLIGAISTQV 699 Query: 718 KALLRGEDPSL--------PEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGG--LLGP 767 K LL G+DP G D LT D ++ G + GP Sbjct: 700 KNLLAGKDPEPMFDDVKHAAGFWTRAFSVGGGAGFAGDMLTASFESTDYGSLLGSVVGGP 759 Query: 768 VPSM----VTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQIL 823 +PS V +S+A + A + + + K + P +N+W+ K ++ LI + + Sbjct: 760 LPSTIYQVVRAFSSNAQDAAQGKDTHVSADLLKVAQSNTPLVNLWFWKTVWNRLIWDNLA 819 Query: 824 EELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 E L+PG R ++ + + + F + G P R P Sbjct: 820 ENLSPGVTQRNINRSRNQYHNDYFWSPGTGSPQRAP 855 >gi|48697207|ref|YP_024937.1| hypothetical protein BcepC6B_gp17 [Burkholderia phage BcepC6B] gi|47779013|gb|AAT38376.1| gp17 [Burkholderia phage BcepC6B] Length = 864 Score = 451 bits (1159), Expect = e-124, Method: Composition-based stats. Identities = 170/935 (18%), Positives = 306/935 (32%), Gaps = 157/935 (16%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLD------GKGLSKAERYRLAGLKA 54 M +C+ + AAGR+L++ E+ +E+ + S +S+A+R A Sbjct: 1 MHQKCVNAVETAAGRKLTQAEIDGIENRVRAGMRSTARQDPAGWSAMSQADRVAAGAEWA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNK-----LFFKAG 106 + E +D A K+ Q+ + DR+Q +Y + K + Sbjct: 61 RQQLVHEA------DLDRARKQLQIAKQIETTDRIQEALYADPENAHRKRARETIVKHDI 114 Query: 107 SAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLD---KQFGLDVFDEM-KGKK--TQ 160 IK+ + + +VG L D D+ E+ +G T Sbjct: 115 EQTYVTAGAIKSDYMRQTMGAIDAMKVGQNFLARAFDVDNPAMERDIIREVYRGADGSTG 174 Query: 161 NEQASRLVKQYFETQRELHSQAHEAGL-----DYKFFENRIPQPMSVDKLRATKKDDFVR 215 NE A +Q +T + + + AG DY + R Q + ++ + Sbjct: 175 NEVAKAAAEQIGKTTGAMRERFNRAGGNVGELDYGYVPIRHAQSKVLGNGSDAQRHAWAD 234 Query: 216 SMLDWLDLSRYKDIDGTPLSRSEIAS-FVGEVFAERVRSTSFKDPSIPSSEVGVKR---- 270 +++ LD S+Y D G PL+ +E+ VGE R+ + ++ + GV Sbjct: 235 AVMPLLDRSQYLDDAGNPLNDAELRKVLVGEDREAWERANAAARGNVAPRKQGVWDTIAY 294 Query: 271 ---------------------EFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLS 309 RV HF+D+ AHM Y FG + +N ++ + ++ Sbjct: 295 GGVNKIVPGETSGGAARANAGSAHRVLHFRDADAHMQYNRQFGEGSLLNALV-DHVGGMA 353 Query: 310 KDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWE- 368 K+I + GPN +K + T +D LE ++ W Sbjct: 354 KNIALVERYGPNPTRNMKTQMQLTAVHDGT------------EMRTLEGGMTSVGAYWNY 401 Query: 369 VMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFI-SRQMLSRVGIDKEA-- 425 V T N A M LR+ A L + AL + G + ++V K Sbjct: 402 VTGATNTPVNPALARKMETLRTTVSAVKLQGTILAALGDVGTMFVTAGYNKVPFFKTLGT 461 Query: 426 IQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLD 485 R+ K+ LS GL AE + + A L + K+ G Sbjct: 462 AARLMAPGSKDFRSWLSSQGLIAESLEHGLNRWGTDNLATTWARNLSAATMKFGGVTGWT 521 Query: 486 KKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMS 545 ++ + + + T L R + L D+ ++ +A Sbjct: 522 DALRTAFQSHMMRGLAGIGRT--DWNSLTEWDRRALTR----AGLTADDWAIVNKATPGK 575 Query: 546 SPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADL 605 D TP + + + AD+ Sbjct: 576 YGDAEY--LTPDALYA-----------------------------------TGEARAADV 598 Query: 606 ERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMF 665 K + +++++ + D + + + GT GE + F Sbjct: 599 VPKLLGMIREEGEFAVLN------------------PDLRTKVIASATPGTVTGELKKSF 640 Query: 666 QQFTTTPTGMFLNILDL----SNSAKMPKGASMALNHVWIQYSA---TMALAGIGVASIK 718 QF + P M S + AL + +A + L G K Sbjct: 641 MQFKSFPMAMISRHWGRIGDMRRSGDFRVDGAPALANPMAYAAALVVSTTLIGAISTQAK 700 Query: 719 ALLRGEDPSLP--------EVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLL--GPV 768 LL G+DP G D L D ++ G GP+ Sbjct: 701 NLLAGKDPEPMFDDVKHAGGFWTRAFSVGGGAGFAGDMLVAAFQSADYGSLLGSAIGGPL 760 Query: 769 PSMV----TNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILE 824 S + ++S+ + A + + + K + P +N+W+ K ++ LI + + E Sbjct: 761 LSTLFQPLRAVSSNVQDAAQGKDTHIGADLLKIAQSNTPLVNLWFWKTVWNRLIWDNLAE 820 Query: 825 ELNPGYLDRQQSKKK-KKGIELFQNMDEGLPHRLP 858 L+PG R ++ + + + F + G P R P Sbjct: 821 NLSPGVTQRNMNRSRTQYHNDYFWSPGTGSPQRSP 855 >gi|221201510|ref|ZP_03574549.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207934|ref|ZP_03580940.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221172119|gb|EEE04560.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221178778|gb|EEE11186.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 869 Score = 450 bits (1156), Expect = e-124, Method: Composition-based stats. Identities = 171/940 (18%), Positives = 309/940 (32%), Gaps = 162/940 (17%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M +C+ + AAGR+L++ E+ +E+ + + L +S+A+R A Sbjct: 1 MHQKCVNAVEAAAGRKLTQAEIDGIENRVRAGMRAKARQDPLAWSAMSQADRVAAGAEWA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNK-----LFFKAG 106 + E +D K+ Q+ + DR+Q +Y + K + Sbjct: 61 RQQLVHEA------ELDRMRKQLQIAKQIETTDRIQEALYADPENAHRKRARETIVKHDI 114 Query: 107 SAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLD---KQFGLDVFDEM-KGKK--TQ 160 L IK+ + E + G L D D+ E+ +G T Sbjct: 115 EQTYVLAGAIKSDYMRQTMGAIEAMKAGQNFLARAFDVDNPAMERDIIREVYRGADGSTG 174 Query: 161 NEQASRLVKQYFETQRELHSQAHEAGL-----DYKFFENRIPQPMSVDKLRATKKDDFVR 215 NE A +Q +T + + + AG DY + R Q + + + Sbjct: 175 NEVAKAAAEQISKTTAAMRERFNRAGGNVGELDYGYVPIRHSQSKVLGNGSDAARHAWAD 234 Query: 216 SMLDWLDLSRYKDIDGTPLSRSEIASFV-GEVFAERVRSTSFKDPSIPSSEVGVKR---- 270 +++ LD S+Y D G PL+ ++ + GE R+ + +I + GV Sbjct: 235 AVMPLLDRSQYLDDAGNPLNDVDLRKMLVGEDREPWERANAAARGNIAPRKQGVWDTIAY 294 Query: 271 ---------------------EFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLS 309 RV HF+D+ AH+ Y +G + +N ++ + ++ Sbjct: 295 GGINKIVPGETTGSAARANAGSAHRVLHFRDADAHIQYNRQYGEGSLLNALI-DHVGGMA 353 Query: 310 KDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWE- 368 K+I + GPN +K + T +D LE ++ W Sbjct: 354 KNIALVERYGPNPTRNMKTQMQLTAVHDGT------------EMRTLEGGMTSVGAYWNY 401 Query: 369 VMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFI-SRQMLSRVGIDKEA-- 425 V T N A M LR+ A L + AL + G + ++V K Sbjct: 402 VTGATNTPVNPALARKMETLRTTVSAVKLQGTILAALGDVGTMFVTAGYNKVPFFKTLGT 461 Query: 426 IQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLD 485 R+ E LS GL AE + + A L + K+ G Sbjct: 462 AARLMAPGSSEFRSWLSAQGLIAESLEHGLNRWGTDNLATTWARNLSAATMKFGGVTGWT 521 Query: 486 KKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMS 545 ++ + + + T L R + + D+ V+ +A Sbjct: 522 DALRTAFQSHMMRGLAGIGRT--DWNSLTEWDRRALTR----AGITADDWAVVNKATP-G 574 Query: 546 SPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADL 605 DG Y TP + DA + AD+ Sbjct: 575 RYDGAEY-LTPDALYATGDA-----------------------------------RAADV 598 Query: 606 ERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMF 665 K + +++++ + D + + + GT GE + F Sbjct: 599 VPKLLGMIREEGEFAVLN------------------PDLRTKVIASATPGTVTGELKKSF 640 Query: 666 QQFTTTPTGMFLNILDLSNSAK---------MPKGASMALNHVWIQYSA---TMALAGIG 713 QF + P M + + P+ + L + +A + L G Sbjct: 641 MQFKSFPMAMISRHWGRIGNMRRSGDYLVEGAPRAFGIPLANPMAYAAALVVSTTLIGAI 700 Query: 714 VASIKALLRGEDPSL--------PEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGG-- 763 K LL G+DP G D L D ++ G Sbjct: 701 STQAKNLLAGKDPEPMFDDVKHAGGFWTRAFSVGGGAGFAGDMLVAAFESADYGSLLGSA 760 Query: 764 LLGPVPSMV----TNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLIL 819 + GP+ S + ++S+ + A + + + K + P +N+W+ K ++ LI Sbjct: 761 VGGPLLSTLFQPLRAISSNVQDAAQGKDTHVGADLLKIAQSNTPLVNLWFWKTVWNRLIW 820 Query: 820 NQILEELNPGYLDRQQSKKK-KKGIELFQNMDEGLPHRLP 858 + + E L+PG R ++ + + E F + G P R P Sbjct: 821 DNLAENLSPGVTQRNMNRSRTQYHNEYFWSPGTGAPQRAP 860 >gi|317120709|gb|ADV02531.1| hypothetical protein SC2_gp030 [Liberibacter phage SC2] gi|317120770|gb|ADV02591.1| hypothetical protein SC2_gp030 [Candidatus Liberibacter asiaticus] Length = 809 Score = 449 bits (1154), Expect = e-124, Method: Composition-based stats. Identities = 188/877 (21%), Positives = 346/877 (39%), Gaps = 103/877 (11%) Query: 1 MKPECIQVLNKAAGR-ELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQ 59 MK ECI + AAG +LS ++ +E I ++ + +G+ +A A L ++ + Sbjct: 1 MKEECINAVRVAAGELKLSDVDIEHIEHHI---RIAWEQEGVKQAG---FADLPLDQQIK 54 Query: 60 KELIRSVNDAIDEA--YKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIK 117 + ++ + ++ YK ++L S + L ++L A S +EM IK Sbjct: 55 RVSKKAKSSFFSDSDRYKPYELLSTFKG-----ENQVTELGHRLAHHATSGG-SIEMSIK 108 Query: 118 AAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRE 177 +KV +F +Y G+K GF D ++ ++G K N +A +L + ET Sbjct: 109 GLRSKVFDRFKDYHTYGTKAFGFKNDVNAHTELLRALRGDKGVNPEALKLASIFHETMDF 168 Query: 178 LHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRS 237 L +A G+ + +N PQPM K+ KD+FV L LD + Y+ + Sbjct: 169 LVKEAKAVGIKFNPRDNYTPQPMDFRKISLVTKDEFVDRTLPRLDWAEYQKRG--LDNEG 226 Query: 238 EIASFVGEVFAE-------RVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEH 290 + FV +V+ +V ++ KD S +G + R H+ Q ++ M+ Sbjct: 227 SLRQFVEDVYETLASEGRNKVIASGGKDHSGI--SLGGRLRQVRQLHYT-PQGLVEAMKE 283 Query: 291 FGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGN-KVLKD 349 FG V +++ +L +DI IARE G NA+ ++ D+E + K Sbjct: 284 FGSDLTVEGMMSRSFDNLIRDIAIAREFGANANENFNFVLASMFERDREDINSRLEGDKK 343 Query: 350 WLGRNKLEVRQEAMLQM-WEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIG---AL 405 NKL+ ++E +QM W+ + G +T + + + LG + + Sbjct: 344 TKALNKLK-KEEMQVQMDWDGLTMGRKQPST-MDKIVDSATAWTVITKLGSQSLYIPKEI 401 Query: 406 LEDGFISRQMLSRVGIDK-EAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDA 464 +E F+ Q + I + + KER E + + + E + +E + Sbjct: 402 IESAFMGSQRMGYTWKTNIANIWNASPVAGKERKEFIKSITVGLEHMATGFTRDLETNSQ 461 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIK 524 +G + K W G LD + + + + +G T + + LK Sbjct: 462 SVLG-VMAKKTMDWQGLTTLDNMMVRGLSATLQDYVGGFTRNFKDMDSLK---------- 510 Query: 525 AFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKL 584 K++ + F I + D +K L AD + Sbjct: 511 ---KKIGEQSFKSIIDEHRFNERD----------LKLLSLADTESFKGKGTYLTDKNIYR 557 Query: 585 KNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDR 644 + L+P ++ D+ R LK ++NK + VQ RG++ +++ D+ Sbjct: 558 IDDTKLTP-----FLKKGEDIYR-----LKSDLANKYRTFIWSTVQEHARGSVGSTIQDK 607 Query: 645 QRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYS 704 + +T K G+ R+ QF P + ++P + V+ + Sbjct: 608 R---WITGKDGSV-NNLARLMGQFLVMPIS-----WSRMHLIEIPSSLVGVSSQVYRAKA 658 Query: 705 ATMALAG--IGVASIKALLRGEDPSL---PEVIYDGTLANGALLPYMDRLTKLVSKGDRA 759 + + G + ++ L+ G++P L Y L NG + + +R + S G Sbjct: 659 LVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING--ITHYERFSPFNSSG--- 713 Query: 760 AIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKA------IRKTLPFMNMWYLKNS 813 +LGP S L + E + + +A + +PF N+WY + + Sbjct: 714 --WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 771 Query: 814 FDHLILNQILEELNPG-------YLDRQQSKKKKKGI 843 F+H + N I + LNPG Y RQ+ KK++K Sbjct: 772 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 808 >gi|293609607|ref|ZP_06691909.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828059|gb|EFF86422.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 1175 Score = 440 bits (1131), Expect = e-121, Method: Composition-based stats. Identities = 134/650 (20%), Positives = 257/650 (39%), Gaps = 47/650 (7%) Query: 1 MKPECIQVLNKAAGRE-LSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLK 53 MK +C Q + KA G++ L+ +E +E I +L + + LS AE+ A + Sbjct: 1 MKEQCKQAVAKALGKQSLTAQEATDIEARINETMRNLARKDINNWRNLSDAEKLTEAAKQ 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG-SAEVPL 112 D Q++L R A + K+ Q + LD G + + + S + Sbjct: 61 VAIDIQEQLKRKHKIAAQDILKQSQNIAALD---HGKLSSMEVIDRMVAAHGDMSGIQSI 117 Query: 113 EMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYF 172 + K + + ++ LG D++ + E G+ T + A ++ + Sbjct: 118 DSKARGIAAIYRGELVDFYTNIKGGLGIFTDQELVQKIVRERFGESTGDALAKKISDKMG 177 Query: 173 ETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDG 231 + + + + G D +N +PQ +++K+ K +V +D +Y +G Sbjct: 178 DVFETMRDRFNRNGGDIGKLDNWGLPQTHNLEKIAQAGKQAWVSKAESLIDTRQYVHENG 237 Query: 232 TPLSRSEIASFVGEVFAERVRSTSFK------DPSIPSSEVGVKREFERVFHFKDSQAHM 285 S+ EI S + + + K +S+V + RV HFKD+++ + Sbjct: 238 DYYSQQEIRSLLEYTYDTLSSDGANKIEVGRQATGGGTSKVTNRHGESRVLHFKDAESWL 297 Query: 286 DYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNK 345 +Y FG V ++ + + LSKDI + LG N + +K ++ D E Sbjct: 298 EYQSDFGGMQFV-DLVEAHINGLSKDIAMVENLGSNPKTALKILMDAAAKKDWEK----- 351 Query: 346 VLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGAL 405 + N+ + ++ M++ + G T ++ AN RS ASMLG I +L Sbjct: 352 ----GIEENQTKSSRKRAQVMFDELSGGNTPQSQVLANLGIAYRSMNVASMLGGTTIASL 407 Query: 406 LEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEG 461 + I++ +S I+++N +R E +GL E ++ G Sbjct: 408 ADQATIAKNASVHNVSYRKAFGGLIEQLNPANKADR-EQAHSLGLATEEML--GSIARWS 464 Query: 462 SDAFQIGHKLHSKMHKWSGAEYLDKKRIS-SHALIVYNQIG---RMTDTYASLKDLKADP 517 D + K+ + S R+S +AL +++G + + Y L KA Sbjct: 465 DDGLTSTYGKSEKLARISSGVATQVMRVSFLNALTSASKVGFTKLLMEKYGRLSRSKAWN 524 Query: 518 RLDPSIKAFFK--QLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADL-----RDL 570 LD + LD+ + V + A+ + G + +I + D L +D+ Sbjct: 525 DLDVQDRELLSNTGLDERAWQVFQLAEPVVDRKGNQLM-SARSIYEIPDDKLLAAMDKDV 583 Query: 571 ARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNK 620 ++ I K+L + L ++ +Q+L D++R L D + K Sbjct: 584 NQLVSGINDQIKELNDRNALDDQRILNREQKLDDVKRSLSQRLLDYANRK 633 Score = 200 bits (508), Expect = 8e-49, Method: Composition-based stats. Identities = 72/361 (19%), Positives = 139/361 (38%), Gaps = 29/361 (8%) Query: 506 TYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDA 565 D K D + + + +K +D + A+ + S+ + Sbjct: 808 RIKGKTDKKIDSSVARNTRRNYKSGEDLG-RRLGNAERRMTEMRAKMRAADSSANKSINQ 866 Query: 566 DLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINIL----KDKVSNKM 621 +DL + + + + + +RQ + +LA+ E +L +D+V++++ Sbjct: 867 KFKDLDKRVNALDDEFVEYQAKVAERQAKRQYVMDKLANSIDGEKKLLAQKIRDEVASQL 926 Query: 622 HALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILD 681 A +LD +V A R+R + +GT GE + QF + + Sbjct: 927 QAHLLDEQGMAVIEA-----GLRERTWMTVGAKGTITGEVFKGLMQFKSFSASFLMRQGS 981 Query: 682 LSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPS------LPE----V 731 + + + KG + I +M L G V ++ +L G DP P+ Sbjct: 982 RAMAQEGLKGKA----AYAIPLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSF 1037 Query: 732 IYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSK 791 +A G L D L R A + GP+ S T+L V T+ NE Sbjct: 1038 FMRSLVAGGGLPVLGDILVAGTDTSGRDANSFVSGPLGSDFTSLLGLTVGNLTQYNEGKD 1097 Query: 792 VN----ATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK-KGIELF 846 N A K ++ +P N+WY K + + ++ +++ + + PGY ++ K ++ + E F Sbjct: 1098 TNFGNEAFKFVKGKIPAQNLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRERF 1157 Query: 847 Q 847 Sbjct: 1158 W 1158 >gi|167032768|ref|YP_001667999.1| hypothetical protein PputGB1_1760 [Pseudomonas putida GB-1] gi|166859256|gb|ABY97663.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 855 Score = 439 bits (1128), Expect = e-121, Method: Composition-based stats. Identities = 145/882 (16%), Positives = 306/882 (34%), Gaps = 92/882 (10%) Query: 5 CIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGK--GLSKAERYRLAGLKAEEDFQKEL 62 C + AAG ++ E++ + + + + L + A + + Sbjct: 10 CADAVRAAAG-DMESNEIQEIFQLLRGRTQEILAREGALGSEQAALRAADELARQAEHAA 68 Query: 63 IRSVNDAIDEAYKRHQLRSDL-DRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAET 121 I +A+ R +L + + D+ ++ + + + + KA Sbjct: 69 IIERRNALINVRARARLVAFVRDQFADRPDLGIESFLVGTNLARQGSRLSVAAEQKALGD 128 Query: 122 KVLSKFN---EYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQ--NEQASRLVKQYFETQR 176 + + A++ + D+ ++ K + T+ N Q + K + Q Sbjct: 129 AYIGGMLADLDRADLTAVLARGDSDQDIADALWRIGKDQDTKDLNPQVVEIAKIIQKYQE 188 Query: 177 ELHSQAHEAGLDYKFFENRIP-QPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLS 235 A+ AG I Q +K+ A + + +L LD + ++ G P+ Sbjct: 189 GARIDANRAGASIGKLPGYIARQSHDSEKMGAAGFERWAEEILPRLDTATFR-EGGDPMV 247 Query: 236 RSEIASFVGEVFAERVRSTSFKDPSIPSSEV--GVKREFERVFHFKDSQAHMDYMEHFGV 293 + + G V + ++S + + P+ K ERV HFKD A +Y + FG Sbjct: 248 FLK-GVYDGLVSGDHLKSPAGQQPNGFRGPANLAKKLSQERVLHFKDGVAWHEYNQLFGT 306 Query: 294 STNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGR 353 N+ + L ++ + R LG N ++ + M + I D A L ++ Sbjct: 307 G-NLREAVLRGLDLSGQNTALMRRLGTNPEANLN-MAMDVIKEDVRAGGDPAALANFNTA 364 Query: 354 NKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISR 413 + + ++ EV N A A +R+ S LG + + + + Sbjct: 365 RRGVIG----NRLKEVSGQTRIPGNATQARVAANVRAWQSLSKLGGALLSSFTDLPVAAS 420 Query: 414 QMLSRVGID-----KEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIG 468 +M + + + E+ ++LS G+YA+ + D+ +G Sbjct: 421 EMRYQGQSFLGSLAEMGAGLMKGRGSAEQRQILSAYGVYADSMRGEIMRRFSADDS--VG 478 Query: 469 HKLHSKMHKW---SGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKA 525 K+ M ++ +G + +S L++ + + + + + L D + Sbjct: 479 GKMSRGMSQFFRLNGLSWWTDANKASAGLMMAHNLAQ--NKGKAWGSLNGDFKRALG--- 533 Query: 526 FFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLK 585 LD + +++ + DG Y TP I + D + ++ Sbjct: 534 -LYDLDAGKWELLREMDTRMA-DGRDYM-TPDGIAGISDERIGQYLAERNR--------- 581 Query: 586 NSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQ 645 PE +++ DLER + D+V+ + R M+ Sbjct: 582 ------PESAGAIRETRQDLERSLRAYVNDRVTYAVL-----EPDARTRSIMNQGT---- 626 Query: 646 RLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNH------- 698 + GT G+ LR QF + P L + ++ + Sbjct: 627 -------QPGTVPGDLLRFVTQFKSFPAAYMQKTLGRELYGRGYTPTALGNSFRGGRDLV 679 Query: 699 -----------VWIQYSATMALAGIGVASIKALLRGEDPS---LPEVIYDGTLANGALLP 744 Q G + K + +G +P P+ + G L Sbjct: 680 QALRNGNGERLALAQLMLWTTAFGYLSMASKDVTKGREPRPADDPKTWLAAMVQGGGLGI 739 Query: 745 YMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPF 804 + D L ++ +A+ GP ++ + + K+ +++ +A + + PF Sbjct: 740 FGDYLFGEANRFGNSALESAAGPTIGTAADVIN--LWARAKEGDDTASSALRLAQNNTPF 797 Query: 805 MNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELF 846 MN++Y + + DHL L + E +NPG L R + + +++ + F Sbjct: 798 MNLFYTRIALDHLFLYSVQEAMNPGSLRRTEERIRQQNGQEF 839 >gi|85059662|ref|YP_455364.1| hypothetical protein SG1684 [Sodalis glossinidius str. 'morsitans'] gi|84780182|dbj|BAE74959.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 507 Score = 435 bits (1119), Expect = e-119, Method: Composition-based stats. Identities = 113/506 (22%), Positives = 214/506 (42%), Gaps = 25/506 (4%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAERYRLAGLKA 54 M+ ECIQ + A+ R L+ E++ +ED IV+ L + LS++ER + AG A Sbjct: 6 MRQECIQAITAASKRTLTSAEIQGIEDRIVKNMRHLARNDPTSWRSLSESERMQRAGHMA 65 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E ++E R +L + + + G GK +AL K+ F A + + + Sbjct: 66 AEALEREATLKKRRVALTIAARQRLDNFIAGYK-GKGGKLEALNRKIAFHADGKAPFLSV 124 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ +E ++ + + DKQ+ D+ EM+G+ T N +A + + + Sbjct: 125 ESRTKATRDYALSQLDELFSAIDPRFFQLFEDKQWIRDLVYEMRGQDTGNVRAKKGAEAW 184 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID 230 L + ++AG D E+ +PQ S++K+ + D+V ++ LD ++Y + Sbjct: 185 KNVSELLRRRFNDAGGDIGHLEDWGMPQYHSMEKVGKATQSDWVGFVIGKLDRNKYVKEN 244 Query: 231 GTPLSRSEIASFVGEVFAERVRSTSFK--DPSIPSSEVGVKR-EFERVFHFKDSQAHMDY 287 G +S ++A F+G + K D S R ER HFKD++ ++ Y Sbjct: 245 GELMSDKDVADFLGHAYKTIATGGMNKLGDSGRRLSGARANRGNAERQIHFKDAEGYIAY 304 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 + FG ++ IL + L +SKDI + GPN D + ++ + A + + Sbjct: 305 QQRFG-EKSMWDILVNHLDGISKDIALVETYGPNPDHVFRSLLDELAAKTADETPSRTGK 363 Query: 348 KDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE 407 KL+ + E + + + V N A W +R+ AS LG I +L + Sbjct: 364 -----IKKLKNKTEDLYNF--IAGKTQPVANPHIARWADHVRNWLVASRLGSALISSLSD 416 Query: 408 DGFISRQM-LSRVGIDKEAIQRINKMPLK--ERMELLSDVGLYAEGVVAHGRNMMEGSDA 464 +G + ++ + + + ++ M + + L E ++ + Sbjct: 417 NGTMYLTAKVNNLPMAQLLRNQLAAMNPANKDEIRFARGASLAMETLLGSVNRWATDNMG 476 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRIS 490 + + + + SG Sbjct: 477 PSPSRWVANAVMRASGLSAWSDAHKR 502 >gi|315121926|ref|YP_004062415.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122888|ref|YP_004063377.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495328|gb|ADR51927.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496290|gb|ADR52889.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 810 Score = 424 bits (1089), Expect = e-116, Method: Composition-based stats. Identities = 188/882 (21%), Positives = 335/882 (37%), Gaps = 136/882 (15%) Query: 1 MKPECIQVLNKAAGR-ELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQ 59 M PECI+ + K AG +L ++L ++E + + L+GL+ E F+ Sbjct: 1 MHPECIERVKKLAGEWKLEPEDLDQIE----------------RVSKQALSGLELNESFK 44 Query: 60 KELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQAL-----------FNKL-FFKAGS 107 +++ + + K H L ++ G + S+ L N L F Sbjct: 45 N--LKTADKVKALSEKAHLLL-----LENGAFAMSETLGGVGRAKHGEQLNTLKNFLRYE 97 Query: 108 AEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRL 167 +E +IK + F+++ ++GSKNLGF+ D + ++G +T + Q ++ Sbjct: 98 TTASIESRIKGEQANARKAFHDFEDLGSKNLGFSADPITNEKITKALRGVETDDPQVNKF 157 Query: 168 VKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYK 227 + Y + + + +QA + GL + PQP K+RA K ++ +++ W+D+ Y Sbjct: 158 GRAYRKIRDRVTAQAEDMGLLHPLDNWGSPQPDDALKIRAKGKKAWIETIMPWVDVEAY- 216 Query: 228 DIDGTPLSRSEIASFVGEVFAERVRSTSFK------DPSIPSSEVGVKREFERVFHFKDS 281 D L + F+G V+ + K + VG R+ R D Sbjct: 217 --DKKGLYGKGLTEFLGHVWDTKSSEGRNKILASGGAEQAGKASVGGSRKQPRHLFLLD- 273 Query: 282 QAHMDYMEHFG-VSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEA 340 + + DY FG N ++ + L +DI IAR G NAD+ + +I Q ND ++ Sbjct: 274 EHYSDYNAAFGKTGLNAEDLVRMTIDPLIRDIEIARTFGSNADNNFRWVITQAYENDLKS 333 Query: 341 SAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSA-------AG 393 + + G +EA + +W+ + + + +N LR Sbjct: 334 AKTASDVTKMGGL-----YKEANI-LWDRLTISSEMLDHELSNAQINLRELKSGFSTFQV 387 Query: 394 ASMLGQHPIGALLE------DGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLY 447 G AL E G + M E + + K + + G Sbjct: 388 VKSFGMQIFSALPETINCVVMGSHRQGMPFWSRALPEFKRHLTNANYKASIRAFAPAG-- 445 Query: 448 AEGVVAHGRNMMEGSDAFQIGHK-LHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDT 506 E + N F G K L K KW G + LD+ + + +G +T Sbjct: 446 -EMAITGMMNEFHNQSKFVSGMKVLAEKTVKWQGLKALDRFQRDLSFGFTSSWMGEVTRG 504 Query: 507 YASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDAD 566 + L+D K+ + + T T+IK Y T S + L + Sbjct: 505 FKGLEDFKS------------RYGEQTFKTLIKD-----------YGFTQSDMHALSKVE 541 Query: 567 LRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ----QLADLERKEINILKDKVSNKMH 622 L + L+P+ +E + LA E K I + +S+KM Sbjct: 542 L-----------------DAGRLLTPDSIRECRHPDLVTLARSENKSIERMMGDLSSKMS 584 Query: 623 ALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDL 682 + Q + RG++ +SL D + T RG G L + QF TTP M L Sbjct: 585 GYIWSQTQDNARGSVGSSLRDTK----YTSSRGGIPG--LSLVTQFLTTPISMAEKHLWA 638 Query: 683 SNSAKMPKGASMALNHVWIQYSA-TMALAGIGVASIKALLRGE---DPSLPEVIYDGTLA 738 + M+ ++ A + L GI + + L G+ D + P+V+ L Sbjct: 639 VPKTLVGGANGMSAWSYRAKFLAFGIVLEGIVANTARKALTGQELDDFTDPKVL---ALM 695 Query: 739 NGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELAT---KDN----ENSK 791 L + DR + + + PV S V L + +E++ ++ + Sbjct: 696 TARTLTHYDRFFNEYHHDFKDLLHSV--PVASTVIGLGDAGLEVSRNIFGEDEEKKAKAN 753 Query: 792 VNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDR 833 K + +P N++Y+K +F ++++ + E N GY DR Sbjct: 754 AKLAKEVANNMPLKNLFYVKAAFQKMVVDNLCEYFNEGYKDR 795 >gi|303328566|ref|ZP_07359001.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861332|gb|EFL84271.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 855 Score = 422 bits (1084), Expect = e-115, Method: Composition-based stats. Identities = 158/893 (17%), Positives = 303/893 (33%), Gaps = 101/893 (11%) Query: 21 ELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLR 80 E + D ++ L G + A E ++ K + Sbjct: 2 EALDIVDMLLEQKARLKASGDLTPQNLSRAWSATAEGLARQRAIQRRRTALGLVKFREAA 61 Query: 81 SDLDRVQA---GVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKN 137 +D +A QAL + + A + + S E Sbjct: 62 GFVDSAKAQGVSAMEGIQALMVGVSRRFDGARRSVSALRQGIFKSWASPMLRELEAVDNG 121 Query: 138 LGFT---LDKQFGLDVFDEMKGKK-TQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFE 193 DK F VF EM+ T ++ A + + + + + AG D + Sbjct: 122 AALRLMREDKAFHDSVFREMREPDSTGDKNARAIADIFSRYTEQSRVRLNAAGADIGKLD 181 Query: 194 NRIPQPMSVDKLRA---TKKDDFVRSMLDWLDLSRYKDIDGTPLSRS-EIASFVGEVFAE 249 PQ KL A + +V ML LDL R DG L + + V+ Sbjct: 182 GWTPQTHDPYKLMAGGEAGRAKWVDFMLPRLDLER--TFDGVGLVDANRARELLNGVYDT 239 Query: 250 RVRSTSFKDPSIPSSEVGVKREF------------ERVFHFKDSQAHMDYMEHFGVSTNV 297 T ++P +P G RV HFKD+Q ++Y + +G N+ Sbjct: 240 ---LTMGRNPHMPGDFTGGGASVPGPRNLASGMGKSRVLHFKDAQGALEYHDAYGRG-NI 295 Query: 298 NTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQE-----ASAGNKVLKDWLG 352 + L ++ + + LGPN +++++ ++ + +++ Sbjct: 296 FDAMLRHLEQDARALALMERLGPNPQYTLERLLAHEKRALKDNAVLTPEEKARQMRELDN 355 Query: 353 RNKLEVRQEAMLQMW--EVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGF 410 + ++ + W E+ + A A LR++ S LG + A+ + Sbjct: 356 AFSGGIIRQGRVSAWLAELTGETSWAVHPTLARVGAVLRASQNLSKLGGASLSAIADVFT 415 Query: 411 ISRQMLSRV----GIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAH-GRNMMEGSDAF 465 + M G +++ + + + ++ G + + V + S Sbjct: 416 KAASMRVNGETWPGAIGKSLAQYIQGFSGKEKDVARQCGAFLDHVRGDIVARWDDASGMP 475 Query: 466 QIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKA 525 + L K+ +WSG ++ ++ + + L + +G ++ KA +LD +A Sbjct: 476 GVLADLQDKLFRWSGLNWITERGKAGYTLWLSEHLGEVSG--------KAFDQLDGPRRA 527 Query: 526 FFKQLDDTDFTVIKRAKAMS--SPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKK 583 Q D + + MS + DG Y TP L DADL L Sbjct: 528 ML-QYHGVDPERWEAMRKMSHQAEDGKAY-FTPEAAAYLTDADLAPLLP----------- 574 Query: 584 LKNSKTLSPE-QRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLF 642 +++K P+ Q +EL + L + +L D+ + + + R M Sbjct: 575 -EHAKNAPPDVQARELARIRDSLRFDSMAMLADETA-----FAIIEPDDATRAIMRQGT- 627 Query: 643 DRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILD-------------LSNSAKMP 689 + GT AGE R QF + P +L +P Sbjct: 628 ----------RPGTGAGEVWRAIMQFKSFPIAYMQRVLGGRRWVRGDLQRGMRYGPRNLP 677 Query: 690 KGASMALNH---VWIQYSATMALAGIGVASIKALLRGEDPSLP---EVIYDGTLANGALL 743 AL + + + G ++K L +G +P E + +G Sbjct: 678 GAVEDALTRDMGGLMGFVLSSVAFGYASMTLKDLAKGREPRSLAHRETWLAAAMQSGGAG 737 Query: 744 PYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLP 803 + D L V++ + +GP+ ++ + + +L D ++ + + P Sbjct: 738 IFGDILFGKVNRFGNSFAETAVGPLGGLIGDAATLGGQLVRGDMADAGEDTLRLAMGNAP 797 Query: 804 FMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELFQNMDEGLPHR 856 F+N+WY + + D ++L + E ++PG L R + K KK+ + F R Sbjct: 798 FINLWYTRAALDWMLLYHVREMMSPGTLRRTERKMKKEFGQEFLFPPSQFIRR 850 >gi|254251753|ref|ZP_04945071.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] gi|124894362|gb|EAY68242.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] Length = 865 Score = 403 bits (1035), Expect = e-110, Method: Composition-based stats. Identities = 171/937 (18%), Positives = 308/937 (32%), Gaps = 160/937 (17%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGI---VRAYVSLD---GKGLSKAERYRLAGLKA 54 M +C + +AAGR+L K EL +E+ + +RA D + +++AER + A Sbjct: 1 MHAKCAAAVAQAAGRDLKKAELDGIENRVRAGMRAVARQDPAAWRSMTEAERVQAGAEWA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDL---DRVQAGVYGKSQALFNKLFFKAGSAEVP 111 + + E +D+A K+ Q+ + DR+Q ++ + + K + + + Sbjct: 61 RQQLEAEAN------LDKARKQLQIAKQIETTDRIQEALFADPERAYAKRA-REKAVKAD 113 Query: 112 LE------MKIKAAETKVLSKFNEYAEVGSKNLGFTLD---KQFGLDVFDEM-KGKK--T 159 +E IKA + E + G L D D+ E+ +G T Sbjct: 114 IERTYELAGGIKADYMRQTMDAIEAMKHGQNFLARAFDIDNPAMERDIIREIYRGADGST 173 Query: 160 QNEQASRLVKQYFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKL----RATKKDDFV 214 NE A +Q T + + + AG + + +P S K+ + + Sbjct: 174 GNEVAKAAAQQIGATSNAMRERFNRAGGNVGQLDYGYVPIRHSQAKILGNGSDAARHAWA 233 Query: 215 RSMLDWLDLSRYKDIDGTPLSRSEIASFV-GEVFAERVRST------------------- 254 +L LD S+Y D G PL + + + GE Sbjct: 234 DFVLPRLDRSQYLDDAGNPLDDAALRRVLTGEDRESWEARNIAARGMGVEPRQQGVWDTI 293 Query: 255 --SFKDPSIPSSEVGVKRE-----FERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELAS 307 + +P G RV HFKD+ AH++Y +G + +N ++ + Sbjct: 294 AYGGVNKIVPGETTGAAARANAGSQHRVLHFKDADAHIEYNRAYGEGSLLNALI-DHVGG 352 Query: 308 LSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMW 367 ++K+I + GPN ++ + T +D LE ++ W Sbjct: 353 MAKNIALVERYGPNPTRNMRTQMQLTALHDNT------------ELRTLEGGMTSVGAYW 400 Query: 368 E-VMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFI-SRQMLSRVGIDKEA 425 V T N AN M +R+ A L + AL + G + +RV K Sbjct: 401 NYVTGATNTPVNPAVANKMETVRTTVSAIKLQLTILAALGDVGTMFVTAGYNRVPFFKTL 460 Query: 426 --IQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEY 483 R+ + L+ GL AE + A L ++ K+ G Sbjct: 461 GTAARLMGPGSGDYRSWLTSQGLIAETLEHGLNRWGTDHLATSWAKWLSAQTMKFGGVTG 520 Query: 484 LDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKA 543 ++ + + ++ T L R + + D+ ++ RA Sbjct: 521 WTDAMRTAFQAQMMRGLAEISGT--EWSKLTEWDRRSLTR----SGITADDWALVNRATP 574 Query: 544 MSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLA 603 +G Y TP + DA D+ Sbjct: 575 -GEYNGSKY-LTPDALYGTGDARAADVVP------------------------------- 601 Query: 604 DLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALR 663 K+ ++ D + +V D + + GT GE + Sbjct: 602 ----------------KLLGMIRDEGEFAVLNP------DLRTKVIAAATPGTLQGELQK 639 Query: 664 MFQQFTTTPTGMFLNILDL----SNSAKMPKGASMALNHVWI---QYSATMALAGIGVAS 716 F QF + P M S + L + L G Sbjct: 640 TFLQFKSFPIAMISRHWGRIGEMRRSGDFRVEGAPTLASPMAYGAALVVSTTLLGALAVQ 699 Query: 717 IKALLRGEDPSLPE--------VIYDGTLANGALLPYMDRLTKLVS--KGDRAAIGGLLG 766 ++ LL G+DP + G D L+ +++ A G Sbjct: 700 LQNLLLGKDPEPMGDDVKHGGAFWFRAFTKGGGAGFAGDMLSAMLTGKNPAEAVGSVFGG 759 Query: 767 PVPSMVTN----LTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQI 822 P+ S +++A+ A + + + K + +P +N+WY K ++ LI + I Sbjct: 760 PLVSTAIQAVTPFSNNAMAAAEGKDTHLSADLLKFAQSNMPIVNLWYWKTVWNRLIWDNI 819 Query: 823 LEELNPGYLDRQQSKKKKK-GIELFQNMDEGLPHRLP 858 E L+PG R +K +++ + F P R P Sbjct: 820 AENLSPGVTSRNVAKSRQQYHNDYFWEPGTSAPQRAP 856 >gi|167041093|gb|ABZ05854.1| hypothetical protein ALOHA_HF400048F7ctg1g21 [uncultured marine microorganism HF4000_48F7] Length = 828 Score = 401 bits (1030), Expect = e-109, Method: Composition-based stats. Identities = 145/892 (16%), Positives = 296/892 (33%), Gaps = 132/892 (14%) Query: 9 LNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSK-AERYRLAGLKAEEDFQKELIRSVN 67 + + L E + L D + ++ ++R + +KEL Sbjct: 11 VANSTKFGLKASEAKELVDVLRNEQRNVRATAKGDYTIQFRKTAEELTARQKKELAAKRL 70 Query: 68 DAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEV---PLEMKIKAAETKVL 124 + +K L + +D K L + A + K A + Sbjct: 71 QRKQQVFKNEALDAKMDAGN----NKEATLSRMMVGSAKRGFQALDSIASKQIAMGKLRV 126 Query: 125 SKFNEYAEVGSKNL-------------GFTLDKQFGLDVFDEMKGK--KTQNEQASRLVK 169 + + L G D++F + E+ K+ N +A ++ + Sbjct: 127 GRILSVFGKTNLQLSRPTVSGFYPFGKGLFDDEKFQTALIKELFDGLGKSGNAEARQMAE 186 Query: 170 QYFETQRELHSQAHEAGLDYKFFENRIP-QPMSVDKLRATKKDDFVRSMLDWLD-LSRYK 227 + +RE+ + G+ + ++ + Q + +++ + L+ + Sbjct: 187 AVLKEKREMINALQAEGVPIGWLDDHVTTQTHDSAAIGKAGFKTWLKDIKGLLNHERTFL 246 Query: 228 DIDGTPLSRSEIASFVGEVFAERVRSTSF-----KDPSIPSSEVGVKREFERVFHFKDSQ 282 D F+ +V+ +P + + K R HF+DS Sbjct: 247 SSDPEKQDD-----FLEKVYNNIKSGKRNVVELVSEPGVGRKSLSTKISQSRQLHFRDSA 301 Query: 283 AHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASA 342 A ++Y + +G S V I+ + LS + + + G N D K+++ + Sbjct: 302 AWIEYNKKYGHSNAVQAIVQG-VGHLSDSLELIKVFGANPDGTFKRLLERQ--------- 351 Query: 343 GNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPI 402 D+ + +R E +V V N W W G+++ S LG Sbjct: 352 ------DFDPGQRTMLRSE----YNQVSGAAFEVANPAWHKWTQGIQAIQNLSKLGSAIF 401 Query: 403 GALLE-------DGFISRQMLSRV--GIDKEAIQRINKMPLKERMELL-SDVGLYAEGVV 452 + + + + + S + R+ + + +E+ +GL +GV+ Sbjct: 402 SSTTDPIYVAFTQHYHGKNIFSAYYNAFLNIGVGRLLQRGKSKEIEMFARKLGLGFDGVI 461 Query: 453 AHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKD 512 S +WSGA+ + + + + L Sbjct: 462 G-------------------SAASRWSGAKDTTEF------------MQGAVNNFFRLNG 490 Query: 513 LKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLAR 572 L A+ D D T + K Y R + D+D +D+A Sbjct: 491 LSGWTNFYREGAAYLMASDMADATKLNWDKLAP-----NYRRLLER-YGITDSDWKDIAG 544 Query: 573 MSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTS 632 + +K+ +SP R + +L ++ I ++ L+ +N Sbjct: 545 LP------FEKINGLDVISP-TRVFDEIELGNITGDAIPRSRELAEKIQQVLITENEFAV 597 Query: 633 VR-GAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKG 691 ++ GA + R G K GT A ++F QF + M + +P Sbjct: 598 LQPGANERAFMGRFFTGEEGIKSGTPMAMANKLFWQFRSFGLTMLFRQWPRAYEMGLPS- 656 Query: 692 ASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLP-----EVIYDGTLANGALLPYM 746 + M L G ++K +L+G + ++ L +G Sbjct: 657 ---------FYHLVPMVLMGYVAMAMKDILKGRELKDVVEDPGKIAVASVLQSGFGGIAG 707 Query: 747 DRLTKLVSKGDRAAIGGLLGPVPSMVTNLT---SSAVELATKDNE-NSKVNATKAIRKTL 802 D L + + + L GP S + +L ++ ++AT + ++ +A++ + Sbjct: 708 DFLFNDYRQYSTSYVDLLAGPSGSSLNDLAEFGATTFDVATGGDPVDAAAAGWRAVKGNI 767 Query: 803 PFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELF---QNMDE 851 P+ N W + FD+LI Q+ E LNPG L R + + K+K + + E Sbjct: 768 PYANWWASRTLFDYLINYQVQEILNPGSLRRMERRFKQKNNQDYRAGWAPSE 819 >gi|157372110|ref|YP_001480099.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] gi|157323874|gb|ABV42971.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] Length = 850 Score = 400 bits (1026), Expect = e-109, Method: Composition-based stats. Identities = 153/895 (17%), Positives = 314/895 (35%), Gaps = 107/895 (11%) Query: 3 PECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLS---KAERYRLAGLKAEEDFQ 59 +C +++ KAAGR+LS EL+ + + R + S + + A D Sbjct: 7 ADCEKIVIKAAGRDLSDDELQDVFGQLRRNIDRYQAENASMTLEEAALKAADEMVRGDKL 66 Query: 60 KELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGK-SQALFNKLFFKAGSAEVPLEMKIKA 118 +I + N AI+ R +L S L+ + + + L A Sbjct: 67 ARVIEARNKAIN-LKIRTKLESFLNNSKESLGADRPDIALSALLVSRNEASEGFRASASR 125 Query: 119 AETKVLSKFNEYAEVGSKNLGFT-------LDKQFGLDVFDEMKGKKTQ--NEQASRLVK 169 + ++ K+ E G + D++ ++ +G+ T ++++ +L + Sbjct: 126 EQGQLEGKYIAGFEHDLNQSGLSKALSSGEYDQEIADALWKVGRGEPTAGLSKESIKLAE 185 Query: 170 QYFETQRELHSQAHEAGLDYKFFENRI-PQPMSVDKLRATKKDDFVRSMLDWLDLSRYKD 228 + Q ++ AG I Q K+R +D+ ++L LD S + Sbjct: 186 IINKWQEVARLDSNRAGSFIGKLAGYITRQSHDWAKIRGAGYEDWRDTILPRLDHSTFDG 245 Query: 229 IDGTPLSRSEIASFVGEVFAERVRSTSFKDPSI----PSSEVGVKREFERVFHFKDSQAH 284 + +R E V A + + K + S + ERV HFKD A Sbjct: 246 VA----NRDEFLQSVYNGLASGIHLSDQKSDWLSGFKGGSNQAKRASQERVLHFKDGVAW 301 Query: 285 MDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGN 344 +Y + +GV N+ + S L S ++ + R LG N ++ + A ++ + Sbjct: 302 HEYNKAYGVG-NLRESVMSGLTSSARTTGVMRVLGTNPENMFGHLFETQQARLKKLN-NP 359 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGA 404 D+ GR R+ ++ E++ Y N+ A A +R+ G + LG I + Sbjct: 360 AAEADFAGR-----RRALENELSEILGYNSIPANSAIARAGATIRAVEGMTKLGGAVISS 414 Query: 405 LLEDGFISRQMLSRV-----GIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMM 459 + G + ++ + + K ++ ++ E+L +G++ + V Sbjct: 415 FNDVGNAAMELRYQGMNLMDAMGKSIAGKLKGYSAADQKEILGYMGIFTDSVRDEMIAKF 474 Query: 460 EGSDA-FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPR 518 G + +L K + + + S L++ N + R + A Sbjct: 475 SGDTSVPGRISRLQRTFFKLNLLNWWTENSRKSMGLVMSNWMARNSK--------SAWSS 526 Query: 519 LDPSIKAFF--KQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDK 576 ++ ++ + + ++ + + + S TP+ +K + D + Sbjct: 527 MNEDLRRVLNSSGITEREWNLYRGMEMDSVRGNQHM--TPNGVKYIPDERI--------- 575 Query: 577 IAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGA 636 + + + + I ++ + K+ LD V ++ Sbjct: 576 ------------------AEYVAADGLQVNKASIAAARESLEGKLRGYYLDRVLIAMSEP 617 Query: 637 MHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDL-------------- 682 + + + GT GEA+R QF + N + Sbjct: 618 ----GARTRAMMKQGTQPGTPLGEAIRFGGQFKSFTGSFMQNTIGREIYGRGYTPAELGQ 673 Query: 683 ------SNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPS--LPEVIYD 734 +N+ + G M L ++I M G K LL+G+ P + Sbjct: 674 SRFTSLANAMRNGNGEKMGLAQLFIW----MTALGYVSMQTKLLLKGQTPRPADAKTFLA 729 Query: 735 GTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNA 794 G L D L ++ L GP + + + + +D + + Sbjct: 730 AAAQGGGLGIMGDFLFGEYNRFGGGLASSLAGPTVGDLDQIRNLFLRA--RDGDAKAADL 787 Query: 795 TKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELFQNM 849 K PFMN+ ++ + ++LILN+ E L+PG L+R + + +K+ F Sbjct: 788 LKFGIDHTPFMNLHVVRPAMNYLILNRAQEWLSPGSLERYRQRVEKEQGNTFIVP 842 >gi|262043648|ref|ZP_06016757.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038986|gb|EEW40148.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 974 Score = 365 bits (936), Expect = 2e-98, Method: Composition-based stats. Identities = 142/887 (16%), Positives = 287/887 (32%), Gaps = 134/887 (15%) Query: 23 RRLEDGIVRAYVSLDGKGLSKA--------ERYRLAGLKAEEDFQKELIRSVNDAIDEAY 74 + + D + R +LD + + E+++ + + +++ Sbjct: 154 QNIADAMWRLGNNLDVGHIPEDAIKIARVLEKWQEKARIDANRAGASIGKLPGYIARQSH 213 Query: 75 KRHQLRSD-LDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEV 133 H++R+ + + + + + G V + E ++ + + Sbjct: 214 DIHKIRTAGFEAWRDAILPELDPRTFEGLDVNGQNGVTVRKATVMTEDQIYGRARPAKPL 273 Query: 134 GSKNLGFTLDKQFGLDVFDEMKGKKT----QNEQASRLVKQYFETQRELHSQAHEAGLDY 189 +N+G + G + + N Q R + + Q + G Sbjct: 274 KPENVGALAQRADGRFYIKGIVSENVDLMRGNGQVMRA--NFRNGDLLANGQDIDLGDIV 331 Query: 190 KFFENRIPQPMSVDKLRATKKDDFVRSM--LDWLDLSRYKDIDGTPLSRSEIASFVGEVF 247 F + ++V + D + G S++ I F+ V+ Sbjct: 332 GFRNDG---------------GEWVSVAGRIPRFDPAA---PGGLSPSQAVIDDFLHNVY 373 Query: 248 AERVRSTSFKDP--------SIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNT 299 + S+ V + ERV HFKD + Y + FGV N+ Sbjct: 374 VGLSSGVHLRTDRPDWMTGFKGGSTNVARRASQERVLHFKDGLSWYRYNDKFGVG-NLRE 432 Query: 300 ILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVR 359 + S L ++ + R +G N ++ ++ + + A KD NK + Sbjct: 433 AVGSGLIHSAETTGLMRRMGTNPENMFNELADRIEQRYKAA-------KDDNALNKFRQK 485 Query: 360 QEAML--QMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLE-------DGF 410 + L Q+ E+ N A A R+ LG I + + + Sbjct: 486 RNTSLTSQLKEITGQTNIPGNAALARVAATTRAIETMMKLGGSMISSFNDIATQAMEMRY 545 Query: 411 ISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAH------GRNMMEGSDA 464 R ML V ++ + ER ++L +GL+A+ + N M G Sbjct: 546 QGRNMLGSVWEATANKVQLTRWKNAERQQVLKSIGLHADAMKDELIYRFSADNSMPGRVN 605 Query: 465 FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIK 524 + + + W S ++V +G T S D+ + R S+ Sbjct: 606 RAMRNYFRLNLQSW-----WTNSSRYSTGMMVSEWLG--THAGKSFGDVPEELRRVLSMH 658 Query: 525 AFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKL 584 +++ ++ + + K + DG Y TP + ++ D+ + Sbjct: 659 ----GIEENEWAALSKMKL-HAADGNAYM-TPDGVADIPRTDIENY-------------- 698 Query: 585 KNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDR 644 L + + + + ++ +S+K+ +LD V ++ ++ Sbjct: 699 -------------LTNRGIKINDRSVEYARELLSDKVRGYILDRVGVALNEPDARTMSIM 745 Query: 645 QRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYS 704 ++ +RGT GE LR QF + N + + S++ N+ + + Sbjct: 746 KQ----GMQRGTAYGEMLRFAWQFKSFTASFMQNAIGRELYGRGYDFGSLSQNNTFRNNA 801 Query: 705 ATMAL-------------------AGIGVASIKALLRGEDPSLPE---VIYDGTLANGAL 742 A+ G K +LRG+ P + G L Sbjct: 802 LIRAMRNGNGELMGIAQLFLWATAFGYLSMQTKLMLRGQTPRPADNVSTWTAAMAQGGGL 861 Query: 743 LPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTL 802 D L ++ L GP S L + + TK + + Sbjct: 862 GILGDFLFGEYNRFGNTPATSLAGPFASDAAQLVN--LFGLTKQGDAKAADYFNFAINHT 919 Query: 803 PFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELFQNM 849 P+MN+ ++ D LILNQ+ E ++PG L R Q + K++ F Sbjct: 920 PYMNLHVVRPVMDFLILNQMREWMSPGSLQRYQQRVKEEQGNDFIIP 966 >gi|262043551|ref|ZP_06016664.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039085|gb|EEW40243.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 708 Score = 353 bits (904), Expect = 1e-94, Method: Composition-based stats. Identities = 148/719 (20%), Positives = 268/719 (37%), Gaps = 84/719 (11%) Query: 4 ECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAE--RYRLAGLKAEEDFQK- 60 +C +N AAGR+LS+ E+ L VR + L+ E A L+A ++ Sbjct: 8 QCEIAVNTAAGRKLSEDEMESL----VRDMNDTTNRILAGNEALTLEEAALRAAQELGNR 63 Query: 61 ----ELIRSVNDAIDEAYKRHQL----RSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPL 112 ++I + N AI+ +L R+ DR G+ + S + Sbjct: 64 DQLAKVIEARNKAINTRIAAQRLGELRRTWKDRPDIGLEAMLVGRNDARTGSRRSVSSEV 123 Query: 113 EMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQ-----NEQASRL 167 + F++ V G + D++ ++ +G+KT + A+++ Sbjct: 124 AQLRGKYHAGINYDFDQAGLVKFIASG-SNDREIADAMWRIGRGQKTDGMTPQSVSAAKI 182 Query: 168 VKQYFETQRELHSQAHEAGLDYKFFENRI-PQPMSVDKLRATKKDDFVRSMLDWLDLSRY 226 + ++ ET R + AG I Q + K+RA + + ++L LD + + Sbjct: 183 IMKWQETARVDE---NRAGAWIGKMPGYIVRQSHDILKIRAAGYESWRNAILPRLDDATF 239 Query: 227 KDIDGTPLSRSEIASFVGEVFAERVRSTSFKDP---SIPSSEVGVKR-EFERVFHFKDSQ 282 I R V + A V TS K S VKR ERV HFKD Sbjct: 240 DGIS----DREGFLRGVYDGLASGVHLTSEKPDWMNGFKGSANAVKRASQERVLHFKDGV 295 Query: 283 AHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASA 342 +Y E FG ++ + L S ++ I R LG N + K + TIA D + Sbjct: 296 NWHEYNEQFGTG-SLREAVFGGLNSAARTTGIMRVLGTNPQNMFKYL-TDTIAKDVSKQS 353 Query: 343 GNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPI 402 L D++ +VR+ M +V + GWAN A +R S LG I Sbjct: 354 NPAALADFM----TKVRRLNRTVMPQVDGSLNIPGSVGWANASANVRGWLRMSQLGGAVI 409 Query: 403 GALLEDGFISRQMLSRVGIDKEAI-----QRINKMPLKERMELLSDVGLYAEGVVAHGRN 457 + + + +M + +A+ R ++ E+ E+LS +G+Y++ + Sbjct: 410 SSFNDVPISATEMRYQGQNFMQALTGAMKGRFSRYTSDEQKEILSSIGVYSDTMTQEIIR 469 Query: 458 MMEGSDAF-QIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKAD 516 M G+D+ + K++ + + +S+A+++ N + + D L D Sbjct: 470 RMSGNDSMSGKMGRAQQLFFKYNLMNFWTESGRNSNAMMITNWLAKNADQ--QFTALPED 527 Query: 517 PRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDK 576 R + D ++ + + M+ +G + T S I+ + D + D Sbjct: 528 LRRVLD----LHGIGDAEWNIYRNMD-MADSEGRKFM-TTSGIRAVPDEVIGDYV----- 576 Query: 577 IAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGA 636 K LK ++ + R+ L+ QL +NI + ++ A + Sbjct: 577 ---ASKGLKVTERSIADARETLESQLRGYILDRLNIAMSEPGDRTQAFM----------- 622 Query: 637 MHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMA 695 + GT AGEA+R Q+ + N+L + A + Sbjct: 623 ------------KMGTVPGTVAGEAVRFAGQYKSFTASFMQNVLGREVFGRGYTPAGLG 669 >gi|291336683|gb|ADD96225.1| hypothetical protein Rsph17025_0444 [uncultured organism MedDCM-OCT-S08-C1350] Length = 850 Score = 341 bits (873), Expect = 4e-91, Method: Composition-based stats. Identities = 136/860 (15%), Positives = 291/860 (33%), Gaps = 95/860 (11%) Query: 39 KGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQA--GVYGKSQA 96 KGL K+E +LA + + + E + + + K +++ + + G + A Sbjct: 42 KGLGKSEAEKLAAKETLDQAKIEFAEKLRFTLLQKDKFNEITTLFATYRNKNGEIDIANA 101 Query: 97 LFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLG---FTLDKQFGLDVFDE 153 + + +E + K + LG L K + E Sbjct: 102 YRSMQAHDIVANTPNIERTVDIERGKAHQLMAGLLDKMKYKLGGRQSKLQKTNLKLMVKE 161 Query: 154 MKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDY-KFFENRIPQPMSVDKLRATKKDD 212 + G+ T N A +L + ET L + ++ G + +PQ +R + K D Sbjct: 162 LMGETTGNVNAKQLADAWRETAEHLRKRFNKFGGKVLSRKDWGLPQIHDSLLVRQSSKAD 221 Query: 213 FVRSMLDWLDLSRYKDI-DGTPLSRSEIASFVGEVFAERVRSTSFK---DPSIPSSEVGV 268 ++ +L LDL + + G P + I + EV+ + + Sbjct: 222 WIDYILPKLDLDKMVNERSGLPFNDKTIREALSEVYDNIATEGMATFKPGTAGYGRALHN 281 Query: 269 KREFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQ 328 +R R FK++ M+Y FG T++ + ++++DI + + LGPN D+ Sbjct: 282 RRIDHRFLAFKNADDWMEYQTRFGSPDPFKTMME-HINAMARDISMLKILGPNPDATHTW 340 Query: 329 ---MIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVM--------RYGETVE 377 MI + + D A A K + + + ++ + E + Sbjct: 341 ALGMIKKQMKIDAAAEAQGKFKRKKVSQKFSGNEEDRSNAIIENINNLYAYHKGTLHKPI 400 Query: 378 NTGWANWMAGLRSAAGASMLGQHPIGALLEDG----FISRQMLSRVGIDKEAIQRINKMP 433 + A LR A+ LG + A+ + L ++EA++ + + Sbjct: 401 DGFMGRTFAALRQILTAAQLGGASVMAITDFHWSRLTSKFNGLPAYKANQEALKLLGEGI 460 Query: 434 LKER--MELLSDVGLYAE---GVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKR 488 K++ +GL AE V + DA ++ + + SG ++ + Sbjct: 461 KKDKAMARTAIRLGLIAEHWSTVAGVAARYLNEVDAPFWSKRISDVVLRGSGLSHITQSG 520 Query: 489 ISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPD 548 + + + + + + K DP L ++ + ++ D+ +I+ K + Sbjct: 521 RWAFGMSIMGTLAEESGKVFN----KLDPNLQKQLQKY--GIEADDWEIIRSTKLYDAGI 574 Query: 549 GYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERK 608 D ++ Sbjct: 575 DEPSMVGKGATFLRPDDIMKRA-------------------------------------D 597 Query: 609 EINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQF 668 ++ ++ ++ V + +V TS + + + GT GE + + Sbjct: 598 LDEATREFLTTRLLTYVTNETNFAVP----TSSAKGRITLSGSAQPGTVKGEIVNSMLMY 653 Query: 669 TTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSL 728 P + + L KG + L + A+ G IK + G+ P+ Sbjct: 654 KNFPITLGMTHLSRGFQQVGLKGKAKYL----VPMIVGGAVMGSIAYEIKQIAAGKTPTK 709 Query: 729 PE-----VIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELA 783 PE + + G L + D L ++ + L GPV S + + + A Sbjct: 710 PEDMGVRYWLNAIIYGGGLGIFGDFLFSDQNRYGGSFSKTLAGPVASFIGDSINLTFGNA 769 Query: 784 ----TKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGY---LDRQQS 836 + + N+ I++ P ++WY + + + ++ + I +NP + R + Sbjct: 770 AQLISGEKTNAGKELAAFIQRYTPGSSLWYARVALERILFDSIERLINPDFDSDNRRNIN 829 Query: 837 KKK-KKGIELFQNMDEGLPH 855 K K + G + + + + P+ Sbjct: 830 KLKSRTGQDYWWSPGDIKPN 849 >gi|48696687|ref|YP_024981.1| hypothetical protein VP5_gp18 [Vibrio phage VP5] gi|40806150|gb|AAR92068.1| hypothetical protein [Vibrio phage VP5] Length = 782 Score = 332 bits (851), Expect = 1e-88, Method: Composition-based stats. Identities = 144/848 (16%), Positives = 265/848 (31%), Gaps = 118/848 (13%) Query: 48 RLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGS 107 L ED + + I YK+ + + V + AL L Sbjct: 21 TDTDLITAEDIADAIKGKKQEKIA-VYKQAEAIKKGNEVLTQSKDPASALLGMLSRDPNE 79 Query: 108 --AEVPLEMKIKAAETKVLSKFNEYAE--------------VGSKNLGFTLDKQFGLDVF 151 + + +I A +K +++ G + L ++ D Sbjct: 80 EVKFLSADQRINAIRAVSKAKISDFMADLAPTTRQIFAGIATGERRL-TKSQQRLLDDFV 138 Query: 152 DEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKK 210 E+ G++T N A + K + + +L+++ +AG ++ +PQ + + Sbjct: 139 HELYGRQTGNADALKAAKGWKKATEDLNARFGQAGGHMAELDDWRLPQKHNRMAISKAGA 198 Query: 211 DDFVRSMLDWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKR 270 D +V + D +D + + + V+ V ++ S + Sbjct: 199 DVWVEKVWDLIDRDKMVKKLRKGKDEDNLREALYSVYNNIVTDGMSSSKTL-SKKFTDMM 257 Query: 271 EFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMI 330 ER FKDS + + Y FG TNV + + ++S+ I + GP+ D + Sbjct: 258 RSERFITFKDSDSWLKYQREFG-DTNVYASMLGHIDNMSRAIGMMETFGPDPDIGFNTL- 315 Query: 331 VQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRS 390 + G + R ++ +M Y E T W N +AGLR+ Sbjct: 316 ----ERAVKTKKGLTSRQPTGARPTFDM----------LMGYNMVEEQTVWGNRVAGLRN 361 Query: 391 AAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRIN------KMPLKERMELLSDV 444 AS LG + AL + + S ++R+ R D Sbjct: 362 LWTASKLGAAVVSALTDSVYASMAASYNAMSPARVLRRMLSEVMKPSKSEASRKLWAQDF 421 Query: 445 GLYAEGVVAHGRNMMEGSDAFQ--IGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGR 502 G AE + + + +F L + SG + +S Sbjct: 422 GFGAEFALDRMAMTSDYTQSFGGHRSRNLAEAVMVVSGMNQWTQSARASFQF-------- 473 Query: 503 MTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNL 562 T + RA D R + Sbjct: 474 ------------------------------EFATALTRAADSKWSDLPEKMRNSMGRYGI 503 Query: 563 KDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMH 622 ++D +A K + ++ Sbjct: 504 TESDWAAIAAAPRTNYKGNKMIDPRNM----------------------------DAELQ 535 Query: 623 ALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDL 682 ++ V A+ T + K G GE R F + P +N Sbjct: 536 TKLVGMVDGETMMAVPTPDARTRAFMAGGTKSGNFGGELHRSLFMFHSFPITTIMNQWRR 595 Query: 683 SNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPS---LPEVIYDGTLAN 739 + K GA ++ I AT L G+G+ K +L G+ P P++ +G Sbjct: 596 VFTGKGYSGAFDRMSAAAIMVGATSVL-GVGIIQAKDILNGKKPRSMSDPKLWIEGMAQG 654 Query: 740 GALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIR 799 G+ D + S + GPV + + +A ++A D E++ Sbjct: 655 GSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYGDWVAMTAADMAKGDAESAMARTANFAT 714 Query: 800 KTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK----KGIELFQNMDEGLPH 855 + +PF N+WY K + D L++++I +P Y +Q +K +K E + + G Sbjct: 715 QQIPFNNLWYTKIATDRLLMDRIRRLSDPEYDKKQLNKMRKMQRTSQQEYWWSPPIGGQS 774 Query: 856 RLPFPFGE 863 + PF E Sbjct: 775 NIESPFEE 782 >gi|48696644|ref|YP_024423.1| hypothetical protein VP2p19 [Vibrio phage VP2] gi|40950042|gb|AAR97633.1| hypothetical protein [Vibrio phage VP2] Length = 782 Score = 331 bits (849), Expect = 2e-88, Method: Composition-based stats. Identities = 147/848 (17%), Positives = 267/848 (31%), Gaps = 118/848 (13%) Query: 48 RLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGS 107 L ED + + I YK+ + + V + AL L Sbjct: 21 TDTDLITAEDIADAIKGKKQEKIA-VYKQAEAIKKGNEVLTQSKDPASALLGMLSRDPNE 79 Query: 108 --AEVPLEMKIKAAETKVLSKFNEYAE--------------VGSKNLGFTLDKQFGLDVF 151 + + +I A +K +++ G + L ++ D Sbjct: 80 EVKFLSADQRINAIRAVSKAKISDFMADLAPTTRQIFAGIATGERRL-TKSQQRLLDDFV 138 Query: 152 DEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKK 210 E+ G++T N A + K + + +L+++ +AG ++ +PQ + + Sbjct: 139 HELYGRQTGNADALKAAKGWKKATEDLNARFGQAGGHMAELDDWRLPQKHNRMAISKAGA 198 Query: 211 DDFVRSMLDWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKR 270 D +V + D +D + + + V+ V ++ S + Sbjct: 199 DVWVEKVWDLIDRDKMVKKLRKGKDEDNLREALYSVYNNIVTDGMSSSKTL-SKKFTDMM 257 Query: 271 EFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMI 330 ER FKDS + + Y FG TNV + + ++S+ I + GP+ D + Sbjct: 258 RSERFITFKDSDSWLKYQREFG-DTNVYASMLGHIDNMSRAIGMMETFGPDPDIGFNTL- 315 Query: 331 VQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRS 390 + G + R ++ +M Y E T W N +AGLR+ Sbjct: 316 ----ERAVKTKKGLTSRQPTGARPTFDM----------LMGYNMVEEQTVWGNRVAGLRN 361 Query: 391 AAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRIN------KMPLKERMELLSDV 444 AS LG + AL + + S ++R+ R D Sbjct: 362 LWTASKLGAAVVSALTDSVYASMAASYNAMSPARVLRRMLSEVMKPSKSEASRKLWAQDF 421 Query: 445 GLYAEGVVAHGRNMMEGSDAFQ--IGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGR 502 G AE + + + +F L + SG + +S Sbjct: 422 GFGAEFALDRMAMTSDYTQSFGGHRSRNLAEAVMVVSGMNQWTQSARASFQF-------- 473 Query: 503 MTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNL 562 T + RA D R + Sbjct: 474 ------------------------------EFATALTRAADSRWSDLPEKMRNSMGRYGI 503 Query: 563 KDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMH 622 ++D +A K + ELQ +L + E + + Sbjct: 504 TESDWAAIAAAPRTNYKGNKMID-----PRNMDAELQTKLVGMVDGETMMAVPTPDARTR 558 Query: 623 ALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDL 682 A + K G GE R F + P +N Sbjct: 559 AFMAG-----------------------GTKSGNFGGELHRSLFMFHSFPITTIMNQWRR 595 Query: 683 SNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPS---LPEVIYDGTLAN 739 + K GA ++ I AT L G+G+ K +L G+ P P++ +G Sbjct: 596 VFTGKGYSGAFDRMSAAAIMVGATSVL-GVGIIQAKDILNGKKPRSMSDPKLWIEGMAQG 654 Query: 740 GALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIR 799 G+ D + S + GPV + + +A ++A D E++ Sbjct: 655 GSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYGDWVAMTAADMAKGDAESAMARTANFAT 714 Query: 800 KTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKK----KGIELFQNMDEGLPH 855 + +PF N+WY K + D L++++I +P Y +Q +K +K E + + G Sbjct: 715 QQIPFNNLWYTKIATDRLLMDRIRRLSDPEYDKKQLNKMRKMQRTSQQEYWWSPPIGGQS 774 Query: 856 RLPFPFGE 863 + PF E Sbjct: 775 NIESPFEE 782 >gi|146276496|ref|YP_001166655.1| hypothetical protein Rsph17025_0444 [Rhodobacter sphaeroides ATCC 17025] gi|145554737|gb|ABP69350.1| hypothetical protein Rsph17025_0444 [Rhodobacter sphaeroides ATCC 17025] Length = 830 Score = 317 bits (812), Expect = 5e-84, Method: Composition-based stats. Identities = 118/821 (14%), Positives = 270/821 (32%), Gaps = 95/821 (11%) Query: 57 DFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSA--EVPLEM 114 D ++ ++ + + + Q L + AL N L GS + Sbjct: 51 DLKEAFRKAKTSRLHKVVNQLQAMRRLRAQIEQAPDPAVALRNLLEHSDGSGYTGESVRS 110 Query: 115 KIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 +A E + + + VG +G + + D+ E+ + + N QA + Sbjct: 111 ISEAYEASINAGLRDTLETVGLNVIGSSRNPVLLRDLIRELHAEASGNAQAKAMADAVRT 170 Query: 174 TQRELHSQAHEAGLDYKFF-ENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID-- 230 Q+ + + G D + +P +R + + + L R D + Sbjct: 171 VQQRMRRAFNSYGGDIGEIADYGVPHSHDAGAMRQAGFEAWAAEIEQRLAWDRIVDFNTG 230 Query: 231 ------GTPLSRSEIASFVGEVFAERVRST-SFKDPS--IPSSEVGVKREFERVFHFKDS 281 G R+ F+ +V+ V +DPS + + +R R+ HF+ Sbjct: 231 QPFAAPGQVPPRAVSGRFLKDVYEGIVTRGWDDRDPSLAVGGKALANQRAERRLLHFRSG 290 Query: 282 QAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEAS 341 ++Y + FG S + + + L L++D+ + R LGP+ + ++ Q Sbjct: 291 SDWIEYNKAFGASDPF-SAMMNGLHGLARDVALMRVLGPSPKAGLEY-AAQVAKKRAATI 348 Query: 342 AGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHP 401 K+ ++K+ L + GWA + +G R+ + LG Sbjct: 349 GNQKLEARVDTQSKVAKAMLMHLD-----GSANVPDRAGWAAFFSGTRAVLTSIQLGSAV 403 Query: 402 IGALLEDGFISRQMLS-RVGIDKEAIQRINKMPLKERMELLSDVGL---YAEGVVAHGRN 457 + ++ + ++ S + + + M + E + +G Sbjct: 404 LSSVSDVATMTAAAHSVGLSATSVLGRSVQLMASQATRETAARMGYVAGALADAGGGASR 463 Query: 458 MMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADP 517 I ++ + +G ++ R + + + + D+ A Sbjct: 464 YFGQLFGTGIPARMAGFTLRATGLSFVTDMRKLAWQMEFSGYMAENAGR--TFADIDAPL 521 Query: 518 RLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKI 577 R + + D+ +++ Sbjct: 522 RQLFERR----GITAADWDLLRD------------------------------------P 541 Query: 578 AYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAM 637 A+ ++ + +SP Q ++ +E + + + ++ A +L+ ++ ++ Sbjct: 542 AFRFREPGGADFVSPIYWLHAQNRIPHVEAEGLAM-------RLQAAILEELEFAIP--- 591 Query: 638 HTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALN 697 T+ + + L T G+ AGE +R + + + LN S P + Sbjct: 592 -TASIEGRALLQGTAAPGSVAGELMRSSMSYKSFSLSLMLNQYRRFASLPTPWDKAKYAA 650 Query: 698 HVWIQYSATMALAGIGVASIKALLRGEDPSL---PEVIYDGTLANGALLPYMDRLTKLVS 754 + S + + G +K L +G DP + G L + D + S Sbjct: 651 ----KVSTLLLVTGAMAIQLKELAKGNDPRPMDENKFWLAALFQGGGLGIFGDFFSAETS 706 Query: 755 KGDRAAIGGLLGPVPSMVTNL----TSSAVELATKDNENSKVNATKAIRKTLPFMNM-WY 809 + + GPV +L S+ ++ + +R+ PF++ WY Sbjct: 707 RVGGGLAETIAGPVVGAAGDLLKPVASNITRAVQGEDTLVGRDVAALVRRNTPFLSSAWY 766 Query: 810 LKNSFDHLILNQILEELNPG----YLDRQQSKKKKKGIELF 846 + ++ L+ +++ L+P + R + K G + + Sbjct: 767 ARTAYSRLVADELQAFLDPEAEVLFRRRMKKMAKDYGTQPW 807 >gi|291334971|gb|ADD94604.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233] Length = 530 Score = 282 bits (721), Expect = 2e-73, Method: Composition-based stats. Identities = 117/568 (20%), Positives = 217/568 (38%), Gaps = 85/568 (14%) Query: 310 KDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEV 369 +++ + LG +++ A + G ++ + + + + Sbjct: 5 RNMGMIDSLGTKPKQNFEKI---RYAIQERLIDGERLNAAQSISSYAPFDKYMKVVDGSI 61 Query: 370 MRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGF----ISRQMLSRVGIDKEA 425 G A W A R+ + LG I A + G +S Q S +G E Sbjct: 62 HTIEGGSIGFGVAKWSAITRAVGNTAKLGGAVISAAADLGIYGSEMSFQGRSFLGGMYEG 121 Query: 426 IQRI-NKMPLKERMELLSDVGLYAEGVV-------AHGRNMMEGSDAFQIGHKLHSKMHK 477 + + + +++ +L+ +G A+GVV G N+ +G Q ++ + Sbjct: 122 FKGLARRKNTQDKKDLVEGMGFLADGVVYDVSGRHTVGDNLTKGWTRIQRTFFKYNLLSW 181 Query: 478 WSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFF--KQLDDTDF 535 W+ + N + M + YA K+L D +L+ ++ FF +D + Sbjct: 182 WTNT-------------LKENSMLGMANYYAKQKNLSFD-KLNKPLQEFFGLYNIDSVKW 227 Query: 536 TVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQR 595 VI++ + DG + + + + DAD++ + + + Sbjct: 228 DVIRKNGMAKADDGTEFI-NIANLDQISDADIKKITGIDN-------------------- 266 Query: 596 QELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYK-- 653 L + E+ I KDK + ++LD +V D + G++T Sbjct: 267 ---------LSKTELQIEKDKFKYSVSGILLDRSIYAVIEP------DARVKGIMTQGLL 311 Query: 654 RGTRAGEALRMFQQFTTTPTGMFLNILDLSNS------------AKMPKGASMALNHVWI 701 GT GEA+R QF P + +L + + + Sbjct: 312 AGTGMGEAIRFVGQFKAFPMSIMNKVLGREMAYIRKGKKLGGLSTEAGRAEIGRGIRGMA 371 Query: 702 QYSATMALAGIGVASIKALLRGEDPSLP---EVIYDGTLANGALLPYMDRLTKLVSKGDR 758 T G ++K LL+G++P P + I G L G L Y D L K + Sbjct: 372 ALVITSGFMGYMAMTMKDLLKGKEPRDPTKFKTIMAGFLQGGGLGIYGDVLFKEQ-RDAG 430 Query: 759 AAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLI 818 + I GL+GP P+ V +L + + S A +AI +PF+N++Y+K +FD+LI Sbjct: 431 SVIAGLVGPAPTTVVDLGLALQYALLGEGGKSGKAAYRAISSNIPFLNLFYIKIAFDYLI 490 Query: 819 LNQILEELNPGYLDRQQSKKKKKGIELF 846 QI+E +NPG L + + + KK + + Sbjct: 491 GFQIMETVNPGVLKKVERRMKKDYNQEY 518 >gi|320175032|gb|EFW50145.1| 17 [Shigella dysenteriae CDC 74-1112] Length = 236 Score = 241 bits (614), Expect = 4e-61, Method: Composition-based stats. Identities = 70/234 (29%), Positives = 111/234 (47%), Gaps = 11/234 (4%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVS------LDGKGLSKAERYRLAGLKA 54 M+ ECIQ + +AA R L+ +E++ +ED I R S + + LS++ER A A Sbjct: 1 MRQECIQAVQQAAQRTLTAREIQNIEDRIYRNMRSIARDDTMSWRQLSESERLYRAAQLA 60 Query: 55 EEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAG--SAEVPL 112 E+ Q+E R +L ++ Q G GK AL + F A S + + Sbjct: 61 SEELQREAALKKRRVALTIAARQRLDKFINSYQ-GADGKLGALNRTIAFNADGKSNFLSV 119 Query: 113 EMKIKAAETKVLSKFNE-YAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 E + KA LS+ E + V + G D+ D+ EM+G+ T N +A + K + Sbjct: 120 ESRTKATREYALSQLQEAFEAVDPRFFGLFEDEAGVRDLVYEMRGQNTGNAKARKGAKAW 179 Query: 172 FETQRELHSQAHEAGLDYKFFENR-IPQPMSVDKLRATKKDDFVRSMLDWLDLS 224 E L + ++AG D + EN IPQ S++K+ A KD +V ++ LD Sbjct: 180 REVTELLRRRFNDAGGDIGYLENWGIPQHHSMEKVGAVSKDKWVSDVIGKLDRR 233 >gi|288959378|ref|YP_003449719.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] gi|288911686|dbj|BAI73175.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] Length = 995 Score = 198 bits (502), Expect = 4e-48, Method: Composition-based stats. Identities = 116/680 (17%), Positives = 238/680 (35%), Gaps = 49/680 (7%) Query: 3 PECIQVLNKAAGRELSKKELR-RLEDGIVRAYV-SLDGKGLSKAERYRLAGLKAEEDFQK 60 +C+ + AAGR+LS ++ LED +RA + LS+AE YR A +A + + Sbjct: 4 QDCLGEIRGAAGRDLSDDDIHVMLEDIQLRADRMRRERVDLSQAELYRAAAREAGAEAEM 63 Query: 61 ELIRSVNDAIDEAYKRHQLRSDLDRVQA-----GVYGKSQALFNKLFFKAGSAEVPLEMK 115 +A KR R + A G+ +A + + + + + Sbjct: 64 AARIEARNAKLNLVKRVARREFYEAAPAVGSRPGILIGLEAKLVGVNTPFSGSRLSVAAQ 123 Query: 116 IKAAETKVL----SKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKK-----TQNEQASR 166 A + ++F+ + G +D+Q ++F+ + + T ++ A+ Sbjct: 124 QNALRRDYMVGLTTEFDRAGLYETVRSG-AIDRQIARELFELSRAEGGAPGVTGSKPAAE 182 Query: 167 LVKQYFETQRELHSQAHEAGLDYKFFENRIPQP-MSVDKLRATKKDDFVRSMLDWLDLSR 225 + Q + G ++ I + DK+R + + ++ LD Sbjct: 183 AAGIIAKYQALAREALNREGAWIGQYDGYIARTAHDPDKIRRATFEGWRDQVVKLLDERT 242 Query: 226 YKDI-DGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKR-EFERVFHFKDSQA 283 ++ I D R + V V FKDP+ S KR RV H++D+ A Sbjct: 243 FEGIADRERFLRGVYNALVTGVHLTPDGMQGFKDPAFKGSGNIAKRLSQGRVLHWRDADA 302 Query: 284 HMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAG 343 MDY FG V +L L +++ + RE G N + + ++ Sbjct: 303 WMDYQAAFGHGNLVEAVLRG-LDQAARNTALMREFGTNPRGEFDADMQALAESWRD---- 357 Query: 344 NKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVE-NTGWANWMAGLRSAAGASMLGQHPI 402 +D KL ++ + ++ + ++ N A A +R+ S LG + Sbjct: 358 ----RDPDAVVKLGEARKWLANRFDELDGTSSMPVNRLGARIGASVRAWESMSKLGGATL 413 Query: 403 GALLEDGFISRQMLSRVGIDKEAIQ-------RINKMPLKERMELLSDVGLYAEGVVAHG 455 A+ + F + ++ + E R E++ + +EG++ H Sbjct: 414 SAVTDVPFKASELRYQGINLLEGYADGVQSLIRGRGRSDSGTREIIDLLRAGSEGMLGHI 473 Query: 456 RNMMEGSDA-FQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLK 514 + D KL + +WSG Y + + I+ +GR+ T L Sbjct: 474 AGRFDAQDTVPGTLSKLTNVFFRWSGLNYWTDAQRAGAEFIMSRHLGRLQRT--EFAALP 531 Query: 515 ADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMS 574 + + + ++ ++ + + + TP + D + L + Sbjct: 532 RQTQRVLT----LFDIKPEEWDALRAGEWVQADGRAH--LTPDAASRMTDQQVDGL--IG 583 Query: 575 DKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVR 634 K+ R+ + + + L+ +LA E + ++ A + V+ R Sbjct: 584 GKLDGIRQAALDRMEKAVDALDRLESRLAKHE-AAMGKAGPTGADVERATMQATVEGVQR 642 Query: 635 GAMHTSLFDRQRLGLLTYKR 654 + ++ R Sbjct: 643 YQRSIQQLRQDMREMVAGSR 662 Score = 194 bits (492), Expect = 7e-47, Method: Composition-based stats. Identities = 72/418 (17%), Positives = 146/418 (34%), Gaps = 28/418 (6%) Query: 449 EGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYA 508 + + ++ D + H +G D +R + + + Sbjct: 591 QAALDRMEKAVDALDRLESRLAKHEAAMGKAGPTGADVER-----ATMQATVEGVQRYQR 645 Query: 509 SLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLR 568 S++ L+ D R + ++ +++ ++ + L + + LKD + Sbjct: 646 SIQQLRQDMREMVAGSRTQNEVHQ---HLVREIGYLARAERELAVKAERRVARLKDR-VP 701 Query: 569 DLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDN 628 DK A + + ++ L +L + + + + ++ K+H+ D Sbjct: 702 AAEAARDKAAAAIEGIHQDMLRHLDELDSLPVRLDEQMSRARDGARADLALKLHSYFSDR 761 Query: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSN-SAK 687 + +V + + + GT GEALR QF P + + + Sbjct: 762 GEYAVINP----GARERAMLRRGTQAGTLEGEALRFVGQFKAFPVAVISKVWGRDLYGGE 817 Query: 688 MPKGASMALNHVWIQYSATMALAGIGVASIKALLRGE---DPSLPEVIYDGTLANGALLP 744 G + + H + + G +K L +G DP+ P L G Sbjct: 818 RGWGRAAGIVHTLVA----TTVMGYVAGMLKDLSKGRAPRDPTDPRAWGAAFLQGGGAGI 873 Query: 745 YMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPF 804 Y D L S+ + GP S L + + ++ + K + PF Sbjct: 874 YGDFLLGQYSRFGNRFLESAAGPTLSSAGELLN--IWAGAREGNDEKAATLRWTLSNTPF 931 Query: 805 MNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIELFQNMDEGLPHRLPFPFG 862 +N++Y + + D+L L Q+ E +NPG+L R + + K + F P R P+G Sbjct: 932 VNLFYTRMALDYLFLYQVQEAMNPGFLRRFEQRVAKDNNQRF----ILSPSRA-IPYG 984 >gi|190893672|ref|YP_001980214.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652] gi|190698951|gb|ACE93036.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652] Length = 460 Score = 143 bits (359), Expect = 2e-31, Method: Composition-based stats. Identities = 77/494 (15%), Positives = 157/494 (31%), Gaps = 89/494 (18%) Query: 271 EFERVFHFKDSQAHMDYMEHFGVST-NVNTILTSELASLSKDIVIARELGPNADSFVKQM 329 RVF F + + + M+ +GV + + + + +++++I LGPN +++ Sbjct: 43 NQLRVFRFDNPETYKRLMKKYGVGSGGLFNTIMGHVQAMAREIAFTEVLGPN----YQRI 98 Query: 330 IVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLR 389 + G + K R + + ++ A G+R Sbjct: 99 SRSCCRRRAKMMPGARSAKRIGNRITMNSPGAVQRTYDALSGRLGVAQSELIAGIGGGMR 158 Query: 390 SAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAI-QRINKMPLKER---MELLSDVG 445 + A+ LG I AL D + + GI + R+ R EL + Sbjct: 159 NLQTAARLGSATIAALPGDSMTAVLAANYNGIPATNVLARLVTDLTTNREGAEELARQLN 218 Query: 446 LYAEGVVAHGRNMMEGSD---AFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGR 502 L A V+ D + ++ + + +G + + ++ I R Sbjct: 219 LTAATVLDTAIGTKRFEDEVIGQGVTGRIADGLMRVTGINVWTEGLKRAFSMEFMGTIAR 278 Query: 503 MTDTYASLKDLKADPRLDPSIKAFF--KQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIK 560 ++ +LDP + F D+ ++ A + + + Sbjct: 279 QSEHTFE--------KLDPMFQGFLTRYGFTPADWDKLRVAPHIEADGAKFF-------- 322 Query: 561 NLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNK 620 D + + R++D++ ++ + P+ R Sbjct: 323 ---DVNAVEDQRLADRLMSAVIDERHFAVVEPDAR------------------------- 354 Query: 621 MHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNIL 680 +RGAM L +RGT GEA+R QF + P + + Sbjct: 355 ------------IRGAMTGGL-----------QRGTIIGEAVRSATQFKSFPMTYMMTHM 391 Query: 681 DLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPS---LPEVIYDGTL 737 + + M Q + TM +AG ++ +++L+ G DP P + Sbjct: 392 MRALTQGMANRTYR-----TTQLALTMTIAGAEMSQMQSLIAGRDPQNMADPRFWEQSFI 446 Query: 738 ANGALLPYMDRLTK 751 G D + Sbjct: 447 RGGGGGMLADFIYS 460 >gi|291336674|gb|ADD96217.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 333 Score = 119 bits (297), Expect = 2e-24, Method: Composition-based stats. Identities = 67/368 (18%), Positives = 130/368 (35%), Gaps = 47/368 (12%) Query: 366 MWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQM----LSRVGI 421 M EV T+ +A W A R+ A + LG I A+ + +++M S VG Sbjct: 4 MAEVDGSVNTINGFAYAKWGAISRAIAAMAKLGGATISAISDIHLYAKEMKWQGRSYVGG 63 Query: 422 DKEAIQRINK-MPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHK-LHSKMHKWS 479 EA+ R+ K ++ + +G + ++ D G + K + Sbjct: 64 LAEAMGRLAKIKNTADKNGIAEQLGFINDNIIYDLAARYSAGDNLNRGFSQVQRTFFKLN 123 Query: 480 GAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIK 539 G + L + + + + T S K+L + +++ + I+ Sbjct: 124 GLAWWTNSLKQGAILGMGSYVAKQTK--VSYKNLSPQFKRLIDH----YGINEKIWNHIR 177 Query: 540 RAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQ 599 + + DG L+ T I +L DA ++D+ + Sbjct: 178 KMDLDKADDGKLFFNTQK-IDDLSDAVIKDIEGKT------------------------- 211 Query: 600 QQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAG 659 + +++I + KD + ++ + LD +V + + + + GT G Sbjct: 212 ----TMSKRQIEVAKDNLKTRVLGMFLDRSTYAVLEPDART----RGWMKMGQQAGTHPG 263 Query: 660 EALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKA 719 EALR QF P + ++ +A G M Q AL G + K Sbjct: 264 EALRFMTQFKAFPFAFYQKMIGRE-TAAWKDGNKMNAALSMAQLVGGSALFGYMAMTAKD 322 Query: 720 LLRGEDPS 727 +L+G++ Sbjct: 323 ILKGKNLR 330 >gi|262043550|ref|ZP_06016663.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039084|gb|EEW40242.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 143 Score = 115 bits (288), Expect = 3e-23, Method: Composition-based stats. Identities = 30/137 (21%), Positives = 60/137 (43%), Gaps = 4/137 (2%) Query: 715 ASIKALLRGEDPS--LPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMV 772 K LL+G+ P + G L D + V++ + L+GP S Sbjct: 1 MQSKLLLKGQTPRPADAKTFLAAASQGGGLGILGDFMFGEVNRMGAGPVTSLMGPAASNA 60 Query: 773 TNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLD 832 ++ + + D + + + PF+N+++L+ + + LILN+I + L+PG L+ Sbjct: 61 DSIITLLQQTTRGDAD--LGDWYRTALDNTPFLNVFWLRTAMNGLILNRIQDALDPGSLE 118 Query: 833 RQQSKKKKKGIELFQNM 849 R Q + +++ F Sbjct: 119 RYQRRVEREQGNDFLIP 135 >gi|291336673|gb|ADD96216.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 101 Score = 111 bits (278), Expect = 4e-22, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 53/101 (52%), Gaps = 1/101 (0%) Query: 736 TLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNAT 795 L G L Y D L + + +A+ +GP+P+ + S+ + + A Sbjct: 1 MLQGGGLGIYTDFLFGNI-QNSTSALATAVGPIPTEAARVLSALNYAIKGEGGKAGKQAY 59 Query: 796 KAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQS 836 +I++ +PF+N++Y+K +FD++I Q++E L+PG L + Sbjct: 60 YSIKENIPFLNLFYIKTAFDYMIGYQMMETLSPGSLKEWRK 100 >gi|315122771|ref|YP_004063260.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496173|gb|ADR52772.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 137 Score = 91.2 bits (224), Expect = 7e-16, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 45/111 (40%), Gaps = 7/111 (6%) Query: 725 DPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSS--AVEL 782 D + P+ + L L + DR + + + PV S + L + Sbjct: 17 DFTDPKTL---ALLTARTLTHYDRFFNEYHHDFKDLLHAV--PVASTIIGLGDARNIFGE 71 Query: 783 ATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDR 833 + E + N K + +P N++Y K +F +I++ + E N GY +R Sbjct: 72 DEEKREKANANFAKELANNIPLKNLFYAKAAFQKMIVDNLCEYFNEGYKER 122 >gi|216906074|ref|YP_002333630.1| hypothetical protein ASSaV_gp13 [Abalone shriveling syndrome-associated virus] gi|216263167|gb|ACJ71991.1| unknown [Abalone shriveling syndrome-associated virus] Length = 1194 Score = 90.4 bits (222), Expect = 1e-15, Method: Composition-based stats. Identities = 116/820 (14%), Positives = 241/820 (29%), Gaps = 118/820 (14%) Query: 51 GLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQA-------LFNKLFF 103 + AE+ F + + A + K +L + ++AG + K+ Sbjct: 449 AISAEQIFSFKTEEKIRAAANYNKKVAELSTWETLLRAGTMTGKENNLFSGLDSLGKIRN 508 Query: 104 KAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQ 163 + +E + VL + K D+ D + +N Sbjct: 509 VYNATMELVESQSVQPVVSVLEEAQLSLAKLLKVDEGVFQLPAHADIVDGVINPTGKNRY 568 Query: 164 ASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDL 223 S+ + + ++ SQ E GL K + P +++R+ + +F+ M+ +D Sbjct: 569 NSKSA-IFRQAINKIKSQGIEKGLYSKLDDGWFPNMWDKERIRSVGQAEFIEEMIGLVDE 627 Query: 224 SRY---KDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKD 280 SR G S +G+++ + +S +R+ FKD Sbjct: 628 SRMRQAVTASGNIYKNS--TDSLGKIYNNIAAD--QRRVKSDASGTLRTLRGDRLLFFKD 683 Query: 281 SQAHMDYMEHFGVS--TNVNTILTSELASLSKDIVIARELGPNA---DSFVKQMIVQTIA 335 + + FG + + L + + S DIV A G ++ + ++ + Sbjct: 684 GASWYAAHDLFGSEDVPSAFSALRNFAINASDDIVQA-SFGVHSLEDINTFTNVLHNGLG 742 Query: 336 NDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTG-WANWMAGLRSAAGA 394 N +A + LQ E M G + L++ Sbjct: 743 NMAKAQGLSIDSGKLANLE---------LQFKEAMLLHNGYVLPGKLGRLLGFLKNTTLK 793 Query: 395 SMLGQHPIGALLEDG------FISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYA 448 M + A + D + L R+ + A + KM ER E + Sbjct: 794 GMTAGAFVPAAVLDPLGNLPIAGTMFGLDRLTSYRSAKTILKKMTKAERNECFFFLKTSI 853 Query: 449 EGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYA 508 + M+ G ++ + L +K +++ ++ R T Sbjct: 854 NALTTEVNEMLNGPGKPV---FKSLGRKIFNSSHDLTRKISNNNEVMGAALFSRATH--- 907 Query: 509 SLKDLKADPRLDPSIKAFFK--QLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDAD 566 L +L +AF + ++ D+ ++ K+++ I + Sbjct: 908 -LNKSTPWTKLSMDYRAFLERFGINRADWDSYRKKKSVTVGGNIDLMSARYLINQGDRSA 966 Query: 567 LRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVL 626 + + + ++ ++ +P+ + + +I++ + + Sbjct: 967 VVN--------RFAVAEVGSALFAAPKNTRLGRTAKVRTGATVASIVQSDLVEPFANVAY 1018 Query: 627 DNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSA 686 +N+ + R F QF Sbjct: 1019 NNILGLGNLQIEHLYAGR--------------------FGQF------------------ 1040 Query: 687 KMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPS-LPEVIYDGTLANGALLPY 745 + SA + G+ +K LLRGE P+ + + G P Sbjct: 1041 --------------VINSAHVLFLGLLAVEVKKLLRGEKPAVDSRSLALAMMYAGFSGPT 1086 Query: 746 MDRLTKLVS-KGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPF 804 D L + + G PV A N + + +R LPF Sbjct: 1087 GDALIEQFMFSSGGINLWGFELPVA---------AGAKLIGKKRNVFLALHRTMRAKLPF 1137 Query: 805 MNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKKGIE 844 N N + + L+P + + +K IE Sbjct: 1138 -NQTLAANILQKYTTDILFALLDPEGAKAYEDRLQKDFIE 1176 >gi|71736491|ref|YP_273928.1| hypothetical protein PSPPH_1691 [Pseudomonas syringae pv. phaseolicola 1448A] gi|71557044|gb|AAZ36255.1| conserved domain protein [Pseudomonas syringae pv. phaseolicola 1448A] Length = 359 Score = 89.3 bits (219), Expect = 2e-15, Method: Composition-based stats. Identities = 33/265 (12%), Positives = 82/265 (30%), Gaps = 23/265 (8%) Query: 301 LTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQ 360 + + + KD V+ +LGPNA + + D S + + + Sbjct: 1 MNGSVHAQIKDTVLTEQLGPNAAQTYRLLHDTAKQKDAGGSGAFAGTEFGATPDMV---- 56 Query: 361 EAMLQMWEVM-RYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRV 419 W V+ N +A + G+R+ A+ L I +++ D S + S Sbjct: 57 ------WNVLNGSLGVPVNARFAEFNQGIRNFMVAAKLQATLIASVIGD-VQSLAITSAY 109 Query: 420 GIDKEAIQRINKMP--LKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHK 477 ++ + K+ + + + + + + + + KL + + K Sbjct: 110 HGLPIGKTLVSALKSVSKDYRTEAGRMSIGMDSITSDMVSFHTDNLSAGWTSKLANAIMK 169 Query: 478 WSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTV 537 + E ++ + +++ T + + D+ V Sbjct: 170 VTLLEGWTNAMRRGFSVEIMSRMAGDTRKAWG-------DDPVLQSRLERHGITQEDWAV 222 Query: 538 IKRAKAMSSPDGYLYARTPSTIKNL 562 + A TP ++ ++ Sbjct: 223 WQAATPEDWR--GHQMLTPESVASM 245 >gi|212710806|ref|ZP_03318934.1| hypothetical protein PROVALCAL_01874 [Providencia alcalifaciens DSM 30120] gi|212686503|gb|EEB46031.1| hypothetical protein PROVALCAL_01874 [Providencia alcalifaciens DSM 30120] Length = 1122 Score = 83.9 bits (205), Expect = 1e-13, Method: Composition-based stats. Identities = 125/854 (14%), Positives = 281/854 (32%), Gaps = 144/854 (16%) Query: 26 EDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIR-----------SVNDAIDEAY 74 + + A + D + + A E + ++ + Y Sbjct: 359 SEQVGDAMRNNDRHAIPEVAEAARAVRPIVEKTKDRMVELGILREGVTVSTAESYFPRIY 418 Query: 75 KRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKV------LSKFN 128 K ++ +D + + Q + + +KA S+ + I+ A V ++ Sbjct: 419 KFDKILNDRAEFRNIIADWLQEMNQRTVYKAESSLAKADAGIEQARASVPQAEKLNAEIK 478 Query: 129 EYAEVGSKNLGFTLDKQFGLDVF--DEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAG 186 E K + + + E + + +A + K+ E + +A Sbjct: 479 EAERWSGKKQLLMNEIEKNRKLVAEKEAVSAEIEMRKAKKPTKKL-EQLERKLMRIEDAE 537 Query: 187 LDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDG------TPLSRSEIA 240 + +DK R ++++ + L+RY + PL+R E+ Sbjct: 538 ---NKLASYQRSLEILDKPRQF-RNEYSQLTRKANSLTRYDNRRHAALRRMEPLAREEVE 593 Query: 241 SFVGEVFAERVRSTSFKDPSIPSSEVGVKREF---ERVFHFKDSQAHMDYMEHFGVSTNV 297 + ++ + + + S PS + KR R + D + ++ F + ++V Sbjct: 594 AAADDIINKIIGAPSGIVPSELIPDGLTKRAGFTKSRTLNIPD-----ERIKDF-LESDV 647 Query: 298 NTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLE 357 N ++ + + ++ +I + + G M Q A + + K R KLE Sbjct: 648 NYVMENYIRQVAPEIELTAQFG------RVDMDAQIKAITNDYNTLISEAKTAKERGKLE 701 Query: 358 VRQEAMLQ----MWEVMRYGETVE---NTGWANWMAGLRSAAGASMLGQHPIGALLEDG- 409 R++A L+ M + + ++ + R +LG I +L + Sbjct: 702 ARRDADLRDIRAMRDRLLGTYGAPKDPSSFFVRAGRIARHVNFLRLLGGMTISSLPDIAR 761 Query: 410 -FISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIG 468 + + S + + + I+ M + + L ++G+ E ++ ++ + Sbjct: 762 PIMQHGLRSALKPLGKMLTDISAMKIAKAD--LREMGVGLEYALSSRSKVIADLNDPYAR 819 Query: 469 HKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFK 528 + +WS ++ + ++ + + G +T + Sbjct: 820 RTFLERGLEWSSQKFGNFTLMNQYTDTMKMWTGVVTQS---------------------- 857 Query: 529 QLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSK 588 +++ A+ +S+ + L +++ LA + Sbjct: 858 -------KILRAAQEVSTGN------------ALSSKEIKKLAHLG-------------- 884 Query: 589 TLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLG 648 + + + QQ + +L H+ + D VR ++ R Sbjct: 885 -VDKNMLERIAQQYSKHGEDLDGMLTG------HSHLWD--DRVVRETFQAAVLKDVRTT 935 Query: 649 LLTYKRGTRA----GEALRMFQQFTTTPTGMFLNILD---LSNSAKMPKGASMALNHVWI 701 ++T G E ++ QF T G L S A GA + ++ + Sbjct: 936 VITPGIGDTPLMMSSELGKIVMQFKTFFFGTHNRALVSGIQSGDASFYYGALLQISLGSL 995 Query: 702 QYSATMALAG--IGVASIKALLRGEDPSLPEVIYDG------TLANGALLPYMDRLTKLV 753 Y +AG I + G D S L+ G+ Sbjct: 996 VYVLKSMMAGREINAEPANLVKEGLDWSGMMGWLGEPNNLLENLSGGSYGMSAMFGGPPA 1055 Query: 754 SKG-DRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKN 812 S+ R IG LLGP + ++ + + + ++ + +++RK LPF N++YL Sbjct: 1056 SRYQSRNGIGALLGPTFDLGGDIQNITAGVMNGEFDDRE---VRSVRKLLPFQNLFYLSP 1112 Query: 813 SFDHLILNQILEEL 826 +LNQ+ E+L Sbjct: 1113 -----LLNQVEEQL 1121 >gi|218514216|ref|ZP_03511056.1| hypothetical protein Retl8_11184 [Rhizobium etli 8C-3] Length = 73 Score = 79.6 bits (194), Expect = 2e-12, Method: Composition-based stats. Identities = 18/61 (29%), Positives = 31/61 (50%), Gaps = 4/61 (6%) Query: 802 LPFMNMWYLKNSFDHLILNQILEELNPGYL---DRQQSKKKKKGIE-LFQNMDEGLPHRL 857 P ++WY K + D LI + I ++P Y DR + + K++ + + +GLP R Sbjct: 10 TPGSSLWYTKIATDRLIFDNIQAMIDPNYRASFDRYERRMKREFGQAFWWGPGDGLPQRP 69 Query: 858 P 858 P Sbjct: 70 P 70 >gi|227355848|ref|ZP_03840241.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164167|gb|EEI49064.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 1127 Score = 78.5 bits (191), Expect = 5e-12, Method: Composition-based stats. Identities = 106/685 (15%), Positives = 228/685 (33%), Gaps = 116/685 (16%) Query: 172 FETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKD-DFVRSMLDWLDLSRYKDID 230 T + + ++A + + + K R + + L D R ++ Sbjct: 528 QATLQRKLQRINDAENKLPALQRSVDILDNPRKFRNEHRRLTRTANSLTRHDRIRQSALN 587 Query: 231 G-TPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREF---ERVFHFKDSQAHMD 286 TPL R E+ + ++ + + + S PS + VKR +R + D + Sbjct: 588 RLTPLEREELDAAADDIINKIIGAPSGIVPSELIPDGLVKRAGFTKDRTLNIPD-----E 642 Query: 287 YMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKV 346 ++ + + ++VN ++ + + ++ +I + + G M Q A +E + Sbjct: 643 RIKDY-LESDVNYVMENYIRQVAPEIELTAKFG------RVDMDNQIKAITEEYNQLIAD 695 Query: 347 LKDWLGRNKLEVRQEAMLQ----MWEVMRYGETVE---NTGWANWMAGLRSAAGASMLGQ 399 R++LE R+EA L+ M + + ++ + R +LG Sbjct: 696 ATTPKERSRLEARREADLRDIRAMRDRLLGTYGAPKDPSSFFVRAGRVARHVNFLRLLGG 755 Query: 400 HPIGALLEDG--FISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRN 457 I +L + + + S + + + I M + + L ++G+ E V++ Sbjct: 756 MTISSLPDMARPIMQHGLRSALKPLSKMLTDIGAMRIAKAD--LREMGIGLEYVLSSRSK 813 Query: 458 MMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADP 517 ++ + +WS ++ + ++ + + G +T + Sbjct: 814 VIADLSDPYSRRSYLERGLQWSSQKFGNFTLMNQYTDTMKMWSGLITQS----------- 862 Query: 518 RLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKI 577 V+K A T +L +++ LA + Sbjct: 863 ------------------KVLKAA------------NTLDAGGSLSKREIKKLAHIG--- 889 Query: 578 AYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAM 637 + + + Q +L H+ + D VR Sbjct: 890 ------------IDESMLKRIADQFKRHGEDLDGMLTG------HSHLWD--DRVVRETF 929 Query: 638 HTSLFDRQRLGLLTYKRGTRA----GEALRMFQQFTTTPTGMFLNILD---LSNSAKMPK 690 ++ R ++T G E ++ QF T L S A Sbjct: 930 QAAVLKDVRTTVITPGIGDTPLMMSSELGKIVMQFKTFFFATHNRALVSGIQSGDASFYY 989 Query: 691 GASMALNHVWIQYSATMALAGIGVAS--IKALLRGEDPSLPEVIY---DGTLANGALLPY 745 GA + + + Y +AG + + + G D S + L N + Y Sbjct: 990 GALLQVALGSLVYVLKAKMAGRDINTEPANLVKEGLDWSGMMGWLGEPNNVLENLSGGTY 1049 Query: 746 MDRLT---KLVSKG-DRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKT 801 S+ R IG LLGP + ++ + + + ++ + +++RK Sbjct: 1050 GMSAMFGGPPASRYQSRNGIGALLGPTFDLGGDIKNITSGVLNGEFDDRE---VRSVRKL 1106 Query: 802 LPFMNMWYLKNSFDHLILNQILEEL 826 LPF N++YL +LNQ+ E++ Sbjct: 1107 LPFQNLFYLSP-----LLNQVEEQM 1126 >gi|262043399|ref|ZP_06016524.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039225|gb|EEW40371.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 964 Score = 70.0 bits (169), Expect = 2e-09, Method: Composition-based stats. Identities = 96/735 (13%), Positives = 208/735 (28%), Gaps = 137/735 (18%) Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 + +AA + + +LG+T ++ + G N + + Sbjct: 327 RREEAAVVTANKQAYTQYKAEGGDLGYTAFREQVGEALRN--GDVHVNTKVQEAAQAMRT 384 Query: 174 TQRELHSQAHEAGLDYKFFE-------NRIPQPMSVDKLRATKKDDFVRSMLDWLDL--S 224 + + E GL E + P+ V K+ +++D F ++DW Sbjct: 385 VINRVKTAQQELGLLPPDAELKAMGQTSYFPRVYKVGKI-VSERDKFRNMLVDWWSRGEK 443 Query: 225 RYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAH 284 D + + I VG + ++ + + R D Sbjct: 444 TMSREDAEIAADTTINRIVGAKIPQEFA-------NVFMVKAPGSTK-SRTLSVPD---- 491 Query: 285 MDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGN 344 M+ + + ++ N +L + S +I + R G + + + I ++ +A Sbjct: 492 -RLMKDY-LESDANYVLQRHIREASAEIELTRTFG---NKSLDSQLA-AIQDEYDALMRL 545 Query: 345 KVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVE---NTGWANWMAGLRSAAGASMLGQHP 401 + + E +L + + + + ++ + A LRSA + LG Sbjct: 546 RPAEQEKLAKAREADLRDILALRDRLVGTYGMPDDPSSFFVRAGAFLRSANFVTKLGGMT 605 Query: 402 IGALLEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEG 461 + A+ + L +V N M G Sbjct: 606 VSAIPD--------------------------------------LARGMMVNGFSNTMRG 627 Query: 462 SDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDP 521 + + S A + A+ + + T L D + Sbjct: 628 ----------YGALITRSPAYLASRAEQKKMAVGLETILHTRARTMGDLVDSSS---RTT 674 Query: 522 SIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHR 581 + +A +++ D + T I + Sbjct: 675 AAEAGMERITDVFGKLTMMGHFDDMNKSVNGMITSDGILS------------GAFPTKRL 722 Query: 582 KKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSL 641 KL ++ ++ ++E + ++ I + L+ V V + T Sbjct: 723 AKLGINEKMAERIQREFHKHGEVIQGWHIGNFEKWDDQYAAGLLQSAVLKDVNNTVITPG 782 Query: 642 FDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWI 701 L T G + QF + T + G + Sbjct: 783 IGDTPLWAS-----TPLG---KTVFQFKSFATASYNRAT---------LGGLQEGTAQFY 825 Query: 702 QYSATMALAGIGVASIKALLRGE--DPSLPEVIYDGTLANGALLPYMDR----------- 748 +A G ++K G D + +++ +G +G L P M+ Sbjct: 826 YGTAFQIGLGSLTYALKQAANGREVDLTPQKMVLEGIDRSGILGPLMEYNNMAEKASGGM 885 Query: 749 -----LTKLVSKG---DRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRK 800 L ++ R IG LGP ++ +T + D + ++R Sbjct: 886 IGLGPLLGTGTQSRYASRGFIGSALGPTFGLLDTVTDVTAGVLNGD---AGDRVLHSVRT 942 Query: 801 TLPFMNMWYLKNSFD 815 LP N++++ + Sbjct: 943 LLPGNNLFWVAPLIN 957 >gi|295096859|emb|CBK85949.1| hypothetical protein ENC_24210 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 963 Score = 70.0 bits (169), Expect = 2e-09, Method: Composition-based stats. Identities = 96/728 (13%), Positives = 200/728 (27%), Gaps = 123/728 (16%) Query: 114 MKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFE 173 + +AA + + + +L F+ ++ + G N + Sbjct: 326 RREEAAVVVTNKQAYSHYKASGGDLSFSRFREEVGNAMRS--GDVHANPVVQEAAQAMRT 383 Query: 174 TQRELHSQAHEAGL--------DYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDL-- 223 + + GL E+ P+ V K+ ++D F ++DW Sbjct: 384 VVNRVKVAQQKLGLLPPDEELKAIGQ-ESYFPRVYKVGKIVN-ERDKFRDMLVDWWSRGE 441 Query: 224 SRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQA 283 + + + I VG + ++ + R D Sbjct: 442 KTMSREEAEITADATINKIVGAKIPQDFA-------NVFMVKAAGSTR-SRTLSVPD--- 490 Query: 284 HMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAG 343 M+ + + ++ N +L + S ++ + R G S KQ+ D Sbjct: 491 --RLMKDY-LESDANYVLQRHIREASAEVELTRAFGN--KSLEKQLKDIQDEYDALMRQN 545 Query: 344 NKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVE---NTGWANWMAGLRSAAGASMLGQH 400 K ++R L+ + + + ++ + A LRSA + LG Sbjct: 546 PKDQAKLAKARDNDIRDITALR--DRLAGTYGMPDDPSSFFVRAGAFLRSANFVTKLGGM 603 Query: 401 PIGALLEDGF-ISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMM 459 + A+ + + A+ + R E L + + E ++ M Sbjct: 604 TVSAIPDLARGVMVNGFGNTMRGYSALITRSPAFKASRAEQLK-MAVGLETILHTRARTM 662 Query: 460 EGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRL 519 D S+ V + R+TD + L + + Sbjct: 663 G------------------------DLVDGSARTTAVEAGMERVTDAFGKLTLMGHFDDM 698 Query: 520 DPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAY 579 + S+ T I + Sbjct: 699 NKSVNG---------------------------MITSDGILS------------GAFAGR 719 Query: 580 HRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHT 639 KL + ++ R E ++ + I + + + V V + T Sbjct: 720 RLAKLGINDNMAARIRSEFEKHGEVINGWHIGNFEKWDDQHVAGVFQSAVLKDVNNTVIT 779 Query: 640 SLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNI---LDLSNSAKMPKGASMAL 696 L T G + QF + T + + + G + + Sbjct: 780 PGIGDTPLWAS-----TPLG---KTIFQFKSFATASYNRATLGGLQEGTGQFYYGTAFQI 831 Query: 697 NHVWIQYSATMALAGIGV--ASIKALLRGED------PSLPEVIYDGTLANGALLPYMDR 748 + Y+ + G V + K +L G D P + + G + Sbjct: 832 GLGALTYALKQSANGKEVDWSPNKLVLEGVDRSGILGPLMEYNNMAEKASGGMVGLGALL 891 Query: 749 LTKLVSKG-DRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNM 807 T S+ R IG LGP ++ +T + D + +R LP N+ Sbjct: 892 GTGTQSRYASRGFIGSALGPTFGLLDTITDVTAGVLNGD---AGDRVLHNVRTLLPGNNL 948 Query: 808 WYLKNSFD 815 +++ + Sbjct: 949 FWIAPLIN 956 >gi|119386478|ref|YP_917533.1| hypothetical protein Pden_3771 [Paracoccus denitrificans PD1222] gi|119377073|gb|ABL71837.1| hypothetical protein Pden_3771 [Paracoccus denitrificans PD1222] Length = 1099 Score = 68.1 bits (164), Expect = 6e-09, Method: Composition-based stats. Identities = 112/855 (13%), Positives = 241/855 (28%), Gaps = 140/855 (16%) Query: 29 IVRAYVSLDGKG--LSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRV 86 + AY ++ +G +++ E G + ++ A K D Sbjct: 328 MNEAYKAMRKRGVAMTRTEFNNAVGQAMRRGDRSDIPEVAQAAASIRAKVFDPLKDRAVA 387 Query: 87 QAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQF 146 + + +F +E + + + F+ ++ DK Sbjct: 388 AGLLPEGVSVDTAESYFSRVWNRPVIEANEAEFKQILRNYFDGQVTAAAQRAAAETDKAT 447 Query: 147 G------LDVFDEMKGKKTQ-NEQASRLVKQYFETQRELHSQAHEAGLD------YKFFE 193 + M G++ + + + + + + +A +G+D + Sbjct: 448 ASLRSAREAIERSMAGRQADASALSDGVARGVADVMSDDAMRAFRSGVDTLAGRVVGELD 507 Query: 194 N----RIPQPM-SVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIASFVGEVFA 248 ++ + ++ L + D++ D RY D EI V EV Sbjct: 508 EADLAKLAKIDADLEALGRRGEYDWLSDA----DRKRYLD---------EIVDSVYEVVT 554 Query: 249 ERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASL 308 R IP+ + ER FH D + +E F + +N + I+ + Sbjct: 555 GRALDADLPSNIIPTKRGPL---AERTFHIPD-----ELVEKF-LDSNADLIMRRYARVM 605 Query: 309 SKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWE 368 S D+ + G V DQ A ++ K+ + +Q A L E Sbjct: 606 SADVELQTRFGS-----VTMKDQIKTIRDQYAQIRAELEKNTELPETAKQKQLAKLAAKE 660 Query: 369 VMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQR 428 + RS A G I+ ++ + Sbjct: 661 KSDIEDIQAVRDMLRGTYNARSQTTA-------------FGRIANAAMTFNYLRTLGGVT 707 Query: 429 INKMPLKERMELLSDVGLYAEGVVAHGRNMMEG----SDAFQIGHKLHSKMHKWSGAEYL 484 I+ + R ++ + Y E + M+G + + K+ A Sbjct: 708 ISSLTDAVRPAMVHGLKSYMEDGLKPLIRNMQGIKLAKKEAKEAGAISEKILHSRLATLA 767 Query: 485 DKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAM 544 D + + + + + L ++ A T ++K A+ + Sbjct: 768 DLTDPYAQGSPFERFLQNASVGFTKMTGLLHWNDFQKTLAA-----TMTQNRILKNAEIV 822 Query: 545 SSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLAD 604 + L A+ +A + + +P + ++ Sbjct: 823 ADR----------GFDALPKAEQAYMAYLG-----------LGRDGAPLLGRLFREHGQV 861 Query: 605 LERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRM 664 + + + +V ++ + + ++ + + + + + + T RM Sbjct: 862 I--DGVRVANSEVWPAEMDHMVRSWRAAINKDVDSIIVTKGVADVPLFASTTVG----RM 915 Query: 665 FQQFTTTPTGMFLNIL--DLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLR 722 QF + +L L G M+ G + +K L Sbjct: 916 ALQFRSFALASNQRVLLRGLQEDQTRFWGG-----------VVGMSAIGAFIYMLKQLES 964 Query: 723 GEDPSL-PEVIYDGTL-----------------ANGALLPY--MDRLTKLVSK------- 755 G + S P L G Y S+ Sbjct: 965 GREISDNPGTWVAEGLDRSGIFSLAFEVNNALEKAGGFGIYNAAAAAFPGKSQKAPASRF 1024 Query: 756 GDRAAIGGLLGPVPSMVTN---LTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKN 812 R + GP + L S + A D + + + +R+ PF ++ Y + Sbjct: 1025 ASRTGYASMFGPTYELGEGAYGLMSMGLRAARGDLDMTAGD-VGTLRRMTPFASLPYWRW 1083 Query: 813 SFDHLILNQILEELN 827 D I+N + E L+ Sbjct: 1084 LIDGQIVNPLKESLS 1098 >gi|301021601|ref|ZP_07185598.1| hypothetical protein HMPREF9551_01224 [Escherichia coli MS 196-1] gi|299881535|gb|EFI89746.1| hypothetical protein HMPREF9551_01224 [Escherichia coli MS 196-1] Length = 614 Score = 68.1 bits (164), Expect = 6e-09, Method: Composition-based stats. Identities = 75/585 (12%), Positives = 154/585 (26%), Gaps = 147/585 (25%) Query: 294 STNVNTILTSELASLSKDIVIARELGPNA------------DSFVKQMIVQTIANDQEAS 341 ++VN +L + + +I + R G DS ++++ + A E+ Sbjct: 107 ESDVNYVLQRHIREAAAEIELTRTFGKRTMTERLQLIEDEYDSLLREVPEKIKAKYDESV 166 Query: 342 AGNKVLKDWLG--------------------RNKLEVRQEAMLQMWEVMRYGETV----- 376 A K + G + + + + + ++ + + Sbjct: 167 ANLKARYESNGEVVPQGKLDSLMRKYEKELRKEQSRLSKSRANDLRDITALRDRLVGTYG 226 Query: 377 ----ENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRINKM 432 ++ + A LR + LG + A+ + + K +I++ Sbjct: 227 MPDDPSSFFVRAGAFLRDVNFTTKLGGMTVSAIPDLA-RGVMVNGFRNTMKGYASQISQS 285 Query: 433 PL-KERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISS 491 P K E + +G+ E V+ + Sbjct: 286 PAFKASKEEMLKMGIGLETVLHSRSRAIGDLVDSSSRTTAVEA----------------- 328 Query: 492 HALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYL 551 + R+TD + L + ++ S M DG L Sbjct: 329 -------GMERITDAFGKLTLMDRFNDINKS------------------MNGMVISDGIL 363 Query: 552 YARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEIN 611 P A KL + ++ R E ++ ++ I Sbjct: 364 SGAFP---------------------ARRLAKLGINDNMAARIRSEFEKHGEVIDGWHIG 402 Query: 612 ILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTT 671 + + V V + T L T + R QF + Sbjct: 403 NFDKWDDQYVAGVFQSAVLKDVNNTIITPGIGDTPLWAST----SWG----RTIFQFKSF 454 Query: 672 PTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGED------ 725 T + L G + +A G V ++K +G+D Sbjct: 455 TTASYNRAL---------LGGLQEGTAQFYYGTAFQIALGSLVYALKEASKGKDVDWSPE 505 Query: 726 --------------PSLPEVIYDGTLANGALLPYMDRLTKLVSKG-DRAAIGGLLGPVPS 770 P + GA+ T S+ R + L GP S Sbjct: 506 KLVLEGIDRSGILGPLMEYNNMAEKATGGAVGLGALFGTGTQSRYASRGFVSSLFGPSFS 565 Query: 771 MVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFD 815 + ++ + D +R T+P N++++ + Sbjct: 566 LADSIIDVTSGVLNGD---VGDRIVHNVRTTIPGNNLFWIAPLIN 607 >gi|259418630|ref|ZP_05742547.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259344852|gb|EEW56706.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 1302 Score = 64.2 bits (154), Expect = 9e-08, Method: Composition-based stats. Identities = 118/809 (14%), Positives = 243/809 (30%), Gaps = 142/809 (17%) Query: 50 AGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAE 109 +G A K+L S D ++ Y R L + Sbjct: 578 SGELAAALDTKKLSISRRDGMEADYMREALEEMGYLPEGSTVNDLYDALRS-AAGGEKIY 636 Query: 110 VPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQA----- 164 E + + + ++F E E ++ +D+ + D+ + +KTQ +A Sbjct: 637 SSRENPFELSRFQAANEFAEAMEEMGIDITEPIDRIIA-QLPDKARNQKTQGAKATEAER 695 Query: 165 ---SRLVKQYFETQRELHS--QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLD 219 + R L + + EA +N I P ++++A + D +RS+L Sbjct: 696 SGKKAGKEDVSADVRALRALDRLDEANARLAELKNDIG-PKVQEEIKAAQAD--LRSILP 752 Query: 220 WLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKR-EFERVFHF 278 L ++ + ++ + E + VRS P S E + RV Sbjct: 753 ELRKAKKAQSAEEFYANADDLQ-IEEAVTDTVRSLLNLKPGQHSYEATLSSPTRARVLDV 811 Query: 279 KDS--QAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAN 336 D + ++ +N I++ + D+ + R+ G + +Q I + IA Sbjct: 812 DDLVLEPWLE--------SNAEAIMSQYFRQMVPDLELTRQFGDAEMTVARQRITEEIAR 863 Query: 337 DQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASM 396 + + + K + + + R + + M + +R V W+ G R+ S Sbjct: 864 NMQDAKSAKDRVRI--QEEGQERLKDLEGMRDRLRNRYGVPENPRNGWVQGGRALRTVSY 921 Query: 397 ---LGQHPIGALLEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVA 453 LG + A+ + I + R G++ + + +R + L + + Sbjct: 922 MGYLGGMMLSAIPDIAGI----IGRGGVEGAFGAGVTALTNPKR------MALASRDMAE 971 Query: 454 HGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL 513 G AE+ R + M D Y + Sbjct: 972 IGA-----------------------AAEWWLNSRAL--------SLAEMFDPYGGGTKM 1000 Query: 514 KADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARM 573 + R+ F I + ++ ++ Sbjct: 1001 E---RVLGQGARQFSIATGMIPWNIGWKSVGGAAVASKMSKAADAVRG------------ 1045 Query: 574 SDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSV 633 K + + + P + + QL D+ ++K L L Q Sbjct: 1046 -GKATKKQLRTLAENGIEPWMAERIAAQL------------DEFADKGGTLWLPRGQEWT 1092 Query: 634 RGAMHTSL--FDRQRLGLLTYKRG-----TRAGEALRMFQQFTTTPTGMFLNILDLSNSA 686 + + L+ G + + E + F QF + IL Sbjct: 1093 DPEAFKAFETAMNREFDLMVITPGQDKPLSFSTEMGKFFGQFKSFALSAHHRIL------ 1146 Query: 687 KMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVI-------------- 732 + + + T + G A++KA L G +P + Sbjct: 1147 ---LSGIQRADADVLAQATTALVFGALTANVKAYLGGYEPKEGAAMWEDALDRSGLAGWL 1203 Query: 733 -----YDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDN 787 L+ G + +++ ++ +A+ G LGP V + + N Sbjct: 1204 MEPYNLAAALSGGKTSITGEPVSRYQAR---SALEGALGP---SVDMMKGGVEAINAFSN 1257 Query: 788 ENSKVNATKAIRKTLPFMNMWYLKNSFDH 816 + + + + +P N+WYL F Sbjct: 1258 GKANYRDVRKLMRPIPGNNLWYLLPLFQK 1286 >gi|13186153|emb|CAC33464.1| hypothetical protein [Legionella pneumophila] Length = 504 Score = 63.1 bits (151), Expect = 2e-07, Method: Composition-based stats. Identities = 70/493 (14%), Positives = 138/493 (27%), Gaps = 76/493 (15%) Query: 330 IVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWE-VMRYGETVENTGWANWMAGL 388 + + G + K N +A +QM + V G V N+ A + + Sbjct: 66 LRKEFDTQSAGLTGKQAQKLREQYNSNIEDMKAAIQMLQGVYGQGFNVLNSSGAEFFNNV 125 Query: 389 RSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYA 448 + MLG I +L + G + + + I + K + +G Sbjct: 126 MNWNYTRMLGHMTISSLPDLGMLVMRN-GLMATLAHGIGESFSVVKKISKNDIKALGYAI 184 Query: 449 EGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYA 508 E + Y++ +S++ + +T + Sbjct: 185 ETELGTQIK------------------------TYIEHSGLSTNPSPFTKGLNSLTRAFG 220 Query: 509 SLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLR 568 +L + + ++ ++ T+ K S I N +++ Sbjct: 221 NLSLMNPWTDMIQNMAGHIA-INRILTTIHKVVNGESVAKKETTLLARLGISNEYFSEIA 279 Query: 569 DLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDN 628 + + N +P + L+ A + + I Sbjct: 280 KFTKDNVYKGTRYADWTNWDIKTPSELNALKAFQAAVGKSIDEI---------------- 323 Query: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNI----LDLSN 684 + + LL +RG G + QF + I + N Sbjct: 324 ----------SLSPNLGDKPLLLQQRGAF-GHMTNLMFQFKSFLFAATNRIFYSGIQNRN 372 Query: 685 SAKMPKGASMALNHVWIQYSATMALAGIGVASI--KALLRGEDPSLPEVIYDGTLANGAL 742 + GA + + Y + L G + K LL + I G Sbjct: 373 DINLYLGAVSMMGLGMLGYVVSSHLRGNKEIDLSTKNLL--REGVDRSGILAIF---GEG 427 Query: 743 LPYMDRLT--KLVSKG-DRAAIGGLLGPVPSMVTNLTSS-----AVELATKDNENSKVNA 794 + +L VS+ R A G +LGP V+ L S + A + A Sbjct: 428 INIGQKLFQLGEVSRYKSRDAFGSVLGPTGGSVSQLVSLFNKLNPLSTAKGEWTTKDAEA 487 Query: 795 TKAIRKTLPFMNM 807 + + +PF + Sbjct: 488 ---VMRLMPFAKL 497 >gi|291336675|gb|ADD96218.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 106 Score = 62.7 bits (150), Expect = 3e-07, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 40/107 (37%), Gaps = 5/107 (4%) Query: 234 LSRSEIASFVGEVFAERVRSTS----FKDPSIPSSEVGVKREFERVFHFKDSQAHMDYME 289 ++ + F+ + +R+ + + + + + +RV HFK S +Y Sbjct: 1 MTPEAMDRFLSRAYNSLIRNENQIVNGAGDTFGARSMVKQLGAKRVLHFKSSDDWFEYNT 60 Query: 290 HFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAN 336 FG N+ + ++I + +LG N +++ N Sbjct: 61 MFG-GRNLKEAIFGGFHVAGQNIGMMSKLGSNPQRNYAKIMDLVKTN 106 >gi|218514496|ref|ZP_03511336.1| hypothetical protein Retl8_12732 [Rhizobium etli 8C-3] Length = 182 Score = 62.3 bits (149), Expect = 4e-07, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 52/143 (36%), Gaps = 5/143 (3%) Query: 271 EFERVFHFKDSQAHMDYMEHFGVSTN-VNTILTSELASLSKDIVIARELGPNADSFVKQM 329 RVF F + + + M+ +GV + + + + +++++I LGPN +++ Sbjct: 43 NQLRVFRFDNPETYKRLMKKYGVGSGGLFNTIMGHVQAMAREIAFTEVLGPN----YQRI 98 Query: 330 IVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLR 389 + G + K R + + ++ A G+R Sbjct: 99 SRSCCRRRAKMMPGARSAKRIGNRITMNSPGAVQRTYDALSGRLGVAQSELIAGIGGGMR 158 Query: 390 SAAGASMLGQHPIGALLEDGFIS 412 + A+ LG I AL D + Sbjct: 159 NLQTAARLGSATIAALPGDSMTA 181 >gi|294490696|gb|ADE89452.1| conserved hypothetical protein [Escherichia coli IHE3034] Length = 1129 Score = 53.1 bits (125), Expect = 2e-04, Method: Composition-based stats. Identities = 51/360 (14%), Positives = 109/360 (30%), Gaps = 54/360 (15%) Query: 498 NQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR-----AKAMSSPDGYLY 552 +G M ++ +K R + + T I ++ ++ G + Sbjct: 777 KSLGPMVSMLKNMDSVKIATRDLREMAVGLDYVLSTRTKAIADLTDPYSRRSAAERGLNW 836 Query: 553 ARTPSTIKNLKD---ADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKE 609 L + + L+ + M + + S + + + + + Sbjct: 837 MTQKFGNWTLMNQWNSALKSWSGMIVQSRILDAARQVSAGGTLSKSEMRKMAQVGINEDV 896 Query: 610 INILKDKVS---NKMHALVLDNVQT----SVRGAMHTSLFDRQRLGLLTYKRGT----RA 658 + + ++ M L+ + R +++ ++T G + Sbjct: 897 LRRIGEQFGKHGEDMDGLLTGHSHLWDDRFAREIFQSAVLKDVDSVIVTPGVGDTPLFFS 956 Query: 659 GEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIK 718 E +M QF T +L G ++ T+AL + +K Sbjct: 957 KEGWKMITQFKTFIFAQHNRVLV--------SGIQQGDAAFYLGALGTIALGSMVYM-MK 1007 Query: 719 ALLRGED---------------------PSLPEVIYDGTLANGALLPYMDRLTKLVSKG- 756 L G D S P + ++ G VS+ Sbjct: 1008 QKLSGRDIDYSWNNLVKEGIDRGGMLGWLSEPLNTVE-NISGGRFGLGAMFGAPPVSRFQ 1066 Query: 757 DRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDH 816 R AIG LLGP + + + A + + ++ + +A + K LPF N+W + + Sbjct: 1067 SRNAIGALLGPTFDLGGDAATVANGVLNGEFDSQQTHAVR---KMLPFQNLWAISPLLNK 1123 >gi|301046396|ref|ZP_07193556.1| conserved domain protein [Escherichia coli MS 185-1] gi|300301622|gb|EFJ58007.1| conserved domain protein [Escherichia coli MS 185-1] Length = 1129 Score = 53.1 bits (125), Expect = 2e-04, Method: Composition-based stats. Identities = 51/360 (14%), Positives = 109/360 (30%), Gaps = 54/360 (15%) Query: 498 NQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKR-----AKAMSSPDGYLY 552 +G M ++ +K R + + T I ++ ++ G + Sbjct: 777 KSLGPMVSMLKNMDSVKIATRDLREMAVGLDYVLSTRTKAIADLTDPYSRRSAAERGLNW 836 Query: 553 ARTPSTIKNLKD---ADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKE 609 L + + L+ + M + + S + + + + + Sbjct: 837 MTQKFGNWTLMNQWNSALKSWSGMIVQSRILDAARQVSAGGTLSKSEMRKMAQVGINEDV 896 Query: 610 INILKDKVS---NKMHALVLDNVQT----SVRGAMHTSLFDRQRLGLLTYKRGT----RA 658 + + ++ M L+ + R +++ ++T G + Sbjct: 897 LRRIGEQFGKHGEDMDGLLTGHSHLWDDRFAREIFQSAVLKDVDSVIVTPGVGDTPLFFS 956 Query: 659 GEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIK 718 E +M QF T +L G ++ T+AL + +K Sbjct: 957 KEGWKMITQFKTFIFAQHNRVLV--------SGIQQGDAAFYLGALGTIALGSMVYM-MK 1007 Query: 719 ALLRGED---------------------PSLPEVIYDGTLANGALLPYMDRLTKLVSKG- 756 L G D S P + ++ G VS+ Sbjct: 1008 QKLSGRDIDYSWNNLVKEGIDRGGMLGWLSEPLNTVE-NISGGRFGLGAMFGAPPVSRFQ 1066 Query: 757 DRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDH 816 R AIG LLGP + + + A + + ++ + +A + K LPF N+W + + Sbjct: 1067 SRNAIGALLGPTFDLGGDAATVANGVLNGEFDSQQTHAVR---KMLPFQNLWAISPLLNK 1123 >gi|315122308|ref|YP_004062797.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495710|gb|ADR52309.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 56 Score = 52.7 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 15/52 (28%), Positives = 29/52 (55%), Gaps = 7/52 (13%) Query: 797 AIRKTLPFMNMWYLKNSFDHLILNQILEELNPG-------YLDRQQSKKKKK 841 + T+PF N+WY K+ FD+ + ++ + +NPG Y + ++K+K Sbjct: 4 VLNTTVPFQNLWYTKSVFDYFVRGKLDDAINPGNRARAEAYRRKNIQREKRK 55 >gi|307180901|gb|EFN68709.1| Laminin subunit beta-1 [Camponotus floridanus] Length = 2183 Score = 51.1 bits (120), Expect = 8e-04, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 65/187 (34%), Gaps = 18/187 (9%) Query: 16 ELSKKELRRLEDGIVRAYVSLDGKGLSKAERY---RLAGLKAEEDFQKELIRSVNDAIDE 72 +L E+ +L D I S L+ +E+ L+ D ++ R+ A+++ Sbjct: 1931 QLEPDEITQLADRIKSIVGS-----LTDSEKILADTKNDLRLAYDLEERANRTKEMALEK 1985 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLF--FKAGSAEVPLEMKIKAAETKVLSKFNEY 130 +++ L+ Q Y A+ K+ + KAA+ + S Sbjct: 1986 QALVNKVNLLLNDAQTAQYLAQSAIDKAEADVSKSQKDLADIADVTKAAQIQANSTTQSV 2045 Query: 131 AEVGSKNLGFTLDKQFGLDVFDEM--KGKKTQN------EQASRLVKQYFETQRELHSQA 182 + ++ V E+ + K N + +L ++Y L+ + Sbjct: 2046 EALDNRLKQLQTQSAKNAFVLKEIAVEANKVGNEAQMIDAKTKKLAEEYKRADESLNQRV 2105 Query: 183 HEAGLDY 189 +++ D Sbjct: 2106 NKSKGDI 2112 >gi|326479584|gb|EGE03594.1| protein kinase C substrate [Trichophyton equinum CBS 127.97] Length = 565 Score = 48.8 bits (114), Expect = 0.004, Method: Composition-based stats. Identities = 46/359 (12%), Positives = 107/359 (29%), Gaps = 43/359 (11%) Query: 26 EDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDR 85 ED K + +R A L+ ++ ++ + D + DL+ Sbjct: 151 EDRCKEIGKQWK-KSEEEKKRSYSAALRKRKELAAHASKTEKEMQDRILALEKEAQDLEG 209 Query: 86 VQAGVYGKSQ---ALFNKLFFKAGSAEVPLEMK--IKAAETKVLSKFNEYAEVGSKNLGF 140 A + + + A E+ KA + + E + + Sbjct: 210 SLADLEAQLETARARNRGKTASGQKQGKAYELAQLAKARTDTLRTVLEEVHLQRDQVVNL 269 Query: 141 TLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPM 200 + + L F E +E R V+ + + ++ D E P+ Sbjct: 270 LREAEGILSKFKEEYNPNFNDEGVKRAVRSWEDYVARKGEHGSDSFGDDALLEALKPEHD 329 Query: 201 SVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSF---- 256 + + L Y ++ASF+ A + Sbjct: 330 EPF----GNPEQWAEEAEPGL---VY-----------KLASFLPAGIANTIEDGLASFRA 371 Query: 257 ---KDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIV 313 + + S + + R +D++ ++ G ++N + SE+ L +D+ Sbjct: 372 VLVSNGLLADSSLDDSSDEPREV--RDAKDKVN-----GAEVSLN-LKKSEIKDLKRDLE 423 Query: 314 IARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRY 372 + G DS +++ + I+ D + + + R + + +E + Sbjct: 424 --EDFGV--DSVFRELKGECISQDSGEYTYELCWMEQTKQKSKKGRADTTMGRFEKISS 478 >gi|326470668|gb|EGD94677.1| hypothetical protein TESG_02185 [Trichophyton tonsurans CBS 112818] Length = 546 Score = 48.8 bits (114), Expect = 0.004, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 54/201 (26%), Gaps = 10/201 (4%) Query: 26 EDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDR 85 ED K + +R A L+ ++ ++ + D + DL+ Sbjct: 151 EDRCKEIGKQWK-KSEEEKKRSYSAALRKRKELAAHASKTEKEMQDRILALEKEAQDLEG 209 Query: 86 VQAGVYGKSQ---ALFNKLFFKAGSAEVPLEMK--IKAAETKVLSKFNEYAEVGSKNLGF 140 A + + + A E+ KA + + E + + Sbjct: 210 SLADLEAQLETARARNRGKTASGQKQGKAYELAQLAKARTDTLRTVLEEVHLQRDQVVNL 269 Query: 141 TLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPM 200 + + L F E +E R V+ + + ++ D E P+ Sbjct: 270 LREAEGILSKFKEEYNPNFNDEGVKRAVRSWEDYVARKGEHGSDSFGDDALLEALKPEHD 329 Query: 201 SVDKLRATKKDDFVRSMLDWL 221 + + L Sbjct: 330 EPF----GNPEQWAEEAEPGL 346 >gi|83312738|ref|YP_423002.1| hypothetical protein amb3639 [Magnetospirillum magneticum AMB-1] gi|82947579|dbj|BAE52443.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 614 Score = 47.7 bits (111), Expect = 0.008, Method: Composition-based stats. Identities = 72/540 (13%), Positives = 162/540 (30%), Gaps = 94/540 (17%) Query: 294 STNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGR 353 ++V +L +++ D+ +A G I A + SA L R Sbjct: 130 ESDVEAVLRVYSRTMAPDVELATAFGRADMQDQLDKIASDYARLRVGSADPATLGQLDKR 189 Query: 354 NKLEVRQEAMLQMWEVMRYGETVENTGW-ANWMAGLRSAAGASMLGQHPIGALLEDGFIS 412 + ++R A ++ Y + +G+ +R+ ++G + +L + G Sbjct: 190 MRADLRDVAAVRDRIRGTYALPADPSGFIVRTGKVVRNWNYLRLMGGMTVASLADAG--- 246 Query: 413 RQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLH 472 + + G+ + A + M L A+ G + D+ + Sbjct: 247 -RAVMVHGMMRVAGDGLVPMVSN-----FRGFRLAAKEAQLAGAALDMVLDSRAM----- 295 Query: 473 SKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDD 532 AE D S + +TD + + + + Sbjct: 296 ------QLAEVWDDYGRLSK---FERGVKALTDRFGMVSLMAPWNTAM-----------E 335 Query: 533 TDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSP 592 V+ +++ + + +G + + K+ + + D A + Sbjct: 336 QFAAVVTQSRILQAVEGMA-----KGMHDPKEVEYLAFLGIDDHKAARIGDQFSRHGERQ 390 Query: 593 EQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTY 652 A ++R+ ++ L+ + +H +++ Q + L + T Sbjct: 391 SGGVMWANTSAWVDREAVDALRAALVKDVHRIIIKPGQD-------------KPLWMST- 436 Query: 653 KRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGI 712 E +M QF T + + + + +L + + + +A +G Sbjct: 437 -------ELGKMIGQFKTFSIASTQRVALAALQQRDAAALNGSLLSLGLGALSYVAYSGA 489 Query: 713 GVASIKALLRGEDPSL-PEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPV--- 768 G D S P V + LL V+ G GP Sbjct: 490 ---------SGRDLSDHPAVWAKEAVDRSGLL----FWLSDVNNIGAKVFGYGEGPSRYA 536 Query: 769 -------------PSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFD 815 + + + + + +S A + + +PF N++YL+ FD Sbjct: 537 SRSATEALLGPGLGAGLDTSIQVLGDASRGEWRSSDTRALR---RLVPFQNLFYLRRLFD 593 >gi|302508899|ref|XP_003016410.1| hypothetical protein ARB_05809 [Arthroderma benhamiae CBS 112371] gi|291179979|gb|EFE35765.1| hypothetical protein ARB_05809 [Arthroderma benhamiae CBS 112371] Length = 450 Score = 46.9 bits (109), Expect = 0.016, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 54/201 (26%), Gaps = 10/201 (4%) Query: 26 EDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDR 85 ED K + Y A L+ ++ + ++ + D + DL+ Sbjct: 36 EDRCKEIGKQWKKTEEEKEKSYS-AALRKRKELAAQASKTEKEMQDRILALEKEAEDLEG 94 Query: 86 VQAGVYGKSQ---ALFNKLFFKAGSAEVPLEMK--IKAAETKVLSKFNEYAEVGSKNLGF 140 A + + + A E+ KA + + E + + Sbjct: 95 SLADLEAQLETARARNRGKTASGQKQGKAYELAQLAKARTDTLRTVLEEVHLQRDQVVNL 154 Query: 141 TLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPM 200 + + L F E +E R V+ + + ++ D + P+ Sbjct: 155 LREAEGILSKFKEEYNPNFNDEGVKRAVRSWEDYVARKGEHGSDSFGDDALLDALKPEHD 214 Query: 201 SVDKLRATKKDDFVRSMLDWL 221 + + L Sbjct: 215 EPF----GNPEQWAEEAEPGL 231 >gi|126000002|ref|YP_001039673.1| internal virion-like protein [Erwinia amylovora phage Era103] gi|121621858|gb|ABM63432.1| internal virion-like protein [Enterobacteria phage Era103] Length = 1294 Score = 46.9 bits (109), Expect = 0.017, Method: Composition-based stats. Identities = 94/681 (13%), Positives = 197/681 (28%), Gaps = 68/681 (9%) Query: 160 QNEQASRLVKQYFETQRELHSQAHEAGL-DYKFFENRIPQPMSVDKLRATKKDDFVRSML 218 +A+ + F+ E+ QA EAG + K ++ IP K+ + Sbjct: 630 GVRKAAEGISDRFKKALEIRKQAGEAGFENVKSAQDYIPALFDGPKIASA---------- 679 Query: 219 DWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPS-SEVGVKREFERVFH 277 ++RY + + + + +V + + + S S + + FERV Sbjct: 680 ----VTRYGTENVEAVLANGYRTGKYKVGRKASEAIAKMQVSRALDSTLSSRLSFERVVS 735 Query: 278 FKDSQAHMDYMEHFGVSTNVNT--ILTSELASLSKDIV--IARELGPNADSFVKQMIVQT 333 + Q +D + G+ ++ I EL ++ + R +G N + V + VQ Sbjct: 736 QSERQNFIDGLREAGIPDHIIDDFIEGQELDDVAAAVSSRAMRSMGINTQAEVGGVKVQD 795 Query: 334 IANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAG 393 + A K+ + EVM + E TG + R+ Sbjct: 796 LLKTNIAEIAENYGKEAAAGAAMARMGFRTRN--EVMAAIDAAERTGRNMGIGAKRAGDE 853 Query: 394 ASMLGQHPI----GALLEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDV---GL 446 A+ML L +D + +R + I R+N+M + E+ + G+ Sbjct: 854 ANMLRDSVRLLYGNTLDDDPNAAIVKATRRLREVTTITRLNQMGFAQAPEISRALVKMGI 913 Query: 447 YAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDT 506 + + G + ++G ++H + + + + N + Sbjct: 914 GP-VMKSVGATKILFGRRGRVGGTAQGELHD----VEMREVEQALGYIGEDNWLHGWATR 968 Query: 507 YASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDAD 566 + + + K L + + + G T S LK Sbjct: 969 HDEFN--EDPDNIRKISKVLDNTLAAGSRANLVLSGFKAIQGGSEKIVTRSIAMRLKQHL 1026 Query: 567 LRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVL 626 + + + L ++ + + + +++ +L Sbjct: 1027 AGERKLPTKDLEEIGLDEATMARL--KRHFDDNPRYDEYNGEQVRMLNFDAMEPDLK--- 1081 Query: 627 DNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEAL-RMFQQFTTTPTGMFLNILDLSNS 685 + R GT + + QF L Sbjct: 1082 -----EATAIAIRRMQGRLIQRHFVGDEGTWMNKWWGKALTQFKGFSIVSLEKQLIHDIR 1136 Query: 686 AKMPKGA---SMALNHVWIQYSATMALAGIGVASIKALL--RGEDPSLPEVIYDGTLANG 740 + A ++ Y + M + IG A K L + + +L I++ Sbjct: 1137 GDKTQAAMIFGWSVFLAAAAYGSQMQMQSIGRADRKQFLDDKFNNQALAMGIFNKMPQVA 1196 Query: 741 ALLPYMDRLTK---------------LVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATK 785 AL D L + G + + + A+ Sbjct: 1197 ALGLLGDGLASVGAMPDAMLQAPGRTGFRSMGAGDLVAGAG-MVGDYQEVLQALSNYASG 1255 Query: 786 DNENSKVNATKAIRKTLPFMN 806 ++ S IR+ +P N Sbjct: 1256 SDDVSTRQLVDKIRRVVPLAN 1276 >gi|311875242|emb|CBX44501.1| internal virion-like protein [Erwinia phage phiEa1H] gi|311875363|emb|CBX45104.1| putative internal virion-like protein [Erwinia phage phiEa100] Length = 1294 Score = 46.9 bits (109), Expect = 0.017, Method: Composition-based stats. Identities = 91/681 (13%), Positives = 193/681 (28%), Gaps = 68/681 (9%) Query: 160 QNEQASRLVKQYFETQRELHSQAHEAGL-DYKFFENRIPQPMSVDKLRATKKDDFVRSML 218 +A+ + F+ E+ QA EAG + K ++ +P K+ + Sbjct: 630 GVRKAAEGISDRFKKALEIRKQAGEAGFENVKSAQDYLPALFDGPKIASA---------- 679 Query: 219 DWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPS-SEVGVKREFERVFH 277 ++RY + + + + +V + + + S S + + FERV Sbjct: 680 ----VTRYGTENVEAVLANGYRTGKYKVGRKASEAIAKMQVSRALDSTLSSRLSFERVVS 735 Query: 278 FKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVI----ARELGPNADSFVKQMIVQT 333 + Q +D + G+ ++ L + R +G N + V + VQ Sbjct: 736 QSERQNFIDGLREAGIPDHIIDDLIEGQELDDVAAAVSSRAMRSMGINTQAEVGGVKVQD 795 Query: 334 IANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAG 393 + A K+ + EVM + E TG + R+ Sbjct: 796 LLKTNIAEIAENYGKEAAAGAAMARMGFRTRN--EVMAAIDAAERTGRNMGIGAKRAGDE 853 Query: 394 ASMLGQHPI----GALLEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDV---GL 446 A+ML L +D + +R + I R+N+M + E+ + G+ Sbjct: 854 ANMLRDSVRLLYGNTLDDDPNAAIVKATRRLREVTTITRLNQMGFAQAPEISRALVKMGI 913 Query: 447 YAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDT 506 + + G + ++G ++H + + + + N + Sbjct: 914 GP-VMKSVGATKILFGRRGRVGGTAQGELHD----VEMREVEQALGYIGEDNWLHGWATR 968 Query: 507 YASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDAD 566 + + + K L + + + G T S LK Sbjct: 969 HDEFN--EDPDNIRKISKVLDNTLAAGSRANLVLSGFKAIQGGSEKIVTRSITMRLKQHL 1026 Query: 567 LRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVL 626 + + + L ++ + + + +++ +L Sbjct: 1027 AGERKLPTKDLEEIGLDEATMARL--KRHFDDNPRYDEYNGEQVRMLNFDAMEPDLK--- 1081 Query: 627 DNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEAL-RMFQQFTTTPTGMFLNILDLSNS 685 + R GT + + QF L Sbjct: 1082 -----EATAIAIRRMQGRLIQRHFVGDEGTWMNKWWGKALTQFKGFSIVSLEKQLIHDIR 1136 Query: 686 AKMPKGA---SMALNHVWIQYSATMALAGIGVASIKALL--RGEDPSLPEVIYDGTLANG 740 + A ++ Y + M + IG A K L + + +L I++ Sbjct: 1137 GDKTQAAMIFGWSVFLAAAAYGSQMQMQSIGRADRKQFLDDKFNNQALAMGIFNKMPQVA 1196 Query: 741 ALLPYMDRLTK---------------LVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATK 785 AL D L + G + + + A+ Sbjct: 1197 ALGLLGDGLASVGAMPDAMLQAPGRTGFRSMGAGDLVAGAG-MVGDYQEVLQALSNYASG 1255 Query: 786 DNENSKVNATKAIRKTLPFMN 806 ++ S IR+ +P N Sbjct: 1256 SDDVSTRQLVDKIRRVVPLAN 1276 >gi|295676075|ref|YP_003604599.1| protein of unknown function UPF0118 [Burkholderia sp. CCGE1002] gi|295435918|gb|ADG15088.1| protein of unknown function UPF0118 [Burkholderia sp. CCGE1002] Length = 371 Score = 46.5 bits (108), Expect = 0.020, Method: Composition-based stats. Identities = 42/257 (16%), Positives = 92/257 (35%), Gaps = 27/257 (10%) Query: 491 SHALIVYNQIGRMTDTY-ASLKDLKADPRLDP----SIKAFFKQLDDTDFTVIKRAKAMS 545 + V+ + + + + L DL + P SI+AF+++L ++ +I + + ++ Sbjct: 84 AFGAHVHEIVALVQRLFESGLPDLPPWVQRIPLVGSSIEAFWERLTSSNSELIAQLRTLA 143 Query: 546 SPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADL 605 +P G I A L ++ I + A Sbjct: 144 APAGKW-------ILAAALAVTHGLGLLALSIVLAFFFYTGGEGA------------AAW 184 Query: 606 ERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMF 665 + + + + + AL V+ V G + T+L G + G A L + Sbjct: 185 LNAGMRRVAGERAEYLLALAGSTVKGVVYGILGTALVQGVLAGFGFWVAGVPAPALLGLV 244 Query: 666 QQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGED 725 F + G + + L + + +G S + + + + G+ IK +L G++ Sbjct: 245 TFFLSVVPGGPVVVW-LPAAIWLYQGGSTGWAIFLVVW--GLLVVGMADNVIKPILIGKN 301 Query: 726 PSLPEVIYDGTLANGAL 742 +P ++ + GA Sbjct: 302 SDMPLILVMLGILGGAF 318 >gi|302659279|ref|XP_003021331.1| hypothetical protein TRV_04537 [Trichophyton verrucosum HKI 0517] gi|291185226|gb|EFE40713.1| hypothetical protein TRV_04537 [Trichophyton verrucosum HKI 0517] Length = 450 Score = 45.7 bits (106), Expect = 0.033, Method: Composition-based stats. Identities = 23/201 (11%), Positives = 53/201 (26%), Gaps = 10/201 (4%) Query: 26 EDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDR 85 ED K + Y A L+ ++ + ++ + D + DL+ Sbjct: 36 EDRCKEIGKQWKKTEEEKEKSYS-AALRKRKELAAQASKTEKEMQDRILALEKEAQDLEG 94 Query: 86 VQAGVYGKSQ---ALFNKLFFKAGSAEVPLEMK--IKAAETKVLSKFNEYAEVGSKNLGF 140 + + + A E+ KA + + E + + Sbjct: 95 SLVDLEAQLETARARNRGKTASGQRQGKAYELAQLAKARTDTLRTVLEEVHLQRDQVVNL 154 Query: 141 TLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPM 200 + + L F E +E R V+ + + ++ D + P+ Sbjct: 155 LREAEGILSKFKEEYNPNFNDEGVKRAVRSWEDYVARKGEHGSDSFGDDALLDALKPEHD 214 Query: 201 SVDKLRATKKDDFVRSMLDWL 221 + + L Sbjct: 215 EPF----GNPEQWAEEAEPGL 231 >gi|328792916|ref|XP_001122457.2| PREDICTED: laminin subunit beta-1 [Apis mellifera] Length = 1774 Score = 45.7 bits (106), Expect = 0.037, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 62/185 (33%), Gaps = 13/185 (7%) Query: 16 ELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYK 75 +L E++ L D I SL A+ L E + + DAI++ Sbjct: 1521 QLKPDEIKELADRIKSIVGSLTDSDKILADT--KDDLYLAEQLKNRATKMKEDAIEKQVL 1578 Query: 76 RHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAE--VPLEMKIKAAETKVLSKFNEYAEV 133 + + L+ + +A+ + S + + K A+ + S + Sbjct: 1579 ANVVVVLLNDAKKAQTRAQEAINQAERDVSRSEKDLEEIAEVTKGAQMQANSTTQTVDSL 1638 Query: 134 GSK---------NLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHE 184 ++ F L+++ ++ + + + + L +Y L S+ ++ Sbjct: 1639 DARLKQLQTQSVRNDFVLNQEISVEARKIAEEAQNVDIKTKELAMEYKNADELLDSRMNK 1698 Query: 185 AGLDY 189 + + Sbjct: 1699 SNGNI 1703 >gi|167600423|ref|YP_001671923.1| phage particle protein [Pseudomonas phage LUZ24] gi|161168286|emb|CAP45451.1| phage particle protein [Pseudomonas phage LUZ24] Length = 1055 Score = 45.3 bits (105), Expect = 0.048, Method: Composition-based stats. Identities = 91/755 (12%), Positives = 187/755 (24%), Gaps = 131/755 (17%) Query: 90 VYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLD 149 K+ + + + K + ++ K+ Sbjct: 374 PLAKASPIAREFSETFRADMSGKRASGKTIFEDQELQAGKWNSELDNIFEGKSSKEIDRI 433 Query: 150 VFDEMKGKKTQNEQASRLVKQYFETQRELHSQA-HEAGLDYKFFENRIPQPMSVDKLRAT 208 + D G T +A+RL ++ ++A + G+ N +P +S +K+++ Sbjct: 434 ISDTSAGVNT--PEATRL----RALMDDVRNEAVNRGGMSVGTIPNYMPFGLSPEKVQS- 486 Query: 209 KKDDFVRSMLDWLDLSRY-----------KDIDGTPLSRSEIASFVGE-----VFAERVR 252 +F+ + + + D + E+ V + + R Sbjct: 487 --PEFLNDITPYFQSRQAAEDAVANWLAEVSDDTRGNTAPEVNRLVTQNQQTGAWEVDPR 544 Query: 253 STSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTI-------LTSEL 305 DP + ++S+A + ++N + Sbjct: 545 YRIQGDPDTLRGRFAQSDAVPKYGQLEESRAFGSVPQEILNKYSLNDTPKKRLQEIRDYF 604 Query: 306 ASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQ 365 S I G N + I +A Q A G+ + + M Sbjct: 605 EGASHRIAFTERFGINGEKA-NAKIASAVAEAQRA-----------GKRVTKEEVDRMYD 652 Query: 366 MWEVM-RYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKE 424 + + +++ A A S L L E + K Sbjct: 653 LVDAYNGMHGRIKDPNLKKLAAVTSGALVLSRLPLAGFSTLTE---------FSLPFAKA 703 Query: 425 AIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYL 484 + L E++ R + G + G + + A Sbjct: 704 GVMPTLGAVLPTMGEVVRQA----------ARRIYSGVPKSETGRFMSD--MNHTLASAT 751 Query: 485 DKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAM 544 A + + I + + L ++ + + M Sbjct: 752 SLMADRVGAEVFNSTIQKAIRGQFLINGLSILTHVNRIFAT-------ETAKRVYQNNLM 804 Query: 545 SSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLAD 604 G ++ +K + L M I + LK +P Sbjct: 805 DLAAGLPFSSANGALK------VAQLREMGVNIGSQQDALKLISPATPS----------- 847 Query: 605 LERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRM 664 E+ + + + M V + F + + + ++M Sbjct: 848 ----EVLMANNVKTLAMRRF--------VDQVVLDPTFADKPMWMSNGN--------VQM 887 Query: 665 FQQFTTTPTG-------MFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASI 717 F P MF L + A + M G + Sbjct: 888 FSLLKGYPAAYGNIILPMFRRRLSPHFAGSWTNAGMGAAGIAF--TLGLMMSLGYLQDEL 945 Query: 718 KALLR-----GEDPSLPEVIYDGTLANG---ALLPYMDRLTKLVSKGDRAAIGGLLGPVP 769 + L + ED PE + D LT + +LGPV Sbjct: 946 RQLAKFGGSSREDTRSPEQRMMDAVMQQMPLQASMIYDMLTGY--RRGTTPAEVVLGPVA 1003 Query: 770 SMVTNLTSS-AVELATKDNENSKVNATKAIRKTLP 803 T + +A+ ++ S K + K P Sbjct: 1004 GAATEGAMAVGKTIASFGDDPSAGEIWKFLYKQTP 1038 >gi|94309527|ref|YP_582737.1| hypothetical protein Rmet_0582 [Cupriavidus metallidurans CH34] gi|93353379|gb|ABF07468.1| conserved hypothetical protein; putative membrane protein [Cupriavidus metallidurans CH34] Length = 367 Score = 43.4 bits (100), Expect = 0.16, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 47/144 (32%), Gaps = 13/144 (9%) Query: 604 DLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALR 663 D R + + ++ + +L V+ V G + T+ G+ + G L Sbjct: 183 DWVRGGMRRVSGDRADHLLSLAGSTVKGVVYGVLGTAFVQAVLAGIGFWIAGVPGAAILG 242 Query: 664 MFQQFTTT-----PTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIK 718 F + P L L ++ + M + A+ G+ IK Sbjct: 243 FITFFLSVVPMGPPLAWIPAALWLYHTGETGWAIFMVVW--------GAAVVGMADNVIK 294 Query: 719 ALLRGEDPSLPEVIYDGTLANGAL 742 LL + LP + + GAL Sbjct: 295 PLLISKGTGLPLIWIMMGVLGGAL 318 >gi|31711679|ref|NP_853597.1| internal virion protein [Enterobacteria phage SP6] gi|31505683|gb|AAP48776.1| gp37 [Enterobacteria phage SP6] gi|40787054|gb|AAR90028.1| 36 [Enterobacteria phage SP6] Length = 1270 Score = 43.0 bits (99), Expect = 0.25, Method: Composition-based stats. Identities = 34/296 (11%), Positives = 80/296 (27%), Gaps = 27/296 (9%) Query: 536 TVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQR 595 + I + I+ + + K ++ + L Sbjct: 960 SAIIDNGLAMGSRINTWLSGFKAIQGGSEKIVARSINKRLKQHLMGERELPKRDLEEVGL 1019 Query: 596 QELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLL---TY 652 E + E + D K+ + D ++ +R + ++ + Sbjct: 1020 DEATMKRLKRHFDENPMYADYNGEKVRMMNFDAMEPDLREIVGVAVRRMSGRLIQRNFIG 1079 Query: 653 KRGTRAGEAL-RMFQQFTTTPTGMFLNIL---DLSNSAKMPKGASMALNHVWIQYSATMA 708 G + + QF + L + + + + + + Y+ M Sbjct: 1080 DEGIWMNKWWGKALTQFKSFSIVSIEKQLIHDLRGDKIQAAQIMAWSSLLGFASYATQMQ 1139 Query: 709 LAGIGVASIKALLRGE--DPSLPEVIYDGTLANGALLPYMDRL----------------T 750 + IG LR + ++ +++ D Sbjct: 1140 MQAIGREDRDKFLREKFDTQNIAMGVFNKLPQVAGFGLAGDTFATFGLMPDSMMQAPGRM 1199 Query: 751 KLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMN 806 +G + G V S NL+ + V+ A D++ S +R+ +P N Sbjct: 1200 GFRQQGFGDLVAGAG--VISDAVNLSQALVKYANGDDDVSTRQLVDKVRRLVPLAN 1253 >gi|73542361|ref|YP_296881.1| hypothetical protein Reut_A2676 [Ralstonia eutropha JMP134] gi|72119774|gb|AAZ62037.1| Protein of unknown function UPF0118 [Ralstonia eutropha JMP134] Length = 388 Score = 42.7 bits (98), Expect = 0.25, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 48/141 (34%), Gaps = 13/141 (9%) Query: 607 RKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQ 666 R + + + ++ + AL V+ V G + T+ G+ + G A L Sbjct: 186 RAGMRRIAGERADHLLALAGSTVKGVVYGVLGTAFIQAVLQGIGLWIAGVPAAAILGFVT 245 Query: 667 QFTTT-----PTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALL 721 F + P L L + + + + +A+ G+ +K LL Sbjct: 246 FFLSVIPVGPPLVWLPAALWLYHGGETGWAIFLVVW--------GVAVVGMADNVVKPLL 297 Query: 722 RGEDPSLPEVIYDGTLANGAL 742 + +P + + GAL Sbjct: 298 ISKGTGMPLIWIMMGVLGGAL 318 >gi|312888776|ref|ZP_07748340.1| band 7 protein [Mucilaginibacter paludis DSM 18603] gi|311298776|gb|EFQ75881.1| band 7 protein [Mucilaginibacter paludis DSM 18603] Length = 647 Score = 42.7 bits (98), Expect = 0.26, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 65/178 (36%), Gaps = 28/178 (15%) Query: 11 KAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAI 70 +A + L+ +++ + E+ +++ +R + A + QKE++++ Sbjct: 429 EALMKTLTDRKIAQEEEKTYETQR------MAQVQRQGVEKETAIAEIQKEIVKAQQSV- 481 Query: 71 DEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEY 130 + + + + + G++ +L ++ +A + ++ E + A + ++ Sbjct: 482 --EIAQRTADAAVKKSE----GEATSLKLQVNAEAAATKMRAEAEADATRLRAGAQ---- 531 Query: 131 AEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLD 188 S L + + + KT +A +++ T Q G D Sbjct: 532 --AESTRLNASAEAEKI---------SKTGLAEAEKIMAIGKSTAEAYELQVKAMGGD 578 >gi|313674771|ref|YP_004052767.1| tex-like protein [Marivirga tractuosa DSM 4126] gi|312941469|gb|ADR20659.1| Tex-like protein [Marivirga tractuosa DSM 4126] Length = 748 Score = 42.7 bits (98), Expect = 0.27, Method: Composition-based stats. Identities = 40/318 (12%), Positives = 108/318 (33%), Gaps = 19/318 (5%) Query: 145 QFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDK 204 D +++ + E + +K+ + EL + + A K + +P Sbjct: 51 ADVRDRVQQLRDLDKRREAILKSIKEQEKLTPELEKEINAAETMAKLEDIYLPYKPKRRT 110 Query: 205 LRATKKDDFVRSMLDW----------LDLSRYKDIDGTPLSRSEIASFVGEVFAERVRST 254 ++ + + L+ +Y D + ++ AE Sbjct: 111 KATIAREKGLEPLAKLIFEQANIDLELEAGKYIDEEKAVADIESALHGARDIIAEWANEN 170 Query: 255 SFKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSK---D 311 + I + + +V K+++ Y ++F NV T + + ++ + + Sbjct: 171 AELREDIRELFLENGKFRSKVLSGKETEG-QKYKDYFEWEENVKTAPSHRILAMRRGEKE 229 Query: 312 IVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMR 371 +++ ++ P + + M + D EA+ K+ + L+ E ++++ Sbjct: 230 MILMLDISPEEEDALFIMEKHFVKADNEAAQQVKIALSDAYKRLLKPSMETEIRIF---- 285 Query: 372 YGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRINK 431 + + + LR A+ +GQ + A+ + GF + L + + + Sbjct: 286 TKKKADEEAIKVFSDNLRQLLLAAPMGQKNVMAV-DPGFRTGCKLVCLDRQGKLLFNEAI 344 Query: 432 MPLKERMELLSDVGLYAE 449 P + + + L + Sbjct: 345 YPHEPQRQTAKAAALILQ 362 >gi|254480803|ref|ZP_05094049.1| peptidase, M48 family [marine gamma proteobacterium HTCC2148] gi|214038598|gb|EEB79259.1| peptidase, M48 family [marine gamma proteobacterium HTCC2148] Length = 644 Score = 42.7 bits (98), Expect = 0.30, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 59/216 (27%), Gaps = 40/216 (18%) Query: 678 NILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLPEVIYDGTL 737 ++ + +P A +A+ I + L G+ +L G + I + Sbjct: 143 RGINAFAAGIVPADAVVAVTRGTIDHLKRHELQGVIAHEFSHILNG---DMRLNIRLAAM 199 Query: 738 ANGALLP--YMDRLTKLVSK--GDRAAIGGLLGPVPSMVTNLTSSAVELATK-------- 785 G L + ++ R+ P+ + + LA Sbjct: 200 LKGITFIGDVGHILLRSNNRVRTGRSGKNDAALPMLGLALWILGWLGGLAAGFIKAAISR 259 Query: 786 --------------DNENSKVNATKAIRKTLPFMNMWYLKNS-FDHLILNQIL----EEL 826 + +A K I +P + + + H+ QI + Sbjct: 260 QKEYLADAGAVQFTRDSGGIADALKVIGGYIPGSLVHAARAAEMSHIFFGQIEHHLWQLF 319 Query: 827 N--PGYLDRQQSKKKKKGIELFQNMDEGLPHRLPFP 860 + P +R + + + Q P P P Sbjct: 320 STHPSLQERIRRLDARWDGQYIQR----QPKHYPNP 351 >gi|291231741|ref|XP_002735825.1| PREDICTED: vinculin-like [Saccoglossus kowalevskii] Length = 1356 Score = 42.7 bits (98), Expect = 0.31, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 69/215 (32%), Gaps = 13/215 (6%) Query: 15 RELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAY 74 + L+ + + I G+G+ +++ R + E+IR + + Sbjct: 198 KNLTPVLISGI--KIFVTTKQTGGRGVGESQENRNYVVTKMSQEIHEIIRVLQLTTYDEE 255 Query: 75 KRHQLR-SDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEV 133 + + + QA ++GK + L E I+ + Sbjct: 256 GWDVDDITVMKKAQAAIFGKGDLAKDWLSNPHAEPGGLGERSIRQIVDEARK--VGARCE 313 Query: 134 GSKN---LGFTLDKQFGLDVFDEMKGKKTQN-EQASRLVKQYFETQRELHSQAHEAGLDY 189 G + L D D E++ + N QA +L + + L + + A Y Sbjct: 314 GPEKDEILRLCDDITVMTDQLAELRARGEGNTPQAQQLARAIQDRVDYLTGRVNSAVAHY 373 Query: 190 KFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLS 224 R P P K+ ++ ++ S +D Sbjct: 374 AQSGIRKPAPTVSGKVEQAQQ--WLAS--PGVDDR 404 >gi|47208973|emb|CAF99051.1| unnamed protein product [Tetraodon nigroviridis] Length = 1202 Score = 41.9 bits (96), Expect = 0.52, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 40/153 (26%), Gaps = 25/153 (16%) Query: 166 RLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLD--- 222 + F + L A G + P + + T + + M ++ Sbjct: 119 EGKRIIFTGAKMLRKDAFSGGWE-GVTPGFQPYQHGLQSISVTTEKTWASGMTSTMEGDA 177 Query: 223 LSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFK--- 279 Y+ + +S K PS S + + K Sbjct: 178 RRTYQIPEDHGEDD-----------------SSEKTPSKASKSPQKSTKRPKTIPVKVSL 220 Query: 280 -DSQAHMDYMEHFGVSTNVNTILTSELASLSKD 311 D + +E F + ++ L L +D Sbjct: 221 LDGSDYEAAVEKFAKGQTLLDMVCGHLNLLERD 253 >gi|319950560|ref|ZP_08024469.1| hypothetical protein ES5_13288 [Dietzia cinnamea P4] gi|319435754|gb|EFV90965.1| hypothetical protein ES5_13288 [Dietzia cinnamea P4] Length = 498 Score = 41.5 bits (95), Expect = 0.71, Method: Composition-based stats. Identities = 58/361 (16%), Positives = 117/361 (32%), Gaps = 42/361 (11%) Query: 92 GKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVF 151 + + L +L A + ++ ++ A + A V + +L +Q G V Sbjct: 128 SRVETLTRQLEDLARDTDPSVQGRLAALHRERDRIDAAIARVEAGDLELADPEQVGEKVS 187 Query: 152 DEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKD 211 + ++G +R+ ++ ++L + + + + + + + Sbjct: 188 EILRGADDIPADFARVRAEFESLNQDLRRRLLDQDGARGDVLDAVFGGVDLIGDSEAGR- 246 Query: 212 DFVRSMLDWLDLSRYKDID---GTPLSRSEI-------ASFVGEVFAERVRSTSFKDPSI 261 F LD R +D LSR ++ + E+F E + + + Sbjct: 247 SFSSFYSVLLDPERSASVDTWIDDILSRPQVADLPPSARRGLRELFDEMETAGAEVN--- 303 Query: 262 PSSEVGVKREFERVF-HFKDSQAHMDYMEHFG-----VSTNVNTILTSELASLSKDIVIA 315 GV R HF S A++++ + + + S+ V Sbjct: 304 -----GVLTSLSRSLRHFVTSDAYVEHRQMLALIRSARAAAAEASGARAVKPTSQMSVPL 358 Query: 316 RELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGET 375 R +G V+ + + N E +V G LE + Sbjct: 359 RRVG----MSVRSVSALRLRNPGEERVAAEVAHHEEGHADLEA-----------LTALVR 403 Query: 376 VENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKEAIQRINKMPLK 435 A A +R S LG IGA+L + ++ + S VG+ AI + P Sbjct: 404 ASEIDEAELRAHVRD--VVSRLGPSSIGAILREHPATQGVASIVGLLNLAITTQAEAPPD 461 Query: 436 E 436 + Sbjct: 462 D 462 >gi|313904571|ref|ZP_07837946.1| Sigma 54 interacting domain protein [Eubacterium cellulosolvens 6] gi|313470541|gb|EFR65868.1| Sigma 54 interacting domain protein [Eubacterium cellulosolvens 6] Length = 732 Score = 41.1 bits (94), Expect = 0.76, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 73/260 (28%), Gaps = 37/260 (14%) Query: 142 LDKQFGLDV-------FDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFEN 194 DK + DEMK + + N +A+ E + E Sbjct: 307 EDKDTLRQIRDAVTVRLDEMKSEASGNGEAAASGDTVRAKSGEEKPEKDGEMAVVGQEEP 366 Query: 195 RIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKD------IDGTPLSRSEIASFVGEVFA 248 P+ + + ++ L LD + K DG+ I E Sbjct: 367 SRPEI--SLIVPVYNMEKWLSDFLTGLDAQKCKSLEVIFVDDGSTDGSGGILEEYQEGCK 424 Query: 249 ERV-----RSTSFKDPSIPSSEVGVKREFERVFHFKDSQAHMDY---MEHFGV-STNVNT 299 R T + G+ R F D M+ E +GV Sbjct: 425 SRKGWSVRILTQENQGVSAAKNAGLDAAKGRWLAFADPDDWMEADYLQEMYGVAMREDVD 484 Query: 300 ILTSELASLSKD-------IVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLKDWLG 352 ++ ++ D + ++ P+ +S + A +Q+ + K Sbjct: 485 VVICHERAVDADEHPSPEALGAISKMRPSPES------AEVDAEEQQRADAPKDDSVPGE 538 Query: 353 RNKLEVRQEAMLQMWEVMRY 372 ++E R+E + + Sbjct: 539 PLRIEERKELLRHFQDDFAG 558 >gi|332142305|ref|YP_004428043.1| hypothetical protein MADE_1014555 [Alteromonas macleodii str. 'Deep ecotype'] gi|327552327|gb|AEA99045.1| hypothetical protein MADE_1014555 [Alteromonas macleodii str. 'Deep ecotype'] Length = 2149 Score = 41.1 bits (94), Expect = 0.78, Method: Composition-based stats. Identities = 77/637 (12%), Positives = 180/637 (28%), Gaps = 81/637 (12%) Query: 170 QYFETQREL-HSQAHEAGLDYKFFENRIPQPMSVDKLRA--TKKDDFVRSMLDWLDLSRY 226 ++ E ++ A G ++ +P+ + + +R+ + D+S Sbjct: 1531 EFKEIMDDMWKYAAERMGGKLGKIDDYMPRIYDPEAIINDIEGFKAVLRNAMP--DISNA 1588 Query: 227 KDIDGTPLSRSEIASFVGEVFAE-RVRSTSFKDPSIPSSEVGVKREFERVFH------FK 279 K + +E + E+F + +R+ + S + + ER FK Sbjct: 1589 KMEEIIRTIIAEEGAISEELFEDSGLRAPGNDNVSTRMLKDIPESALERFMATPSHRLFK 1648 Query: 280 ---DSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAN 336 + + +Y G V L + L ++ + N ++++ Sbjct: 1649 YIHKTTSRAEYETRAGAYNTVED-LENRLKRQAQTQYV------NPK-TLERVSEIAKNF 1700 Query: 337 DQEASAGNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASM 396 +E N+++ + + + + Y ++ W R Sbjct: 1701 REEVQNHNEMIASLEEQLLTHPDLSFKAALQDQIDYLKSNPPKPPEYWNPNGRIDEAIEK 1760 Query: 397 LGQHPIGAL--LEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYA-EGVVA 453 L + + +G++ R +S ++ Q + M + + + V Sbjct: 1761 LPEDRQKEARHIIEGYMGRLGISISPESRKLQQWMMAM------QYYTTLAFATISSVTD 1814 Query: 454 HGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDL 513 M G SK+ D + ++ IG + Sbjct: 1815 IANIMARGKVDSFGSMVKQSKVL-------FDAFKNRDDLELIARTIGVIQH-------- 1859 Query: 514 KADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARM 573 SI TD TV K G + + I + Sbjct: 1860 ----DTVTSIINQQYGGTFTDPTVQKWNDRFFRAIGLEWFTKTTRIMAMS--AGFHFIEE 1913 Query: 574 SDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSV 633 S H + N L+ + + Q++ + + DK ++ V Sbjct: 1914 SANNQRHGARFLNELGLTRDDVKYWQRKGSPKVSDGKDPGIDK--------IVAAVNQFA 1965 Query: 634 RGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNIL-DLSNSAKMPKGA 692 ++ ++ G+ G ++ Q + ++ ++ K + Sbjct: 1966 DESILRPNAAQRPTW------GSDLGFFHQLVWQLKSFYWAFGTTVIKGMAREIKARQRR 2019 Query: 693 SMALN-HVWIQYSATMALAGIGVA--SIKALLRGEDPSLPEVIYDGTL-------ANGAL 742 ++ + A + L G+ +K ++ + P G L Sbjct: 2020 GDSIPKSLTPLLFAGVPLMGLAAIGLELKEFIKYGNFEGPSAKMGAAAYTFELFDRAGGL 2079 Query: 743 LPYMDRLTKLVS--KGDRAAIGGLLGPVPSMVTNLTS 777 P L + + K + + LLGP + + Sbjct: 2080 GPA-SLLVGMYNAPKYGDSPLASLLGPTAEHIDSFFG 2115 >gi|315578927|gb|EFU91118.1| phage tail tape measure protein, TP901 family, core region [Enterococcus faecalis TX0630] Length = 767 Score = 41.1 bits (94), Expect = 0.88, Method: Composition-based stats. Identities = 72/506 (14%), Positives = 153/506 (30%), Gaps = 46/506 (9%) Query: 365 QMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQMLSRVGIDKE 424 Q+++ + G + G I + G++++ + +K Sbjct: 291 QLFDAVNRGAPQLKAMGLGFSESTTLIGQMEKAG---IDSAGTLGYLAKASVVYAKDNKT 347 Query: 425 AIQRINKM--------PLKERMELLSDV--GLYAEGVVAHGRNMMEGSDAFQIGHKLHSK 474 ++ +E++ + S+V A +V + D K + Sbjct: 348 MQDGLSGTIESIKGATTEQEKLTIASEVFGTKAASKMVEAIDSGALSMDGLADSAKNAAG 407 Query: 475 MHKWSG---AEYLDKKRISSH-ALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQL 530 + + +D+ +I+ + I ++G A L +A + +F L Sbjct: 408 TVDQTFNDILDPIDQAKIAQNQFKIAMGELGEQV-QIALLPAFEAASNAIQKVSTWFSGL 466 Query: 531 DDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTL 590 D I + + G + + ++ + + + ++ I L Sbjct: 467 TDNQKQTIITIAGVVAAIGPVLVVLGTLASSIS-SLIPVITFIASPIGIVIAALAAFVAG 525 Query: 591 SPEQRQELQQQLADLERKEINILKDKVS--NKMHALVLDNVQTSVRGAMHTSLFDRQRLG 648 ++ D N++KD V K+ A + + G + ++ ++ Sbjct: 526 IVIAYNKVGW-FRDFINASFNVIKDIVVGVFKVLADTTKSTFDFITGFIGGAMDGAAKII 584 Query: 649 LLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMA 708 G R+F TG+F D S + + + + A Sbjct: 585 ------GDYVNAIKRIFGGIVDFVTGVFT--GDWSRAWQGVVDIFGGIFEGIAAVAK--A 634 Query: 709 LAGIGVASIKALLRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPV 768 + I L G + G G + + L + + AI G GP Sbjct: 635 PINAMITLINGFLGGLNNIKIPKWVPGVGGKGFSIAQIPYLAEGGHMINGQAIVGEAGPE 694 Query: 769 PSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNP 828 N ++ L+ ++ A K K H+ Q+ + NP Sbjct: 695 LLTAKNGKTTVTPLSQEEKARGIGGALKGG------------KTIEQHVYFGQV-DANNP 741 Query: 829 GYLDRQQSKKKKKGIELFQNMDEGLP 854 LDR K K + F ++ G+P Sbjct: 742 SELDRMNRKLYKASAQAFYDLG-GVP 766 >gi|332530570|ref|ZP_08406507.1| 2-isopropylmalate synthase [Hylemonella gracilis ATCC 19624] gi|332039976|gb|EGI76365.1| 2-isopropylmalate synthase [Hylemonella gracilis ATCC 19624] Length = 565 Score = 41.1 bits (94), Expect = 0.93, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 57/171 (33%), Gaps = 22/171 (12%) Query: 4 ECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELI 63 + +Q + A+G+ELS +++ ++ + ++ E AG++ ++ Sbjct: 410 QVVQAVMDASGKELSARDIHQVFLREYGLNEVSAPRYRAQEEGENAAGVRTTTLQADVVL 469 Query: 64 RSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKV 123 AI + G +A L AG + L+ A + Sbjct: 470 EGKALAI----------------EGAGNGPIEAFVEGLATAAGESIRVLDYHEHAVGSGA 513 Query: 124 LSKFNEYAE--VGSKN-LGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQY 171 ++ Y E VG + G +D + + +A R +Q Sbjct: 514 NAQAVAYLELRVGERTLFGVGMDANIVSASLKAIV---SGLLRARRGAEQV 561 >gi|134287713|ref|YP_001109879.1| transposase Tn3 family protein [Burkholderia vietnamiensis G4] gi|134132363|gb|ABO60098.1| transposase Tn3 family protein [Burkholderia vietnamiensis G4] Length = 989 Score = 40.7 bits (93), Expect = 1.0, Method: Composition-based stats. Identities = 53/332 (15%), Positives = 94/332 (28%), Gaps = 50/332 (15%) Query: 12 AAGRELSKKELRRLEDGIVRAYVS----LDGKGLSKAERYRLAGLKAEEDFQKELIRSVN 67 A L+ RRL+D + R L S + L+ E + + Sbjct: 180 ALAEPLTDVHRRRLDDLLKRRDNGKTTWLAWLRQSPVKPNSRHMLEHIERLKAWQALDLP 239 Query: 68 DAIDEAYKRHQLRSDLDRVQAGVYGK-----SQALFNKLFFKAGSAEVPLEMKIKAAETK 122 I+ +++L Q + L A + +I + Sbjct: 240 SGIERLVHQNRLLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDR 299 Query: 123 VLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKG----------KKTQNEQAS------- 165 +L K A+ + K V M G + + A+ Sbjct: 300 ILGKLFNAAKNKHQQQFQASGKAINDKV--RMYGRIGQALLEAKQSGGDPFAAIEAVMPW 357 Query: 166 -RLVKQYFETQRELHSQ----AHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDW 220 E Q+ + H G +Y PQ + V KLRA +LD Sbjct: 358 DTFAASVTEAQKLAQPESFDFLHRIGENYTTLRRYAPQFLDVLKLRAAPAAK---GVLDA 414 Query: 221 LDLSRYKDIDGTPLSRSEI-ASFVGEVFAERVRSTSFKDP-----------SIPSSEVGV 268 +D+ R + D ++ +F+ +A+ V + D V Sbjct: 415 IDVLRDMNNDNARKVPADAPTAFIKPRWAKLVLTDDGIDRRYYELCALSELKNALRSGDV 474 Query: 269 KREFERVFHFKDSQAHMDYMEHFGVSTNVNTI 300 + R FKD ++ E F + + Sbjct: 475 WVQGSRQ--FKDFDEYLVPAEKFATLKLASEL 504 >gi|269115086|ref|YP_003302849.1| Lmp related protein [Mycoplasma hominis] gi|268322711|emb|CAX37446.1| Lmp related protein [Mycoplasma hominis ATCC 23114] Length = 1366 Score = 40.7 bits (93), Expect = 1.0, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 57/165 (34%), Gaps = 10/165 (6%) Query: 52 LKAEEDFQKELIRSVNDAIDE-AYKRHQLRSDLDRVQAGVYGKSQA--LFNKLFFKAGSA 108 KA +D QK + + A + K+ QL + +A K Q +FN Sbjct: 172 KKATQDLQKLIDAAKEKAKQDFNSKKQQLDDLIKSNEAKDVDKQQETGIFNNTNLSGNDL 231 Query: 109 EVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLD-KQFGLDVFDEMKGKKTQNEQASRL 167 +E K K E + S + + L D K+ D+ D G+K +A++ Sbjct: 232 IKDIESKTKTIEDAIKSLTKKINDKKDSLLNDFNDAKKKLQDLIDSQDGQKVDTSKANQS 291 Query: 168 VKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDD 212 ++ Q +A + K + KL K+ Sbjct: 292 LQNNNVDASSTTDQIVDATTEIKKA------TQDLQKLIDAAKEK 330 Score = 39.6 bits (90), Expect = 2.3, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 56/164 (34%), Gaps = 10/164 (6%) Query: 52 LKAEEDFQKELIRSVNDAIDE-AYKRHQLRSDLDRVQAGVYGKSQA--LFNKLFFKAGSA 108 KA +D QK + + A + K+ QL + +A K Q +FN Sbjct: 314 KKATQDLQKLIDAAKEKAKQDFNSKKQQLDDLIKSNEAKDVDKQQETGIFNNTNLSGNDL 373 Query: 109 EVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLD-KQFGLDVFDEMKGKKTQNEQASRL 167 +E K K E + S + + L D K+ D+ D G+K +A++ Sbjct: 374 IKDIESKTKTIEDAIKSLTKKINDKKDNLLKDFNDAKKQLEDLIDSQDGQKVDTSKANQS 433 Query: 168 VKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKD 211 ++ Q A + K + KL K+ Sbjct: 434 LQNNNADASSTTDQIVNATNEIKKA------TQDLQKLIDAAKE 471 >gi|187476939|ref|YP_784963.1| hypothetical protein BAV0432 [Bordetella avium 197N] gi|115421525|emb|CAJ48034.1| phage-related protein [Bordetella avium 197N] Length = 1129 Score = 40.7 bits (93), Expect = 1.1, Method: Composition-based stats. Identities = 66/508 (12%), Positives = 149/508 (29%), Gaps = 52/508 (10%) Query: 312 IVIARELGPNADSF----VKQMIVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMW 367 +A+ N ++M+ A E A L ++ M + Sbjct: 419 TEVAKIAADNPSEANLFAFRKMLATHYAIQNEVIAARTETARALASWRIPAGS-GMERFA 477 Query: 368 EVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQ-MLSRVGIDKEAI 426 ++ + + + MA + L Q + L+ SR + + Sbjct: 478 QIENALRSSGDLDLSREMAT-----RIAALSQAGMHRELDQIVRGSVWARSRDAFLEAWV 532 Query: 427 QRINKMPLKERMELLSDVGLYAEGV-----VAHGRNMMEGSDAFQIGHKLHSKMHKWSGA 481 + P + ++S+ + + + A ++ Q+G SG Sbjct: 533 NGLLSSPPTHLVNMMSNTSVIFQQMYERAAAAQISRILGVDGGVQLGEATAQLFGMLSGF 592 Query: 482 EYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRA 541 + D R S+ + + M ++D + + + K + Sbjct: 593 K--DALRYSAKSFLTNETGYGM-------------GKIDLPRA---RAISAEAWGQAKDS 634 Query: 542 KAMSSPDGYLYART-PSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQ 600 S D T P +D + L + A ++ + + ++++ Sbjct: 635 PLGRSLDVLGAVVTMPGRALGAEDEFFKTLGYRMELNALAVRRATHEVNSGIIRSDQVKE 694 Query: 601 QLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGE 660 ++A + L+ + ++ + + A+ + G L Sbjct: 695 RVAAIVSDPPTDLRLEAIDQATYQTFTSAPGELTKAITRGVNSVPLAGRLILPFVRTPAN 754 Query: 661 ALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKAL 720 L+ F TP + + A + G + + + ++ +A ++ + Sbjct: 755 ILKY--SFERTPLAPLMAHVR----ADIAAGGARRDIALARITTGSLLMATAADMAMSGV 808 Query: 721 LRGEDPSLPEVIYDGTLANGALLPY----MDRLTKLVSKGDRAAIGGLLGPVPSMVTNLT 776 L G PS + PY DR ++ IG LG MV L Sbjct: 809 LTGRGPSDRR--ERQAMERSGWQPYSIKVGDRYFA-YNR--LDPIGTSLGLSADMVEILA 863 Query: 777 SSAVELATKDN--ENSKVNATKAIRKTL 802 + + A D E ++ +I + Sbjct: 864 NMDDDEALGDAEVERTQAAIVMSIANNV 891 >gi|4262427|gb|AAD14632.1| putative transmembrane protein MttP [Methanosarcina barkeri] Length = 353 Score = 40.7 bits (93), Expect = 1.2, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 64/187 (34%), Gaps = 19/187 (10%) Query: 615 DKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRA--GEALRMFQQFTTTP 672 D++S+ + D + V + T+ + L G GE +R ++F P Sbjct: 51 DEMSSAIAKTSGDGISLVVTAVLITAFNALAVMLALMVWNGVLGKYGELVRTLKEFH--P 108 Query: 673 TGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLR----GEDPSL 728 + + + G+ +A+ + ++A +AG+ + ++L GE S Sbjct: 109 CSKWFFLASIFGGPMAILGSFIAMGFIGGSFAA---VAGLLYPVVGSILAYYWYGEKISK 165 Query: 729 PEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNE 788 I + G + Y L +S G+ P + L ++A Sbjct: 166 RAAIGIAVIVLGGISIYGGGLFTELSSGNV--------PWIGYLGGLMAAAGWGIEGAIA 217 Query: 789 NSKVNAT 795 ++ Sbjct: 218 GKGLDIA 224 >gi|307313333|ref|ZP_07592956.1| conserved hypothetical protein [Escherichia coli W] gi|306906755|gb|EFN37265.1| conserved hypothetical protein [Escherichia coli W] gi|315063816|gb|ADT78142.1| conserved hypothetical protein [Escherichia coli W] gi|323380955|gb|ADX53222.1| hypothetical protein EKO11_4671 [Escherichia coli KO11] Length = 258 Score = 40.3 bits (92), Expect = 1.4, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 47/148 (31%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ + A+R R A + R Sbjct: 107 AGEWLTEDEIRAVLDAVRDAVRSVSCRVAEDAQRIRAALTTTGQTLLTRQTRR----FRL 162 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKA---AETKVLSKFNE 129 K LD + A+ N+ G+ +EM + + + Sbjct: 163 VVKESDHPCWLDEDDENLPVVLDAILNR-----GARFSAVEMYLVSECVEHILSSGLACD 217 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 218 VLRIPDEPPRRWFDRGVLREVVREARAE 245 >gi|73853259|ref|YP_308755.1| hypothetical protein LH0091 [Escherichia coli] gi|73476843|gb|AAZ76458.1| hypothetical protein LH0091 [Escherichia coli] Length = 256 Score = 40.3 bits (92), Expect = 1.5, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 45/148 (30%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ + A R R A + R Sbjct: 105 AGEWLTEDEIRAVLDAVRDAVRSVSCRVAEDARRIRAALTTTGQTLLTRQTRR----FRL 160 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIK---AAETKVLSKFNE 129 K LD + A+ N+ G+ +EM + + Sbjct: 161 VVKESDHPCWLDEDDENLPVVLDAILNR-----GARFSSVEMYLVCECVEHILASGLVCD 215 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 216 VLRIPDEPSRRWFDRDILREVVLEARDE 243 >gi|323158249|gb|EFZ44335.1| ychA ta [Escherichia coli E128010] Length = 256 Score = 40.3 bits (92), Expect = 1.5, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 46/148 (31%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ + A R R A + R Sbjct: 105 AGEWLTEDEIRAVLDAVHDAVRSVSCRVAEDARRIRAALTTTGQTLLTRQTRR----FRL 160 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKA---AETKVLSKFNE 129 K LD + A+ N+ G+ +EM + + + Sbjct: 161 VVKESDHPCWLDEDDENLPVVLDAILNR-----GARFSSVEMYLVSECVEHILSSGLACD 215 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 216 VLRIPDEPSRRWFDRDILREVVMEARNE 243 >gi|113475196|ref|YP_721257.1| chromosome segregation ATPase-like protein [Trichodesmium erythraeum IMS101] gi|110166244|gb|ABG50784.1| Chromosome segregation ATPase-like protein [Trichodesmium erythraeum IMS101] Length = 1209 Score = 40.3 bits (92), Expect = 1.5, Method: Composition-based stats. Identities = 79/629 (12%), Positives = 200/629 (31%), Gaps = 52/629 (8%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQK 60 M E KA L +++ R + D LS + L E ++ Sbjct: 240 MYQELEARSWKAEDETLDFSWQEKIKSSPYRNWAFQDWMNLSSLGKQNKILLVELEKYKN 299 Query: 61 ELIRS-------VNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFN--KLFFKAGSAEVP 111 + +S + I + + + LD +A + Q L N K++ K+ Sbjct: 300 QDEKSQLELTEVKSQLIQIQDELEKYITQLDGTEAKLSESQQQLHNKEKVYEKSQLELTE 359 Query: 112 LEMKIKAAETK---VLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLV 168 ++ ++ + +S+ N S++ +K+ +K+Q + + + Sbjct: 360 VKSQLTKTQDDLEKYVSQLNGTEAKLSESQQQLHNKEKVY--------EKSQ-LELTEVK 410 Query: 169 KQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDF--VRSMLDWLDLSRY 226 Q +TQ +L + Q + +K+ +D+F V+ + D ++ Sbjct: 411 SQLTKTQDDLEKYVSQLNGTEAKLSESQQQLHNKEKVLEKTQDEFQKVQQIQTKFDQTKN 470 Query: 227 KDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKREFERVFHFKDSQA-HM 285 + + + + + + K E ++Q + Sbjct: 471 ELATAKSQLNETKTELIQCQSELKEKEGELQK-----YQGTQKELLETQSKLDETQGELV 525 Query: 286 DYMEHFGVSTNVNTILTSELASLSKDIVIAR---ELGPNADSFVKQMIVQTIANDQEASA 342 Y + N+ + + ++ +L N + K + Sbjct: 526 QYQSQ--LHQNLEELEKNICKLQEAELAWKELKFQLETNEELLDKFKFQDKQNQAELGQT 583 Query: 343 GNKVLKDWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPI 402 + + + + + + + WE + + + L+ A A + Sbjct: 584 KHSLYETKIKLKTSQNQLHKTQEFWESSQSQLVAKEVVLKKYQQDLQDAEKALEDTYSQL 643 Query: 403 GALLEDGFISRQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGS 462 + ++RQ LS + I + +E E E ++ Sbjct: 644 QRTQIELGVTRQNLSESK-GELFIYKYQLHQSQEEWEKYQSQLAGTEVLLEE-------- 694 Query: 463 DAFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPS 522 HS++ + + + + +++ I+ + +T++ + L+ +K + S Sbjct: 695 --------YHSQLKQATEQKQQTQSKLTETEAILQAKEAELTESNSELEKIKLELERSGS 746 Query: 523 IKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRK 582 Q + + + +K+A+ T I K+A+L + +KI + Sbjct: 747 DLQKTHQEVEKNQSQLKQAEEQKQQTQSKLTET-EAILQAKEAELTESNSELEKIKLELE 805 Query: 583 KLKNSKTLSPEQRQELQQQLADLERKEIN 611 + + + ++ Q++Q QL + Sbjct: 806 RSGSDLQKTHQELQQIQSQLNQTQADLTE 834 >gi|58000337|ref|YP_190172.1| YchA [Escherichia coli] gi|57903237|gb|AAW58867.1| YchA [Escherichia coli] Length = 258 Score = 40.0 bits (91), Expect = 1.7, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 47/148 (31%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ + A+R R A + R Sbjct: 107 AGEWLTEDEIRAVLDVVRDAVRSVSCRVAEDAQRIRAALTTTGQTLLTRQTRR----FRL 162 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKA---AETKVLSKFNE 129 K LD + A+ N+ G+ +EM + + + Sbjct: 163 VVKESDHPCWLDEDDENLPVVLDAILNR-----GARFSAVEMYLVSECVEHILSSGLACD 217 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 218 VLRIPDEPPRRWFDRGVLREVVREARAE 245 >gi|288957672|ref|YP_003448013.1| hypothetical protein AZL_008310 [Azospirillum sp. B510] gi|288958885|ref|YP_003449226.1| lytic transglycosylase [Azospirillum sp. B510] gi|288909980|dbj|BAI71469.1| hypothetical protein AZL_008310 [Azospirillum sp. B510] gi|288911193|dbj|BAI72682.1| lytic transglycosylase [Azospirillum sp. B510] Length = 2889 Score = 40.0 bits (91), Expect = 1.8, Method: Composition-based stats. Identities = 91/834 (10%), Positives = 200/834 (23%), Gaps = 55/834 (6%) Query: 14 GRELSK------KELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVN 67 GR+ ++ E + D L AER E +S Sbjct: 949 GRKPTQRDGLPLDEALTISDLTAANDRLLAVMQKVGAERVIETARVQAEMAATNAGKSAK 1008 Query: 68 DA-----IDEAYKRHQLRSDLDRVQAGVYGKSQA--LFNKLFFKAGSAEVPLEMKIKAAE 120 A R QL +D + L + + ++ +A E+ A Sbjct: 1009 AAELEGIQAARMARAQLVQAVDDSNRAAAAEVAGADLVARAYGQSTAAVREAEIHQNALA 1068 Query: 121 TKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHS 180 Y + S L D Q + K Q + A RL + Sbjct: 1069 EVARGTIEPYDAIVS-RLRAVDDAQRKVQAAQFDATLKQQTDDALRLADAWGRGANAARE 1127 Query: 181 QAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIA 240 + + + +++ + R + + E+A Sbjct: 1128 AGLANEALAEARKRGLAPTKDAGQIQDIGRGILARDAARR--SQEFAQMAAEQRRAVELA 1185 Query: 241 S----FVGEVFAERVRSTS-FKDPSIPSSEVGVKREFERVFHFKDSQAHMDYMEHFGVST 295 + +G+ AER ++ + + + + + + + + + Sbjct: 1186 NAEFGMLGQSNAERAKAVAILQTTNTLRDKGVDLTDAGTQAYIRQAGELARVNSQLQDAA 1245 Query: 296 NVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIAN---DQEASAGNKVLKDWLG 352 L + + +D+V+ + +A + + + + + A L Sbjct: 1246 QNAANLAQPITTAFEDVVVGAKKAGDAGKALAEDLKRVFFRATVTKPAETWLTGTLTKLM 1305 Query: 353 RNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFIS 412 + + + + + + +A G AL Sbjct: 1306 SGPIGAANDNAPRPANDPGSLSRIVTSVSGGLGSSPSNAMWVQQAGSAAAVALDPQALTG 1365 Query: 413 RQMLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLH 472 R L D ++ + + + V L + + R + Sbjct: 1366 RAPLPVAIRDGGQVEDLLR-SEARAQGVPEAVALAIGKLESGFRQHRDDGRLLTSSAGAQ 1424 Query: 473 S------KMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAF 526 KW G + D + + + + + ++ A + Sbjct: 1425 GVMQLMPATAKWLGVDATDTRENVRGGI---KYLAMLGRQFGGDWNMVAAAYNAGPTRVQ 1481 Query: 527 FKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKN 586 A T + + + ++ Sbjct: 1482 QYLTQGRALPTETVTYVERFGKSVQTANTAVEGMAARAEGAATSQAATTQNLTTAQRDAV 1541 Query: 587 SKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQR 646 S LS + + AD + + D VS + + + +GA+ + +Q Sbjct: 1542 SAALSTVKSMDSVSTAADDIDARLGMASDGVSTAAEK-LTAAQKEAAQGAVTFAQSSQQA 1600 Query: 647 LGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDLSNSAKMPKGASMALNHVWIQYSAT 706 + G L + + + + G + + + Q + Sbjct: 1601 GDFMVDGTQQALGALLSVIG------AASGIKGAGIP-GQVVQAGGPQGIANSFSQLGSL 1653 Query: 707 MALAGIG-----VASIKALLRGEDPSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAI 761 + GI ++S+K L P L + Y A L Sbjct: 1654 LKTDGIFGSNSAISSVKGFLNTPIPGLSNIGYTAPQAAAVAKTSTGTLAGGEGASTGTGA 1713 Query: 762 GGLLGPVPSMVTNLTSSAVELAT--------KDNENSKVNATKAIRKTLPFMNM 807 GP N+ A I PF + Sbjct: 1714 QAAGGPTWGNALGAVGYGFNAFQNFRSGNVIGGIGNTAATAMMFIPGAQPFAPL 1767 >gi|73669023|ref|YP_305038.1| trimethylamine permease [Methanosarcina barkeri str. Fusaro] gi|72396185|gb|AAZ70458.1| trimethylamine permease [Methanosarcina barkeri str. Fusaro] Length = 348 Score = 40.0 bits (91), Expect = 1.8, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 64/187 (34%), Gaps = 19/187 (10%) Query: 615 DKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRA--GEALRMFQQFTTTP 672 D++S+ + D + V + T+ + L G GE +R ++F P Sbjct: 46 DEMSSAIAKTSGDGISLVVTAVLITAFNALAVMLALMVWNGVLGKYGELVRTLKEFH--P 103 Query: 673 TGMFLNILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGVASIKALLR----GEDPSL 728 + + + G+ +A+ + ++A +AG+ + ++L GE S Sbjct: 104 CSKWFFLASIFGGPMAILGSFIAMGFIGGSFAA---VAGLLYPVVGSILAYYWYGEKISK 160 Query: 729 PEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNE 788 + + G + Y L +S G+ P + L ++A Sbjct: 161 RAAVGIAVIVLGGISIYGGGLFTELSSGNV--------PWIGYLGGLMAAAGWGIEGAIA 212 Query: 789 NSKVNAT 795 ++ Sbjct: 213 GKGLDIA 219 >gi|308044467|ref|NP_001183573.1| hypothetical protein LOC100502166 [Zea mays] gi|238013152|gb|ACR37611.1| unknown [Zea mays] Length = 239 Score = 40.0 bits (91), Expect = 1.8, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 38/132 (28%), Gaps = 1/132 (0%) Query: 56 EDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMK 115 + F +E I+S+ K R L + Q +A LE Sbjct: 33 KRFSEEQIKSLESMFATQTKLEP-RQKLQLARELGLQPRQVAIWFQNKRARWKSKQLERD 91 Query: 116 IKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQ 175 A + Y + + + ++ E +GK + N A+ Sbjct: 92 YSALRDDYDALLCSYESLKKEKHTLLKQLEKLAEMLHEPRGKYSGNADAAGAGDDVRSGV 151 Query: 176 RELHSQAHEAGL 187 + + +AG Sbjct: 152 GGMKDEFADAGA 163 >gi|289976628|gb|ADD21673.1| internal virion protein [Caulobacter phage Cd1] Length = 1333 Score = 40.0 bits (91), Expect = 1.8, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 55/170 (32%), Gaps = 16/170 (9%) Query: 661 ALRMFQQFTTTPTGMFLNILDL----SNSAKMPKGASMALNHVWIQYSATMALAGIGVAS 716 L+M QF T + K A++ + + A M + +G+ Sbjct: 1147 LLKMLFQFRTFSLTSVEKQWGRNMANHGALKSFGILVAAMSFAFPIHYARMQIKMLGMNE 1206 Query: 717 I-KALLRGEDPSLPEVIYDGTLANGALLPYMDRL----------TKLVSKGDRAAIGGLL 765 + ++ S + A D + AIG Sbjct: 1207 EDREKFAEKNLSAAALWRSTINYASASGLLGDLADVGGGFVAGWGGDNGELFADAIGARG 1266 Query: 766 GPVPSMVTNLTSSAVELATKDNENSKVNATKAIRKTLPFMNMWYLKNSFD 815 G ++ + + ++ L + E + + KAI+ +PF N+ YL+ + Sbjct: 1267 GNQNQLLGGVLAPSLGLVQQAWEAANGDPHKAIKA-MPFANLPYLQPLVN 1315 >gi|260751943|ref|YP_003232481.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257757306|dbj|BAI28806.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] Length = 256 Score = 40.0 bits (91), Expect = 2.1, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 46/148 (31%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A ++ + A R R A + R Sbjct: 105 AGEWLTEDEIRAVLDAVRDAVRTVSCRVAEDARRIRAALTTTGQTLLTRQTRR----FRL 160 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKA---AETKVLSKFNE 129 K LD + A+ N+ G+ +EM + + + Sbjct: 161 VVKESDHPCWLDEDDENLPVVLDAIVNR-----GARFSSVEMYLVSECVEHILSSGLACD 215 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 216 VLRIPDEPPRRWFDRGVLREVVREARAE 243 >gi|209922002|ref|YP_002296075.1| hypothetical protein ECSE_P1-0050 [Escherichia coli SE11] gi|209915180|dbj|BAG80253.1| conserved hypothetical protein [Escherichia coli SE11] Length = 258 Score = 39.6 bits (90), Expect = 2.3, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 47/148 (31%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ + A+R R A + R Sbjct: 107 AGEWLTEDEIRAVLDAVRDAVRSVSCRVAEDAQRIRAALTTTGQTLLTRQTRR----FRL 162 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKA---AETKVLSKFNE 129 K LD + A+ N+ G+ +EM + + + Sbjct: 163 VVKESDHPCWLDEDDENLPVVLDAILNR-----GARFSAVEMYLVSECVEHILSSGLACD 217 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 218 VLRIPDEPPRRWFDRGVLREVVREARTE 245 >gi|283784782|ref|YP_003364647.1| helicase IV [Citrobacter rodentium ICC168] gi|282948236|emb|CBG87803.1| helicase IV [Citrobacter rodentium ICC168] Length = 684 Score = 39.6 bits (90), Expect = 2.3, Method: Composition-based stats. Identities = 31/259 (11%), Positives = 83/259 (32%), Gaps = 28/259 (10%) Query: 3 PECIQVLNKAAGRE--LSKKELRRLEDGIVRAYVSLDGK--GLSKAERYRLAGLKAEEDF 58 + ++ + G+ L++++ ++ I+RA+ +L L + + R A + + Sbjct: 104 QQQLEAIAARTGQHAWLTREQTAGVQQQILRAFAALPLPLNRLLELDNCREALKQCQAWL 163 Query: 59 QK-ELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALF-----NKLFFKAGSAEVPL 112 + + R ++ ++ +V++ +QA L AG+ Sbjct: 164 KDIDACRLAHNQAYTDAMLNEYAEFFRQVESSPLNPAQARAVVNGERALLVLAGAG---- 219 Query: 113 EMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDV---FDEMKGKK-----TQNEQA 164 K + + L +Q ++ E T + A Sbjct: 220 SGKTSVLVARAGWLLARGEAAAEQILLLAFGRQAAQEIDERIRERLHTDAITARTFHALA 279 Query: 165 SRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLS 224 +++Q + + +++ +K F + Q S K A ++ + W Sbjct: 280 LHIIRQGSKKAPTVSKLENDSAARHKLFISAWRQQCSEKKAHAKGWRQWLEEEMQW---- 335 Query: 225 RYKDIDGTPLSRSEIASFV 243 +G ++ + Sbjct: 336 --TVAEGNFWDDEKLQRRL 352 >gi|257789840|ref|YP_003180446.1| putative ATP-binding protein [Eggerthella lenta DSM 2243] gi|257473737|gb|ACV54057.1| putative ATP-binding protein [Eggerthella lenta DSM 2243] Length = 1136 Score = 39.6 bits (90), Expect = 2.6, Method: Composition-based stats. Identities = 59/367 (16%), Positives = 111/367 (30%), Gaps = 33/367 (8%) Query: 8 VLNKAAGRELSKKELR-RLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRS- 65 + A E++ KE + R+ + A+ L+ + F EL +S Sbjct: 663 AVATNATAEITAKEQECQALCRTERSLRDKHWEDYDNAQAAF--DLERAQAFYDELAQSD 720 Query: 66 --VNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIK------ 117 A + +L VQ + Q + S +E +I Sbjct: 721 AFREAESRRATAQGRLDEANKAVQKALVN--QQTNEERIQDTRSDIAEVERRINKRNPSG 778 Query: 118 -AAETKVLSKFNEYAEVGSKNLGFTLD--KQFGLDVFDEMKGKKTQNEQASRLVKQYFET 174 A + + ++F + + Q DV + + +A+R + Sbjct: 779 IAMDDETRAQFIDLFSSANDRFDSDTSLVYQTSNDVQRIL---DARVAKAARAQQDARRR 835 Query: 175 QRELHSQAHE-----AGLDYKFFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDI 229 + Q A FE+R ++RA+ + R LD L + + Sbjct: 836 TELVLQQYKSTWKLLAADLSASFEDRDAYIGRYRQIRASGLPQYERKFLDVL--NSFSQD 893 Query: 230 DGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEV--GVKREFERVFHFKDSQAHMDY 287 T +S SEI + EV V S SS + ++ + R + + Sbjct: 894 QITAIS-SEIRNAFREVRDRLVPVNRSLLLSEFSSGIHLQIEVKEHRSLRVNE---FLAD 949 Query: 288 MEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVL 347 ++ + L + ++ I + LG N S + D +V Sbjct: 950 LKEITRGSWEEDDLEAAERRYARTAAIMKRLGSNDRSDQTWRMACLNTPDHMKFIAKEVA 1009 Query: 348 KDWLGRN 354 D N Sbjct: 1010 GDGAVVN 1016 >gi|13449152|ref|NP_085368.1| hypothetical protein pWR501_0214 [Shigella flexneri 5a] gi|31983666|ref|NP_858335.1| hypothetical protein CP0202 [Shigella flexneri 2a str. 301] gi|13310700|gb|AAK18524.1|AF348706_213 orf, hypothetical [Shigella flexneri 5a] gi|12329116|emb|CAC05847.1| unnamed protein product [Shigella flexneri] gi|18462658|gb|AAL72430.1| orf, conserved hypothetical protein [Shigella flexneri 2a str. 301] gi|281603961|gb|ADA76944.1| hypothetical protein SFxv_5049 [Shigella flexneri 2002017] gi|333006543|gb|EGK26044.1| hypothetical protein SFK218_1316 [Shigella flexneri K-218] Length = 256 Score = 39.6 bits (90), Expect = 2.7, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 51/148 (34%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ +G A R R A + + R Sbjct: 105 AGEWLTEDEIRAVLDAVRDAVCSVSCRGAEDARRIRAALTTSGQTLLTRQTRR----FRL 160 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKA--AETKVLSKFN-E 129 K LD + A+ N+ G+ +EM + + E + S + Sbjct: 161 VVKESDHPCWLDEDDENLPVVLDAILNR-----GARFSAVEMYLVSDCIEHILSSGLACD 215 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 216 VLRIPDEPPRRWFDRGVLREVVREARAE 243 >gi|260779191|ref|ZP_05888083.1| chromosome partition protein MukB [Vibrio coralliilyticus ATCC BAA-450] gi|260605355|gb|EEX31650.1| chromosome partition protein MukB [Vibrio coralliilyticus ATCC BAA-450] Length = 1486 Score = 39.2 bits (89), Expect = 2.9, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 57/158 (36%), Gaps = 2/158 (1%) Query: 43 KAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLF 102 + + +LA + D Q+ A+ K L D D + L N+ Sbjct: 393 DSLKTQLADYQQALDVQQTRALQYQQAVQALEKAKTLLGDEDLTAERAHSLVSELKNQES 452 Query: 103 FKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNE 162 +A + ++ K+ + + +F + K G +++++ +V E + E Sbjct: 453 EST-AALLSVKHKLD-MSSAAVEQFETALTLVRKIAGDSVERKNAAEVAKESIRQARDAE 510 Query: 163 QASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPM 200 Q ++ +Q+ R+L ++ + + Q Sbjct: 511 QIAQNEQQWRAQHRDLERNLNQQRQACELVDAYQKQHH 548 >gi|308477855|ref|XP_003101140.1| CRE-MYO-2 protein [Caenorhabditis remanei] gi|308264068|gb|EFP08021.1| CRE-MYO-2 protein [Caenorhabditis remanei] Length = 1960 Score = 39.2 bits (89), Expect = 2.9, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 62/183 (33%), Gaps = 14/183 (7%) Query: 10 NKAAGRELSKKELRRLEDGIVRAYVSLDGK-GLSKAERYRLAGLKAEEDFQKELIRSVND 68 A +L K ++ LED + + K +R LK ++ +EL +S +D Sbjct: 1031 QNLAANKLKAKLMQSLEDSEQTMEREKRNRADMDKNKRKAEGELKIAQETLEELNKSKSD 1090 Query: 69 AIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFN 128 A + ++ +L QA KL E ++K + Sbjct: 1091 AENALRRKETELHNL----GMKLEDEQAAVAKL----QKGIQQDEARVKDLHD----QLA 1138 Query: 129 EYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQAS-RLVKQYFETQRELHSQAHEAGL 187 + + + D+Q D E +++ A L K+ +L E+GL Sbjct: 1139 DEKDARQRADRSRADQQAEYDELTEQLEDQSRATAAQIELGKKKDAELTKLRRDLEESGL 1198 Query: 188 DYK 190 + Sbjct: 1199 KFG 1201 >gi|226201026|ref|YP_002756638.1| hypothetical protein p026VIR_p087 [Escherichia coli] gi|219881655|gb|ACL52025.1| hypothetical protein [Escherichia coli] Length = 256 Score = 39.2 bits (89), Expect = 2.9, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 49/148 (33%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ + A R R A + R Sbjct: 105 AGEWLTEDEIRAVLDAVRDAVCSVSCRVAEDARRIRAALTTTGQTLLTRQTRR----FRL 160 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKI--KAAETKVLSKFN-E 129 K LD + A+ N+ G+ +EM + + E + S + Sbjct: 161 VVKESDHPCWLDEDDENLPVVLDAILNR-----GARFSSVEMYLVCECVEHILSSGLACD 215 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 216 VLRIPDEPSRRWFDRDILREVVREARAE 243 >gi|239907128|ref|YP_002953869.1| hypothetical protein DMR_24920 [Desulfovibrio magneticus RS-1] gi|239796994|dbj|BAH75983.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 3195 Score = 39.2 bits (89), Expect = 3.2, Method: Composition-based stats. Identities = 54/482 (11%), Positives = 139/482 (28%), Gaps = 50/482 (10%) Query: 38 GKGLSKA-ERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQL-----------RSDLDR 85 + + A +R A L A + A+ + + DR Sbjct: 2293 WRAMRAAYDRLLDARLAAYRRLVDKARAGYARAVTRRLIEAGIPVEAARAFDAAAYNADR 2352 Query: 86 VQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE-----TKVLSKFNEYAEVGS-KNLG 139 + A + A V ++ ++ S +G + Sbjct: 2353 ILADALAPYADKVKAVVAAARKDGVTIKGRLADVRLTDENGTTFSFAEMVERMGQLRGFY 2412 Query: 140 FTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQP 199 ++ G V + +E+ R K++ + +L + AG + Sbjct: 2413 APRLREAGDFVVRGRRTGTDGSEERFRAHKEWRRSAEKLRLEMARAGWA-------MDTV 2465 Query: 200 MSVDKLRATKKDDFVRSMLDWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDP 259 ++KL + ++ L+L++ + + A VGE+ + Sbjct: 2466 TRLEKLPEATQG-----VIKTLELAKTVETAVNQVGEDVEAGLVGEILEALADEVKARGF 2520 Query: 260 SIPSSEVGVKREFERVFHFKDSQ---AHMDYMEHFGVSTNVNTILTSELASLSKDIVI-A 315 S + +FKD+ + +G++ + + + Sbjct: 2521 RSQSIRRSGRHGEVVQGYFKDAVERFSRYAGSTAYGLAKAEAAQKAATALFATDGQGLDI 2580 Query: 316 RELGPNADSFVKQMIVQTIANDQEASAGNKV---LKDWLGRNKLEVRQEAMLQMWEVMRY 372 R+ G + + N + A AG++V K L ++ L M Sbjct: 2581 RKEG----EVYRLAVDYLAENLRNAEAGDRVFALAKSMASLKYLGFNAKSALVNLTSMAT 2636 Query: 373 GETVENTGWANWMAGLRSAAGASMLGQHPIGALLE-DGFISRQMLSRVGIDKEAIQRINK 431 +A G G + +G+ + A+ + G ++ + ++ + + + Sbjct: 2637 SVPAALHAYAMAGKG-----GWARIGREIVRAMGDYLGLMAGRSGRLTAGERAFMAQARR 2691 Query: 432 MPLKERMELLSDVGLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAEYLDKKRISS 491 L + + +Y + G+ + ++++ + + ++ Sbjct: 2692 ESLDDPQFAREALSVYRD---TAGQAWTWAMGKALLLFGATERLNRGATLLAGYRLARAA 2748 Query: 492 HA 493 Sbjct: 2749 GT 2750 >gi|187933425|ref|YP_001886912.1| hypothetical protein CLL_A2724 [Clostridium botulinum B str. Eklund 17B] gi|187721578|gb|ACD22799.1| phage tail tape measure protein, TP901 family [Clostridium botulinum B str. Eklund 17B] Length = 1019 Score = 39.2 bits (89), Expect = 3.4, Method: Composition-based stats. Identities = 48/365 (13%), Positives = 101/365 (27%), Gaps = 13/365 (3%) Query: 355 KLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDGFISRQ 414 ++ A+ + + G T LR+ + H L + Sbjct: 186 NVKETTSALPGLLSLASAGSLDLATATDIASGTLRAFNIDAAQTSHVADVLALSAAATNS 245 Query: 415 MLSRVGIDKEAIQRINKMPLKERMELLSDVGLYAEGVVAH---GRNMMEGSDAFQIGHKL 471 ++ +G + + + + + GL + + G + + K Sbjct: 246 DVTDLGETMKYAAPVAQALGISFEDTAAASGLLSNANIKGSQAGTILRQTMARLASPTKE 305 Query: 472 HSKMHKWSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLD 531 +K+ K G D + + G + + +SL L + R D F + Sbjct: 306 AAKVMKAYGINAFDAQGN------MKPLNGVINNLNSSLGKLTSQKRADIISTVFDTESM 359 Query: 532 DTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLS 591 ++ + T + ++ L +LA + + +K Sbjct: 360 SGVLALMNQGGQSLGDLSKKLTETKGSADEMEKTKLDNLAGQWTILKSAVEGMKIELG-- 417 Query: 592 PEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLT 651 E+ +Q +I + D V + + + G L Sbjct: 418 -EKLAPYAKQFVTWFTAKIPSITDSVVKFVDTISNNIGTIKAAGGAFLGLTGAFVGMFAI 476 Query: 652 YKRGTRAGEALRMFQQFTTTPTG-MFLNILDLSNSAKMPKGASMALNHVWIQYSATMALA 710 K GT G ++ F T T + + AL A + +A Sbjct: 477 NKIGTTVGTFGKLLGGFKTATTADALVKTTSAMQGLGLASKIIPALLSPTGLAIAGIGIA 536 Query: 711 GIGVA 715 G+ A Sbjct: 537 GLVAA 541 >gi|51892253|ref|YP_074944.1| DNA mismatch repair protein [Symbiobacterium thermophilum IAM 14863] gi|81692142|sp|Q67QE3|MUTS2_SYMTH RecName: Full=MutS2 protein gi|51855942|dbj|BAD40100.1| DNA mismatch repair protein [Symbiobacterium thermophilum IAM 14863] Length = 793 Score = 38.8 bits (88), Expect = 3.7, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 44/116 (37%), Gaps = 7/116 (6%) Query: 15 RELSKKELRRLEDGI--VRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 R+ +E R+ED I + A + K ++A R R + E++++ + A + Sbjct: 503 RQFLTQEQERVEDLIQGIHATRAELEKERAEAHRLRAEAQRMREEYERRYGDAQRKAAET 562 Query: 73 AYK-----RHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKV 123 K + L + +A + QAL + + A ++ A V Sbjct: 563 VEKARAQAQQILATARREAEAVIAELKQALREQREAERMQAIQSARSRLARARQAV 618 >gi|116006854|ref|YP_788037.1| hypothetical protein pO86A1_p071 [Escherichia coli] gi|115500709|dbj|BAF33940.1| hypothetical protein [Escherichia coli] Length = 275 Score = 38.8 bits (88), Expect = 4.0, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 46/148 (31%), Gaps = 12/148 (8%) Query: 13 AGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDE 72 AG L++ E+R + D + A S+ + A R R A + R Sbjct: 124 AGEWLTEDEIRAVLDAVRDAVCSVSYQVAEDARRIRAALTTTGQTLLTRQTRR----FRL 179 Query: 73 AYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKA---AETKVLSKFNE 129 K LD + A+ N+ G+ +EM + + + Sbjct: 180 VVKESDHPCWLDEYDENLPVVLDAILNR-----GARFSSVEMYLVSECVEHILSSGLACD 234 Query: 130 YAEVGSKNLGFTLDKQFGLDVFDEMKGK 157 + + D+ +V E + + Sbjct: 235 VLRIPDEPPRRWFDRGVLREVVREARNE 262 >gi|294895705|ref|XP_002775265.1| troponin T, skeletal muscle, putative [Perkinsus marinus ATCC 50983] gi|239881339|gb|EER07081.1| troponin T, skeletal muscle, putative [Perkinsus marinus ATCC 50983] Length = 705 Score = 38.8 bits (88), Expect = 4.2, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 66/169 (39%), Gaps = 9/169 (5%) Query: 20 KELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQL 79 +EL+ LE I+ + K E+ + Q+ +R +A + +L Sbjct: 357 RELKALESRILWNMRREEKSAERKMEKEAQKDITQWRREQETSLREGIEAFRRTTHQREL 416 Query: 80 RSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAE-TKVLSKFNEYAEVGSKNL 138 + +L+ VQ K +A +L S + E I+ K +E + + Sbjct: 417 KENLEFVQFKRERKRRAREAELELITQSYDAQREKSIQRENCEKERITADEVFKAEQRK- 475 Query: 139 GFTLDKQFGLDVFDEMKGKKTQNEQASRL---VKQYFETQRELHSQAHE 184 D++ + +K ++ + EQA RL ++ +R+L ++ + Sbjct: 476 ----DREETRKLVRALKEEEARKEQAERLYNSAEKMEYEKRQLLAEKNR 520 >gi|134297319|ref|YP_001121054.1| hypothetical protein Bcep1808_3229 [Burkholderia vietnamiensis G4] gi|134140476|gb|ABO56219.1| hypothetical protein Bcep1808_3229 [Burkholderia vietnamiensis G4] Length = 875 Score = 38.8 bits (88), Expect = 4.3, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 46/136 (33%), Gaps = 5/136 (3%) Query: 10 NKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDA 69 +AA R++++ + I + + LS +ER A L+A + +K + +A Sbjct: 541 EEAAARKITEGLIGGNRQRIEALQLQREMLDLSASER---AVLQARNELEKSATAARKEA 597 Query: 70 --IDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKF 127 I +A R Q ++ A + L S E + ++ + Sbjct: 598 SQIQDADLRAQTIEAINDALARQLPIVENLIRANAEYQRSTEFGAKAALRTYIEDATNAA 657 Query: 128 NEYAEVGSKNLGFTLD 143 + + D Sbjct: 658 KQAERAVTGAFKSMED 673 >gi|329938772|ref|ZP_08288168.1| integral membrane [Streptomyces griseoaurantiacus M045] gi|329302263|gb|EGG46155.1| integral membrane [Streptomyces griseoaurantiacus M045] Length = 852 Score = 38.8 bits (88), Expect = 4.4, Method: Composition-based stats. Identities = 40/381 (10%), Positives = 97/381 (25%), Gaps = 19/381 (4%) Query: 293 VSTNVNTILTSELASLSKDIVIARELGPNADSFVKQMIVQTIANDQEASAGNKVLK---- 348 V LT + +D+ + R +G + Q + A L Sbjct: 284 TGFVVAGALTVAVNGQRRDLALMRAVGATPKQIRRLAAAQAMVVTAMAYVPGAALGYLLA 343 Query: 349 ---DWLGRNKLEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGAL 405 L ++ V L + + V A + ++ + Sbjct: 344 DRLRDLLVDRGAVPSALPLTVSPLPALATAVLLAAAVQLAARGAAWRTSTRPATEAVAES 403 Query: 406 LEDGFISRQMLSRVGIDKE-AIQRINKMPLKERMELLSDVGLYAEGVVAHGRNMMEGSDA 464 + ++ + G+ A ++ PL R +G A + + Sbjct: 404 RTEPREPARLRTYGGLLVIVAATTLSAAPLLSRT----AIGAAATQMAGIVGAIGLAMAG 459 Query: 465 FQIGHKLHSKMHK-----WSGAEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRL 519 + + + + + +L + +AL V + + A + Sbjct: 460 PALTRWAGTALARRLRPGTTAPTWLAVANVRGYALRVAGVVSTLAMAVAFVLTYAFTLTT 519 Query: 520 DPSIKAF-FKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIA 578 A + + V R +K + ++ Sbjct: 520 VAEATAQDTRAGTLAQYRVSAPGLGGLPTGLLDDVRDTPGVKEAAPVTTTTVVYSYRELG 579 Query: 579 YHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMH 638 + + L+P+ + L + + + +S + + V + + Sbjct: 580 DVTTESAGATILTPDAPRVLDLDVREGGLDRLRGATAAISEETARSLDAAVGDRITLTLG 639 Query: 639 TSLFDRQRLGLLTYKRGTRAG 659 R+ + Y RG G Sbjct: 640 DGTTAHPRV-VAVYGRGLGFG 659 >gi|223994133|ref|XP_002286750.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220978065|gb|EED96391.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 1284 Score = 38.4 bits (87), Expect = 4.9, Method: Composition-based stats. Identities = 43/318 (13%), Positives = 92/318 (28%), Gaps = 24/318 (7%) Query: 19 KKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAY---- 74 ++EL +E + + + + +R K E+ + S A +A Sbjct: 315 ERELEVVETEMNKDRGVPEMGARKEVDRVSEVAAKLVEELPRTQALSRGSAGADAGAGVE 374 Query: 75 --KRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAE 132 K ++ + + F S + KAA S + Sbjct: 375 RGKHQSAHRQYMNLEEYADALHAFDSSGVLFDDESHPSSGMWEDKAALELYSSSLQSRLK 434 Query: 133 VGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFF 192 + L ++T +E S L + E +++A Y Sbjct: 435 DAMDRTRSLEKRLVVL--------ERTGDEIVSSLCEDLVEVTG----HSNKAEARYVKK 482 Query: 193 ENRIPQPMSVDKLRATKKDDFVRSMLDWLDLS--RYKDIDGTPLSRSEIASFVGEVFAER 250 + + +++R K+ + L+ G+ + + A Sbjct: 483 GKELQRKRRREEVRLRNKERQAERRVRKLEERLLPISGEAGSQFNHKDFADSDSSDGNTT 542 Query: 251 VRSTSFKDPSIPS----SEVGVKREFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELA 306 +D I + + K E E+ H + + E F + +V ++ + Sbjct: 543 DEDDEEEDDEIRLEKKLASIKSKNEQEKAAHESEVDSIRRQCEQFKLRLSVVRLVMAGDD 602 Query: 307 SLSKDIVIARELGPNADS 324 +L I I L P+ Sbjct: 603 NLRDYIAILDRLNPSVQH 620 >gi|320100517|ref|YP_004176109.1| hypothetical protein Desmu_0308 [Desulfurococcus mucosus DSM 2162] gi|319752869|gb|ADV64627.1| hypothetical protein Desmu_0308 [Desulfurococcus mucosus DSM 2162] Length = 546 Score = 38.4 bits (87), Expect = 5.1, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 48/109 (44%), Gaps = 6/109 (5%) Query: 80 RSDLDRVQAGVYG-KSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNL 138 R L+ ++ G+ G +AL +L AGS L + + ++L + + G K L Sbjct: 91 RRILEALEKGLLGAGREALVRELVENAGSVVDELAARRRGLSKRLLLQLLDSPIPGYKYL 150 Query: 139 GFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQ---AHE 184 D ++ ++G ++E+ +R+V+ +T RE ++ Sbjct: 151 RDYFDP--YRELCGVIRGAPCRDEEVARVVEYVRQTLRETGRDMVSFND 197 >gi|163792657|ref|ZP_02186634.1| Helicase-like protein [alpha proteobacterium BAL199] gi|159182362|gb|EDP66871.1| Helicase-like protein [alpha proteobacterium BAL199] Length = 936 Score = 38.4 bits (87), Expect = 5.6, Method: Composition-based stats. Identities = 44/272 (16%), Positives = 77/272 (28%), Gaps = 21/272 (7%) Query: 15 RELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRS---VNDAID 71 R+L+ EL ++ R + A E+ + E IR N + Sbjct: 251 RKLTPAELAQIAGRAGRHMNDGSFGVTMDCRPLDEEIVSAIEEHRFESIRQLHWRNADLS 310 Query: 72 EAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAA-ETKVLSKFNEY 130 A R LR+ +R L K A L + T S+ Sbjct: 311 FATVRDLLRTLDERPPH------DFLIRKRDADDQRALEALSRLPEVTDRTTASSRIRLL 364 Query: 131 AEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYK 190 +V + + + ++ T+ + RL + E + D Sbjct: 365 WDVCQIPDFQKIMSESHARLLVQVFTHLTEGSE--RLPSNW---VDEQMDRFDRTDGDID 419 Query: 191 FFENRIPQPMSVDKLRATKKDDFVRSMLDWLDLSRYKDID-GTPLSRSEIASFVGEVFAE 249 RI + + T + +++ LD +R + L FV A Sbjct: 420 TLAGRIAHVRTWTYI--TNRGEWIDDPLDRQQRARAIEDRLSDALHDRLTQRFVDRRAAT 477 Query: 250 RVRSTSFKDPSIPSSEVGVKR---EFERVFHF 278 R D + ++ E RV H Sbjct: 478 LSRRLQDDDAELIAAVAADGAVLVEGHRVGHL 509 >gi|288960723|ref|YP_003451063.1| hypothetical protein AZL_a09880 [Azospirillum sp. B510] gi|288913031|dbj|BAI74519.1| hypothetical protein AZL_a09880 [Azospirillum sp. B510] Length = 426 Score = 38.4 bits (87), Expect = 5.6, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 48/143 (33%), Gaps = 16/143 (11%) Query: 10 NKAAGRELSKKELRRLEDGIVRA---YVSLDGKGLSK--AERYRLAGLKAEEDF-----Q 59 +A GR+L+ +E E + R S+ AE E+ Sbjct: 170 EQALGRKLADEEALLTERVVTRQSVLQTRQAWNQASQEVAEIANQIAQLDNEELDLRFRA 229 Query: 60 KELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFK------AGSAEVPLE 113 + +R +A+ EA +R + R+Q + + N++ G + +E Sbjct: 230 DQRVRDAENALGEAERRLAQIGETRRMQTDIRAPASGRVNEIQANAGALVQHGENILSIE 289 Query: 114 MKIKAAETKVLSKFNEYAEVGSK 136 + + + + N+ + Sbjct: 290 TQGNGLQLLMFADQNQGDRLKPG 312 >gi|300726651|ref|ZP_07060086.1| putative exonuclease sbcCD, C subunit [Prevotella bryantii B14] gi|299776069|gb|EFI72644.1| putative exonuclease sbcCD, C subunit [Prevotella bryantii B14] Length = 1032 Score = 38.0 bits (86), Expect = 6.2, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 58/176 (32%), Gaps = 9/176 (5%) Query: 22 LRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRS 81 ++ L D +++ K KAE+ + +L EA KR QL Sbjct: 218 IKNLYDQAKANRKTVEVKL--KAEKEHTMAEDEVKQLNDQLKLLTEQQKAEAEKRQQLEG 275 Query: 82 DLDRVQAGVYGK--SQALFN-----KLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVG 134 + ++ + K + L +L FK SA++ + + + + + Sbjct: 276 QITVLKNYLAAKDEKEQLNKQLTEYRLHFKILSADILYRQQQLQIADSYIQELQTWLKNH 335 Query: 135 SKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYK 190 + G + + + KK + + ++ + L + A D K Sbjct: 336 EEREGVYSQVDLISERLKQWRKKKDDGRRLAEGLEAENKKTTLLERTKNVADSDLK 391 >gi|303289573|ref|XP_003064074.1| kinesin-II motor subunit protein [Micromonas pusilla CCMP1545] gi|226454390|gb|EEH51696.1| kinesin-II motor subunit protein [Micromonas pusilla CCMP1545] Length = 897 Score = 38.0 bits (86), Expect = 6.4, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 65/190 (34%), Gaps = 18/190 (9%) Query: 2 KPECIQVLNKAAGRELSKKELRRL----EDGIVRAYVS-LDGKGLSKAERYRLAGLKAEE 56 K + L ++ ++ R+ E+ R + +D + ++ E+ R+A + Sbjct: 501 KRQVQDELAGKLKSATTQADIDRIHRDAEERTQREMRAIMDDRATTEEEKRRIASEMEAQ 560 Query: 57 DFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMKI 116 + E A E K+ L++ + ++ + + L + + E Sbjct: 561 RLEIESQTEA--ASREREKKEALQAQIKAIEGKLLHGADDLEAR-------NKQLEEAAA 611 Query: 117 KAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQN--EQASRLVKQYFET 174 K + + + + +K D K ++ + + ++ +Y Sbjct: 612 KGVRDIADRERLKLER--QRAVAAMEEKALLSDEKFASKKEEVADKTRKLKKMFSKYQTA 669 Query: 175 QRELHSQAHE 184 +++L A E Sbjct: 670 KQDLEEHADE 679 >gi|125654623|ref|YP_001033817.1| ICE nucleation protein [Rhodobacter sphaeroides 2.4.1] gi|77386283|gb|ABA81712.1| Ice nucleation protein [Rhodobacter sphaeroides 2.4.1] Length = 1561 Score = 38.0 bits (86), Expect = 7.3, Method: Composition-based stats. Identities = 49/371 (13%), Positives = 105/371 (28%), Gaps = 17/371 (4%) Query: 370 MRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDG-----FISRQMLSRVGIDKE 424 + ET + + +A A+ LG + AL + + V Sbjct: 740 VAALETADVAALSTAGVKGVGSAQAAALGSAQVAALTTAQVGQLSTTALKGFGSVQASGL 799 Query: 425 AIQRINKMPLKERMELLSDV--GLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAE 482 ++ + + +L + GL +VA + Q+G +++ + A+ Sbjct: 800 TTAQVAALTTAQLSQLSTAAVKGLGTAQIVALTTGQTAALGSAQLGALSTAQVAAFETAD 859 Query: 483 YLDKKRISSHALIVYNQIGRMTDTYASLKDLK----ADPRLDPSIKAFFKQLDDTDFTVI 538 + L + T A+L + + ++ A L T + Sbjct: 860 AAALTTTALKGLTTAQVVALTTGQAAALGSAQVAGLSSTQIAALETADLAALTTTAVKGL 919 Query: 539 KRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQEL 598 + S G + A T + L A ++ + + + + + Sbjct: 920 GSTQVSSLTTGQVAALTTVQVAALSTAAVKGVGSVQASGLTTAQVAALTTAQVAQLSTAA 979 Query: 599 QQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRA 658 + L + + + L Q + + + + Sbjct: 980 LKGLGTAQIVALTTAQAAKLGSDQVAALSTAQVAALETADLATLSATGVKGFGSAQAAAL 1039 Query: 659 GEALRMFQQFTTTPTGMFL----NILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGV 714 G A FTT ++ + AL + +T A+ G+G Sbjct: 1040 GSAQ--VAAFTTAQVAALTTAAVKGFGSVQASGLTTAQVAALTTAQLSQLSTAAVKGLGT 1097 Query: 715 ASIKALLRGED 725 A I AL G+ Sbjct: 1098 AQIVALTTGQT 1108 >gi|126464806|ref|YP_001041782.1| large exoprotein involved in heme utilization or adhesion [Rhodobacter sphaeroides ATCC 17029] gi|126106621|gb|ABN79146.1| large exoprotein involved in heme utilization or adhesion [Rhodobacter sphaeroides ATCC 17029] Length = 1561 Score = 38.0 bits (86), Expect = 7.3, Method: Composition-based stats. Identities = 49/371 (13%), Positives = 106/371 (28%), Gaps = 17/371 (4%) Query: 370 MRYGETVENTGWANWMAGLRSAAGASMLGQHPIGALLEDG-----FISRQMLSRVGIDKE 424 + ET + + +A A+ LG + AL + + V Sbjct: 740 VAALETADVAALSTAGVKGLGSAQAAALGSAQVAALTTTQVGQLSTTALKGFGSVQASGL 799 Query: 425 AIQRINKMPLKERMELLSDV--GLYAEGVVAHGRNMMEGSDAFQIGHKLHSKMHKWSGAE 482 ++ + + +L + GL +VA + Q+G +++ + A+ Sbjct: 800 TTAQVAALTTTQLSQLSTAAVKGLGTAQIVALTTGQTAALGSAQLGALSTAQVAAFETAD 859 Query: 483 YLDKKRISSHALIVYNQIGRMTDTYASLKDLK----ADPRLDPSIKAFFKQLDDTDFTVI 538 + L + T A+L + + ++ A L T + Sbjct: 860 AAALTTTALKGLTTAQVVALTTGQAAALGSAQVAGLSSTQIAALETADLAALTTTAVKGL 919 Query: 539 KRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQEL 598 + S G + A T + + L A ++ + + + + + Sbjct: 920 GSTQVSSLTTGQVAALTTAQVAALSTAAVKGVGSVQASGLTTAQVAALTTAQVAQLSTAA 979 Query: 599 QQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRA 658 + L + + + L Q + + + + Sbjct: 980 LKGLGTAQIVALTTAQAAKLGSDQVAALSTAQVAALETADLATLSATGVKGFGSAQAAAL 1039 Query: 659 GEALRMFQQFTTTPTGMFL----NILDLSNSAKMPKGASMALNHVWIQYSATMALAGIGV 714 G A FTT ++ + AL + +T A+ G+G Sbjct: 1040 GSAQ--VAAFTTAQVAALTTAAVKGFGSVQASGLTTAQVAALTTAQLSQLSTAAVKGLGT 1097 Query: 715 ASIKALLRGED 725 A I AL G+ Sbjct: 1098 AQIVALTTGQT 1108 >gi|24641597|ref|NP_536797.2| smrter, isoform A [Drosophila melanogaster] gi|24641599|ref|NP_727634.1| smrter, isoform B [Drosophila melanogaster] gi|24641601|ref|NP_727635.1| smrter, isoform C [Drosophila melanogaster] gi|22832155|gb|AAF48195.2| smrter, isoform A [Drosophila melanogaster] gi|22832156|gb|AAN09315.1| smrter, isoform B [Drosophila melanogaster] gi|22832157|gb|AAF48196.2| smrter, isoform C [Drosophila melanogaster] Length = 3604 Score = 38.0 bits (86), Expect = 7.4, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 50/195 (25%), Gaps = 33/195 (16%) Query: 10 NKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDA 69 A + + KEL + V L + AE+ A K + L + D Sbjct: 766 AALAKEQRAAKELND-NNNDQEPMVELSWRSQMLAEKIYAANRKTAQAQHSMLQNAAADE 824 Query: 70 IDEAYKR--------------HQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMK 115 L + + Q+ + KL + + L K Sbjct: 825 SSPGSVAGRPWLPLYNQPLDVEALAMLIRQHQSQIRAPLLLHIRKLKAERWAHNQGLVEK 884 Query: 116 IKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQ 175 + + + +++F VF E++ + Sbjct: 885 YTKDQADWQRRCERMEASAKRKAREAKNREFFEKVFTELRKQ------------------ 926 Query: 176 RELHSQAHEAGLDYK 190 RE + + G K Sbjct: 927 REDKERFNRVGSRIK 941 >gi|147676372|ref|YP_001210587.1| membrane-fusion protein [Pelotomaculum thermopropionicum SI] gi|146272469|dbj|BAF58218.1| membrane-fusion protein [Pelotomaculum thermopropionicum SI] Length = 468 Score = 38.0 bits (86), Expect = 7.8, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 12 AAGRELSKKELRRLEDGIVRAYVSLDG--------------KGLSKAERYRLAGLKAEED 57 AG+ L+++E LE ++++ SL G + +++AE + A + Sbjct: 69 TAGQLLAEQESDNLEAQVIQSSASLKGALAKLELLKNGSTAEEIAQAEANVVMAQAAYDL 128 Query: 58 FQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMK-I 116 + L R + A R L S + QA + G+ +E + Sbjct: 129 TKTNLERYQALFQEGAVSRADLDSASNEYVNAEAKLKQAQESLKALLNGNRREDIEAQAA 188 Query: 117 KAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASR-----LVKQY 171 + ++ + + A G+K V E+ G + Q A+ Sbjct: 189 QVESSRAQLQIAQKALAGTKLFSPIN------GVVSEVNGGEGQRAAANNNSTSSGTGFI 242 Query: 172 FETQRELHSQAHEAGLDYKFFENR 195 L +A D E Sbjct: 243 VVISDALQVRAQVNEADIGRLETG 266 >gi|39937742|ref|NP_950018.1| methyl-accepting chemotaxis receptor/sensory transducer [Rhodopseudomonas palustris CGA009] gi|39651602|emb|CAE30124.1| methyl-accepting chemotaxis receptor/sensory transducer [Rhodopseudomonas palustris CGA009] Length = 559 Score = 38.0 bits (86), Expect = 7.8, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 60/192 (31%), Gaps = 5/192 (2%) Query: 481 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTD---FTV 537 E + ++GR+ Y + L+P+IKA D + F Sbjct: 66 LEATLALEDPGSLALHRERLGRLKKEYQERHAFWSKAPLEPAIKARLIDDSDREVQKFWR 125 Query: 538 IKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQE 597 I A + + + + K+L A A + D + ++ + + + Sbjct: 126 IVDASLLPAIEAKDPDTSMQAYKDLTAAYTAHRAIIDDIVKRTNDLNAATEAATAVRVTD 185 Query: 598 LQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTR 657 L L + + ++ + +++ + M + + + +RG Sbjct: 186 LNYLLWGVSGTVFLLFVAGLTAIVKGVIVPITG--MTEVMRRLASGDRAVAIPAIERGDE 243 Query: 658 AGEALRMFQQFT 669 G R Q F Sbjct: 244 VGAMARAVQVFK 255 >gi|329889871|ref|ZP_08268214.1| parB-like nuclease domain protein [Brevundimonas diminuta ATCC 11568] gi|328845172|gb|EGF94736.1| parB-like nuclease domain protein [Brevundimonas diminuta ATCC 11568] Length = 593 Score = 37.6 bits (85), Expect = 8.2, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 63/204 (30%), Gaps = 19/204 (9%) Query: 27 DGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYKRHQLRSD--LD 84 D VR D + AE A +A DA DE+ R Q R ++ Sbjct: 384 DRRVRNEAMADSVETASAETVFDAKRRAVLALLG------FDAEDESVARAQARETPVVE 437 Query: 85 RVQAGVYGKSQALFNKLFFKAGSAEVPLEMKIKAAETKVLSKFNEYAEVGSKNLGFTLDK 144 V +F G ++A + E+ G D+ Sbjct: 438 LFARMVKLSDDEVFAVAAVIMGETLAVGGPLVEAVGAYLKVDMAEWWTPDDAFFGLLRDR 497 Query: 145 QFGLDVFDEMKGKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFENRIPQPMSVDK 204 + ++ GK+ + + VK TQ+ + + P+ M+ Sbjct: 498 AVANALLRDVGGKRIADANVAEKVK----TQKTILRDFLAGSGGRAKVDGWTPKWMAFPP 553 Query: 205 LRATKK-----DDF--VRSMLDWL 221 R T + D + V++++ L Sbjct: 554 ARYTDRIYAPVDRWGAVKTVMRRL 577 >gi|192293523|ref|YP_001994128.1| methyl-accepting chemotaxis sensory transducer [Rhodopseudomonas palustris TIE-1] gi|192287272|gb|ACF03653.1| methyl-accepting chemotaxis sensory transducer [Rhodopseudomonas palustris TIE-1] Length = 559 Score = 37.6 bits (85), Expect = 8.2, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 60/192 (31%), Gaps = 5/192 (2%) Query: 481 AEYLDKKRISSHALIVYNQIGRMTDTYASLKDLKADPRLDPSIKAFFKQLDDTD---FTV 537 E + ++GR+ Y + L+P+IKA D + F Sbjct: 66 LEATLALEDPGSLALHRERLGRLKKEYQERHAFWSKAPLEPAIKARLIDDSDREVQKFWR 125 Query: 538 IKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQE 597 I A + + + + K+L A A + D + ++ + + + Sbjct: 126 IVDASLLPAIEAKDPDTSMQAYKDLTAAYTAHRAIIDDIVKRTNDLNAATEAATAVRVTD 185 Query: 598 LQQQLADLERKEINILKDKVSNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTR 657 L L + + ++ + +++ + M + + + +RG Sbjct: 186 LNYLLWGVSGTVFLLFVAGLTAIVKGVIVPITG--MTEVMRRLASGDRAVAIPAIERGDE 243 Query: 658 AGEALRMFQQFT 669 G R Q F Sbjct: 244 VGAMARAVQVFK 255 >gi|5815245|gb|AAD52614.1|AF175223_1 SANT domain protein SMRTER [Drosophila melanogaster] Length = 3469 Score = 37.6 bits (85), Expect = 8.6, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 50/195 (25%), Gaps = 33/195 (16%) Query: 10 NKAAGRELSKKELRRLEDGIVRAYVSLDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDA 69 A + + KEL + V L + AE+ A K + L + D Sbjct: 631 AALAKEQRAAKELND-NNNDQEPMVELSWRSQMLAEKIYAANRKTAQAQHSMLQNAAADE 689 Query: 70 IDEAYKR--------------HQLRSDLDRVQAGVYGKSQALFNKLFFKAGSAEVPLEMK 115 L + + Q+ + KL + + L K Sbjct: 690 SSPGSVAGRPWLPLYNQPLDVEALAMLIRQHQSQIRAPLLLHIRKLKAERWAHNQGLVEK 749 Query: 116 IKAAETKVLSKFNEYAEVGSKNLGFTLDKQFGLDVFDEMKGKKTQNEQASRLVKQYFETQ 175 + + + +++F VF E++ + Sbjct: 750 YTKDQADWQRRCERMEASAKRKAREAKNREFFEKVFTELRKQ------------------ 791 Query: 176 RELHSQAHEAGLDYK 190 RE + + G K Sbjct: 792 REDKERFNRVGSRIK 806 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.305 0.109 0.253 Lambda K H 0.267 0.0337 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 11,124,902,010 Number of Sequences: 14124377 Number of extensions: 403096336 Number of successful extensions: 1347164 Number of sequences better than 10.0: 473 Number of HSP's better than 10.0 without gapping: 112 Number of HSP's successfully gapped in prelim test: 488 Number of HSP's that attempted gapping in prelim test: 1345834 Number of HSP's gapped (non-prelim): 1100 length of query: 864 length of database: 4,842,793,630 effective HSP length: 148 effective length of query: 716 effective length of database: 2,752,385,834 effective search space: 1970708257144 effective search space used: 1970708257144 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 85 (37.6 bits)