BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781188|ref|YP_003065601.1| hypothetical protein CLIBASIA_05475 [Candidatus Liberibacter asiaticus str. psy62] (185 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781188|ref|YP_003065601.1| hypothetical protein CLIBASIA_05475 [Candidatus Liberibacter asiaticus str. psy62] gi|254040865|gb|ACT57661.1| hypothetical protein CLIBASIA_05475 [Candidatus Liberibacter asiaticus str. psy62] Length = 185 Score = 380 bits (976), Expect = e-104, Method: Compositional matrix adjust. Identities = 185/185 (100%), Positives = 185/185 (100%) Query: 1 MSNLVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK 60 MSNLVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK Sbjct: 1 MSNLVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK 60 Query: 61 DLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN 120 DLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN Sbjct: 61 DLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN 120 Query: 121 QKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSI 180 QKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSI Sbjct: 121 QKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSI 180 Query: 181 IEHNN 185 IEHNN Sbjct: 181 IEHNN 185 >gi|315122903|ref|YP_004063392.1| hypothetical protein CKC_05795 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496305|gb|ADR52904.1| hypothetical protein CKC_05795 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 180 Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 73/177 (41%), Positives = 112/177 (63%), Gaps = 1/177 (0%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 +K K + +K + YS L I+ ++ANG+ + VL+ MP FY W+ + + Sbjct: 1 MKANSKKRKPQKPTVKYSQSLTKEIITRIANGETMQAVLKYPNMPTADAFYDWLARYPEH 60 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 +E+Y +A +L+L+ EE+ EP PT ELANPVFYSKMRDRKQ+ +L E+L+ Q YG Sbjct: 61 RESYNQARVKKLELMVEEVTNEPEPTEHELANPVFYSKMRDRKQKSVLWLAERLNRQIYG 120 Query: 125 PRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSII 181 ++VE KHTIDL+P + ++ E KH + + R+ ++TEKPLE+ + S A+QS + Sbjct: 121 NHMTVEQKHTIDLKPVLAKIEERRKHQQANKAMRV-LEATEKPLELTKLSKAKQSTM 176 >gi|254781217|ref|YP_003065630.1| hypothetical protein CLIBASIA_05620 [Candidatus Liberibacter asiaticus str. psy62] gi|254040894|gb|ACT57690.1| hypothetical protein CLIBASIA_05620 [Candidatus Liberibacter asiaticus str. psy62] Length = 162 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 8/154 (5%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK-LQEAYTEALQCRLDLL 79 Y+ E IL + G +L +K G+ S F+ W+K+D + L++ Y +A Q ++ L Sbjct: 6 YTKERIDNILASFSGGLSLSQSCKKHGVTVVS-FHGWVKQDREDLEKRYEQAKQSHMEHL 64 Query: 80 AEELLEE-PAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLR 138 +E L AP EE + K+R+ + + + LEK YG VSVE KHTIDL+ Sbjct: 65 SENLASVVEAPLTEEERDHPQAIKLRELRMKRLQWELEKRYRNVYGNHVSVEQKHTIDLK 124 Query: 139 PAIERLRE--HYKHLKPIDSERIPHKSTEKPLEI 170 P ++R++ K LKP+ + K TEKPLE+ Sbjct: 125 PLMDRVQHSIQSKGLKPV---KALDKQTEKPLEL 155 >gi|317120681|gb|ADV02504.1| hypothetical protein SC1_gp120 [Liberibacter phage SC1] gi|317120723|gb|ADV02545.1| hypothetical protein SC2_gp120 [Liberibacter phage SC2] gi|317120784|gb|ADV02605.1| hypothetical protein SC2_gp120 [Candidatus Liberibacter asiaticus] gi|317120825|gb|ADV02646.1| hypothetical protein SC1_gp120 [Candidatus Liberibacter asiaticus] Length = 171 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 8/154 (5%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK-LQEAYTEALQCRLDLL 79 Y+ E IL + G +L +K G+ S F+ W+K+D + L++ Y +A Q ++ L Sbjct: 15 YTKERIDNILASFSGGLSLSQSCKKHGVTVVS-FHGWVKQDREDLEKRYEQAKQSHMEHL 73 Query: 80 AEELLEE-PAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLR 138 +E L AP EE + K+R+ + + + LEK YG VSVE KHTIDL+ Sbjct: 74 SENLASVVEAPLTEEERDHPQAIKLRELRMKRLQWELEKRYRNVYGNHVSVEQKHTIDLK 133 Query: 139 PAIERLRE--HYKHLKPIDSERIPHKSTEKPLEI 170 P ++R++ K LKP+ + K TEKPLE+ Sbjct: 134 PLMDRVQHSIQSKGLKPV---KALDKQTEKPLEL 164 >gi|260596961|ref|YP_003209532.1| hypothetical protein CTU_11690 [Cronobacter turicensis z3032] gi|260216138|emb|CBA28941.1| hypothetical protein CTU_11690 [Cronobacter turicensis z3032] Length = 164 Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 37/113 (32%), Positives = 65/113 (57%), Gaps = 4/113 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ AGI +A+G++L V + GMP ST +RW+ + + ++ Y +A + R D + Sbjct: 13 YLPEVAAGICSLLADGESLRKVCDRPGMPNKSTVFRWLAQHEEFRDQYAKATETRADAIF 72 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKH 133 EE+ + AEE A V +++R ++ + L +++ +KYG +VS E H Sbjct: 73 EEMFDIADTVAEEAAA-VGKARLRIDTRK---WALARMNPKKYGDKVSQEIDH 121 >gi|315121941|ref|YP_004062430.1| hypothetical protein CKC_00955 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495343|gb|ADR51942.1| hypothetical protein CKC_00955 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 172 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 52/164 (31%), Positives = 90/164 (54%), Gaps = 6/164 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK-LQEAYTEALQCRLDLL 79 Y+ E IL +A+G L + G+ TF+ W+++D L++ Y +A ++ L Sbjct: 8 YTLERTNKILASIASGMTLAKSCKLHGVL-VMTFFAWLERDFDDLKKRYEQAKISHMEHL 66 Query: 80 AEELLEE-PAPTAEELAN-PVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDL 137 ++++ + AP +EE PV + K R+ + + + LEK YG ++VE KHTIDL Sbjct: 67 SDKIADTIEAPLSEEEKKYPVSF-KRRELQMKRLQWELEKRYRNVYGNHMTVEQKHTIDL 125 Query: 138 RPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSII 181 +P + ++ E KH + + R+ ++TEKPLE + S EQS + Sbjct: 126 KPVLAKIEERRKHQQANKAMRVI-EATEKPLECRKLSKDEQSTM 168 >gi|156934896|ref|YP_001438812.1| hypothetical protein ESA_02744 [Cronobacter sakazakii ATCC BAA-894] gi|156533150|gb|ABU77976.1| hypothetical protein ESA_02744 [Cronobacter sakazakii ATCC BAA-894] Length = 150 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 40/129 (31%), Positives = 70/129 (54%), Gaps = 9/129 (6%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ A I +A+G++L V + GMP ST +RW+ + + ++ Y +A + R D + Sbjct: 15 YLPEVAADICSLLADGESLRKVCERPGMPNKSTVFRWLAQHEEFRDQYAKATETRADAIF 74 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKH-----TI 135 EE+ + AEE A V +++R ++ + L +++ +KYG +VS E H T+ Sbjct: 75 EEMFDIADTVAEEAA-AVGKARLRIDTRK---WALARMNPKKYGDKVSQEIDHKSSDGTM 130 Query: 136 DLRPAIERL 144 +P RL Sbjct: 131 ATKPTTIRL 139 >gi|251791567|ref|YP_003006288.1| hypothetical protein Dd1591_4010 [Dickeya zeae Ech1591] gi|247540188|gb|ACT08809.1| hypothetical protein Dd1591_4010 [Dickeya zeae Ech1591] Length = 245 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 31/110 (28%), Positives = 64/110 (58%), Gaps = 4/110 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +A+G++L V R GMP +T +RW+ ++ ++ Y +A R D++ Sbjct: 106 YMPEVADDICTLLADGESLRDVCRHPGMPNKATVFRWLAENEVFRDQYAKATDVRADVIF 165 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 +E++ + A + E A V S+++ ++ ++L +++ +KYG R++ E Sbjct: 166 DEII-DIADSTEADAAEVAKSRLKIDARK---WVLSRMAPKKYGERITQE 211 >gi|268589861|ref|ZP_06124082.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291314844|gb|EFE55297.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 140 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 35/113 (30%), Positives = 64/113 (56%), Gaps = 10/113 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE I ++A G++L V + GMP +T +RW+ ++ + ++ Y +A + R D L Sbjct: 13 YMPETAHDICAKLAEGESLRSVCNRQGMPSKATVFRWLSENAEFRDQYAKATEQRADALF 72 Query: 81 EELLE---EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+LE + P + E+A +K+R ++ + L ++S +KYG +V+ E Sbjct: 73 EEMLEIADDVLPDSAEVAK----AKLRIDTRK---WSLARMSPKKYGDKVTQE 118 >gi|329119588|ref|ZP_08248269.1| putative phage associated protein [Neisseria bacilliformis ATCC BAA-1200] gi|327464185|gb|EGF10489.1| putative phage associated protein [Neisseria bacilliformis ATCC BAA-1200] Length = 135 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 4/103 (3%) Query: 29 ILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPA 88 I + +A G+++ + R MP ST YRWI + + E Y A + + D+LA+E++E Sbjct: 12 ICELLAAGQSMAQICRSKAMPAQSTVYRWIAESAEFSERYAHAREMQADMLADEIIEIAD 71 Query: 89 PTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVES 131 E A D ++ +L +L+ +KYG RV V + Sbjct: 72 SCEPEAAAVAKAKARIDARK----WLAARLAPKKYGDRVDVSA 110 >gi|254240164|ref|ZP_04933486.1| hypothetical protein PA2G_00805 [Pseudomonas aeruginosa 2192] gi|126193542|gb|EAZ57605.1| hypothetical protein PA2G_00805 [Pseudomonas aeruginosa 2192] Length = 188 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 12/132 (9%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKD--LKLQEAYTEALQCRLDL 78 Y+P + + +A G +L V + GMP STF RW+ + L+E Y A + + D Sbjct: 42 YTPTVGLAVCTALAEGMSLRKVCQLPGMPAMSTFLRWLADEQHADLREQYARAREAQADT 101 Query: 79 LAEELLE------EPAPTAEELANPVFYSKMRDRKQRMGT--FLLEKLSNQKYGPRVSVE 130 LAEE+L+ T +E V Y + K R+ +L K++ +KYG R++ E Sbjct: 102 LAEEILQIADDGSNDTYTDDEGRTHVDYDHISRSKLRVDARKWLASKMAPKKYGDRITNE 161 Query: 131 SKHTIDLRPAIE 142 HT AIE Sbjct: 162 --HTGANGGAIE 171 >gi|218695969|ref|YP_002403636.1| putative terminase small subunit [Escherichia coli 55989] gi|218352701|emb|CAU98483.1| putative terminase small subunit [Escherichia coli 55989] Length = 141 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 33/110 (30%), Positives = 63/110 (57%), Gaps = 4/110 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +A+G++L V ++ GMP +T +RW+ + + ++ Y +A + R D + Sbjct: 14 YLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHDEFRDKYAKATEARADSIF 73 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E A TA A V +++R ++ + L +++ +KYG +V+ E Sbjct: 74 EEIF-EIADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKVTNE 119 >gi|53793590|ref|YP_112490.1| terminase small subunit [Flavobacterium phage 11b] gi|53748180|emb|CAH56641.1| terminase small subunit [Flavobacterium phage 11b] Length = 160 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 35/114 (30%), Positives = 55/114 (48%), Gaps = 4/114 (3%) Query: 26 FAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEEL-- 83 F+ IL + +G++L VL++ GMP +TFY+WI D YT A + R D+ E + Sbjct: 33 FSSILSFIEDGQSLRSVLKREGMPDSTTFYKWIDADENKSIQYTRACETRADVKFESIEK 92 Query: 84 --LEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTI 135 LEEP + +++ K + L KL +KYG ++ E I Sbjct: 93 DYLEEPQRDPVSGKIDAGWVQLQRLKIDSKKWELGKLQPKKYGDKIQNEISGAI 146 >gi|298381519|ref|ZP_06991118.1| terminase small subunit [Escherichia coli FVEC1302] gi|298278961|gb|EFI20475.1| terminase small subunit [Escherichia coli FVEC1302] Length = 141 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 33/110 (30%), Positives = 63/110 (57%), Gaps = 4/110 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +A+G++L V ++ GMP +T +RW+ + + ++ Y +A + R D + Sbjct: 14 YLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHEEFRDKYAKATEARADSIF 73 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E A TA A V +++R ++ + L +++ +KYG +V+ E Sbjct: 74 EEIF-EIADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKVTNE 119 >gi|325497785|gb|EGC95644.1| putative terminase small subunit [Escherichia fergusonii ECD227] Length = 141 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 33/110 (30%), Positives = 62/110 (56%), Gaps = 4/110 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +A+G++L V ++ GMP +T +RW+ + ++ Y +A + R D + Sbjct: 14 YLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHDDFRDKYAKATEARADSVF 73 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E A TA A V +++R ++ + L +++ +KYG +V+ E Sbjct: 74 EEIF-EIADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKVTNE 119 >gi|300922508|ref|ZP_07138620.1| hypothetical protein HMPREF9548_00763 [Escherichia coli MS 182-1] gi|300421166|gb|EFK04477.1| hypothetical protein HMPREF9548_00763 [Escherichia coli MS 182-1] gi|324114095|gb|EGC08068.1| ubiquitin carboxyl-terminal hydrolase 35 [Escherichia fergusonii B253] Length = 141 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 32/113 (28%), Positives = 58/113 (51%), Gaps = 10/113 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +A+G++L V ++ GMP +T +RW+ + + ++ Y +A + R D + Sbjct: 14 YLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHDEFRDKYAKATEARADSIF 73 Query: 81 EELLE---EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E P A E+A R + L +++ +KYG +V+ E Sbjct: 74 EEIFEIADNAIPDAAEVAKARLRVDTR-------KWALARMNPRKYGDKVTNE 119 >gi|240080383|ref|ZP_04724926.1| putative phage associated protein [Neisseria gonorrhoeae FA19] gi|268596526|ref|ZP_06130693.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] gi|268550314|gb|EEZ45333.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] Length = 149 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 32/132 (24%), Positives = 71/132 (53%), Gaps = 5/132 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ ++ I +++ANG++L + + G+P T YRW++ + + + Y A + + D A Sbjct: 14 YTKDMADKICEKIANGRSLRSICAEDGVPPMKTIYRWLEANEEFRHQYARAREKQADYFA 73 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTI-DLRP 139 EE++ E A +A+ + V +K++ ++ + K++ +KYG + ++ K +R Sbjct: 74 EEII-EIADSAQAESAAVSKAKLQIDARK---WAASKIAPKKYGDKTELDVKSGDGSMRA 129 Query: 140 AIERLREHYKHL 151 A+ E Y+ + Sbjct: 130 AVRLDAEEYRKI 141 >gi|240016263|ref|ZP_04722803.1| putative phage associated protein [Neisseria gonorrhoeae FA6140] gi|240113248|ref|ZP_04727738.1| putative phage associated protein [Neisseria gonorrhoeae MS11] gi|240116003|ref|ZP_04730065.1| putative phage associated protein [Neisseria gonorrhoeae PID18] gi|240118307|ref|ZP_04732369.1| putative phage associated protein [Neisseria gonorrhoeae PID1] gi|240126039|ref|ZP_04738925.1| putative phage associated protein [Neisseria gonorrhoeae SK-92-679] gi|268599328|ref|ZP_06133495.1| terminase small subunit [Neisseria gonorrhoeae MS11] gi|268601670|ref|ZP_06135837.1| gene 1 protein [Neisseria gonorrhoeae PID18] gi|268604019|ref|ZP_06138186.1| terminase small subunit [Neisseria gonorrhoeae PID1] gi|268684632|ref|ZP_06151494.1| terminase small subunit [Neisseria gonorrhoeae SK-92-679] gi|268583459|gb|EEZ48135.1| terminase small subunit [Neisseria gonorrhoeae MS11] gi|268585801|gb|EEZ50477.1| gene 1 protein [Neisseria gonorrhoeae PID18] gi|268588150|gb|EEZ52826.1| terminase small subunit [Neisseria gonorrhoeae PID1] gi|268624916|gb|EEZ57316.1| terminase small subunit [Neisseria gonorrhoeae SK-92-679] Length = 149 Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 32/132 (24%), Positives = 71/132 (53%), Gaps = 5/132 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ ++ I +++ANG++L + + G+P T YRW++ + + + Y A + + D A Sbjct: 14 YTKDMADKICEKIANGRSLRSICAEDGVPPMKTIYRWLEANEEFRHQYARAREKQADYFA 73 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTI-DLRP 139 EE++ E A +A+ + V +K++ ++ + K++ +KYG + ++ K +R Sbjct: 74 EEII-EIADSAQAESAAVSKAKLQIDARK---WAASKIAPKKYGDKTELDVKSGDGSMRA 129 Query: 140 AIERLREHYKHL 151 A+ E Y+ + Sbjct: 130 AVRLDAEEYRKI 141 >gi|59800932|ref|YP_207644.1| putative phage associated protein [Neisseria gonorrhoeae FA 1090] gi|239999272|ref|ZP_04719196.1| putative phage associated protein [Neisseria gonorrhoeae 35/02] gi|240013814|ref|ZP_04720727.1| putative phage associated protein [Neisseria gonorrhoeae DGI18] gi|240121387|ref|ZP_04734349.1| putative phage associated protein [Neisseria gonorrhoeae PID24-1] gi|240123851|ref|ZP_04736807.1| putative phage associated protein [Neisseria gonorrhoeae PID332] gi|254494034|ref|ZP_05107205.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291] gi|260440181|ref|ZP_05793997.1| putative phage associated protein [Neisseria gonorrhoeae DGI2] gi|268595095|ref|ZP_06129262.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|268682475|ref|ZP_06149337.1| terminase small subunit [Neisseria gonorrhoeae PID332] gi|291043472|ref|ZP_06569188.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|293398784|ref|ZP_06642952.1| phage associated protein [Neisseria gonorrhoeae F62] gi|59717827|gb|AAW89232.1| hypothetical protein, putative phage associated protein [Neisseria gonorrhoeae FA 1090] gi|226513074|gb|EEH62419.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291] gi|268548484|gb|EEZ43902.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|268622759|gb|EEZ55159.1| terminase small subunit [Neisseria gonorrhoeae PID332] gi|291011935|gb|EFE03924.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|291610786|gb|EFF39893.1| phage associated protein [Neisseria gonorrhoeae F62] Length = 149 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 32/132 (24%), Positives = 71/132 (53%), Gaps = 5/132 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ ++ I +++ANG++L + + G+P T YRW++ + + + Y A + + D A Sbjct: 14 YTKDMADKICEKIANGRSLRSICAEDGVPPMKTIYRWLEANEEFRHQYARAREKQADYFA 73 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTI-DLRP 139 EE++ E A +A+ + V +K++ ++ + K++ +KYG + ++ K +R Sbjct: 74 EEII-EIADSAQAESAAVSKAKLQIDARK---WAASKIAPKKYGDKSELDVKSGDGSMRA 129 Query: 140 AIERLREHYKHL 151 A+ E Y+ + Sbjct: 130 AVRLDAEEYRKI 141 >gi|145631713|ref|ZP_01787475.1| hypothetical protein CGSHi22421_00952 [Haemophilus influenzae R3021] gi|144982656|gb|EDJ90198.1| hypothetical protein CGSHi22421_00952 [Haemophilus influenzae R3021] Length = 173 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 37/138 (26%), Positives = 71/138 (51%), Gaps = 21/138 (15%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y E+ I +A G++L + + GMP T +RW+ ++ + + Y A + + D L Sbjct: 24 YVKEVADDICQLIAQGESLRKICERPGMPNQRTIFRWLNENDEFCQQYARARENQADFLL 83 Query: 81 EELLEEPAPTAEELANP--VFYSKMRDRKQRMGTFLLEKLSNQKYGPRVS---------- 128 EE+LE +LA P V +K+R ++ + + K++ +KYG +V+ Sbjct: 84 EEMLE-----ISDLATPENVSVAKLRVDARK---WYITKVAPKKYGDKVTQEITGVNGAP 135 Query: 129 VESKHTIDL-RPAIERLR 145 +E K ++DL R +++ L+ Sbjct: 136 IEQKISMDLSRLSVDELK 153 >gi|260856408|ref|YP_003230299.1| putative terminase small subunit [Escherichia coli O26:H11 str. 11368] gi|331678315|ref|ZP_08378990.1| conserved hypothetical protein [Escherichia coli H591] gi|257755057|dbj|BAI26559.1| probable terminase small subunit [Escherichia coli O26:H11 str. 11368] gi|331074775|gb|EGI46095.1| conserved hypothetical protein [Escherichia coli H591] Length = 141 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 30/113 (26%), Positives = 58/113 (51%), Gaps = 10/113 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +++G++L V + GMP ++ +RW+ + + ++ Y +A + R D + Sbjct: 14 YMPEVANDICALLSSGESLRKVCERPGMPSKTSVFRWLAEHQEFRDQYAKATETRADSIF 73 Query: 81 EELLE---EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E + P A E+A R + L +++ +KYG +V+ E Sbjct: 74 EEIFEIADDVIPDAAEVAKARLRVDTR-------KWALARMNPRKYGDKVTNE 119 >gi|261381053|ref|ZP_05985626.1| conserved hypothetical protein [Neisseria subflava NJ9703] gi|284796086|gb|EFC51433.1| conserved hypothetical protein [Neisseria subflava NJ9703] Length = 141 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 22/65 (33%), Positives = 38/65 (58%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 YS E+ I +++ANG++L + + GMP ST +W+ ++ + E Y A + D A Sbjct: 14 YSDEMAENICEKIANGRSLRSICAEDGMPTTSTVCKWLIENKEFSEQYARARDKQADYFA 73 Query: 81 EELLE 85 EE++E Sbjct: 74 EEIIE 78 >gi|126443704|ref|YP_001063349.1| hypothetical protein BURPS668_A2355 [Burkholderia pseudomallei 668] gi|167907352|ref|ZP_02494557.1| hypothetical protein BpseN_34300 [Burkholderia pseudomallei NCTC 13177] gi|126223195|gb|ABN86700.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 147 Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 30/112 (26%), Positives = 58/112 (51%), Gaps = 8/112 (7%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 + +S LF I + +GK++ V + GMP TF +W K+ +LQ+ Y +A R D Sbjct: 16 VEFSQALFDRICALIGDGKSVREVCKGPGMPDRMTFLKWAKRTPELQKQYDDACIDRQDA 75 Query: 79 LAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 + +++L A+ + +P M D ++ + L +++ +++G VS E Sbjct: 76 IFDDVLY----IADTVRDPKRAKVMVDARE----WTLARMNRKRFGNHVSNE 119 >gi|312831397|emb|CBY17578.1| unnamed protein product [Escherichia coli LF82] Length = 140 Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 34/110 (30%), Positives = 62/110 (56%), Gaps = 4/110 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +++G++L V ++ GMP ST +RW+ K ++ Y +A + R D + Sbjct: 13 YMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDFRDKYAKATEARADSIF 72 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E A TA A V +++R ++ + L +++ +KYG +V+ E Sbjct: 73 EEIF-EIADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKVTNE 118 >gi|205360376|ref|ZP_02682926.2| terminase small subunit [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|205349763|gb|EDZ36394.1| terminase small subunit [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] Length = 142 Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 34/110 (30%), Positives = 62/110 (56%), Gaps = 4/110 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +++G++L V ++ GMP ST +RW+ K ++ Y +A + R D + Sbjct: 15 YMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDFRDKYAKATEARADSIF 74 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E A TA A V +++R ++ + L +++ +KYG +V+ E Sbjct: 75 EEIF-EIADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKVTNE 120 >gi|288562970|pdb|3HEF|A Chain A, Crystal Structure Of The Bacteriophage Sf6 Terminase Small Subunit gi|288562971|pdb|3HEF|B Chain B, Crystal Structure Of The Bacteriophage Sf6 Terminase Small Subunit Length = 143 Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 10/113 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +++G++L V ++ GMP ST +RW+ K ++ Y +A + R D + Sbjct: 16 YMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDFRDKYAKATEARADSIF 75 Query: 81 EELLE---EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E P A E+A R + L +++ +KYG +V+ E Sbjct: 76 EEIFEIADNAIPDAAEVAKARLRVDTR-------KWALARMNPRKYGDKVTNE 121 >gi|13559865|ref|NP_112075.1| terminase small subunit [Enterobacteria phage HK620] gi|41057279|ref|NP_958177.1| gene 1 protein [Enterobacteria phage Sf6] gi|168239625|ref|ZP_02664683.1| terminase small subunit [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|260868684|ref|YP_003235086.1| putative terminase small subunit [Escherichia coli O111:H- str. 11128] gi|13517601|gb|AAK28890.1|AF335538_42 terminase small subunit [Salmonella phage HK620] gi|33334158|gb|AAQ12191.1| gene 1 protein [Shigella phage Sf6] gi|197287688|gb|EDY27079.1| terminase small subunit [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|257765040|dbj|BAI36535.1| putative terminase small subunit [Escherichia coli O111:H- str. 11128] Length = 140 Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 10/113 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +++G++L V ++ GMP ST +RW+ K ++ Y +A + R D + Sbjct: 13 YMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDFRDKYAKATEARADSIF 72 Query: 81 EELLE---EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E P A E+A R + L +++ +KYG +V+ E Sbjct: 73 EEIFEIADNAIPDAAEVAKARLRVDTR-------KWALARMNPRKYGDKVTNE 118 >gi|119386462|ref|YP_917517.1| hypothetical protein Pden_3755 [Paracoccus denitrificans PD1222] gi|119377057|gb|ABL71821.1| hypothetical protein Pden_3755 [Paracoccus denitrificans PD1222] Length = 133 Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 37/134 (27%), Positives = 62/134 (46%), Gaps = 20/134 (14%) Query: 19 IYYSPELFAG----ILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQC 74 I E AG I + +A G +L VLRK GMP YST RW++ D + Y A + Sbjct: 3 IKRDAETIAGYKDAICEAIACGDSLAKVLRKKGMPGYSTVIRWLRDDEDFRVDYARAREA 62 Query: 75 RLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN----QKYGPRVSVE 130 + D A+++ ++ + V K+ + R L+ + +KYG ++ ++ Sbjct: 63 QADADADKI--------GDIVDRVLAGKLDPQAARAAIDALKWTAGKRQPKKYGDKLDLD 114 Query: 131 SKHTIDLRPAIERL 144 I+ R IER+ Sbjct: 115 ----INGRVQIERV 124 >gi|300935793|ref|ZP_07150756.1| conserved hypothetical protein [Escherichia coli MS 21-1] gi|300459026|gb|EFK22519.1| conserved hypothetical protein [Escherichia coli MS 21-1] gi|323177129|gb|EFZ62719.1| hypothetical protein ECOK1180_4497 [Escherichia coli 1180] Length = 142 Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 10/113 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y PE+ I +++G++L V ++ GMP ST +RW+ K ++ Y +A + R D + Sbjct: 15 YMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDFRDKYAKATEARADSIF 74 Query: 81 EELLE---EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE+ E P A E+A R + L +++ +KYG +V+ E Sbjct: 75 EEIFEIADNAIPDAAEVAKARLRVDTR-------KWALARMNPRKYGDKVTNE 120 >gi|332343006|gb|AEE56340.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 161 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 33/112 (29%), Positives = 60/112 (53%), Gaps = 7/112 (6%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ E I + VA+G+++ + + GMP ST +W + + + Y A + ++LA Sbjct: 25 YTEEKALEICELVADGQSVNKISKMPGMPTRSTILKWFRDVPEFSDMYMRAKEIGFEVLA 84 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGT--FLLEKLSNQKYGPRVSVE 130 +E+L+ A++ N V ++R + + T +LL KL +KYG RV+ E Sbjct: 85 DEILD----IADDAKN-VDKDQLRRHQLMIETRKWLLAKLQPRKYGERVTQE 131 >gi|300935518|ref|ZP_07150512.1| helix-turn-helix protein [Escherichia coli MS 21-1] gi|300459317|gb|EFK22810.1| helix-turn-helix protein [Escherichia coli MS 21-1] Length = 171 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 33/112 (29%), Positives = 60/112 (53%), Gaps = 7/112 (6%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ E I + VA+G+++ + + GMP ST +W + + + Y A + ++LA Sbjct: 25 YTEEKALEICELVADGQSVNKISKMPGMPTRSTILKWFRDVPEFSDMYMRAKEIGFEVLA 84 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGT--FLLEKLSNQKYGPRVSVE 130 +E+L+ A++ N V ++R + + T +LL KL +KYG RV+ E Sbjct: 85 DEILD----IADDAKN-VDKDQLRRHQLMIETRKWLLAKLQPRKYGERVTQE 131 >gi|41179387|ref|NP_958695.1| Bbp26 [Bordetella phage BPP-1] gi|45569519|ref|NP_996588.1| hypothetical protein BMP-1p25 [Bordetella phage BMP-1] gi|45580770|ref|NP_996636.1| hypothetical protein BIP-1p25 [Bordetella phage BIP-1] gi|40950126|gb|AAR97692.1| Bbp26 [Bordetella phage BPP-1] Length = 194 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 41/133 (30%), Positives = 61/133 (45%), Gaps = 11/133 (8%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEA 67 A+K R K Y + + I D++A+G+ L + R MP + T Y WI+ + E Sbjct: 16 ARKTARMKNASTY-TEAIATEICDRIADGEPLRQICRDEHMPAWRTVYHWIEARPEFAER 74 Query: 68 YTEALQCRLDLLAEELLE--EPAPTAEEL--ANPVFYSKMRD----RKQRMGTF--LLEK 117 A + D +AEE LE T EE ++ + K D RK ++ T LL K Sbjct: 75 MERARRAGFDAIAEEALEIANTPQTGEETEESDDGYKVKRGDMLGHRKLQVETRLKLLAK 134 Query: 118 LSNQKYGPRVSVE 130 KYG + S+E Sbjct: 135 WHPTKYGDKTSME 147 >gi|218782398|ref|YP_002433716.1| hypothetical protein Dalk_4570 [Desulfatibacillum alkenivorans AK-01] gi|218763782|gb|ACL06248.1| hypothetical protein Dalk_4570 [Desulfatibacillum alkenivorans AK-01] Length = 196 Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust. Identities = 30/117 (25%), Positives = 56/117 (47%), Gaps = 10/117 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKD--LKLQEAYTEALQCRLDL 78 Y+ E+ I +++A+G+++ + MP + YRW+ K +E YT A + + D Sbjct: 9 YTDEIAETICERIASGESVQSITSDEDMPSSAMVYRWLADARYSKFREMYTRARERQADF 68 Query: 79 LAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTI 135 A+E++ A+ NP + +Q + KL +KYG R+ V + T+ Sbjct: 69 FADEII----TIADTEDNPKKAAVRVQARQ----WAASKLKPRKYGDRIDVHQQGTV 117 >gi|187476924|ref|YP_784948.1| phage terminase small subunit [Bordetella avium 197N] gi|115421510|emb|CAJ48019.1| Putative phage terminase small subunit [Bordetella avium 197N] Length = 170 Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust. Identities = 39/120 (32%), Positives = 54/120 (45%), Gaps = 10/120 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ + I D++A+G+ L + R MP + T Y WI+ K E A + D +A Sbjct: 7 YTEAIATEICDRIADGEPLRQICRDQHMPAWRTVYHWIEARSKFAERMERARRVGFDAIA 66 Query: 81 EELLE---EPAPTAE-ELANPVFYSKMRD----RKQRMGTF--LLEKLSNQKYGPRVSVE 130 EE LE P E E F K D RK ++ T LL K KYG + S+E Sbjct: 67 EEALEIANTPQIGEETEQEGSKFRVKRGDMLGHRKLQVETRLKLLAKWHPTKYGDKTSME 126 >gi|254497230|ref|ZP_05110039.1| putative phage associated protein [Legionella drancourtii LLAP12] gi|254353554|gb|EET12280.1| putative phage associated protein [Legionella drancourtii LLAP12] Length = 157 Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust. Identities = 35/125 (28%), Positives = 63/125 (50%), Gaps = 10/125 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGM--PKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 Y+ EL I + +A+ VL K M P T + W+KK + E Y +A C+++L Sbjct: 14 YNSELAEEICNVIASTSKGTKVLCKENMHWPCQDTLFAWLKKYPEFSEQYAQAKICQIEL 73 Query: 79 LAEELLEEPAPTA-----EELANPVFYSKMRDRKQ-RMGT--FLLEKLSNQKYGPRVSVE 130 L +E+LE + +E PV + + +R + ++ T +L KL + YG ++ +E Sbjct: 74 LIDEILEIADDESNDCFIDEDGKPVSNNSVVNRARLKIDTRKWLASKLVPKVYGNKIGIE 133 Query: 131 SKHTI 135 S ++ Sbjct: 134 SDSSM 138 >gi|239835387|ref|YP_002956059.1| hypothetical protein DMR_p1_00530 [Desulfovibrio magneticus RS-1] gi|239794478|dbj|BAH73469.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 162 Score = 43.1 bits (100), Expect = 0.016, Method: Compositional matrix adjust. Identities = 27/117 (23%), Positives = 52/117 (44%), Gaps = 8/117 (6%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 +S E+ + + G+ L + GMP +T Y W+ K QE Y A + + D+ Sbjct: 20 FSQEVADAVCQGIMEGRGLRSICADPGMPGPATVYSWLTKYASFQEQYALAREVQADVYF 79 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDL 137 +E++ A+ +P D ++ + KL ++YG R+ VE + + + Sbjct: 80 DEVVS----IADNEPDPQIARNRIDARK----WAASKLRPKRYGDRLEVEHQGGVGM 128 >gi|116253679|ref|YP_769517.1| hypothetical protein RL3939 [Rhizobium leguminosarum bv. viciae 3841] gi|115258327|emb|CAK09429.1| putative phage-related protein [Rhizobium leguminosarum bv. viciae 3841] Length = 157 Score = 43.1 bits (100), Expect = 0.020, Method: Compositional matrix adjust. Identities = 37/142 (26%), Positives = 65/142 (45%), Gaps = 23/142 (16%) Query: 29 ILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRLDLLAEELLEE 86 I +++A+G++L + R MP ST W+ D K + Y +A + + D +E++E Sbjct: 16 ICERIADGESLRSICRDDAMPAKSTVLAWLADDDKTAFRTKYAQAREIQADGFVDEMVE- 74 Query: 87 PAPTAEELANPVFYSKMRD--------------RKQ-RMGT--FLLEKLSNQKYGPRVSV 129 A++ N K D R Q R+GT ++ EKL +KYG +V + Sbjct: 75 ---IADDGTNDWMEKKNADGETTGWQENGEAIRRSQLRIGTRQWIAEKLKPKKYGAKVEL 131 Query: 130 ESKHTIDLRPAIERLREHYKHL 151 E T ++ +E + + L Sbjct: 132 EHGVTGEVSQLLEAINGKTRGL 153 >gi|309379924|emb|CBX21335.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 136 Score = 42.7 bits (99), Expect = 0.024, Method: Compositional matrix adjust. Identities = 33/125 (26%), Positives = 56/125 (44%), Gaps = 14/125 (11%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 YS E I + +A G +L + MP T +RW+ + QE Y A + + D A Sbjct: 14 YSQETADKICELIARGMSLRAICTSADMPAGGTVHRWLAEHQDFQEQYARAREEQADGFA 73 Query: 81 EELL---EEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDL 137 +E++ + AP E+A R K KL+ +KYG ++ +++ D+ Sbjct: 74 DEIIDIADSVAPETGEVAKAKLQIDARKWK-------AAKLAPKKYGEKLELDA----DM 122 Query: 138 RPAIE 142 R +E Sbjct: 123 RVKVE 127 >gi|197104489|ref|YP_002129866.1| hypothetical protein PHZ_c1023 [Phenylobacterium zucineum HLK1] gi|196477909|gb|ACG77437.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 221 Score = 42.0 bits (97), Expect = 0.037, Method: Compositional matrix adjust. Identities = 28/91 (30%), Positives = 42/91 (46%), Gaps = 9/91 (9%) Query: 3 NLVKKAKKAVRAKKG--------CIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTF 54 L ++A +A R + G C Y E I ++ G+AL + R MP ST Sbjct: 5 GLFRRAIEAARIEAGRGSAIGRPCGYCR-ETAEAIFVRLCGGEALSAICRDPQMPAASTV 63 Query: 55 YRWIKKDLKLQEAYTEALQCRLDLLAEELLE 85 Y W+ ++ + EAY A + + D LA E E Sbjct: 64 YAWLAREPQFAEAYRLAREVQADALAAEGWE 94 >gi|329119004|ref|ZP_08247698.1| terminase small subunit [Neisseria bacilliformis ATCC BAA-1200] gi|327464877|gb|EGF11168.1| terminase small subunit [Neisseria bacilliformis ATCC BAA-1200] Length = 153 Score = 41.2 bits (95), Expect = 0.063, Method: Compositional matrix adjust. Identities = 30/125 (24%), Positives = 58/125 (46%), Gaps = 14/125 (11%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 YS L I +++A G++L V MP ST ++W+ ++ + E Y A + + D A Sbjct: 31 YSKGLADRICERIARGESLRAVCSDGDMPSASTVHKWLAENQEFSEQYAHAREEQADHFA 90 Query: 81 EELL---EEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDL 137 ++++ +E P + +A R + KL+ +KYG ++ + D+ Sbjct: 91 QDIIRIADEVEPESAAVAKAKLQIDAR-------KWTAAKLAPKKYGDKIDLSG----DM 139 Query: 138 RPAIE 142 R +E Sbjct: 140 RVQVE 144 >gi|326564855|gb|EGE15061.1| putative phage associated protein [Moraxella catarrhalis 103P14B1] Length = 139 Score = 39.7 bits (91), Expect = 0.17, Method: Compositional matrix adjust. Identities = 19/64 (29%), Positives = 32/64 (50%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ +L A L +VA G+++ V MP +T Y W+ + + Y A + R D Sbjct: 14 YNEDLVAEFLYRVAKGRSVASVCSDDDMPHRATIYEWLAHNSDFSDRYARASEQRADQYF 73 Query: 81 EELL 84 +E+L Sbjct: 74 DEML 77 >gi|167750592|ref|ZP_02422719.1| hypothetical protein EUBSIR_01569 [Eubacterium siraeum DSM 15702] gi|167656518|gb|EDS00648.1| hypothetical protein EUBSIR_01569 [Eubacterium siraeum DSM 15702] Length = 379 Score = 38.5 bits (88), Expect = 0.37, Method: Compositional matrix adjust. Identities = 23/75 (30%), Positives = 38/75 (50%), Gaps = 4/75 (5%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ E ++D+V +G+ H+L G+PK STFY W++ QE A + ++D+ Sbjct: 4 YTQEEKQAVIDRVISGEPSAHILADTGIPK-STFYSWLR---IYQEEQNAANRRKVDIRN 59 Query: 81 EELLEEPAPTAEELA 95 LLE E + Sbjct: 60 FHLLENKVARLESIV 74 >gi|160897384|ref|YP_001562966.1| putative terminase small subunit protein [Delftia acidovorans SPH-1] gi|160362968|gb|ABX34581.1| putative terminase small subunit protein [Delftia acidovorans SPH-1] Length = 157 Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust. Identities = 19/67 (28%), Positives = 35/67 (52%), Gaps = 2/67 (2%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK--DLKLQEAYTEALQCRLDL 78 ++ ++ + D++A G++L + MP +T RW+ K + +E Y A + D Sbjct: 7 FNEDIATKLCDRIAAGESLRKICTDPKMPSMATVMRWLGKPENAAFREQYACARDAQADK 66 Query: 79 LAEELLE 85 L EE+LE Sbjct: 67 LVEEMLE 73 >gi|90416803|ref|ZP_01224733.1| hypothetical protein GB2207_04109 [marine gamma proteobacterium HTCC2207] gi|90331556|gb|EAS46792.1| hypothetical protein GB2207_04109 [marine gamma proteobacterium HTCC2207] Length = 248 Score = 36.6 bits (83), Expect = 1.5, Method: Compositional matrix adjust. Identities = 32/114 (28%), Positives = 56/114 (49%), Gaps = 5/114 (4%) Query: 69 TEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLS--NQKYGPR 126 T A+Q +L L + + + PTA+EL FY+ + R + TF LE +S K+G + Sbjct: 82 TLAMQAKLAGLTKNVTTQRLPTADELK--TFYTLNQQRYRNPSTFWLEVISFATAKHGGQ 139 Query: 127 VSVESKHTIDL-RPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQS 179 V ++ + +P + LR+ Y + I+ E + +S L +E I+ S Sbjct: 140 VFERAEQELKTGQPPVGDLRDQYTAISTIELETLYGQSFVDSLMEIEQPISALS 193 >gi|154412501|ref|XP_001579283.1| hypothetical protein [Trichomonas vaginalis G3] gi|121913488|gb|EAY18297.1| hypothetical protein TVAG_254090 [Trichomonas vaginalis G3] Length = 807 Score = 35.8 bits (81), Expect = 2.6, Method: Compositional matrix adjust. Identities = 33/119 (27%), Positives = 57/119 (47%), Gaps = 16/119 (13%) Query: 13 RAKKGCIYYSPELFAGILDQVANG-KALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEA 71 RA +G I P+ A +D V G A+G L K + +TFYR+++ + + + Y + Sbjct: 434 RALEGAILKQPQKKA--VDIVRTGLSAVGDYLIKKKSSEVATFYRYVQDNCHVCDGYVQ- 490 Query: 72 LQCRLDLLAEELLEE------PAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 L +ELLEE P EE A P+ ++++ KQ++ + + L + G Sbjct: 491 ------FLVKELLEEREKYTKQTPKQEEKAKPIQKQRIQEIKQKVPEYDIASLEEEIEG 543 >gi|260949161|ref|XP_002618877.1| hypothetical protein CLUG_00036 [Clavispora lusitaniae ATCC 42720] gi|238846449|gb|EEQ35913.1| hypothetical protein CLUG_00036 [Clavispora lusitaniae ATCC 42720] Length = 1889 Score = 35.8 bits (81), Expect = 2.7, Method: Compositional matrix adjust. Identities = 20/100 (20%), Positives = 46/100 (46%), Gaps = 3/100 (3%) Query: 32 QVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTA 91 Q +N + G + + +FY+ I +K ++ A++C+++ L E L + P+ + Sbjct: 689 QSSNTLSCGEIQGTIDFKTLLSFYQGITSFVK---QFSAAIKCKVNALPESLADSPSQSP 745 Query: 92 EELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVES 131 + + S + + R G F + + +G + S+ S Sbjct: 746 KAFSTAGLASSIYSKHSRFGRFKAKNRDHFHFGEQRSISS 785 >gi|224438834|ref|ZP_03659668.1| hypothetical protein HcinC1_12185 [Helicobacter cinaedi CCUG 18818] Length = 196 Score = 35.0 bits (79), Expect = 5.3, Method: Compositional matrix adjust. Identities = 26/111 (23%), Positives = 54/111 (48%), Gaps = 7/111 (6%) Query: 33 VANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCR-LDLLAEE--LLEEPAP 89 V++G+++ ++ + G Y++ W+ L + + + +QC+ L EE ++E A Sbjct: 89 VSDGESIKYIYIQ-GFEYYNS---WVTSSLSMHQDVKQHIQCKPFKDLKEEYAIIENIAF 144 Query: 90 TAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPA 140 +E K R + RMG +K +Q + ++ +S H IDL+ + Sbjct: 145 NNKEYIGDEMIVKERRKLNRMGYATSQKYISQVFKEAINAKSHHYIDLKES 195 >gi|313145155|ref|ZP_07807348.1| conserved hypothetical protein [Helicobacter cinaedi CCUG 18818] gi|313130186|gb|EFR47803.1| conserved hypothetical protein [Helicobacter cinaedi CCUG 18818] Length = 203 Score = 34.7 bits (78), Expect = 5.5, Method: Compositional matrix adjust. Identities = 26/111 (23%), Positives = 54/111 (48%), Gaps = 7/111 (6%) Query: 33 VANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCR-LDLLAEE--LLEEPAP 89 V++G+++ ++ + G Y++ W+ L + + + +QC+ L EE ++E A Sbjct: 96 VSDGESIKYIYIQ-GFEYYNS---WVTSSLSMHQDVKQHIQCKPFKDLKEEYAIIENIAF 151 Query: 90 TAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPA 140 +E K R + RMG +K +Q + ++ +S H IDL+ + Sbjct: 152 NNKEYIGDEMIVKERRKLNRMGYATSQKYISQVFKEAINAKSHHYIDLKES 202 >gi|188575220|ref|YP_001912149.1| isrso16-transposase orfa protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188519672|gb|ACD57617.1| isrso16-transposase orfa protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 88 Score = 34.3 bits (77), Expect = 7.3, Method: Compositional matrix adjust. Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 5/62 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 ++ E A L Q G A+G + RK+G+ + +TFY W KKD L + + RL LL Sbjct: 6 FTEEQIAYALKQAKLGMAVGEICRKLGIAE-ATFYVWRKKDGGLGPSELK----RLRLLE 60 Query: 81 EE 82 EE Sbjct: 61 EE 62 >gi|322488501|emb|CBZ23747.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103] Length = 739 Score = 34.3 bits (77), Expect = 7.5, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 46/96 (47%), Gaps = 5/96 (5%) Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLE-KLSNQKY 123 +EA + + R ++ E + E A TA + A +F R + R+ + E K+ Sbjct: 598 EEARVVSEKLRREIADLERVFEKASTASDAAETLF----RQTEDRVTNSMTELKMKRTNM 653 Query: 124 GPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERI 159 R+ +E T L +E++R KHL+ ++ER+ Sbjct: 654 DARIQLEKDRTDRLNRELEKIRRQIKHLEEAENERL 689 >gi|91202301|emb|CAJ75361.1| strongly similar to ATPA gene encoding subunit alpha of ATP synthase [Candidatus Kuenenia stuttgartiensis] Length = 513 Score = 34.3 bits (77), Expect = 7.6, Method: Compositional matrix adjust. Identities = 30/90 (33%), Positives = 40/90 (44%), Gaps = 11/90 (12%) Query: 17 GCIYYSPELFA-GILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQ-- 73 G IY SPEL GI+ V G+++ V K P Y R + DL+L A E L+ Sbjct: 349 GQIYLSPELAQKGIMPAVNVGRSVSRVGGKAQFPSY----RMVASDLRLSYAQFEELERF 404 Query: 74 ----CRLDLLAEELLEEPAPTAEELANPVF 99 RLD +++E E L P F Sbjct: 405 ARFSTRLDESTRKIIERGKRVREILKQPQF 434 >gi|58583518|ref|YP_202534.1| transposase [Xanthomonas oryzae pv. oryzae KACC10331] gi|58428112|gb|AAW77149.1| putative truncated transposase [Xanthomonas oryzae pv. oryzae KACC10331] Length = 89 Score = 34.3 bits (77), Expect = 8.6, Method: Compositional matrix adjust. Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 5/62 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 ++ E A L Q G A+G + RK+G+ + +TFY W KKD L + + RL LL Sbjct: 6 FTEEQIAYALKQAKLGMAVGEICRKLGIAE-ATFYVWHKKDGGLGPSELK----RLRLLE 60 Query: 81 EE 82 EE Sbjct: 61 EE 62 >gi|218461767|ref|ZP_03501858.1| putative terminase small subunit protein [Rhizobium etli Kim 5] Length = 146 Score = 34.3 bits (77), Expect = 8.7, Method: Compositional matrix adjust. Identities = 31/127 (24%), Positives = 56/127 (44%), Gaps = 19/127 (14%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 +S L I D++A+ ++L + R MP ST W+ + K A+ ++LA Sbjct: 7 FSQALAEKICDRIADRESLRSICRDDAMPAKSTVLSWLADEDKA--AFRARYALAREILA 64 Query: 81 EELLEEPAPTAEELANPVFYSK---------------MRDRKQRMGT--FLLEKLSNQKY 123 + ++E A++ +N K +R + R+ T ++ EKL +KY Sbjct: 65 DGFVDELVEIADDSSNDWIEKKNAAGETTGWQENGEAIRRSQLRIATRQWVAEKLRPKKY 124 Query: 124 GPRVSVE 130 G +V E Sbjct: 125 GAKVEPE 131 >gi|84625334|ref|YP_452706.1| transposase [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84369274|dbj|BAE70432.1| transposase [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 88 Score = 34.3 bits (77), Expect = 8.8, Method: Compositional matrix adjust. Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 5/62 (8%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 ++ E A L Q G A+G + RK+G+ + +TFY W KKD L + + RL LL Sbjct: 6 FTEEQIAYALKQAKLGMAVGEICRKLGIAE-ATFYVWHKKDGGLGPSELK----RLRLLE 60 Query: 81 EE 82 EE Sbjct: 61 EE 62 Searching..................................................done Results from round 2 >gi|254781188|ref|YP_003065601.1| hypothetical protein CLIBASIA_05475 [Candidatus Liberibacter asiaticus str. psy62] gi|254040865|gb|ACT57661.1| hypothetical protein CLIBASIA_05475 [Candidatus Liberibacter asiaticus str. psy62] Length = 185 Score = 244 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 185/185 (100%), Positives = 185/185 (100%) Query: 1 MSNLVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK 60 MSNLVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK Sbjct: 1 MSNLVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK 60 Query: 61 DLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN 120 DLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN Sbjct: 61 DLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN 120 Query: 121 QKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSI 180 QKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSI Sbjct: 121 QKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSI 180 Query: 181 IEHNN 185 IEHNN Sbjct: 181 IEHNN 185 >gi|315122903|ref|YP_004063392.1| hypothetical protein CKC_05795 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496305|gb|ADR52904.1| hypothetical protein CKC_05795 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 180 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 73/178 (41%), Positives = 113/178 (63%), Gaps = 1/178 (0%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 +K K + +K + YS L I+ ++ANG+ + VL+ MP FY W+ + + Sbjct: 1 MKANSKKRKPQKPTVKYSQSLTKEIITRIANGETMQAVLKYPNMPTADAFYDWLARYPEH 60 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 +E+Y +A +L+L+ EE+ EP PT ELANPVFYSKMRDRKQ+ +L E+L+ Q YG Sbjct: 61 RESYNQARVKKLELMVEEVTNEPEPTEHELANPVFYSKMRDRKQKSVLWLAERLNRQIYG 120 Query: 125 PRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSIIE 182 ++VE KHTIDL+P + ++ E KH + + R+ ++TEKPLE+ + S A+QS ++ Sbjct: 121 NHMTVEQKHTIDLKPVLAKIEERRKHQQANKAMRVL-EATEKPLELTKLSKAKQSTMK 177 >gi|315121941|ref|YP_004062430.1| hypothetical protein CKC_00955 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495343|gb|ADR51942.1| hypothetical protein CKC_00955 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 172 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 50/169 (29%), Positives = 89/169 (52%), Gaps = 4/169 (2%) Query: 16 KGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKD-LKLQEAYTEALQC 74 + Y+ E IL +A+G L + G+ TF+ W+++D L++ Y +A Sbjct: 3 RPFTTYTLERTNKILASIASGMTLAKSCKLHGVL-VMTFFAWLERDFDDLKKRYEQAKIS 61 Query: 75 RLDLLAEELLE-EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKH 133 ++ L++++ + AP +EE K R+ + + + LEK YG ++VE KH Sbjct: 62 HMEHLSDKIADTIEAPLSEEEKKYPVSFKRRELQMKRLQWELEKRYRNVYGNHMTVEQKH 121 Query: 134 TIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSIIE 182 TIDL+P + ++ E KH + + R+ ++TEKPLE + S EQS ++ Sbjct: 122 TIDLKPVLAKIEERRKHQQANKAMRVI-EATEKPLECRKLSKDEQSTMK 169 >gi|156934896|ref|YP_001438812.1| hypothetical protein ESA_02744 [Cronobacter sakazakii ATCC BAA-894] gi|156533150|gb|ABU77976.1| hypothetical protein ESA_02744 [Cronobacter sakazakii ATCC BAA-894] Length = 150 Score = 160 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 42/145 (28%), Positives = 73/145 (50%), Gaps = 13/145 (8%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + K R Y PE+ A I +A+G++L V + GMP ST +RW+ + + Sbjct: 3 MATESKTGRPS----DYLPEVAADICSLLADGESLRKVCERPGMPNKSTVFRWLAQHEEF 58 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ Y +A + R D + EE+ + AEE A V +++R ++ + L +++ +KYG Sbjct: 59 RDQYAKATETRADAIFEEMFDIADTVAEE-AAAVGKARLRIDTRK---WALARMNPKKYG 114 Query: 125 PRVSVESKH-----TIDLRPAIERL 144 +VS E H T+ +P RL Sbjct: 115 DKVSQEIDHKSSDGTMATKPTTIRL 139 >gi|260596961|ref|YP_003209532.1| hypothetical protein CTU_11690 [Cronobacter turicensis z3032] gi|260216138|emb|CBA28941.1| hypothetical protein CTU_11690 [Cronobacter turicensis z3032] Length = 164 Score = 158 bits (399), Expect = 4e-37, Method: Composition-based stats. Identities = 43/163 (26%), Positives = 79/163 (48%), Gaps = 17/163 (10%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + K R Y PE+ AGI +A+G++L V + GMP ST +RW+ + + Sbjct: 1 MATESKTGRPS----EYLPEVAAGICSLLADGESLRKVCDRPGMPNKSTVFRWLAQHEEF 56 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ Y +A + R D + EE+ + AEE A V +++R ++ + L +++ +KYG Sbjct: 57 RDQYAKATETRADAIFEEMFDIADTVAEE-AAAVGKARLRIDTRK---WALARMNPKKYG 112 Query: 125 PRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKP 167 +VS E H + + +P + +P + T +P Sbjct: 113 DKVSQEIDH---------KSSDGSMTPQPTTIQLVPVEPTHEP 146 >gi|317120681|gb|ADV02504.1| hypothetical protein SC1_gp120 [Liberibacter phage SC1] gi|317120723|gb|ADV02545.1| hypothetical protein SC2_gp120 [Liberibacter phage SC2] gi|317120784|gb|ADV02605.1| hypothetical protein SC2_gp120 [Candidatus Liberibacter asiaticus] gi|317120825|gb|ADV02646.1| hypothetical protein SC1_gp120 [Candidatus Liberibacter asiaticus] Length = 171 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 54/160 (33%), Positives = 84/160 (52%), Gaps = 8/160 (5%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKD-LKLQEAYTEALQCRLDL 78 Y+ E IL + G +L +K G+ S F+ W+K+D L++ Y +A Q ++ Sbjct: 14 KYTKERIDNILASFSGGLSLSQSCKKHGVTVVS-FHGWVKQDREDLEKRYEQAKQSHMEH 72 Query: 79 LAEELLEE-PAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDL 137 L+E L AP EE + K+R+ + + + LEK YG VSVE KHTIDL Sbjct: 73 LSENLASVVEAPLTEEERDHPQAIKLRELRMKRLQWELEKRYRNVYGNHVSVEQKHTIDL 132 Query: 138 RPAIERLRE--HYKHLKPIDSERIPHKSTEKPLEIVESSI 175 +P ++R++ K LKP+ + K TEKPLE+ + + Sbjct: 133 KPLMDRVQHSIQSKGLKPVKA---LDKQTEKPLELPKLTK 169 >gi|218695969|ref|YP_002403636.1| putative terminase small subunit [Escherichia coli 55989] gi|218352701|emb|CAU98483.1| putative terminase small subunit [Escherichia coli 55989] Length = 141 Score = 150 bits (379), Expect = 6e-35, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 68/135 (50%), Gaps = 5/135 (3%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEA 67 A + + Y PE+ I +A+G++L V ++ GMP +T +RW+ + + ++ Sbjct: 2 ATEKKNVGRPS-DYLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHDEFRDK 60 Query: 68 YTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRV 127 Y +A + R D + EE+ E A TA A V +++R ++ + L +++ +KYG +V Sbjct: 61 YAKATEARADSIFEEIFE-IADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKV 116 Query: 128 SVESKHTIDLRPAIE 142 + E IE Sbjct: 117 TNELVGKDGGAIQIE 131 >gi|254781217|ref|YP_003065630.1| hypothetical protein CLIBASIA_05620 [Candidatus Liberibacter asiaticus str. psy62] gi|254040894|gb|ACT57690.1| hypothetical protein CLIBASIA_05620 [Candidatus Liberibacter asiaticus str. psy62] Length = 162 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 54/160 (33%), Positives = 84/160 (52%), Gaps = 8/160 (5%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKD-LKLQEAYTEALQCRLDL 78 Y+ E IL + G +L +K G+ S F+ W+K+D L++ Y +A Q ++ Sbjct: 5 KYTKERIDNILASFSGGLSLSQSCKKHGVTVVS-FHGWVKQDREDLEKRYEQAKQSHMEH 63 Query: 79 LAEELLEE-PAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDL 137 L+E L AP EE + K+R+ + + + LEK YG VSVE KHTIDL Sbjct: 64 LSENLASVVEAPLTEEERDHPQAIKLRELRMKRLQWELEKRYRNVYGNHVSVEQKHTIDL 123 Query: 138 RPAIERLRE--HYKHLKPIDSERIPHKSTEKPLEIVESSI 175 +P ++R++ K LKP+ + K TEKPLE+ + + Sbjct: 124 KPLMDRVQHSIQSKGLKPVKA---LDKQTEKPLELPKLTK 160 >gi|260856408|ref|YP_003230299.1| putative terminase small subunit [Escherichia coli O26:H11 str. 11368] gi|331678315|ref|ZP_08378990.1| conserved hypothetical protein [Escherichia coli H591] gi|257755057|dbj|BAI26559.1| probable terminase small subunit [Escherichia coli O26:H11 str. 11368] gi|331074775|gb|EGI46095.1| conserved hypothetical protein [Escherichia coli H591] Length = 141 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 31/135 (22%), Positives = 66/135 (48%), Gaps = 5/135 (3%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEA 67 A + + + Y PE+ I +++G++L V + GMP ++ +RW+ + + ++ Sbjct: 2 ATEKKKGGRPS-DYMPEVANDICALLSSGESLRKVCERPGMPSKTSVFRWLAEHQEFRDQ 60 Query: 68 YTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRV 127 Y +A + R D + EE+ E + A V +++R ++ + L +++ +KYG +V Sbjct: 61 YAKATETRADSIFEEIFEIADDVIPD-AAEVAKARLRVDTRK---WALARMNPRKYGDKV 116 Query: 128 SVESKHTIDLRPAIE 142 + E IE Sbjct: 117 TNELVGKDGGAIQIE 131 >gi|298381519|ref|ZP_06991118.1| terminase small subunit [Escherichia coli FVEC1302] gi|298278961|gb|EFI20475.1| terminase small subunit [Escherichia coli FVEC1302] Length = 141 Score = 150 bits (378), Expect = 8e-35, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 68/135 (50%), Gaps = 5/135 (3%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEA 67 A + + Y PE+ I +A+G++L V ++ GMP +T +RW+ + + ++ Sbjct: 2 ATEKKNVGRPS-DYLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHEEFRDK 60 Query: 68 YTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRV 127 Y +A + R D + EE+ E A TA A V +++R ++ + L +++ +KYG +V Sbjct: 61 YAKATEARADSIFEEIFE-IADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKV 116 Query: 128 SVESKHTIDLRPAIE 142 + E IE Sbjct: 117 TNELVGKDGGAIQIE 131 >gi|300922508|ref|ZP_07138620.1| hypothetical protein HMPREF9548_00763 [Escherichia coli MS 182-1] gi|300421166|gb|EFK04477.1| hypothetical protein HMPREF9548_00763 [Escherichia coli MS 182-1] gi|324114095|gb|EGC08068.1| ubiquitin carboxyl-terminal hydrolase 35 [Escherichia fergusonii B253] Length = 141 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 35/135 (25%), Positives = 67/135 (49%), Gaps = 5/135 (3%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEA 67 A + + Y PE+ I +A+G++L V ++ GMP +T +RW+ + + ++ Sbjct: 2 ATEKKNVGRPS-DYLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHDEFRDK 60 Query: 68 YTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRV 127 Y +A + R D + EE+ E A A A V +++R ++ + L +++ +KYG +V Sbjct: 61 YAKATEARADSIFEEIFE-IADNAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKV 116 Query: 128 SVESKHTIDLRPAIE 142 + E IE Sbjct: 117 TNELVGKDGGAIQIE 131 >gi|325497785|gb|EGC95644.1| putative terminase small subunit [Escherichia fergusonii ECD227] Length = 141 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 67/135 (49%), Gaps = 5/135 (3%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEA 67 A + + Y PE+ I +A+G++L V ++ GMP +T +RW+ + ++ Sbjct: 2 ATEKKNVGRPS-DYLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHDDFRDK 60 Query: 68 YTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRV 127 Y +A + R D + EE+ E A TA A V +++R ++ + L +++ +KYG +V Sbjct: 61 YAKATEARADSVFEEIFE-IADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYGDKV 116 Query: 128 SVESKHTIDLRPAIE 142 + E IE Sbjct: 117 TNELVGKDGGAIQIE 131 >gi|251791567|ref|YP_003006288.1| hypothetical protein Dd1591_4010 [Dickeya zeae Ech1591] gi|247540188|gb|ACT08809.1| hypothetical protein Dd1591_4010 [Dickeya zeae Ech1591] Length = 245 Score = 144 bits (362), Expect = 6e-33, Method: Composition-based stats. Identities = 35/133 (26%), Positives = 71/133 (53%), Gaps = 6/133 (4%) Query: 10 KAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYT 69 K R + Y PE+ I +A+G++L V R GMP +T +RW+ ++ ++ Y Sbjct: 96 KQNRTGRPS-DYMPEVADDICTLLADGESLRDVCRHPGMPNKATVFRWLAENEVFRDQYA 154 Query: 70 EALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSV 129 +A R D++ +E+++ A + E A V S+++ ++ ++L +++ +KYG R++ Sbjct: 155 KATDVRADVIFDEIID-IADSTEADAAEVAKSRLKIDARK---WVLSRMAPKKYGERITQ 210 Query: 130 ESKHTIDLRPAIE 142 E D P ++ Sbjct: 211 EVTGK-DGGPVVQ 222 >gi|268589861|ref|ZP_06124082.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291314844|gb|EFE55297.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 140 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 37/134 (27%), Positives = 66/134 (49%), Gaps = 5/134 (3%) Query: 9 KKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAY 68 K + + Y PE I ++A G++L V + GMP +T +RW+ ++ + ++ Y Sbjct: 2 TKKNKGGRPS-DYMPETAHDICAKLAEGESLRSVCNRQGMPSKATVFRWLSENAEFRDQY 60 Query: 69 TEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVS 128 +A + R D L EE+LE + A V +K+R ++ + L ++S +KYG +V+ Sbjct: 61 AKATEQRADALFEEMLEIADDVLPDSA-EVAKAKLRIDTRK---WSLARMSPKKYGDKVT 116 Query: 129 VESKHTIDLRPAIE 142 E IE Sbjct: 117 QEITGADGGAIQIE 130 >gi|254240164|ref|ZP_04933486.1| hypothetical protein PA2G_00805 [Pseudomonas aeruginosa 2192] gi|126193542|gb|EAZ57605.1| hypothetical protein PA2G_00805 [Pseudomonas aeruginosa 2192] Length = 188 Score = 137 bits (344), Expect = 9e-31, Method: Composition-based stats. Identities = 43/146 (29%), Positives = 66/146 (45%), Gaps = 13/146 (8%) Query: 7 KAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIK--KDLKL 64 K + Y+P + + +A G +L V + GMP STF RW+ + L Sbjct: 29 KTAAKRPPGRPS-RYTPTVGLAVCTALAEGMSLRKVCQLPGMPAMSTFLRWLADEQHADL 87 Query: 65 QEAYTEALQCRLDLLAEELLEEPAP------TAEELANPVFYSKMRDRKQRMG--TFLLE 116 +E Y A + + D LAEE+L+ T +E V Y + K R+ +L Sbjct: 88 REQYARAREAQADTLAEEILQIADDGSNDTYTDDEGRTHVDYDHISRSKLRVDARKWLAS 147 Query: 117 KLSNQKYGPRVSVESKHTIDLRPAIE 142 K++ +KYG R++ E HT AIE Sbjct: 148 KMAPKKYGDRITNE--HTGANGGAIE 171 >gi|145631713|ref|ZP_01787475.1| hypothetical protein CGSHi22421_00952 [Haemophilus influenzae R3021] gi|144982656|gb|EDJ90198.1| hypothetical protein CGSHi22421_00952 [Haemophilus influenzae R3021] Length = 173 Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 74/166 (44%), Gaps = 9/166 (5%) Query: 11 AVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE 70 + + Y E+ I +A G++L + + GMP T +RW+ ++ + + Y Sbjct: 15 KKKTGRPS-SYVKEVADDICQLIAQGESLRKICERPGMPNQRTIFRWLNENDEFCQQYAR 73 Query: 71 ALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 A + + D L EE+LE E V +K+R ++ + + K++ +KYG +V+ E Sbjct: 74 ARENQADFLLEEMLEISDLATPE---NVSVAKLRVDARK---WYITKVAPKKYGDKVTQE 127 Query: 131 SKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIA 176 ++ P +++ L +D ++ + K +SS Sbjct: 128 ITG-VNGAPIEQKISMDLSRL-SVDELKLLRELKSKATGCDDSSKE 171 >gi|288562970|pdb|3HEF|A Chain A, Crystal Structure Of The Bacteriophage Sf6 Terminase Small Subunit gi|288562971|pdb|3HEF|B Chain B, Crystal Structure Of The Bacteriophage Sf6 Terminase Small Subunit Length = 143 Score = 135 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 38/138 (27%), Positives = 67/138 (48%), Gaps = 8/138 (5%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + KA R Y PE+ I +++G++L V ++ GMP ST +RW+ K Sbjct: 4 MATEPKAGRPS----DYMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDF 59 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ Y +A + R D + EE+ E A A A V +++R ++ + L +++ +KYG Sbjct: 60 RDKYAKATEARADSIFEEIFE-IADNAIPDAAEVAKARLRVDTRK---WALARMNPRKYG 115 Query: 125 PRVSVESKHTIDLRPAIE 142 +V+ E IE Sbjct: 116 DKVTNELVGKDGGAIQIE 133 >gi|205360376|ref|ZP_02682926.2| terminase small subunit [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|205349763|gb|EDZ36394.1| terminase small subunit [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] Length = 142 Score = 135 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 39/138 (28%), Positives = 68/138 (49%), Gaps = 8/138 (5%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + KA R Y PE+ I +++G++L V ++ GMP ST +RW+ K Sbjct: 3 MATEPKAGRPS----DYMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDF 58 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ Y +A + R D + EE+ E A TA A V +++R ++ + L +++ +KYG Sbjct: 59 RDKYAKATEARADSIFEEIFE-IADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYG 114 Query: 125 PRVSVESKHTIDLRPAIE 142 +V+ E IE Sbjct: 115 DKVTNELVGKDGGAIQIE 132 >gi|312831397|emb|CBY17578.1| unnamed protein product [Escherichia coli LF82] Length = 140 Score = 134 bits (338), Expect = 4e-30, Method: Composition-based stats. Identities = 39/138 (28%), Positives = 68/138 (49%), Gaps = 8/138 (5%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + KA R Y PE+ I +++G++L V ++ GMP ST +RW+ K Sbjct: 1 MATEPKAGRPS----DYMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDF 56 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ Y +A + R D + EE+ E A TA A V +++R ++ + L +++ +KYG Sbjct: 57 RDKYAKATEARADSIFEEIFE-IADTAIPDAAEVAKARLRVDTRK---WALARMNPRKYG 112 Query: 125 PRVSVESKHTIDLRPAIE 142 +V+ E IE Sbjct: 113 DKVTNELVGKDGGAIQIE 130 >gi|300935793|ref|ZP_07150756.1| conserved hypothetical protein [Escherichia coli MS 21-1] gi|300459026|gb|EFK22519.1| conserved hypothetical protein [Escherichia coli MS 21-1] gi|323177129|gb|EFZ62719.1| hypothetical protein ECOK1180_4497 [Escherichia coli 1180] Length = 142 Score = 134 bits (337), Expect = 6e-30, Method: Composition-based stats. Identities = 38/138 (27%), Positives = 67/138 (48%), Gaps = 8/138 (5%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + KA R Y PE+ I +++G++L V ++ GMP ST +RW+ K Sbjct: 3 MATEPKAGRPS----DYMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDF 58 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ Y +A + R D + EE+ E A A A V +++R ++ + L +++ +KYG Sbjct: 59 RDKYAKATEARADSIFEEIFE-IADNAIPDAAEVAKARLRVDTRK---WALARMNPRKYG 114 Query: 125 PRVSVESKHTIDLRPAIE 142 +V+ E IE Sbjct: 115 DKVTNELVGKDGGAIQIE 132 >gi|13559865|ref|NP_112075.1| terminase small subunit [Enterobacteria phage HK620] gi|41057279|ref|NP_958177.1| gene 1 protein [Enterobacteria phage Sf6] gi|168239625|ref|ZP_02664683.1| terminase small subunit [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|260868684|ref|YP_003235086.1| putative terminase small subunit [Escherichia coli O111:H- str. 11128] gi|13517601|gb|AAK28890.1|AF335538_42 terminase small subunit [Salmonella phage HK620] gi|33334158|gb|AAQ12191.1| gene 1 protein [Shigella phage Sf6] gi|197287688|gb|EDY27079.1| terminase small subunit [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|257765040|dbj|BAI36535.1| putative terminase small subunit [Escherichia coli O111:H- str. 11128] Length = 140 Score = 134 bits (336), Expect = 8e-30, Method: Composition-based stats. Identities = 38/138 (27%), Positives = 67/138 (48%), Gaps = 8/138 (5%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + KA R Y PE+ I +++G++L V ++ GMP ST +RW+ K Sbjct: 1 MATEPKAGRPS----DYMPEVADDICSLLSSGESLLKVCKRPGMPDKSTVFRWLAKHEDF 56 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ Y +A + R D + EE+ E A A A V +++R ++ + L +++ +KYG Sbjct: 57 RDKYAKATEARADSIFEEIFE-IADNAIPDAAEVAKARLRVDTRK---WALARMNPRKYG 112 Query: 125 PRVSVESKHTIDLRPAIE 142 +V+ E IE Sbjct: 113 DKVTNELVGKDGGAIQIE 130 >gi|53793590|ref|YP_112490.1| terminase small subunit [Flavobacterium phage 11b] gi|53748180|emb|CAH56641.1| terminase small subunit [Flavobacterium phage 11b] Length = 160 Score = 131 bits (328), Expect = 6e-29, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 4/117 (3%) Query: 23 PELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEE 82 + F+ IL + +G++L VL++ GMP +TFY+WI D YT A + R D+ E Sbjct: 30 EKTFSSILSFIEDGQSLRSVLKREGMPDSTTFYKWIDADENKSIQYTRACETRADVKFES 89 Query: 83 L----LEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTI 135 + LEEP + +++ K + L KL +KYG ++ E I Sbjct: 90 IEKDYLEEPQRDPVSGKIDAGWVQLQRLKIDSKKWELGKLQPKKYGDKIQNEISGAI 146 >gi|126443704|ref|YP_001063349.1| hypothetical protein BURPS668_A2355 [Burkholderia pseudomallei 668] gi|167907352|ref|ZP_02494557.1| hypothetical protein BpseN_34300 [Burkholderia pseudomallei NCTC 13177] gi|126223195|gb|ABN86700.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 147 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 33/139 (23%), Positives = 66/139 (47%), Gaps = 9/139 (6%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 + +S LF I + +GK++ V + GMP TF +W K+ +LQ+ Y +A R D Sbjct: 16 VEFSQALFDRICALIGDGKSVREVCKGPGMPDRMTFLKWAKRTPELQKQYDDACIDRQDA 75 Query: 79 LAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLR 138 + +++L A+ + +P M D ++ + L +++ +++G VS E D Sbjct: 76 IFDDVLY----IADTVRDPKRAKVMVDARE----WTLARMNRKRFGNHVSNEHSGP-DGG 126 Query: 139 PAIERLREHYKHLKPIDSE 157 P + + + P++ Sbjct: 127 PIQTKTQVVRLRMSPVEEL 145 >gi|240016263|ref|ZP_04722803.1| putative phage associated protein [Neisseria gonorrhoeae FA6140] gi|240113248|ref|ZP_04727738.1| putative phage associated protein [Neisseria gonorrhoeae MS11] gi|240116003|ref|ZP_04730065.1| putative phage associated protein [Neisseria gonorrhoeae PID18] gi|240118307|ref|ZP_04732369.1| putative phage associated protein [Neisseria gonorrhoeae PID1] gi|240126039|ref|ZP_04738925.1| putative phage associated protein [Neisseria gonorrhoeae SK-92-679] gi|268599328|ref|ZP_06133495.1| terminase small subunit [Neisseria gonorrhoeae MS11] gi|268601670|ref|ZP_06135837.1| gene 1 protein [Neisseria gonorrhoeae PID18] gi|268604019|ref|ZP_06138186.1| terminase small subunit [Neisseria gonorrhoeae PID1] gi|268684632|ref|ZP_06151494.1| terminase small subunit [Neisseria gonorrhoeae SK-92-679] gi|268583459|gb|EEZ48135.1| terminase small subunit [Neisseria gonorrhoeae MS11] gi|268585801|gb|EEZ50477.1| gene 1 protein [Neisseria gonorrhoeae PID18] gi|268588150|gb|EEZ52826.1| terminase small subunit [Neisseria gonorrhoeae PID1] gi|268624916|gb|EEZ57316.1| terminase small subunit [Neisseria gonorrhoeae SK-92-679] Length = 149 Score = 127 bits (318), Expect = 9e-28, Method: Composition-based stats. Identities = 33/150 (22%), Positives = 75/150 (50%), Gaps = 6/150 (4%) Query: 11 AVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE 70 + + Y+ ++ I +++ANG++L + + G+P T YRW++ + + + Y Sbjct: 5 KRKLGRPT-DYTKDMADKICEKIANGRSLRSICAEDGVPPMKTIYRWLEANEEFRHQYAR 63 Query: 71 ALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 A + + D AEE++E A +A+ + V +K++ ++ + K++ +KYG + ++ Sbjct: 64 AREKQADYFAEEIIE-IADSAQAESAAVSKAKLQIDARK---WAASKIAPKKYGDKTELD 119 Query: 131 SKHTI-DLRPAIERLREHYKHLKPIDSERI 159 K +R A+ E Y+ + R+ Sbjct: 120 VKSGDGSMRAAVRLDAEEYRKIAEDVLRRV 149 >gi|59800932|ref|YP_207644.1| putative phage associated protein [Neisseria gonorrhoeae FA 1090] gi|239999272|ref|ZP_04719196.1| putative phage associated protein [Neisseria gonorrhoeae 35/02] gi|240013814|ref|ZP_04720727.1| putative phage associated protein [Neisseria gonorrhoeae DGI18] gi|240121387|ref|ZP_04734349.1| putative phage associated protein [Neisseria gonorrhoeae PID24-1] gi|240123851|ref|ZP_04736807.1| putative phage associated protein [Neisseria gonorrhoeae PID332] gi|254494034|ref|ZP_05107205.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291] gi|260440181|ref|ZP_05793997.1| putative phage associated protein [Neisseria gonorrhoeae DGI2] gi|268595095|ref|ZP_06129262.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|268682475|ref|ZP_06149337.1| terminase small subunit [Neisseria gonorrhoeae PID332] gi|291043472|ref|ZP_06569188.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|293398784|ref|ZP_06642952.1| phage associated protein [Neisseria gonorrhoeae F62] gi|59717827|gb|AAW89232.1| hypothetical protein, putative phage associated protein [Neisseria gonorrhoeae FA 1090] gi|226513074|gb|EEH62419.1| conserved hypothetical protein [Neisseria gonorrhoeae 1291] gi|268548484|gb|EEZ43902.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|268622759|gb|EEZ55159.1| terminase small subunit [Neisseria gonorrhoeae PID332] gi|291011935|gb|EFE03924.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|291610786|gb|EFF39893.1| phage associated protein [Neisseria gonorrhoeae F62] Length = 149 Score = 126 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 33/150 (22%), Positives = 75/150 (50%), Gaps = 6/150 (4%) Query: 11 AVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE 70 + + Y+ ++ I +++ANG++L + + G+P T YRW++ + + + Y Sbjct: 5 KRKLGRPT-DYTKDMADKICEKIANGRSLRSICAEDGVPPMKTIYRWLEANEEFRHQYAR 63 Query: 71 ALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 A + + D AEE++E A +A+ + V +K++ ++ + K++ +KYG + ++ Sbjct: 64 AREKQADYFAEEIIE-IADSAQAESAAVSKAKLQIDARK---WAASKIAPKKYGDKSELD 119 Query: 131 SKHTI-DLRPAIERLREHYKHLKPIDSERI 159 K +R A+ E Y+ + R+ Sbjct: 120 VKSGDGSMRAAVRLDAEEYRKIAEDVLRRV 149 >gi|240080383|ref|ZP_04724926.1| putative phage associated protein [Neisseria gonorrhoeae FA19] gi|268596526|ref|ZP_06130693.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] gi|268550314|gb|EEZ45333.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] Length = 149 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 33/150 (22%), Positives = 75/150 (50%), Gaps = 6/150 (4%) Query: 11 AVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE 70 + + Y+ ++ I +++ANG++L + + G+P T YRW++ + + + Y Sbjct: 5 KRKLGRPT-DYTKDMADKICEKIANGRSLRSICAEDGVPPMKTIYRWLEANEEFRHQYAR 63 Query: 71 ALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 A + + D AEE++E A +A+ + V +K++ ++ + K++ +KYG + ++ Sbjct: 64 AREKQADYFAEEIIE-IADSAQAESAAVSKAKLQIDARK---WAASKIAPKKYGDKTELD 119 Query: 131 SKHTI-DLRPAIERLREHYKHLKPIDSERI 159 K +R A+ E Y+ + R+ Sbjct: 120 VKSGDGSMRAAVRLDAEEYRKIAEDVLRRV 149 >gi|300935518|ref|ZP_07150512.1| helix-turn-helix protein [Escherichia coli MS 21-1] gi|300459317|gb|EFK22810.1| helix-turn-helix protein [Escherichia coli MS 21-1] Length = 171 Score = 122 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 32/128 (25%), Positives = 61/128 (47%), Gaps = 7/128 (5%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ E I + VA+G+++ + + GMP ST +W + + + Y A + ++LA Sbjct: 25 YTEEKALEICELVADGQSVNKISKMPGMPTRSTILKWFRDVPEFSDMYMRAKEIGFEVLA 84 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMG--TFLLEKLSNQKYGPRVSVESKHTIDLR 138 +E+L+ + A V ++R + + +LL KL +KYG RV+ E + Sbjct: 85 DEILDIA-----DDAKNVDKDQLRRHQLMIETRKWLLAKLQPRKYGERVTQEIVGNKEEA 139 Query: 139 PAIERLRE 146 P + + Sbjct: 140 PVQVEVNQ 147 >gi|332343006|gb|AEE56340.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 161 Score = 117 bits (293), Expect = 6e-25, Method: Composition-based stats. Identities = 32/129 (24%), Positives = 61/129 (47%), Gaps = 7/129 (5%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ E I + VA+G+++ + + GMP ST +W + + + Y A + ++LA Sbjct: 25 YTEEKALEICELVADGQSVNKISKMPGMPTRSTILKWFRDVPEFSDMYMRAKEIGFEVLA 84 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMG--TFLLEKLSNQKYGPRVSVESKHTIDLR 138 +E+L+ + A V ++R + + +LL KL +KYG RV+ E + Sbjct: 85 DEILDIA-----DDAKNVDKDQLRRHQLMIETRKWLLAKLQPRKYGERVTQEIVGNKEEA 139 Query: 139 PAIERLREH 147 P + + Sbjct: 140 PVQVEVTKE 148 >gi|261381053|ref|ZP_05985626.1| conserved hypothetical protein [Neisseria subflava NJ9703] gi|284796086|gb|EFC51433.1| conserved hypothetical protein [Neisseria subflava NJ9703] Length = 141 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 63/131 (48%), Gaps = 5/131 (3%) Query: 11 AVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE 70 + + YS E+ I +++ANG++L + + GMP ST +W+ ++ + E Y Sbjct: 5 KRKTGRPS-KYSDEMAENICEKIANGRSLRSICAEDGMPTTSTVCKWLIENKEFSEQYAR 63 Query: 71 ALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVE 130 A + D AEE++E E A +K++ ++ + K++ +KYG +V + Sbjct: 64 ARDKQADYFAEEIIEIADSAEAESAAV-AKAKLQIDARK---WAASKIAPKKYGDKVEQQ 119 Query: 131 SKHTIDLRPAI 141 + ++ + Sbjct: 120 ITGNLAIQADV 130 >gi|329119004|ref|ZP_08247698.1| terminase small subunit [Neisseria bacilliformis ATCC BAA-1200] gi|327464877|gb|EGF11168.1| terminase small subunit [Neisseria bacilliformis ATCC BAA-1200] Length = 153 Score = 116 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 33/133 (24%), Positives = 64/133 (48%), Gaps = 8/133 (6%) Query: 10 KAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYT 69 KA R YS L I +++A G++L V MP ST ++W+ ++ + E Y Sbjct: 20 KAKRPIGRPNTYSKGLADRICERIARGESLRAVCSDGDMPSASTVHKWLAENQEFSEQYA 79 Query: 70 EALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSV 129 A + + D A++++ E A V +K++ ++ + KL+ +KYG ++ + Sbjct: 80 HAREEQADHFAQDIIRIADEVEPESAA-VAKAKLQIDARK---WTAAKLAPKKYGDKIDL 135 Query: 130 ESKHTIDLRPAIE 142 + D+R +E Sbjct: 136 ----SGDMRVQVE 144 >gi|239835387|ref|YP_002956059.1| hypothetical protein DMR_p1_00530 [Desulfovibrio magneticus RS-1] gi|239794478|dbj|BAH73469.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 162 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 37/173 (21%), Positives = 66/173 (38%), Gaps = 13/173 (7%) Query: 1 MSNLVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK 60 MS A + C +S E+ + + G+ L + GMP +T Y W+ K Sbjct: 1 MSGKKSGDVAKKVAGRPC-SFSQEVADAVCQGIMEGRGLRSICADPGMPGPATVYSWLTK 59 Query: 61 DLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN 120 QE Y A + + D+ +E++ + ++ R ++ + KL Sbjct: 60 YASFQEQYALAREVQADVYFDEVVSIADNEPD-----PQIARNRIDARK---WAASKLRP 111 Query: 121 QKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKP-LEIVE 172 ++YG R+ VE + + + + L P + T KP L VE Sbjct: 112 KRYGDRLEVEHQGGVGMNAVQITISPATP-LAP--AMPAVLDVTPKPALPCVE 161 >gi|309379924|emb|CBX21335.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 136 Score = 114 bits (284), Expect = 7e-24, Method: Composition-based stats. Identities = 33/141 (23%), Positives = 64/141 (45%), Gaps = 6/141 (4%) Query: 9 KKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAY 68 KA R YS E I + +A G +L + MP T +RW+ + QE Y Sbjct: 2 AKAKRPIGRPTTYSQETADKICELIARGMSLRAICTSADMPAGGTVHRWLAEHQDFQEQY 61 Query: 69 TEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVS 128 A + + D A+E+++ A E V +K++ ++ + KL+ +KYG ++ Sbjct: 62 ARAREEQADGFADEIIDIADSVAPE-TGEVAKAKLQIDARK---WKAAKLAPKKYGEKLE 117 Query: 129 VESKHTIDLRPAIERLREHYK 149 +++ + ++ L + +K Sbjct: 118 LDAD--MRVKVETRSLEDIFK 136 >gi|119386462|ref|YP_917517.1| hypothetical protein Pden_3755 [Paracoccus denitrificans PD1222] gi|119377057|gb|ABL71821.1| hypothetical protein Pden_3755 [Paracoccus denitrificans PD1222] Length = 133 Score = 111 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 31/119 (26%), Positives = 52/119 (43%), Gaps = 8/119 (6%) Query: 27 AGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEE 86 I + +A G +L VLRK GMP YST RW++ D + Y A + + D A+++ + Sbjct: 15 DAICEAIACGDSLAKVLRKKGMPGYSTVIRWLRDDEDFRVDYARAREAQADADADKIGDI 74 Query: 87 PAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLR 145 +P + K +KYG ++ ++ I+ R IER+ Sbjct: 75 VDRVLAGKLDPQAA----RAAIDALKWTAGKRQPKKYGDKLDLD----INGRVQIERVE 125 >gi|329119588|ref|ZP_08248269.1| putative phage associated protein [Neisseria bacilliformis ATCC BAA-1200] gi|327464185|gb|EGF10489.1| putative phage associated protein [Neisseria bacilliformis ATCC BAA-1200] Length = 135 Score = 108 bits (268), Expect = 6e-22, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 53/108 (49%), Gaps = 4/108 (3%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEEL 83 ++ I + +A G+++ + R MP ST YRWI + + E Y A + + D+LA+E+ Sbjct: 7 DIMERICELLAAGQSMAQICRSKAMPAQSTVYRWIAESAEFSERYAHAREMQADMLADEI 66 Query: 84 LEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVES 131 +E E A D + +L +L+ +KYG RV V + Sbjct: 67 IEIADSCEPEAAAVAKAKARIDAR----KWLAARLAPKKYGDRVDVSA 110 >gi|116253679|ref|YP_769517.1| hypothetical protein RL3939 [Rhizobium leguminosarum bv. viciae 3841] gi|115258327|emb|CAK09429.1| putative phage-related protein [Rhizobium leguminosarum bv. viciae 3841] Length = 157 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 32/150 (21%), Positives = 64/150 (42%), Gaps = 21/150 (14%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRLD 77 ++ L I +++A+G++L + R MP ST W+ D K + Y +A + + D Sbjct: 7 KFTTALADIICERIADGESLRSICRDDAMPAKSTVLAWLADDDKTAFRTKYAQAREIQAD 66 Query: 78 LLAEELLEEPAPTA----------------EELANPVFYSKMRDRKQRMGTFLLEKLSNQ 121 +E++E +E + S++R + ++ EKL + Sbjct: 67 GFVDEMVEIADDGTNDWMEKKNADGETTGWQENGEAIRRSQLRIGTR---QWIAEKLKPK 123 Query: 122 KYGPRVSVESKHTIDLRPAIERLREHYKHL 151 KYG +V +E T ++ +E + + L Sbjct: 124 KYGAKVELEHGVTGEVSQLLEAINGKTRGL 153 >gi|218782398|ref|YP_002433716.1| hypothetical protein Dalk_4570 [Desulfatibacillum alkenivorans AK-01] gi|218763782|gb|ACL06248.1| hypothetical protein Dalk_4570 [Desulfatibacillum alkenivorans AK-01] Length = 196 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 28/127 (22%), Positives = 56/127 (44%), Gaps = 10/127 (7%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWI--KKDLKLQEAYTEALQCRLDL 78 Y+ E+ I +++A+G+++ + MP + YRW+ + K +E YT A + + D Sbjct: 9 YTDEIAETICERIASGESVQSITSDEDMPSSAMVYRWLADARYSKFREMYTRARERQADF 68 Query: 79 LAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLR 138 A+E++ V + + + KL +KYG R+ V + T+ Sbjct: 69 FADEIITIADTEDNPKKAAV--------RVQARQWAASKLKPRKYGDRIDVHQQGTVTHT 120 Query: 139 PAIERLR 145 A + + Sbjct: 121 IAPDMIE 127 >gi|160897384|ref|YP_001562966.1| putative terminase small subunit protein [Delftia acidovorans SPH-1] gi|160362968|gb|ABX34581.1| putative terminase small subunit protein [Delftia acidovorans SPH-1] Length = 157 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 57/134 (42%), Gaps = 26/134 (19%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIK--KDLKLQEAYTEALQCRLDL 78 ++ ++ + D++A G++L + MP +T RW+ ++ +E Y A + D Sbjct: 7 FNEDIATKLCDRIAAGESLRKICTDPKMPSMATVMRWLGKPENAAFREQYACARDAQADK 66 Query: 79 LAEELLEEPAPTAEEL---------------------ANPVFYSKMRDRKQRMGTFLLEK 117 L EE+LE + A V +K+R ++ ++ K Sbjct: 67 LVEEMLEIADEECTMVRSDKHGSKDDDGQGNTEVVFDATAVARNKLRVDARK---WVASK 123 Query: 118 LSNQKYGPRVSVES 131 ++ +KYG +V+ E Sbjct: 124 MAPKKYGDKVTQEV 137 >gi|241203595|ref|YP_002974691.1| terminase small subunit protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240857485|gb|ACS55152.1| putative terminase small subunit protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 157 Score = 98.0 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 28/128 (21%), Positives = 54/128 (42%), Gaps = 21/128 (16%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRL 76 I ++ L I +++A+ ++L + R GMP S+ W+ D K + Y A + + Sbjct: 5 IKFTQALADRICERIADRESLRSICRDEGMPAMSSVLSWLADDDKAGFRAKYALAREIQA 64 Query: 77 DLLAEELLEEPAPTAE----------------ELANPVFYSKMRDRKQRMGTFLLEKLSN 120 D +E++E A+ E + S++R + ++ KL Sbjct: 65 DGFVDEMVEIADDRADDWIEKKNASGETTGWQENGEAIRRSQLRIATR---QWVAAKLKP 121 Query: 121 QKYGPRVS 128 +KYG +V Sbjct: 122 KKYGSKVE 129 >gi|319783527|ref|YP_004143003.1| hypothetical protein Mesci_3836 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169415|gb|ADV12953.1| putative phage-related protein [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 158 Score = 97.2 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 56/145 (38%), Gaps = 20/145 (13%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK--DLKLQEAYTEALQCRLD 77 ++ E+ I +A G +L + MP ST RW+ + Y A + + D Sbjct: 6 EFTQEIADTICQSLAEGNSLRSICLADEMPDKSTVLRWLLDPVRAAFCDQYARARELQAD 65 Query: 78 LLAEELLEEPAPTAEELANPV---------------FYSKMRDRKQRMGTFLLEKLSNQK 122 + +E+ + + + V S++R ++ ++ E++ +K Sbjct: 66 IHVDEMPDIADDSRNDYMTKVNGDGTTTEQLNSENIQRSRLRIDTRK---WIAERMRPKK 122 Query: 123 YGPRVSVESKHTIDLRPAIERLREH 147 YG +V++ L + +L + Sbjct: 123 YGAKVALTDGEGGPLVVQVLKLADQ 147 >gi|254497230|ref|ZP_05110039.1| putative phage associated protein [Legionella drancourtii LLAP12] gi|254353554|gb|EET12280.1| putative phage associated protein [Legionella drancourtii LLAP12] Length = 157 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 42/157 (26%), Positives = 66/157 (42%), Gaps = 16/157 (10%) Query: 4 LVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGM--PKYSTFYRWIKKD 61 + K + K R K Y+ EL I + +A+ VL K M P T + W+KK Sbjct: 1 MAKTSSKIGRPTK----YNSELAEEICNVIASTSKGTKVLCKENMHWPCQDTLFAWLKKY 56 Query: 62 LKLQEAYTEALQCRLDLLAEELLEEPAPTA-----EELANPV---FYSKMRDRKQRMGTF 113 + E Y +A C+++LL +E+LE + +E PV K + Sbjct: 57 PEFSEQYAQAKICQIELLIDEILEIADDESNDCFIDEDGKPVSNNSVVNRARLKIDTRKW 116 Query: 114 LLEKLSNQKYGPRVSVESKHTID--LRPAIERLREHY 148 L KL + YG ++ +ES ++ LR L Y Sbjct: 117 LASKLVPKVYGNKIGIESDSSMSEELRKLSADLEVKY 153 >gi|86356735|ref|YP_468627.1| putative terminase small subunit protein [Rhizobium etli CFN 42] gi|86280837|gb|ABC89900.1| putative terminase small subunit protein [Rhizobium etli CFN 42] Length = 156 Score = 96.1 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 30/147 (20%), Positives = 58/147 (39%), Gaps = 15/147 (10%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRLD 77 +S L I D++A+ ++L + R MP ST W+ + K + Y A + D Sbjct: 6 KFSQALAEKICDRIADRESLRSICRDEDMPSKSTVLSWLADEEKAAFRARYALAREIMAD 65 Query: 78 LLAEELLEEPAPTAEELANPVFYS-------------KMRDRKQRMGTFLLEKLSNQKYG 124 +EL+E +++ + + + ++ EKL +KYG Sbjct: 66 SFVDELVEIADDRSDDWVEKKNAAGETTGWQENGEAIRRSQLRIATRQWVAEKLRPKKYG 125 Query: 125 PRVSVESKHTIDLRPAIERLREHYKHL 151 +V E T ++ +E + + L Sbjct: 126 TKVEPEQSVTGEVSQLLEVVNGRTRGL 152 >gi|326564855|gb|EGE15061.1| putative phage associated protein [Moraxella catarrhalis 103P14B1] Length = 139 Score = 94.5 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 27/122 (22%), Positives = 55/122 (45%), Gaps = 7/122 (5%) Query: 9 KKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAY 68 K + + I Y+ +L A L +VA G+++ V MP +T Y W+ + + Y Sbjct: 3 AKKPKMGRPTI-YNEDLVAEFLYRVAKGRSVASVCSDDDMPHRATIYEWLAHNSDFSDRY 61 Query: 69 TEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVS 128 A + R D +E+L E V +K++ ++ ++L +++ +KY + Sbjct: 62 ARASEQRADQYFDEMLHIADNALPEE---VQKAKLQIDTRK---WVLARMNPKKYSGKSQ 115 Query: 129 VE 130 + Sbjct: 116 DD 117 >gi|209548330|ref|YP_002280247.1| terminase small subunit [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534086|gb|ACI54021.1| putative terminase small subunit protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 156 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 60/150 (40%), Gaps = 21/150 (14%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRLD 77 ++ L I +++A+ ++L + R MP ST W+ D K + Y A + D Sbjct: 6 KFTQALAEKICERIADRESLRAICRDEDMPAKSTVLSWLADDGKAAFRARYALAREILAD 65 Query: 78 LLAEELLEEPAPTA----------------EELANPVFYSKMRDRKQRMGTFLLEKLSNQ 121 +EL+E + +E + S++R + ++ EKL + Sbjct: 66 GFVDELVEIADDRSNDWIEKKNAAGETTGWQENGEAIRRSQLRIATR---QWVAEKLRPK 122 Query: 122 KYGPRVSVESKHTIDLRPAIERLREHYKHL 151 KYG + E T ++ +E + + L Sbjct: 123 KYGSKAEPEQGVTGEVSQLLEDINGKTRGL 152 >gi|116251009|ref|YP_766847.1| hypothetical protein RL1242 [Rhizobium leguminosarum bv. viciae 3841] gi|115255657|emb|CAK06738.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 157 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 29/135 (21%), Positives = 57/135 (42%), Gaps = 23/135 (17%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRL 76 I ++ + I +++AN ++L + R +P S+ W+ D K + Y A + + Sbjct: 5 IKFTQAIADRICERIANRESLRSICRDEEVPAMSSVLSWLADDDKAGFRAKYALAREIQA 64 Query: 77 DLLAEELLEEPAPTAE----------------ELANPVFYSKMRDRKQRMGTFLLEKLSN 120 D +E++E A+ E + S++R + ++ EKL Sbjct: 65 DGFVDEMVEIADDRADDWIEKKNASGETTGWQENGEAIRRSQLRIATR---QWVAEKLKP 121 Query: 121 QKYGPRVSVESKHTI 135 +KYG +V E H + Sbjct: 122 KKYGSKV--EPDHGV 134 >gi|327189475|gb|EGE56635.1| putative terminase small subunit protein [Rhizobium etli CNPAF512] Length = 156 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 61/150 (40%), Gaps = 21/150 (14%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRLD 77 +S L I D++A+ ++L + R MP ST W+ + K + Y A + D Sbjct: 6 KFSQALAEKICDRIADRESLRSICRDETMPAKSTVLSWLADEDKAAFRARYALAREILAD 65 Query: 78 LLAEELLEEPAPTAE----------------ELANPVFYSKMRDRKQRMGTFLLEKLSNQ 121 +EL+E +++ E + S++R + ++ EKL + Sbjct: 66 GFVDELVEIADNSSDDWIEKKNASGETTGWQENGEAIRRSQLRIATR---QWVAEKLRPK 122 Query: 122 KYGPRVSVESKHTIDLRPAIERLREHYKHL 151 KYG ++ E ++ +E + + L Sbjct: 123 KYGAKIEPEQGVAGEVSQLLEDINGKTRGL 152 >gi|218461767|ref|ZP_03501858.1| putative terminase small subunit protein [Rhizobium etli Kim 5] Length = 146 Score = 89.9 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 31/143 (21%), Positives = 60/143 (41%), Gaps = 21/143 (14%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRLD 77 +S L I D++A+ ++L + R MP ST W+ + K + Y A + D Sbjct: 6 KFSQALAEKICDRIADRESLRSICRDDAMPAKSTVLSWLADEDKAAFRARYALAREILAD 65 Query: 78 LLAEELLEEPAPTA----------------EELANPVFYSKMRDRKQRMGTFLLEKLSNQ 121 +EL+E ++ +E + S++R + ++ EKL + Sbjct: 66 GFVDELVEIADDSSNDWIEKKNAAGETTGWQENGEAIRRSQLRIATR---QWVAEKLRPK 122 Query: 122 KYGPRVSVESKHTIDLRPAIERL 144 KYG +V E + ++ +E + Sbjct: 123 KYGAKVEPEQGVSGEVSQLLEDI 145 >gi|41179387|ref|NP_958695.1| Bbp26 [Bordetella phage BPP-1] gi|45569519|ref|NP_996588.1| hypothetical protein BMP-1p25 [Bordetella phage BMP-1] gi|45580770|ref|NP_996636.1| hypothetical protein BIP-1p25 [Bordetella phage BIP-1] gi|40950126|gb|AAR97692.1| Bbp26 [Bordetella phage BPP-1] Length = 194 Score = 86.8 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 35/149 (23%), Positives = 55/149 (36%), Gaps = 11/149 (7%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEA 67 + K Y+ + I D++A+G+ L + R MP + T Y WI+ + E Sbjct: 15 DARKTARMKNASTYTEAIATEICDRIADGEPLRQICRDEHMPAWRTVYHWIEARPEFAER 74 Query: 68 YTEALQCRLDLLAEELLEEPAP----TAEELANPVFYSKM------RDRKQRMGTFLLEK 117 A + D +AEE LE E ++ + K R + LL K Sbjct: 75 MERARRAGFDAIAEEALEIANTPQTGEETEESDDGYKVKRGDMLGHRKLQVETRLKLLAK 134 Query: 118 LSNQKYGPRVSVESKHTIDLRPAIERLRE 146 KYG + S+E + P E Sbjct: 135 WHPTKYGDKTSMELTGA-NGGPVQINDTE 162 >gi|254497365|ref|ZP_05110168.1| hypothetical protein LDG_1611 [Legionella drancourtii LLAP12] gi|254353418|gb|EET12150.1| hypothetical protein LDG_1611 [Legionella drancourtii LLAP12] Length = 171 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 58/129 (44%), Gaps = 17/129 (13%) Query: 10 KAVRAKKG-CIYYSPELFAGILDQVANG-KALGHVLR-KVGMPKYSTFYRWIKKDLKLQE 66 KA+++K G Y+ L I D +A+ K + + + P T + W+K++ + Sbjct: 24 KALKSKNGRPTTYTLALAKEICDAIASSSKGIKALCQGHPHWPNKDTIFTWLKENQDFSD 83 Query: 67 AYTEALQCRLDLLAEELLEEPAPTAEELA-----------NPVFYSKMRDRKQRMGTFLL 115 Y A Q +++ + +ELLE ++++ + +K+R ++ +L Sbjct: 84 QYARAKQAQIECVVDELLEIADDSSQDQVFDSEGQIHFNGQAIQRAKLRIDTRK---WLA 140 Query: 116 EKLSNQKYG 124 KL + YG Sbjct: 141 CKLVPKVYG 149 >gi|187476924|ref|YP_784948.1| phage terminase small subunit [Bordetella avium 197N] gi|115421510|emb|CAJ48019.1| Putative phage terminase small subunit [Bordetella avium 197N] Length = 170 Score = 84.9 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 54/135 (40%), Gaps = 10/135 (7%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ + I D++A+G+ L + R MP + T Y WI+ K E A + D +A Sbjct: 7 YTEAIATEICDRIADGEPLRQICRDQHMPAWRTVYHWIEARSKFAERMERARRVGFDAIA 66 Query: 81 EELLEEP------APTAEELAN-PVFYSKM---RDRKQRMGTFLLEKLSNQKYGPRVSVE 130 EE LE T +E + V M R + LL K KYG + S+E Sbjct: 67 EEALEIANTPQIGEETEQEGSKFRVKRGDMLGHRKLQVETRLKLLAKWHPTKYGDKTSME 126 Query: 131 SKHTIDLRPAIERLR 145 I+ + Sbjct: 127 LTGAGGGPVRIDETQ 141 >gi|194098999|ref|YP_002002066.1| putative phage associated protein [Neisseria gonorrhoeae NCCP11945] gi|193934289|gb|ACF30113.1| putative phage associated protein [Neisseria gonorrhoeae NCCP11945] gi|317164558|gb|ADV08099.1| putative phage associated protein [Neisseria gonorrhoeae TCDC-NG08107] Length = 106 Score = 83.7 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 25/110 (22%), Positives = 54/110 (49%), Gaps = 5/110 (4%) Query: 51 YSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRM 110 T YRW++ + + + Y A + + D AEE++E A +A+ + V +K++ ++ Sbjct: 1 MKTIYRWLEANEEFRHQYARAREKQADYFAEEIIE-IADSAQAESAAVSKAKLQIDARK- 58 Query: 111 GTFLLEKLSNQKYGPRVSVESKHTI-DLRPAIERLREHYKHLKPIDSERI 159 + K++ +KYG + ++ K +R A+ E Y+ + R+ Sbjct: 59 --WAASKIAPKKYGDKTELDVKSGDGSMRAAVRLDAEEYRKIAEDVLRRV 106 >gi|197105030|ref|YP_002130407.1| hypothetical protein PHZ_c1567 [Phenylobacterium zucineum HLK1] gi|196478450|gb|ACG77978.1| hypothetical protein PHZ_c1567 [Phenylobacterium zucineum HLK1] Length = 289 Score = 83.3 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 28/115 (24%), Positives = 55/115 (47%), Gaps = 9/115 (7%) Query: 11 AVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE 70 RA +S L IL ++A G++L V MP +++ RW+ + ++ Y + Sbjct: 81 RRRANHP---WSRVLEDEILRRIAAGRSLVEVCEAPEMPHHTSVTRWLDEKPGFRDRYLD 137 Query: 71 ALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGP 125 A + D L + L A A+E + V ++++ + + + +LS ++YGP Sbjct: 138 ARLAQADGLFD-LAWRIAREADE--DEVRTARLKVDVIK---WRIGRLSPKRYGP 186 Score = 39.1 bits (89), Expect = 0.26, Method: Composition-based stats. Identities = 11/51 (21%), Positives = 21/51 (41%) Query: 28 GILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 + +++ G++L V R +P+ RW + E EA + D Sbjct: 27 QVCERLEAGESLCGVCRDPLLPQRYQLDRWRAAEPLFDEMVREARRRAQDA 77 >gi|254497906|ref|ZP_05110671.1| putative phage associated protein [Legionella drancourtii LLAP12] gi|254352904|gb|EET11674.1| putative phage associated protein [Legionella drancourtii LLAP12] Length = 155 Score = 81.8 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 55/134 (41%), Gaps = 11/134 (8%) Query: 13 RAKKGCIYYSPELFAGILDQVAN-GKALGHVLRKVG-MPKYSTFYRWIKKDLKLQEAYTE 70 + + Y+ +L I D +A+ K + + P T + W++ + E Y + Sbjct: 5 KQGRPT-KYNLKLGQEICDAIASTSKGTKKLCAEHSHWPCQDTLFTWLRTYPEFSEQYVK 63 Query: 71 ALQCRLDLLAEELLEEPAPTAEE--------LANPVFYSKMRDRKQRMGTFLLEKLSNQK 122 A C+++LL +++LE +++ L + ++ KL + Sbjct: 64 AKICQVELLVDDILEISDDASQDQYVNERGVLVSNTQSINRARLMIDARKWVACKLVPKV 123 Query: 123 YGPRVSVESKHTID 136 YG ++ +ES ++ Sbjct: 124 YGNKIDIESNSSMS 137 >gi|197104489|ref|YP_002129866.1| hypothetical protein PHZ_c1023 [Phenylobacterium zucineum HLK1] gi|196477909|gb|ACG77437.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 221 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 31/109 (28%), Positives = 48/109 (44%), Gaps = 7/109 (6%) Query: 16 KGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCR 75 + C Y E I ++ G+AL + R MP ST Y W+ ++ + EAY A + + Sbjct: 26 RPC-GYCRETAEAIFVRLCGGEALSAICRDPQMPAASTVYAWLAREPQFAEAYRLAREVQ 84 Query: 76 LDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 D LA E E + A F +K+R R + KL + +G Sbjct: 85 ADALAAEGWELAKAATPDTA---FLAKVRLEHLR---WYAAKLGPRTWG 127 >gi|253699635|ref|YP_003020824.1| hypothetical protein GM21_1002 [Geobacter sp. M21] gi|251774485|gb|ACT17066.1| hypothetical protein GM21_1002 [Geobacter sp. M21] Length = 191 Score = 78.7 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 66/155 (42%), Gaps = 17/155 (10%) Query: 6 KKAKKAVRAKKGCIYYSPELFAGILDQVAN-GKALGHVLRKVG-MPKYSTFYRWIKKDLK 63 ++ + + + + Y+ E+ I ++A K+L V + P + T RW++++ Sbjct: 12 EEETEGKKQGRPTL-YTAEIAEEICYRIATTAKSLKRVCDEDERFPSHRTVLRWVQENSD 70 Query: 64 LQEAYTEALQCRLDLLAEELLEEPAPTAEELA----------NPVFYSKMRDRKQRMG-- 111 Y A + + + LA E LE ++ ++A V + ++R Sbjct: 71 FCHQYRIAREEQGNFLAFEGLEIIDDSSLDIAFKDDGKGGATAFVDREVIARSRERAAYR 130 Query: 112 TFLLEKLSNQKYGPRVSVESKH--TIDLRPAIERL 144 +L KL+ ++YG ++ V + L I+++ Sbjct: 131 KWLASKLAPKRYGEKLDVNHDGPVSDALAALIQKV 165 >gi|330992640|ref|ZP_08316585.1| putative phage associated protein [Gluconacetobacter sp. SXCC-1] gi|329760294|gb|EGG76793.1| putative phage associated protein [Gluconacetobacter sp. SXCC-1] Length = 131 Score = 77.6 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 46/107 (42%), Gaps = 7/107 (6%) Query: 14 AKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQ 73 ++G YS L I +++NG++L + MP +T RW+ + + + Y A + Sbjct: 9 PRRGA--YSRRLADEICARLSNGQSLRAICTDPAMPHRATVLRWLHANPEFRSLYALARE 66 Query: 74 CRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN 120 D LAEE++ + V K+R + ++ +L Sbjct: 67 AAADALAEEIVSIADR--ATGRDDVPAIKLRMEARM---WVATRLRP 108 >gi|319786768|ref|YP_004146243.1| hypothetical protein Psesu_1162 [Pseudoxanthomonas suwonensis 11-1] gi|317465280|gb|ADV27012.1| putative phage-related protein [Pseudoxanthomonas suwonensis 11-1] Length = 152 Score = 75.3 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 33/145 (22%), Positives = 55/145 (37%), Gaps = 21/145 (14%) Query: 22 SPELFAGILDQVANGKALGHVLRKVG--MPKYSTFYRWIKKDLKLQEAYTEALQCRLDLL 79 +PE+ +L +++ G+ L +LR P S +Y W+ +D L + +A + D + Sbjct: 6 TPEIADEVLARLSEGEPLRAILRSDPERFPGKSQWYAWMAEDEDLAARFRQAREEGADAI 65 Query: 80 AEELLEEPAPTAEE----------------LANPVFYSKMRDRKQRMGTFLLEKLSNQKY 123 AEE LE + V SK+R + LL K Q+Y Sbjct: 66 AEECLEIADDARNDWMERQGEDGQGEGFQLNGEHVQRSKLRIHTRMQ---LLAKWFPQRY 122 Query: 124 GPRVSVESKHTIDLRPAIERLREHY 148 G +V++E E Sbjct: 123 GDKVAMEHTGPGGGPVQTVTRIERR 147 >gi|82702593|ref|YP_412159.1| hypothetical protein Nmul_A1464 [Nitrosospira multiformis ATCC 25196] gi|82410658|gb|ABB74767.1| hypothetical protein Nmul_A1464 [Nitrosospira multiformis ATCC 25196] Length = 190 Score = 71.8 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 67/166 (40%), Gaps = 27/166 (16%) Query: 6 KKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQ 65 KK+ +A++ + E I ++ GK+ + +VG+ + W+ D +L Sbjct: 36 KKSSRAIKRR--------EALDAIFAGISLGKSARAMCVEVGI-SQRVLWNWLASDEELM 86 Query: 66 EAYTEALQCRLDLLAEELLEEPAPTAEEL-----------ANPVFYSKMRDRKQRMGTFL 114 Y A + +D AEE++E + + + +++R ++ + Sbjct: 87 RQYQRAKELCVDAYAEEIIEISDEGSRDTYVDEKGREVINREVIARAQLRIDARK---WY 143 Query: 115 LEKLSNQKYGPRVSVESKHTIDLRP----AIERLREHYKHLKPIDS 156 +L+ +KYG ++ V ++ + AI + + +P S Sbjct: 144 AARLAPRKYGDKLPVTNEGADARKAAHNIAIAFVAPEDRGNEPNVS 189 >gi|218671080|ref|ZP_03520751.1| putative terminase small subunit protein [Rhizobium etli GR56] Length = 106 Score = 71.0 bits (172), Expect = 8e-11, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 35/76 (46%), Gaps = 2/76 (2%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK--LQEAYTEALQCRLD 77 +S L I D++A+ ++L + R MP ST W+ + K + Y A + D Sbjct: 6 KFSQALAEKICDRIADRESLRSICRDETMPAKSTVLSWLADEDKAAFRARYALAREILAD 65 Query: 78 LLAEELLEEPAPTAEE 93 +EL+E ++ + Sbjct: 66 GFVDELVEIADDSSND 81 >gi|296535572|ref|ZP_06897753.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296264088|gb|EFH10532.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 137 Score = 70.3 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 45/124 (36%), Gaps = 6/124 (4%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEEL 83 E A + D + G ++ + P + Y + D + A + + + +E Sbjct: 5 EAVARLCDLLICGMSMSEACARDDSPSRTAVYTKMAADAAFRATIARAREAQQHAIIDET 64 Query: 84 LEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIER 143 +E E V K+R + + KL+ + YG +V + + L I+ Sbjct: 65 IEMADAATPEDWQVV---KLRIWAR---QWRAAKLAPRVYGNKVEMAGSNENPLTLLIQE 118 Query: 144 LREH 147 ++ Sbjct: 119 VQGR 122 >gi|87201017|ref|YP_498274.1| putative phage associated protein [Novosphingobium aromaticivorans DSM 12444] gi|87136698|gb|ABD27440.1| conserved hypothetical protein, putative phage associated protein [Novosphingobium aromaticivorans DSM 12444] Length = 144 Score = 68.7 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 51/111 (45%), Gaps = 8/111 (7%) Query: 16 KGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCR 75 + + + E+ I++++++G+ L + R +P + T RW ++D + A Sbjct: 3 RPSLK-TDEVVEEIIERLSHGEPLARICRDAHLPSFRTVLRWEEEDEAFRHLSARARSYG 61 Query: 76 LDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPR 126 D +A++ ++ E A+ ++R + L+ K + +KYG + Sbjct: 62 TDYIADDCMDIADNPEIEPAD----KRIRIDTRLR---LIGKWNARKYGEK 105 >gi|264678786|ref|YP_003278693.1| terminase small subunit protein [Comamonas testosteroni CNB-2] gi|262209299|gb|ACY33397.1| putative terminase small subunit protein [Comamonas testosteroni CNB-2] Length = 142 Score = 67.9 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 31/143 (21%), Positives = 52/143 (36%), Gaps = 22/143 (15%) Query: 39 LGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEE----- 93 + R+ Y+T WI D E Y A + R D LA+E++ Sbjct: 1 MADFCRQHEF-AYTTMLDWINADSSRAEMYARAREDRSDKLADEIVAISDECEYAPITDM 59 Query: 94 ----------LANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIER 143 V +K+R ++ ++ KL + YG +V VE T+D + + Sbjct: 60 STGETIAVAFDKTAVARNKLRVDARK---WIAAKLKPRVYGDKVQVE--GTVDHKVMSDD 114 Query: 144 -LREHYKHLKPIDSERIPHKSTE 165 L + L + +P TE Sbjct: 115 ALLQRLAKLGVSVASIVPVGPTE 137 >gi|300928613|ref|ZP_07144136.1| conserved hypothetical protein [Escherichia coli MS 187-1] gi|300463413|gb|EFK26906.1| conserved hypothetical protein [Escherichia coli MS 187-1] Length = 113 Score = 67.2 bits (162), Expect = 9e-10, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 39/103 (37%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLL 79 +P L I + G++L + + GMP +T RW+ +D ++A + L Sbjct: 7 KLTPRLAEDICLLILEGRSLLAISKMPGMPSIATLNRWMCEDEDFRQAVNDCRDVYAGDL 66 Query: 80 AEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQK 122 + E A + + F + ++ + S +K Sbjct: 67 FRQATELIASVPHDTEDRAFLLEQAKQQAASMRAEATRKSPRK 109 >gi|197106240|ref|YP_002131617.1| hypothetical protein PHZ_c2779 [Phenylobacterium zucineum HLK1] gi|196479660|gb|ACG79188.1| hypothetical protein PHZ_c2779 [Phenylobacterium zucineum HLK1] Length = 265 Score = 66.4 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 4/105 (3%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 Y+ I ++A G+A+ + MP YRW++ + + EA + L Sbjct: 156 YTRAAGEAICARLARGEAMSAINADPAMPSSKVVYRWLRTEPEFARMVAEARAQQRRALK 215 Query: 81 EEL-LEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG 124 ++ L P + +K R + L +L+ + +G Sbjct: 216 LKIHLASDLPIQGTTREMLARAKARVARLEGR---LGRLTPKVWG 257 Score = 42.5 bits (98), Expect = 0.028, Method: Composition-based stats. Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 3/53 (5%) Query: 28 GILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 IL ++A G + + MP Y+T + W++ + EAY R +L A Sbjct: 70 EILRRMARGMTIKEITADPDMPSYATLFHWVRTFAEFGEAY---RALRAELAA 119 >gi|323941301|gb|EGB37486.1| hypothetical protein ERDG_02215 [Escherichia coli E482] Length = 112 Score = 65.6 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 39/103 (37%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLL 79 +P L I + GK+L V + GMP +T RW+ +D ++A + L Sbjct: 7 KLTPRLAEEICLLILEGKSLLAVSKMPGMPSIATLNRWMCEDEDFRQAVNDCRDVYAGDL 66 Query: 80 AEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQK 122 + E A + + F + ++ + + +K Sbjct: 67 FRQATELIAAVPSDTEDRAFLLEQAKQQAASMRAEANRKAPRK 109 >gi|197103879|ref|YP_002129256.1| hypothetical protein PHZ_c0413 [Phenylobacterium zucineum HLK1] gi|196477299|gb|ACG76827.1| hypothetical protein PHZ_c0413 [Phenylobacterium zucineum HLK1] Length = 266 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 35/84 (41%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLA 80 +S +L +L ++ G + + R GMP T +W++ +A + R D+ A Sbjct: 169 FSEDLQEAVLARIVEGHSFNSLSRLPGMPCRKTLGKWVRTRPDFAREVAQACEDREDIFA 228 Query: 81 EELLEEPAPTAEELANPVFYSKMR 104 ++ LE + V + + Sbjct: 229 DQALEIALAGGPDAGRRVGRLRRQ 252 Score = 37.1 bits (84), Expect = 1.2, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 19/45 (42%), Gaps = 1/45 (2%) Query: 21 YSPELFAGILDQVANGKALGHVLRK-VGMPKYSTFYRWIKKDLKL 64 + I+ + G+ L +L +P +T RW +++ + Sbjct: 104 FDQAAADRIIVGLNRGEGLRALLNGDPSLPSAATVARWRRENREF 148 >gi|197104059|ref|YP_002129436.1| hypothetical protein PHZ_c0593 [Phenylobacterium zucineum HLK1] gi|196477479|gb|ACG77007.1| hypothetical protein PHZ_c0593 [Phenylobacterium zucineum HLK1] Length = 247 Score = 61.8 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 50/146 (34%), Gaps = 12/146 (8%) Query: 17 GCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRL 76 G Y P + I ++ G++L + R MP ST W + + EA A Q + Sbjct: 35 GMTTYCPVVANEIYTRLCAGESLSAICRDRDMPANSTVNLWRRDVPEFAEAMRMARQAQA 94 Query: 77 DLLAEELLEEPAPTAEELA--NPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHT 134 + + E E A V ++R ++ S + + E + Sbjct: 95 EAFCDTGWELAMGATPETAYLTHVRLGQLR--------WMAAVWSPRTHARLKPAEPEEA 146 Query: 135 IDLRPAIERLREHYKHLKPIDSERIP 160 + + I L+ + + R+ Sbjct: 147 PETQRLI--LKHFRLEEREDGAMRVV 170 >gi|323967888|gb|EGB63300.1| hypothetical protein ERJG_00918 [Escherichia coli M863] Length = 133 Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 29/93 (31%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLL 79 Y+ L I VA G +L + + MP T RW + +E Sbjct: 24 KYTIALAEEICLLVAEGHSLREIAKMPDMPSLRTLMRWQYEHPDFREHIGIFKWIHAQDA 83 Query: 80 AEELLEEPAPTAEELANPVFYSKMRDRKQRMGT 112 AE+ +E E + K + R Sbjct: 84 AEQAVEAIRDVDLEAEDAPLKLKQAEALARTLL 116 >gi|294010852|ref|YP_003544312.1| hypothetical protein SJA_C1-08660 [Sphingobium japonicum UT26S] gi|292674182|dbj|BAI95700.1| hypothetical protein SJA_C1-08660 [Sphingobium japonicum UT26S] Length = 168 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 48/126 (38%), Gaps = 7/126 (5%) Query: 16 KGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCR 75 + + SPE+ + I ++ +G+ + + + MP ST W++ D + +A + Sbjct: 32 RPSVR-SPEVESEIFRRLTDGEDMVTICQDAHMPGRSTVCAWMENDPAFRAKIKDA--FK 88 Query: 76 LDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGP--RVSVESKH 133 L ++ ++ + + FLL K +G +V E K Sbjct: 89 LQVIF--MIGMGWDILRGGEYSTGNVERDKAQVGYLRFLLGKFDRDWFGEQIKVEQEIKS 146 Query: 134 TIDLRP 139 I +P Sbjct: 147 VIINQP 152 >gi|301644429|ref|ZP_07244427.1| conserved hypothetical protein [Escherichia coli MS 146-1] gi|301077237|gb|EFK92043.1| conserved hypothetical protein [Escherichia coli MS 146-1] Length = 138 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 4/58 (6%) Query: 9 KKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQE 66 KK R K Y+ ++ I+ V+ G +L + GMP + RW + + +E Sbjct: 21 KKRGRPTK----YTEKIATQIVLLVSEGYSLRKISMMPGMPSHRQMMRWQLEHMDFRE 74 >gi|240119352|dbj|BAH79216.1| predicted protein [Escherichia coli O157:H7] Length = 140 Score = 49.5 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 3/115 (2%) Query: 9 KKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAY 68 + R+ Y+ + I VA +L + R+ GMP + T W+ + +L++ Sbjct: 15 SPSKRSAGRKTKYNDRIAEQICLYVARNMSLREISRQPGMPHWDTLKIWMLQHPELRQTI 74 Query: 69 TEAL-QCRLDLLAE--ELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSN 120 + +AE E E +E+ + +K R L+E Sbjct: 75 EAIRWLNGTEFIAEAMECYAEVETNSEDFPAQLALAKARAATLLGAARLMELRKP 129 >gi|16130371|ref|NP_416941.1| CPZ-55 prophage; predicted protein [Escherichia coli str. K-12 substr. MG1655] gi|170082055|ref|YP_001731375.1| CPZ-55 prophage; hypothetical protein [Escherichia coli str. K-12 substr. DH10B] gi|238901609|ref|YP_002927405.1| CPZ-55 prophage; hypothetical protein [Escherichia coli BW2952] gi|7388521|sp|P76546|YFFO_ECOLI RecName: Full=Uncharacterized protein yffO gi|1788787|gb|AAC75499.1| CPZ-55 prophage; predicted protein [Escherichia coli str. K-12 substr. MG1655] gi|169889890|gb|ACB03597.1| CPZ-55 prophage; predicted protein [Escherichia coli str. K-12 substr. DH10B] gi|238860668|gb|ACR62666.1| CPZ-55 prophage; predicted protein [Escherichia coli BW2952] Length = 138 Score = 47.9 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 4/58 (6%) Query: 9 KKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQE 66 KK R K Y+ ++ I+ V+ G +L + GMP + RW + + +E Sbjct: 21 KKRGRPAK----YNEKIATQIVLLVSEGYSLRKISMMPGMPSHRQMMRWQLEHMDFRE 74 >gi|297528635|ref|YP_003669910.1| hypothetical protein GC56T3_0266 [Geobacillus sp. C56-T3] gi|297251887|gb|ADI25333.1| hypothetical protein GC56T3_0266 [Geobacillus sp. C56-T3] Length = 177 Score = 47.5 bits (111), Expect = 9e-04, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 56/138 (40%), Gaps = 13/138 (9%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 + K KK I +PEL ++ + G + VG+ +TFY W+K+ + Sbjct: 1 MTKNKKGRP-----IKLTPELQEELVKVIRAGNYIETACAYVGI-NKTTFYDWLKRGARE 54 Query: 65 QEAYTE---ALQCRLDLLAEELLE--EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLS 119 +E + A + + EL E A E+ + K + + + LE+ Sbjct: 55 KERLAKNPNARPKKSEQPFVELSNAVEKALAQAEIRDVAIIGKAAEENWQAAAWRLERKF 114 Query: 120 NQKYG--PRVSVESKHTI 135 +++G R++ + H+ Sbjct: 115 PERWGRRDRLTADLNHSG 132 >gi|196249501|ref|ZP_03148198.1| conserved hypothetical protein [Geobacillus sp. G11MC16] gi|196210795|gb|EDY05557.1| conserved hypothetical protein [Geobacillus sp. G11MC16] Length = 177 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 52/124 (41%), Gaps = 8/124 (6%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE---ALQCR 75 I +PEL ++ + G + VG+ +TFY W+K+ + +E + A + Sbjct: 10 IKLTPELQEELVKVIRAGNYIETACAYVGI-NKTTFYDWLKRGAREKERLAKNPNARPKK 68 Query: 76 LDLLAEELLE--EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG--PRVSVES 131 + EL E A E+ + K + + + LE+ +++G R++ + Sbjct: 69 SEAPFVELSNAVEKALAQAEIRDVAIIGKAAEENWQAAAWRLERKFPERWGRKERLTADL 128 Query: 132 KHTI 135 H+ Sbjct: 129 NHSG 132 >gi|330718174|ref|ZP_08312774.1| integral membrane protein [Leuconostoc fallax KCTC 3537] Length = 641 Score = 46.0 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 39/111 (35%), Gaps = 2/111 (1%) Query: 58 IKKDLKLQEAYTEALQCRLD--LLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLL 115 + + + ++ Y D L +EL E ++ M+ +++ Sbjct: 470 LNQGKQWRDNYQTKHNSEFDDNALRDELRSEQQAQRKKDRKEYLAKDMQKHARKIRKQRA 529 Query: 116 EKLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEK 166 E+L+ K + E KH D P +R ++ P+ S EK Sbjct: 530 ERLNKMKQQKHQTNEPKHEPDNTPRYQRSKKLKADQVPVQEYVAQEGSPEK 580 >gi|138896467|ref|YP_001126920.1| hypothetical protein GTNG_2830 [Geobacillus thermodenitrificans NG80-2] gi|196251050|ref|ZP_03149731.1| conserved hypothetical protein [Geobacillus sp. G11MC16] gi|134267980|gb|ABO68175.1| Putative phage protein [Geobacillus thermodenitrificans NG80-2] gi|196209422|gb|EDY04200.1| conserved hypothetical protein [Geobacillus sp. G11MC16] Length = 177 Score = 45.2 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 52/124 (41%), Gaps = 8/124 (6%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE---ALQCR 75 I +PEL ++ + G + VG+ +TFY W+K+ + +E + A + Sbjct: 10 IKLTPELQEELVKVIRAGNYIETACAYVGI-NKTTFYDWLKRGAREKERLAKNPNARPKK 68 Query: 76 LDLLAEELLE--EPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG--PRVSVES 131 + EL E A E+ + K + + + LE+ +++G R++ + Sbjct: 69 SEQPFVELSNAVEKALAQAEIRDVAIIGKAAEENWQAAAWRLERKFPERWGRKERLTADL 128 Query: 132 KHTI 135 H+ Sbjct: 129 NHSG 132 >gi|326790074|ref|YP_004307895.1| Fis family transcriptional regulator [Clostridium lentocellum DSM 5427] gi|326540838|gb|ADZ82697.1| helix-turn-helix Fis-type [Clostridium lentocellum DSM 5427] Length = 139 Score = 44.1 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 12/97 (12%) Query: 1 MSNLVKKAKKAVRAKK--GCIYYSPEL----FAGILDQVANGKALGHVLRKVGMPKYSTF 54 M+ K+ K + + G Y EL I + ++NG +G+P ST Sbjct: 1 MAREYKRGYKQDKPRNADGTFAYCEELEQRQIDMIHELISNGGHKTKACETLGIP-RSTL 59 Query: 55 YRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTA 91 Y+W+ + K ++AY A + L EE L + T Sbjct: 60 YKWL-DNEKFKKAYQTACER----LYEEGLSDAIKTT 91 >gi|255009806|ref|ZP_05281932.1| hypothetical protein Bfra3_11766 [Bacteroides fragilis 3_1_12] gi|313147593|ref|ZP_07809786.1| predicted protein [Bacteroides fragilis 3_1_12] gi|313136360|gb|EFR53720.1| predicted protein [Bacteroides fragilis 3_1_12] Length = 176 Score = 43.7 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 2/64 (3%) Query: 20 YYSPELFAGILDQV-ANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 Y+ ++ I + A+ + V R G+ STF+ W + E +A R++ Sbjct: 3 KYNTKIVETICSLIRADSYTIAEVCRLSGI-SESTFHEWKASKPEFSECIKKAEAARMEF 61 Query: 79 LAEE 82 E Sbjct: 62 FVTE 65 >gi|218290758|ref|ZP_03494840.1| conserved hypothetical protein [Alicyclobacillus acidocaldarius LAA1] gi|218239296|gb|EED06495.1| conserved hypothetical protein [Alicyclobacillus acidocaldarius LAA1] Length = 146 Score = 43.7 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 51/130 (39%), Gaps = 14/130 (10%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK-DLKLQEAYTEALQCRLDL 78 +PE+ I D + G + G+ T Y W+K+ + + Y L Sbjct: 13 KLTPEVEKKICDAIRAGAYIETAAAYAGV-SKQTLYTWMKRGNAQKSGIYRRFLDAIEKA 71 Query: 79 LAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYG--PRVSVESKHTID 136 LAE E+ + + + + + + LE+ +++ R++V++ H+ Sbjct: 72 LAES----------EMRDLIIIGRAAEENWQAAAWRLERKFPERWARKDRMTVDTHHSGS 121 Query: 137 LRPAIERLRE 146 + ++ + E Sbjct: 122 VNLNVQYVAE 131 >gi|291513660|emb|CBK62870.1| hypothetical protein AL1_01560 [Alistipes shahii WAL 8301] Length = 172 Score = 43.3 bits (100), Expect = 0.015, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 2/64 (3%) Query: 20 YYSPELFAGILDQVA-NGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 Y+ ++ I +A + + V R V + ST+Y WI + + E +A R+ Sbjct: 6 KYNKKIAEKICSLIATDTYTVAEVCRMVKISD-STYYDWITRFPEFSENIKKAEAERMAF 64 Query: 79 LAEE 82 E Sbjct: 65 FVAE 68 >gi|296169537|ref|ZP_06851157.1| cytochrome P450 family protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295895803|gb|EFG75498.1| cytochrome P450 family protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 447 Score = 41.0 bits (94), Expect = 0.079, Method: Composition-based stats. Identities = 10/84 (11%), Positives = 29/84 (34%), Gaps = 7/84 (8%) Query: 57 WIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELAN-----PVFYSKMRDRKQRMG 111 W+ + ++ + LD EE L P + V M++ ++ Sbjct: 260 WLSEHPDERDRLSRDRAVLLDPATEEFLRFFTPAPGDGRTIADEIEVDGVTMQEGQRLWL 319 Query: 112 TFLLEKLSNQKYG--PRVSVESKH 133 ++ + + + ++ ++ K Sbjct: 320 SWAMANRDPRLFDQPDQIVLDRKG 343 >gi|240170404|ref|ZP_04749063.1| cytochrome P450 superfamily protein [Mycobacterium kansasii ATCC 12478] Length = 448 Score = 41.0 bits (94), Expect = 0.081, Method: Composition-based stats. Identities = 10/86 (11%), Positives = 31/86 (36%), Gaps = 7/86 (8%) Query: 55 YRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKM-----RDRKQR 109 W+ ++ + +E + + LD EE L P + ++ ++ ++ Sbjct: 260 LEWLSENPEQRELLSRQRETLLDSATEEFLRYFTPAPGDGRTFADDVELDGLHFKEGERL 319 Query: 110 MGTFLLEKLSNQKY--GPRVSVESKH 133 ++ + + +V ++ K Sbjct: 320 WISWAMANRDPAVFPDPDKVILDRKG 345 >gi|108759650|ref|YP_629470.1| hypothetical protein MXAN_1211 [Myxococcus xanthus DK 1622] gi|108463530|gb|ABF88715.1| hypothetical protein MXAN_1211 [Myxococcus xanthus DK 1622] Length = 158 Score = 40.6 bits (93), Expect = 0.11, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 46/130 (35%), Gaps = 16/130 (12%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDL-KLQEAYTE--ALQCRL 76 +PEL I + + +G VG+ T RW + + + Y E R Sbjct: 6 KLTPELQQQICEHLRSGLFRRAAAGLVGV-DEQTVSRWFHRGASEARGLYREFFVAVNRA 64 Query: 77 DLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTID 136 + E ++ T + A + +LL + + YG R +VE+K D Sbjct: 65 EA---EFMQGATETLQAAAT---------TNPKHVQWLLSRRFPELYGRRDNVEAKSPED 112 Query: 137 LRPAIERLRE 146 LRE Sbjct: 113 QAADTAALRE 122 >gi|126437880|ref|YP_001073571.1| cytochrome P450 [Mycobacterium sp. JLS] gi|126237680|gb|ABO01081.1| cytochrome P450 [Mycobacterium sp. JLS] Length = 451 Score = 40.2 bits (92), Expect = 0.12, Method: Composition-based stats. Identities = 10/86 (11%), Positives = 29/86 (33%), Gaps = 7/86 (8%) Query: 55 YRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELAN-----PVFYSKMRDRKQR 109 W+ + + + L+ EE L P + + +++R+ ++ Sbjct: 260 LEWLSEHPDERTRLSRERDTLLNPATEEFLRYFTPAPGDGRTIAEDMALGDAELREGQRL 319 Query: 110 MGTFLLEKLSNQKYG--PRVSVESKH 133 ++ + + RV ++ K Sbjct: 320 WLSWAMANRDPAVFTEPDRVVLDRKG 345 >gi|120405830|ref|YP_955659.1| cytochrome P450 [Mycobacterium vanbaalenii PYR-1] gi|119958648|gb|ABM15653.1| cytochrome P450 [Mycobacterium vanbaalenii PYR-1] Length = 450 Score = 40.2 bits (92), Expect = 0.15, Method: Composition-based stats. Identities = 9/76 (11%), Positives = 23/76 (30%), Gaps = 5/76 (6%) Query: 55 YRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELAN-----PVFYSKMRDRKQR 109 W+ + +E + LD EE L P + V +++ ++ Sbjct: 263 LEWLSQHPDERERLSRDRDTLLDPATEEFLRYFTPAPGDGRTVAEDIEVDGVVLKEGERL 322 Query: 110 MGTFLLEKLSNQKYGP 125 ++ + + Sbjct: 323 WLSWAMANRDPALFQN 338 >gi|108801899|ref|YP_642096.1| cytochrome P450 [Mycobacterium sp. MCS] gi|119871051|ref|YP_941003.1| cytochrome P450 [Mycobacterium sp. KMS] gi|108772318|gb|ABG11040.1| cytochrome P450 [Mycobacterium sp. MCS] gi|119697140|gb|ABL94213.1| cytochrome P450 [Mycobacterium sp. KMS] Length = 451 Score = 39.8 bits (91), Expect = 0.18, Method: Composition-based stats. Identities = 9/86 (10%), Positives = 29/86 (33%), Gaps = 7/86 (8%) Query: 55 YRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELAN-----PVFYSKMRDRKQR 109 W+ + + + L+ EE L P + + +++R+ ++ Sbjct: 260 LEWLSEHPDERTRLSRERDTLLNPATEEFLRYFTPAPGDGRTIAEDMALGDAELREGQRL 319 Query: 110 MGTFLLEKLSNQKYG--PRVSVESKH 133 ++ + + R+ ++ K Sbjct: 320 WLSWAMANRDPAVFTEPDRIVLDRKG 345 >gi|310818179|ref|YP_003950537.1| hypothetical protein STAUR_0906 [Stigmatella aurantiaca DW4/3-1] gi|309391251|gb|ADO68710.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 158 Score = 39.4 bits (90), Expect = 0.22, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 46/134 (34%), Gaps = 24/134 (17%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK-----LQEAYTEALQC 74 +PE+ I D + +G VG+ T RW + +E Y + Sbjct: 6 KLNPEIQKQICDHLRSGLFRRAAAGLVGV-DEQTISRWYHRGASEARGLFREFYVAVNRA 64 Query: 75 RLDLL--AEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESK 132 + + A E L+ + + + +LL + + YG R +VE+K Sbjct: 65 EAEFMQGATETLQAASTSNP----------------KHVQWLLSRRFPELYGRRDNVEAK 108 Query: 133 HTIDLRPAIERLRE 146 D LRE Sbjct: 109 SPEDQAADTAALRE 122 >gi|71754745|ref|XP_828287.1| hypothetical protein [Trypanosoma brucei TREU927] gi|70833673|gb|EAN79175.1| hypothetical protein, conserved [Trypanosoma brucei] Length = 568 Score = 39.1 bits (89), Expect = 0.28, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 51/140 (36%), Gaps = 11/140 (7%) Query: 44 RKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKM 103 ++ F R ++++ + EA + L E + E ++ LA+ + Sbjct: 277 KRHHQANMEAFCRQLRQEEESFRQRMEARESERLQLLEHIEAEEQWLSQRLADRERQRAL 336 Query: 104 RDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKS 163 + E+ + Y V E + + + Y+ + +++E+ K Sbjct: 337 EIEAKER----AEREQRELYREHVEAEEE-------LLRNRLQQYEASRKMEAEKARQKE 385 Query: 164 TEKPLEIVESSIAEQSIIEH 183 E E+ E +AE+ +I Sbjct: 386 KEDREELYEHVLAEEQLIRQ 405 >gi|183984685|ref|YP_001852976.1| cytochrome P450 188A3 Cyp188A3 [Mycobacterium marinum M] gi|183178011|gb|ACC43121.1| cytochrome P450 188A3 Cyp188A3 [Mycobacterium marinum M] Length = 454 Score = 39.1 bits (89), Expect = 0.31, Method: Composition-based stats. Identities = 8/76 (10%), Positives = 21/76 (27%), Gaps = 5/76 (6%) Query: 55 YRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELAN-----PVFYSKMRDRKQR 109 W+ + +E +D EE L P + ++ ++ Sbjct: 266 LEWLSEHPDERERLWRNRDTLIDSATEEFLRYFTPAPGDGRTFSDDVEFDGIHFKEGERL 325 Query: 110 MGTFLLEKLSNQKYGP 125 ++ + Q + Sbjct: 326 WISWAMANRDPQVFPD 341 >gi|114571303|ref|YP_757983.1| transposase IS3/IS911 family protein [Maricaulis maris MCS10] gi|114341765|gb|ABI67045.1| transposase IS3/IS911 family protein [Maricaulis maris MCS10] Length = 79 Score = 39.1 bits (89), Expect = 0.33, Method: Composition-based stats. Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 1/51 (1%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTE 70 +S E IL + G A V R+ G+ +TFY+W K L +Y Sbjct: 5 KFSEEQIIAILAEQERGMATSGVCRRHGV-SSATFYKWKAKFGGLDVSYAR 54 >gi|331701777|ref|YP_004398736.1| hypothetical protein Lbuc_1419 [Lactobacillus buchneri NRRL B-30929] gi|329129120|gb|AEB73673.1| hypothetical protein Lbuc_1419 [Lactobacillus buchneri NRRL B-30929] Length = 162 Score = 38.7 bits (88), Expect = 0.37, Method: Composition-based stats. Identities = 24/119 (20%), Positives = 46/119 (38%), Gaps = 13/119 (10%) Query: 28 GILDQVA-NGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEE 86 I+ +A +G + +K+G+ STFY+W+ ++ L EA + D E L + Sbjct: 19 SIISSMARDGMTSEEIAKKIGI-APSTFYKWMHENQDLSEAVMANKEM-ADRTVEAALYQ 76 Query: 87 PAPTAEELANPVFYSKMR---------DRKQRMGTFLLEKLSNQKYGPRVSVESKHTID 136 A + + V S R + + L +K+ + +E +D Sbjct: 77 LATGFDYYEDKVSASGKRLKNVVQHESPNIAAIKMW-LNNRQPKKWRDKREIELSGRVD 134 >gi|150375754|ref|YP_001312350.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium medicae WSM419] gi|150030301|gb|ABR62417.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium medicae WSM419] Length = 342 Score = 38.7 bits (88), Expect = 0.39, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTFYR-WIKKDLKLQEAYTEALQCRLDLLAEE 82 ++ A ILD + G+ G + GM Y F W+ K +L + +A + + ++ Sbjct: 250 DVNANILDMIKAGEVFGSINPNQGMQGYMGFLMLWLAKHPELIDPMNDAKRSGFNPMSIP 309 Query: 83 LLEEPAPT-AEELANPVFYSKMRDRK 107 +++ E A+ ++ K R+ Sbjct: 310 VVDNGLSIVTAENADDFYWDKYLKRR 335 >gi|332828192|gb|EGK00904.1| hypothetical protein HMPREF9455_02693 [Dysgonomonas gadei ATCC BAA-286] Length = 218 Score = 38.7 bits (88), Expect = 0.43, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 2/61 (3%) Query: 20 YYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 Y+ EL I +A ++ + + G+ +TFY+W +++ + +A+ R ++ Sbjct: 3 KYNQELAERITSLMAEELCSVSDICKATGI-SRNTFYKWKRENPDFDKDVEDAMSYRNEV 61 Query: 79 L 79 L Sbjct: 62 L 62 >gi|295691410|ref|YP_003595103.1| hypothetical protein Cseg_4074 [Caulobacter segnis ATCC 21756] gi|295433313|gb|ADG12485.1| conserved hypothetical protein [Caulobacter segnis ATCC 21756] Length = 658 Score = 38.7 bits (88), Expect = 0.43, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 28/66 (42%), Gaps = 1/66 (1%) Query: 45 KVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLD-LLAEELLEEPAPTAEELANPVFYSKM 103 + GM + +T+ W ++ Y +A + D L E+ + A +A+ ++ Sbjct: 250 RPGMLEATTYTGWDDISRRMAPLYAKASELSADSSLRPEIAQIAAASADPKVRAFKALQL 309 Query: 104 RDRKQR 109 + K R Sbjct: 310 VEDKTR 315 >gi|153010395|ref|YP_001371609.1| putative ABC transporter periplasmic sugar-binding protein [Ochrobactrum anthropi ATCC 49188] gi|151562283|gb|ABS15780.1| putative ABC transporter periplasmic sugar-binding protein [Ochrobactrum anthropi ATCC 49188] Length = 343 Score = 38.3 bits (87), Expect = 0.50, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTF-YRWIKKDLKLQEAYTEALQCRLDLLAEE 82 ++ A ILD + G+ G + GM Y F W+ K +L + +A + + ++ Sbjct: 251 DVNANILDMIKAGEVFGSINPNQGMQGYMGFMLLWLAKHPELIDPMNDAKRSGFNPMSIP 310 Query: 83 LLEEPAPT-AEELANPVFYSKMRDRK 107 +++ E A+ ++ K R+ Sbjct: 311 VVDNGLSIVTAENADDFYWDKYLKRR 336 >gi|229495719|ref|ZP_04389447.1| helicase conserved domain protein [Porphyromonas endodontalis ATCC 35406] gi|229317293|gb|EEN83198.1| helicase conserved domain protein [Porphyromonas endodontalis ATCC 35406] Length = 1828 Score = 37.9 bits (86), Expect = 0.60, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 55/162 (33%), Gaps = 40/162 (24%) Query: 55 YRWIKKDLKLQEAYTEALQCR--LDLLAE-----ELLEEPAPTAEEL------------- 94 +R+ K +L Y + R D+ + E++ PT E+ Sbjct: 1214 FRYFIKVPELAAFYAQITDYRTAQDIGIDRPEKNEIMHNIPPTPEQEEFITKLVEFAKTG 1273 Query: 95 -ANPVFYSKMRDRKQRMGTFLLEKLS-----------NQKYGPRVSVESKHTIDLRPAIE 142 A + K+ +R+++ + ++ +YG V ++ H Sbjct: 1274 NAELLGRPKLSEREEKAKMLIATDMARKMSLDLRLIDPNRYGDHVDNKASH------CAA 1327 Query: 143 RLREHYKHLKPIDSERIP--HKSTEKPLEIVESSIAEQSIIE 182 ++ E+Y+ + T KP E S ++ ++E Sbjct: 1328 KIAEYYRKFNEQKGTQFVFSDLGTYKPGEWNPYSEIKRKLVE 1369 >gi|296444686|ref|ZP_06886650.1| hypothetical protein MettrDRAFT_0366 [Methylosinus trichosporium OB3b] gi|296257954|gb|EFH05017.1| hypothetical protein MettrDRAFT_0366 [Methylosinus trichosporium OB3b] Length = 328 Score = 37.9 bits (86), Expect = 0.74, Method: Composition-based stats. Identities = 10/59 (16%), Positives = 23/59 (38%), Gaps = 2/59 (3%) Query: 14 AKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEAL 72 + + Y P + ANG + + ++G+ T Y W ++ + +A + Sbjct: 84 PGRPTL-YRPTFAQIAVRLGANGHTIAGIAGRLGV-TRETVYEWARRHPEFSDALQRSK 140 >gi|326404206|ref|YP_004284288.1| putative transposase [Acidiphilium multivorum AIU301] gi|325051068|dbj|BAJ81406.1| putative transposase [Acidiphilium multivorum AIU301] Length = 100 Score = 37.5 bits (85), Expect = 0.75, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 31/74 (41%), Gaps = 12/74 (16%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRW-----------IKKDLKLQEAY 68 +S E GIL + +G + V R+ G+ +TFY+W ++ +L++ Sbjct: 5 RFSEEQIIGILKEQESGMSTAEVCRRHGI-SSATFYKWKAKFGGLEVSDARRLKQLEDEN 63 Query: 69 TEALQCRLDLLAEE 82 + D + + Sbjct: 64 ARLKRLLADAVLDN 77 >gi|307299920|ref|ZP_07579705.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium meliloti BL225C] gi|306904809|gb|EFN35392.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium meliloti BL225C] Length = 342 Score = 37.5 bits (85), Expect = 0.80, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTFYR-WIKKDLKLQEAYTEALQCRLDLLAEE 82 ++ A ILD + G+ G + GM Y F W+ K +L + +A + + ++ Sbjct: 250 DVNANILDMIKAGEVFGSINPNQGMQGYMGFLLLWLAKHPELIDPMNDAKRSGFNPMSIP 309 Query: 83 LLEEPAPT-AEELANPVFYSKMRDRK 107 +++ + A+ ++ K R+ Sbjct: 310 VVDNGLSIVTADNADDFYWDKYLKRR 335 >gi|307316728|ref|ZP_07596171.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium meliloti AK83] gi|306897926|gb|EFN28669.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium meliloti AK83] Length = 342 Score = 37.5 bits (85), Expect = 0.80, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTFYR-WIKKDLKLQEAYTEALQCRLDLLAEE 82 ++ A ILD + G+ G + GM Y F W+ K +L + +A + + ++ Sbjct: 250 DVNANILDMIKAGEVFGSINPNQGMQGYMGFLLLWLAKHPELIDPMNDAKRSGFNPMSIP 309 Query: 83 LLEEPAPT-AEELANPVFYSKMRDRK 107 +++ + A+ ++ K R+ Sbjct: 310 VVDNGLSIVTADNADDFYWDKYLKRR 335 >gi|16264234|ref|NP_437026.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium meliloti 1021] gi|15140359|emb|CAC48886.1| putative ABC transporter periplasmic sugar-binding protein [Sinorhizobium meliloti 1021] Length = 333 Score = 37.5 bits (85), Expect = 0.80, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTFYR-WIKKDLKLQEAYTEALQCRLDLLAEE 82 ++ A ILD + G+ G + GM Y F W+ K +L + +A + + ++ Sbjct: 241 DVNANILDMIKAGEVFGSINPNQGMQGYMGFLLLWLAKHPELIDPMNDAKRSGFNPMSIP 300 Query: 83 LLEEPAPT-AEELANPVFYSKMRDRK 107 +++ + A+ ++ K R+ Sbjct: 301 VVDNGLSIVTADNADDFYWDKYLKRR 326 >gi|301161275|emb|CBW20813.1| putative DNA methylase [Bacteroides fragilis 638R] Length = 1828 Score = 37.5 bits (85), Expect = 0.88, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 55/162 (33%), Gaps = 40/162 (24%) Query: 55 YRWIKKDLKLQEAYTEALQCR--LDLLAE-----ELLEEPAPTAEELA------------ 95 +R+ K +L Y + R D+ + E++ PT E+ A Sbjct: 1214 FRYFIKVPELAAFYAQITDYRTAQDIGIDRPEKNEIMHNIPPTPEQEAFIAKLVEFAKTG 1273 Query: 96 --NPVFYSKMRDRKQRMGTFLLEKLS-----------NQKYGPRVSVESKHTIDLRPAIE 142 + K+ +R+++ + ++ +YG V ++ H Sbjct: 1274 NAELLGREKLSEREEKAKMLIATDMARKMSLDLRLIDPNRYGDHVDNKASH------CAA 1327 Query: 143 RLREHYKHLKPIDSERIP--HKSTEKPLEIVESSIAEQSIIE 182 ++ E+Y+ + T KP E S ++ ++E Sbjct: 1328 KIAEYYQKFNEQKGTQFVFSDLGTYKPGEWNPYSEIKRKLVE 1369 >gi|288924999|ref|ZP_06418935.1| metalloendopeptidase PepO [Prevotella buccae D17] gi|288338189|gb|EFC76539.1| metalloendopeptidase PepO [Prevotella buccae D17] Length = 679 Score = 37.5 bits (85), Expect = 0.90, Method: Composition-based stats. Identities = 12/85 (14%), Positives = 40/85 (47%), Gaps = 1/85 (1%) Query: 89 PTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLREHY 148 P + + ++++ + +L++L ++KY P+ +VE K + + A++ +R + Sbjct: 57 PLPPAYSRFGSFDQLQEDNNKRINSILKELQSKKY-PKGTVEQKLSDFYKLAMDSVRRNK 115 Query: 149 KHLKPIDSERIPHKSTEKPLEIVES 173 + + P+ + ++ + + Sbjct: 116 EGIAPVKALLDEMEAATTKEALQKL 140 >gi|167577457|ref|ZP_02370331.1| BopA protein [Burkholderia thailandensis TXDOH] Length = 513 Score = 37.1 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 51/154 (33%), Gaps = 15/154 (9%) Query: 4 LVKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLK 63 L AK G + Y +A + L + +P + WI +D Sbjct: 287 LRDARTVKEGAKTGELKYQMISKKENCASMAL-RVLRAGGAEHFVPYAAA---WISEDPN 342 Query: 64 LQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKY 123 AY A+Q R+D L ++ + A ++R + + Sbjct: 343 RAHAYARAVQARIDALNQQRADVERHCA----------RLRGSASVRQAWRAFSEAGNAS 392 Query: 124 GPRVSVES-KHTIDLRPAIERLREHYKHLKPIDS 156 G ++ E+ + RL EH + ++ I + Sbjct: 393 GSPLADEAGRGRASAHTRQARLDEHAREVERIGA 426 >gi|148243905|ref|YP_001220144.1| transposase IS3/IS911 family protein [Acidiphilium cryptum JF-5] gi|146400468|gb|ABQ29002.1| transposase IS3/IS911 family protein [Acidiphilium cryptum JF-5] Length = 100 Score = 37.1 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 31/74 (41%), Gaps = 12/74 (16%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRW-----------IKKDLKLQEAY 68 +S E GIL + +G + V R+ G+ +TFY+W ++ +L++ Sbjct: 5 RFSEEQIIGILKEQESGMSTAEVCRRHGI-SSATFYKWKAKFGGLEVSDARRLKQLEDEN 63 Query: 69 TEALQCRLDLLAEE 82 + D + + Sbjct: 64 ARLKRLLADAVLDN 77 >gi|332829030|gb|EGK01694.1| hypothetical protein HMPREF9455_01842 [Dysgonomonas gadei ATCC BAA-286] Length = 166 Score = 37.1 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 25/56 (44%), Gaps = 2/56 (3%) Query: 20 YYSPELFAGILDQVA-NGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQC 74 Y+ ++ I D ++ + + + KV + ST+Y W+K + E +A Sbjct: 3 KYNQKIVNRICDLISKDSYTIAEICEKVKI-SESTYYEWLKTKSEFSEDIKKAQDK 57 >gi|300741774|ref|ZP_07071795.1| putative nuclease SbcCD, D subunit [Rothia dentocariosa M567] gi|300380959|gb|EFJ77521.1| putative nuclease SbcCD, D subunit [Rothia dentocariosa M567] Length = 395 Score = 37.1 bits (84), Expect = 1.2, Method: Composition-based stats. Identities = 12/55 (21%), Positives = 22/55 (40%) Query: 101 SKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLKPID 155 ++ R+ + + L+ + YG + VE H L R+RE + D Sbjct: 115 VELIRDDIRVRIYAIPYLNPRYYGRHLDVEPTHAAVLGEVCTRIREDNEKRNAGD 169 >gi|188993950|ref|YP_001928202.1| putative DNA methylase [Porphyromonas gingivalis ATCC 33277] gi|188593630|dbj|BAG32605.1| putative DNA methylase [Porphyromonas gingivalis ATCC 33277] Length = 1828 Score = 36.7 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 58/162 (35%), Gaps = 40/162 (24%) Query: 55 YRWIKKDLKLQEAYTEALQCRL--DLLAE-----ELLEEPAPTAEEL---ANPVFYSK-- 102 +R+ K +L Y + R D+ + E++ PT E+ A V ++K Sbjct: 1214 FRYFIKVPELAAFYAQITDYRTAKDIGIDRPEKNEIMHNIPPTPEQEEFIAKLVEFAKTG 1273 Query: 103 ---------MRDRKQRMGTFLLEKLSNQ-----------KYGPRVSVESKHTIDLRPAIE 142 + +R+++ + ++ + KYG V ++ H Sbjct: 1274 NAELLGRAPLSEREEKAKMLIATDMARKMSLDLRLIDPDKYGDHVDNKASH------CAA 1327 Query: 143 RLREHYKHLKPIDSERIP--HKSTEKPLEIVESSIAEQSIIE 182 ++ E+Y+ + T KP E S ++ ++E Sbjct: 1328 KIAEYYRKFNEQKGTQFVFSDLGTYKPGEWNPYSEIKRKLVE 1369 >gi|330794621|ref|XP_003285376.1| hypothetical protein DICPUDRAFT_149273 [Dictyostelium purpureum] gi|325084646|gb|EGC38069.1| hypothetical protein DICPUDRAFT_149273 [Dictyostelium purpureum] Length = 2141 Score = 36.7 bits (83), Expect = 1.4, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 42/125 (33%), Gaps = 14/125 (11%) Query: 52 STFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMG 111 + +++ + + ++ Y + D + + ++ +L +F ++ Sbjct: 1790 KSIFKFYLQCFEFRKKY-KDRVKNADKIEDHIISSFMTLVMKLNENLFK----PLFIKVL 1844 Query: 112 TFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIV 171 + L Q+ G ES+ + E++ K + S KP++ Sbjct: 1845 DWALTPQQQQQNGNHHDEESEEDDNSDSEEEQVSNKKKKVMNGKS---------KPVQQQ 1895 Query: 172 ESSIA 176 E S Sbjct: 1896 EKSKD 1900 >gi|167750592|ref|ZP_02422719.1| hypothetical protein EUBSIR_01569 [Eubacterium siraeum DSM 15702] gi|167656518|gb|EDS00648.1| hypothetical protein EUBSIR_01569 [Eubacterium siraeum DSM 15702] Length = 379 Score = 36.7 bits (83), Expect = 1.5, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 43/108 (39%), Gaps = 6/108 (5%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEAL-QCRLDLL 79 Y+ E ++D+V +G+ H+L G+P STFY W++ + Q A R L Sbjct: 4 YTQEEKQAVIDRVISGEPSAHILADTGIP-KSTFYSWLRIYQEEQNAANRRKVDIRNFHL 62 Query: 80 AEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRV 127 E + E L + + R + E+L KY V Sbjct: 63 LENKVARLESIVEILKSAPCTPR---APLRQRLYAAEQLYG-KYNVHV 106 >gi|332828952|gb|EGK01635.1| hypothetical protein HMPREF9455_02167 [Dysgonomonas gadei ATCC BAA-286] Length = 236 Score = 36.7 bits (83), Expect = 1.5, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 28/156 (17%) Query: 19 IYYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLD 77 + Y ++ I+ + ++ + + + TFY W K+ + +EA EA+ R D Sbjct: 1 MKYKEKIATEIIRMIEQDLCSISEICKSFKI-SRKTFYEWKKEKPEFKEAVEEAIDHRED 59 Query: 78 LLA------------------EELLEEP---APTAEELANPVFYSKMRDRKQRMGTFLLE 116 ++ E++ P P + V K R ++L+ Sbjct: 60 VMIASARIGLKQLLEGYVQKKEKITYVPDKNDPVNDVEKCRVVEKKFCPPSIRAIKYVLD 119 Query: 117 KLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLK 152 + +K R+ + RP + +++ + Sbjct: 120 REERKKDKDRLLASER-----RPLVIEVQDEETRRQ 150 >gi|332883330|gb|EGK03613.1| hypothetical protein HMPREF9456_01680 [Dysgonomonas mossii DSM 22836] Length = 224 Score = 36.7 bits (83), Expect = 1.5, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 2/61 (3%) Query: 20 YYSPELFAGILDQVA-NGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 Y+ L I+ + + ++ + + + TFY W K + EA A++CR D Sbjct: 3 KYTERLVERIISFIEEDTYSISEICNALNI-NRKTFYEWRKTKPEFNEAVERAMECRDDK 61 Query: 79 L 79 L Sbjct: 62 L 62 >gi|296412184|ref|XP_002835805.1| hypothetical protein [Tuber melanosporum Mel28] gi|295629600|emb|CAZ79962.1| unnamed protein product [Tuber melanosporum] Length = 456 Score = 36.7 bits (83), Expect = 1.6, Method: Composition-based stats. Identities = 25/117 (21%), Positives = 46/117 (39%), Gaps = 11/117 (9%) Query: 79 LAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFL-------LEKLSNQKYGPRVSVES 131 +AEELL++ AP AE N + ++ + + +YG V + Sbjct: 147 IAEELLDDVAPRAEGERNSIGIDGGGGGGEQYIYNEGQFNGEYIGRRYANEYGEHVQEDG 206 Query: 132 KH--TIDLRPAIERLREHYKHLKPIDSERIPHKSTEKP--LEIVESSIAEQSIIEHN 184 H T + + RL E ++SE K+ ++ + + + E SI+ N Sbjct: 207 DHRMTGEEEGILSRLEELRAENTRLESELATLKAQQEAELVSLRQVKDGETSIVRSN 263 >gi|56552762|ref|YP_163601.1| beta-lactamase domain-containing protein [Zymomonas mobilis subsp. mobilis ZM4] gi|241762498|ref|ZP_04760575.1| beta-lactamase domain protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|260753508|ref|YP_003226401.1| beta-lactamase [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|56544336|gb|AAV90490.1| beta-lactamase domain protein [Zymomonas mobilis subsp. mobilis ZM4] gi|241373000|gb|EER62667.1| beta-lactamase domain protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|258552871|gb|ACV75817.1| beta-lactamase domain protein [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 546 Score = 36.7 bits (83), Expect = 1.6, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 51/147 (34%), Gaps = 8/147 (5%) Query: 24 ELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK------DLKLQEAYTEALQCRLD 77 L A + V + +A+ HV G P YRWI+ +L+ +A Sbjct: 345 ALAAKGIRMVTDRQAMIHVSGHPGRPDLQAMYRWIRPEMLIPVHGELRHMAEQARLGLAT 404 Query: 78 LLAEELLEEPAPTAE-ELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTID 136 + L++ PVF S + + ++ + G R + I Sbjct: 405 GIPRSLVQGNGDLIRLAPKGPVFVSHENTGRLLLDGDVILSANGPTIGERRKMAINGLIS 464 Query: 137 LRPAIERLREHYKHLKPIDSERIPHKS 163 + A+++ H K I +P ++ Sbjct: 465 VGLAVDK-HNHLKGHPTISLHGVPIEA 490 >gi|332828978|gb|EGK01654.1| hypothetical protein HMPREF9455_02054 [Dysgonomonas gadei ATCC BAA-286] Length = 231 Score = 36.4 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 53/156 (33%), Gaps = 28/156 (17%) Query: 19 IYYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLD 77 + Y ++ I+ + ++ + + + TFY W K + EA EA+ R D Sbjct: 1 MKYKEKIATEIIRMIEQDLCSISEICKSYKI-SRKTFYEWKKVKPEFGEAVEEAIDHRED 59 Query: 78 LLA------------------EELLEEP---APTAEELANPVFYSKMRDRKQRMGTFLLE 116 ++ E++ P P + V K R ++L+ Sbjct: 60 VMIASARIGLKQLLEGYVQKKEKITYVPDKNDPVNDVEKCRVVEKKFCPPSIRAIKYVLD 119 Query: 117 KLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLK 152 + +K R+ + RP + +++ + Sbjct: 120 REERKKDKDRLLASER-----RPLVIEVQDEETRRQ 150 >gi|332827847|gb|EGK00582.1| hypothetical protein HMPREF9455_03225 [Dysgonomonas gadei ATCC BAA-286] Length = 246 Score = 36.4 bits (82), Expect = 1.8, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 28/156 (17%) Query: 19 IYYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLD 77 + Y ++ I+ + ++ + + + TFY W K+ + +EA EA+ R D Sbjct: 1 MKYKEKIATEIIRMIEQDLCSISEICKSFKI-SRKTFYEWKKEKPEFKEAVEEAIDHRED 59 Query: 78 LLA------------------EELLEEP---APTAEELANPVFYSKMRDRKQRMGTFLLE 116 ++ E++ P P + V K R ++L+ Sbjct: 60 VMIASARIGLKQLLEGYVQKKEKVTYVPDKNDPLNDVEKCRVVEKKFCPPSIRAIKYVLD 119 Query: 117 KLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLK 152 + +K R+ + RP + +++ + Sbjct: 120 REERKKDKDRLLASER-----RPLVIEVQDEETRRQ 150 >gi|160933467|ref|ZP_02080855.1| hypothetical protein CLOLEP_02313 [Clostridium leptum DSM 753] gi|156867344|gb|EDO60716.1| hypothetical protein CLOLEP_02313 [Clostridium leptum DSM 753] Length = 381 Score = 36.4 bits (82), Expect = 1.9, Method: Composition-based stats. Identities = 12/72 (16%), Positives = 29/72 (40%), Gaps = 12/72 (16%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQE-----------A 67 + ++ E + + +G+ + + + +P STFYRWI+ + Sbjct: 1 MKFTKEEKIAFVKRYQDGETVIKICNENQIP-RSTFYRWIQDYQQTVTDTGTVVTPQEFQ 59 Query: 68 YTEALQCRLDLL 79 Y + +L+ + Sbjct: 60 YLKRRINKLEDM 71 >gi|326387445|ref|ZP_08209054.1| beta-lactamase-like protein [Novosphingobium nitrogenifigens DSM 19370] gi|326208101|gb|EGD58909.1| beta-lactamase-like protein [Novosphingobium nitrogenifigens DSM 19370] Length = 550 Score = 36.0 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 32/86 (37%), Gaps = 12/86 (13%) Query: 33 VANGKALGHVLRKVGMPKYSTFYRWIKK------DLKLQEAYTEALQCRLDLLAEELLEE 86 V + +++ HV G P+ Y W++ +++ +A R + +A +L++ Sbjct: 354 VTDRQSMIHVSGHPGRPELMKLYEWLRPQMLVPVHGEIRHMDEQARLGRSEGIARTILQK 413 Query: 87 ------PAPTAEELANPVFYSKMRDR 106 AP V ++ Sbjct: 414 NGDLVRLAPNGPHKITEVRAGRLVLD 439 >gi|197119615|ref|YP_002140042.1| hypothetical protein Gbem_3249 [Geobacter bemidjiensis Bem] gi|197088975|gb|ACH40246.1| hypothetical protein Gbem_3249 [Geobacter bemidjiensis Bem] Length = 213 Score = 36.0 bits (81), Expect = 2.7, Method: Composition-based stats. Identities = 24/99 (24%), Positives = 36/99 (36%), Gaps = 12/99 (12%) Query: 5 VKKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKL 64 VKK A R +K Y I + G+ TFY W+ D K Sbjct: 6 VKKGGAAPRQRKFLCAYGESSTFNI----------SEACKAAGI-GRRTFYNWLTDDSKF 54 Query: 65 QEAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKM 103 + + E + RLD + E L A ++ +F +K Sbjct: 55 KTDFEELTESRLDAI-ESALHSRAVIEKDTTALIFLAKT 92 >gi|290968650|ref|ZP_06560188.1| hypothetical protein HMPREF0889_0290 [Megasphaera genomosp. type_1 str. 28L] gi|290781303|gb|EFD93893.1| hypothetical protein HMPREF0889_0290 [Megasphaera genomosp. type_1 str. 28L] Length = 195 Score = 35.6 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 44/140 (31%), Gaps = 18/140 (12%) Query: 36 GKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAY---------------TEALQCRLDLLA 80 G + + K+G+ Y W+KK + ++ + T+ + + + Sbjct: 57 GWSAEKIADKIGV-SKQCVYGWLKKYPEFKKIWYECVDDIIYTAEKGMTDLTRPQTRKIK 115 Query: 81 EELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRP- 139 E + + ++LE + +Y R SV+ I + Sbjct: 116 ETITYTDKDGHITGRRERETVTTDPPDFKACKYMLENFAPDEYKDRQSVQVSGDISVTAI 175 Query: 140 -AIERLREHYKHLKPIDSER 158 L + K K D E+ Sbjct: 176 DIAGILAKARKKAKENDDEK 195 >gi|282878961|ref|ZP_06287725.1| peptidase family M13 [Prevotella buccalis ATCC 35310] gi|281298960|gb|EFA91365.1| peptidase family M13 [Prevotella buccalis ATCC 35310] Length = 667 Score = 35.6 bits (80), Expect = 3.1, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 41/92 (44%), Gaps = 3/92 (3%) Query: 89 PTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLREHY 148 P + + ++ + + +L +L + Y + ++E K + + A++ R + Sbjct: 45 PLPAAYSRFGSFDQLAEDNNKRINTILGELQKKTY-KKGTMEQKLSDFYKLAMDADRRNK 103 Query: 149 KHLKPIDSERIPHKS--TEKPLEIVESSIAEQ 178 + +KP+ ++ T+K LE ++ A Q Sbjct: 104 EGIKPVKPLLDEMEAAKTKKDLEQLQLKYAAQ 135 >gi|326404151|ref|YP_004284233.1| putative transposase for insertion sequence element [Acidiphilium multivorum AIU301] gi|325051013|dbj|BAJ81351.1| putative transposase for insertion sequence element [Acidiphilium multivorum AIU301] Length = 88 Score = 35.6 bits (80), Expect = 3.3, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 31/74 (41%), Gaps = 12/74 (16%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRW-----------IKKDLKLQEAY 68 +S E GIL + +G + V R+ G+ +TFY+W ++ +L++ Sbjct: 5 RFSEEQIIGILKEQESGMSTAEVCRRHGI-SSATFYKWKAKFGGLEVSDARRLKQLEDEN 63 Query: 69 TEALQCRLDLLAEE 82 + D + + Sbjct: 64 ARLKRLLADAVLDN 77 >gi|218508778|ref|ZP_03506656.1| putative phage-related protein [Rhizobium etli Brasil 5] Length = 75 Score = 35.6 bits (80), Expect = 3.3, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 3/59 (5%) Query: 93 ELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHL 151 E + S++R + ++ EKL +KYG +V +E T + +E + + L Sbjct: 16 ENGEALRRSQLRISTR---QWIAEKLKPKKYGAKVELEHGVTSGVAELLEAINGKTRGL 71 >gi|120603621|ref|YP_968021.1| transposase IS3/IS911 family protein [Desulfovibrio vulgaris DP4] gi|120563850|gb|ABM29594.1| transposase IS3/IS911 family protein [Desulfovibrio vulgaris DP4] Length = 88 Score = 35.6 bits (80), Expect = 3.3, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKD 61 ++ E GIL Q G + + R+ GM +TFY+W K Sbjct: 5 RFTEEKIIGILKQAEAGMTVTSLCRQHGMSD-ATFYKWRSKY 45 >gi|308488189|ref|XP_003106289.1| hypothetical protein CRE_15283 [Caenorhabditis remanei] gi|308254279|gb|EFO98231.1| hypothetical protein CRE_15283 [Caenorhabditis remanei] Length = 510 Score = 35.6 bits (80), Expect = 3.7, Method: Composition-based stats. Identities = 19/62 (30%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 124 GPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSIIEH 183 G V+ KH I+ +P I R + + ++I K TE+P E VE S E+ + Sbjct: 121 GELVATLQKHFINPKPTIPRRPQPKFTVTAEMRQKIALKDTEQP-ECVELSSDEEGRATN 179 Query: 184 NN 185 N Sbjct: 180 TN 181 >gi|332886223|gb|EGK06467.1| hypothetical protein HMPREF9456_00341 [Dysgonomonas mossii DSM 22836] Length = 237 Score = 35.2 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 30/62 (48%), Gaps = 2/62 (3%) Query: 20 YYSPELFAGILDQVA-NGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 Y+ L I+ + + ++ + + + + + S FY W K + ++A A++ R + Sbjct: 3 KYTERLAEKIVSLIEEDTYSITEICKHLKITRKS-FYEWRDKKPEFRKAIEAAMESRDEA 61 Query: 79 LA 80 LA Sbjct: 62 LA 63 >gi|307205501|gb|EFN83818.1| USP6 N-terminal-like protein [Harpegnathos saltator] Length = 1089 Score = 35.2 bits (79), Expect = 3.8, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 3/123 (2%) Query: 65 QEAYTEALQCRLDLLAE-ELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLLEK-LSNQK 122 +E Y A Q L A +LEE A AE A ++ R ++ E+ L + Sbjct: 258 EERYARAKQENATLQARILMLEEAAKDAETRAEERLQAEQRRHREWACRLERERQLQLEN 317 Query: 123 YGPRVSVESKHTIDLRPAIERLREHYKHLKPIDSERIPHKSTEKPLEIVESSIAEQSIIE 182 Y ++ + LR I R+RE + + + R+ + E E + +E+ + Sbjct: 318 YAIKLQAVELDSSSLRDEIARVREQLERARADKT-RLENDLQEARREADAARESERHAVS 376 Query: 183 HNN 185 N Sbjct: 377 RAN 379 >gi|332828274|gb|EGK00986.1| hypothetical protein HMPREF9455_02775 [Dysgonomonas gadei ATCC BAA-286] Length = 215 Score = 35.2 bits (79), Expect = 4.1, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 23/57 (40%), Gaps = 2/57 (3%) Query: 20 YYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCR 75 Y EL I+ + + + V + G+ TFY W+ + + EA+ R Sbjct: 3 KYGAELTEKIVSLIEDEFFTVSQVCKATGI-SRQTFYYWMDTKEEFRREIEEAMAHR 58 >gi|332884242|gb|EGK04510.1| hypothetical protein HMPREF9456_00837 [Dysgonomonas mossii DSM 22836] Length = 237 Score = 35.2 bits (79), Expect = 4.7, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 30/62 (48%), Gaps = 2/62 (3%) Query: 20 YYSPELFAGILDQVA-NGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDL 78 Y+ L I+ + + ++ + + + + + S FY W K + ++A A++ R + Sbjct: 3 KYTERLAEKIVSLIEEDTYSITEICKHLKITRKS-FYEWRDKKPEFRKAIEAAIESRDET 61 Query: 79 LA 80 LA Sbjct: 62 LA 63 >gi|224543178|ref|ZP_03683717.1| hypothetical protein CATMIT_02378 [Catenibacterium mitsuokai DSM 15897] gi|224523965|gb|EEF93070.1| hypothetical protein CATMIT_02378 [Catenibacterium mitsuokai DSM 15897] Length = 1051 Score = 35.2 bits (79), Expect = 4.7, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 49/123 (39%), Gaps = 12/123 (9%) Query: 58 IKKDLKLQEAYTEALQC--RLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGTFLL 115 +K++ +L+ Y E + R D+ EE + ++ + K K + + + Sbjct: 356 LKREPELRSKYEELKEKDNRADIKHEEFVRYCTQETDKTKKDLDNRKTLSDKTKEISDI- 414 Query: 116 EKLSNQKYGPRVSVESKHTIDLRPAIERLREHYKH----LKPIDSERIPHKSTEKPLEIV 171 K+ + H + + ++ + +E K LK +D+E + +S K ++ Sbjct: 415 -----NKHTDDSIEQLNHIKNSKKSLVKKQEEGKKELDTLKDVDTEAVVLESQRKDFKLK 469 Query: 172 ESS 174 S Sbjct: 470 VES 472 >gi|332828026|gb|EGK00748.1| hypothetical protein HMPREF9455_03022 [Dysgonomonas gadei ATCC BAA-286] Length = 236 Score = 35.2 bits (79), Expect = 4.7, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 53/156 (33%), Gaps = 28/156 (17%) Query: 19 IYYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLD 77 + Y ++ I+ + ++ + + + TFY W K + EA EA+ R D Sbjct: 1 MKYKEKIATEIIRMIEQDLCSISEICKSFKI-SRKTFYEWKKVKPEFGEAVEEAIDHRED 59 Query: 78 LLA------------------EELLEEP---APTAEELANPVFYSKMRDRKQRMGTFLLE 116 ++ E++ P P + V K R ++L+ Sbjct: 60 VMIASARIGLKQLLEGYVQKKEKITYVPDKNDPLNDVEKCRVVEKKFCPPSIRAIKYVLD 119 Query: 117 KLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLK 152 + +K R+ + RP + +++ + Sbjct: 120 REERKKDKDRLLASER-----RPLVIEVQDEETKRQ 150 >gi|325271819|ref|ZP_08138284.1| hypothetical protein G1E_03165 [Pseudomonas sp. TJI-51] gi|324103046|gb|EGC00428.1| hypothetical protein G1E_03165 [Pseudomonas sp. TJI-51] Length = 140 Score = 34.8 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 14/59 (23%), Positives = 21/59 (35%) Query: 8 AKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQE 66 A K A + Y + IL+ +G +L + G P Y T W+ E Sbjct: 2 ASKPAAAHQAKRYTQDDRRVIILEYFNSGLSLTKFCKVKGRPSYQTMKLWLCISPDFDE 60 >gi|140845|sp|P17985|YIA3_RHISP RecName: Full=Insertion element ISR1 uncharacterized 10 kDa protein A3 gi|48671|emb|CAA29830.1| unnamed protein product [Rhizobium sp.] Length = 88 Score = 34.8 bits (78), Expect = 5.7, Method: Composition-based stats. Identities = 15/76 (19%), Positives = 30/76 (39%), Gaps = 12/76 (15%) Query: 20 YYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRW-----------IKKDLKLQEAY 68 ++ E GIL + G A V R+ G+ +TFY+W ++ L++ Sbjct: 5 RFTEEQIIGILREQEAGVATAEVCRRHGV-SSATFYKWKAKFGGLDVSEARRLKALEDEN 63 Query: 69 TEALQCRLDLLAEELL 84 + D + + + Sbjct: 64 ARLKRMLADAMLDNVA 79 >gi|289551570|ref|YP_003472474.1| hypothetical protein SLGD_02259 [Staphylococcus lugdunensis HKU09-01] gi|289181101|gb|ADC88346.1| hypothetical protein SLGD_02259 [Staphylococcus lugdunensis HKU09-01] Length = 142 Score = 34.4 bits (77), Expect = 7.2, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 31/104 (29%), Gaps = 9/104 (8%) Query: 35 NGKALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLDLLAEELLEE-------P 87 +G + R G+ + T +W + + EA + + D E L + Sbjct: 25 DGLTDEQIARNFGV-NFKTVMKWKEDHSEFMEAIKKGKEV-SDYELENALHKRATGYYYE 82 Query: 88 APTAEELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVES 131 T V K F L+ KY +V E Sbjct: 83 EETVTNAGKVVKVKKYEHANPTSLIFALKNRLPHKYRDKVEQEI 126 >gi|332828769|gb|EGK01461.1| hypothetical protein HMPREF9455_02294 [Dysgonomonas gadei ATCC BAA-286] Length = 241 Score = 34.4 bits (77), Expect = 7.3, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 53/156 (33%), Gaps = 28/156 (17%) Query: 19 IYYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLD 77 + Y + + I+ ++ + + + TFY W K + +EA EA+ R D Sbjct: 1 MKYKEKTASEIIRMTEQDLCSISEICKSFKI-SRKTFYEWKKMKPEFKEAVEEAIDHRED 59 Query: 78 LLA------------------EELLEEP---APTAEELANPVFYSKMRDRKQRMGTFLLE 116 ++ E++ P P + V K R ++L+ Sbjct: 60 VMIASARIGLKQLLEGYVQKKEKITYVPDKNDPVNDVEKCRVVEKKFCPPSIRAIKYVLD 119 Query: 117 KLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLK 152 + +K R+ + RP + +++ + Sbjct: 120 REERKKDKDRLLASER-----RPLVIEVQDEETKRQ 150 >gi|312891882|ref|ZP_07751386.1| transposase IS3/IS911 family protein [Mucilaginibacter paludis DSM 18603] gi|311295615|gb|EFQ72780.1| transposase IS3/IS911 family protein [Mucilaginibacter paludis DSM 18603] Length = 103 Score = 34.4 bits (77), Expect = 7.8, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 40/107 (37%), Gaps = 19/107 (17%) Query: 6 KKAKKAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDLKLQ 65 KK + A R Y E +L ++ G+++ V +G+ +RW Sbjct: 4 KKTETARR------KYDDEFKRDVLKMISGGRSVQDVSNSLGLNTGM-VHRW-------- 48 Query: 66 EAYTEALQCRLDLLAEELLEEPAPTAEELANPVFYSKMRDRKQRMGT 112 Q + D+ ++ L E P + + +++R+ +Q Sbjct: 49 ----RGEQNQADVTSKSLAEGHGPEVSRGDHELVKARLREVEQERDI 91 >gi|85374276|ref|YP_458338.1| hypothetical protein ELI_07245 [Erythrobacter litoralis HTCC2594] gi|84787359|gb|ABC63541.1| hypothetical protein ELI_07245 [Erythrobacter litoralis HTCC2594] Length = 302 Score = 34.0 bits (76), Expect = 8.2, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 43/107 (40%), Gaps = 5/107 (4%) Query: 21 YSPELFAGILDQVANGKALGHVLRKVGMPKYSTF-YRWIKKDLKLQEAYTEALQCRLDLL 79 ++PE + +A+ ++ + V M + R + + + A+ AL+C + L Sbjct: 34 WTPERQRDFIAALADTGSVEAACKAVNMSTVGAYRIRRLPEGASFRAAWDAALECGVQRL 93 Query: 80 AEELLEEPAPTAEELANP---VFYSKMRDRKQRMGTFLLEKLSNQKY 123 E++ + A E K R R+ F+L + ++Y Sbjct: 94 -EDVAMDRALNGVEETLHYHGQEVGKRRKYNDRLLMFMLRNRAPERY 139 >gi|226223094|ref|YP_002757201.1| transposase [Listeria monocytogenes Clip81459] gi|225875556|emb|CAS04259.1| Putative transposase [Listeria monocytogenes serotype 4b str. CLIP 80459] Length = 88 Score = 34.0 bits (76), Expect = 9.3, Method: Composition-based stats. Identities = 11/44 (25%), Positives = 23/44 (52%), Gaps = 1/44 (2%) Query: 19 IYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKKDL 62 I Y E+ +L + G + + ++ G+P+Y F++W+K Sbjct: 4 IEYGEEIQKSLLVLYSRGSTIQSICKEYGIPRYE-FHKWMKLHD 46 >gi|207108484|ref|ZP_03242646.1| GTP-binding protein Era [Helicobacter pylori HPKX_438_CA4C1] Length = 143 Score = 34.0 bits (76), Expect = 9.3, Method: Composition-based stats. Identities = 29/123 (23%), Positives = 50/123 (40%), Gaps = 15/123 (12%) Query: 38 ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEA-LQCRLDLLAEELLEE----PAPTAE 92 AL + K P F + + D K+++ Y E + D L++E+ E E Sbjct: 31 ALLECISKHLSPSAWLFEKDLMSDEKMRDIYKEIIRESLFDFLSDEIPYESDVMIDKFIE 90 Query: 93 ELANPVFYSKMRDRKQRMGTFLLEKLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLK 152 E Y+++ K+R +KY R+ +K + IE L++ YK +K Sbjct: 91 EECIDKVYARIIVEKER----------PKKYMQRLVNNTKTDFFDKALIEELKKRYKIVK 140 Query: 153 PID 155 I Sbjct: 141 YIQ 143 >gi|332829129|gb|EGK01793.1| hypothetical protein HMPREF9455_01941 [Dysgonomonas gadei ATCC BAA-286] Length = 237 Score = 34.0 bits (76), Expect = 9.3, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 53/156 (33%), Gaps = 28/156 (17%) Query: 19 IYYSPELFAGILDQVANGK-ALGHVLRKVGMPKYSTFYRWIKKDLKLQEAYTEALQCRLD 77 + Y+ + I++ + + + + + + TFY W K + +E EA R + Sbjct: 1 MKYNNKTAERIVEMIEQDLYGVSEICKILRI-SRQTFYDWKKNKPEFKEEVEEAFIRREE 59 Query: 78 LLA------------------EELLEEPA---PTAEELANPVFYSKMRDRKQRMGTFLLE 116 +L E++ P P + V K R ++L+ Sbjct: 60 VLIANARIGLKQLLEGYVQKKEKVTYVPDKNNPVEDVEKCRVVEKKFCPPSIRAIKYVLD 119 Query: 117 KLSNQKYGPRVSVESKHTIDLRPAIERLREHYKHLK 152 + +K R+ K RP + +++ + Sbjct: 120 REERRKEQQRLESREK-----RPLVIEVQDEETRRQ 150 >gi|297580361|ref|ZP_06942288.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297536007|gb|EFH74841.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 107 Score = 34.0 bits (76), Expect = 9.4, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 21/51 (41%), Gaps = 1/51 (1%) Query: 10 KAVRAKKGCIYYSPELFAGILDQVANGKALGHVLRKVGMPKYSTFYRWIKK 60 K R ++ IL Q G + + R+ GM +TFY+W K Sbjct: 14 KRRRFFMKTSKFTDSQIMAILKQAEAGTPVRELCREHGM-SSATFYKWRAK 63 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.300 0.120 0.308 Lambda K H 0.267 0.0375 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,756,737,008 Number of Sequences: 14124377 Number of extensions: 92347369 Number of successful extensions: 337906 Number of sequences better than 10.0: 268 Number of HSP's better than 10.0 without gapping: 115 Number of HSP's successfully gapped in prelim test: 248 Number of HSP's that attempted gapping in prelim test: 337629 Number of HSP's gapped (non-prelim): 434 length of query: 185 length of database: 4,842,793,630 effective HSP length: 130 effective length of query: 55 effective length of database: 3,006,624,620 effective search space: 165364354100 effective search space used: 165364354100 T: 11 A: 40 X1: 16 ( 6.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.8 bits) S2: 76 (34.0 bits)