BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781182|ref|YP_003065595.1| hypothetical protein CLIBASIA_05445 [Candidatus Liberibacter asiaticus str. psy62] (94 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781182|ref|YP_003065595.1| hypothetical protein CLIBASIA_05445 [Candidatus Liberibacter asiaticus str. psy62] gi|254040859|gb|ACT57655.1| hypothetical protein CLIBASIA_05445 [Candidatus Liberibacter asiaticus str. psy62] Length = 94 Score = 196 bits (499), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD Sbjct: 1 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 Query: 61 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG 94 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG Sbjct: 61 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG 94 >gi|315122553|ref|YP_004063042.1| hypothetical protein CKC_04025 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495955|gb|ADR52554.1| hypothetical protein CKC_04025 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 246 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 45/93 (48%), Positives = 70/93 (75%) Query: 1 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 + DDQG+ I+F L ++ + L++ NLI E++EWSN +R+ GNKAVHEG++ IE+++ Sbjct: 154 ITDDQGKAIDFSNNINLKNKIKSLREQNLITRELYEWSNHIRLAGNKAVHEGEAHIEDAN 213 Query: 61 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRD 93 EC EFV+LFC I+FTLPALI++ K + ++S + Sbjct: 214 ECFEFVHLFCHILFTLPALIEQNKLINSDKSTE 246 >gi|254503691|ref|ZP_05115842.1| hypothetical protein SADFL11_3730 [Labrenzia alexandrii DFL-11] gi|222439762|gb|EEE46441.1| hypothetical protein SADFL11_3730 [Labrenzia alexandrii DFL-11] Length = 242 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 25/68 (36%), Positives = 38/68 (55%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFTL 76 L R L+ +I + + +W++ +RI+GN A HEG E E + F+ LF DIVF L Sbjct: 173 LISRINDLRDQGVITQGLADWAHHIRIDGNLAAHEGVGDQEAVREYIGFLRLFLDIVFAL 232 Query: 77 PALIKEKK 84 P I ++ Sbjct: 233 PERIAARR 240 >gi|218688916|ref|YP_002397128.1| hypothetical protein ECED1_1107 [Escherichia coli ED1a] gi|218426480|emb|CAR07308.1| hypothetical protein ECED1_1107 [Escherichia coli ED1a] Length = 224 Score = 53.5 bits (127), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 27/70 (38%), Positives = 44/70 (62%), Gaps = 3/70 (4%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ--SSIEESDECLEFVNLFCDIVF 74 LS R + + + LI E++ EW++ VRI+ NKAVH + + IE S + L F +F F Sbjct: 146 LSQRIQMIYKKGLITEQMKEWAHIVRIDANKAVHTDEVFTPIEAS-QILSFTEMFLVYAF 204 Query: 75 TLPALIKEKK 84 TLPA+++ ++ Sbjct: 205 TLPAMVEARR 214 >gi|188533144|ref|YP_001906941.1| Hypothetical protein, probable Cecropin family protein [Erwinia tasmaniensis Et1/99] gi|188028186|emb|CAO96044.1| Hypothetical protein, probable Cecropin family protein [Erwinia tasmaniensis Et1/99] Length = 232 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 23/69 (33%), Positives = 40/69 (57%), Gaps = 1/69 (1%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH-EGQSSIEESDECLEFVNLFCDIVFT 75 L R LK+ +I +E++EW++ VR++GN+ H E + + + L F F FT Sbjct: 156 LQKRIEKLKEKGVITKEMYEWADIVRLDGNEQTHSEDEFDPQSAKAVLAFTETFLLYAFT 215 Query: 76 LPALIKEKK 84 LP +++EK+ Sbjct: 216 LPEMVREKR 224 >gi|212703623|ref|ZP_03311751.1| hypothetical protein DESPIG_01668 [Desulfovibrio piger ATCC 29098] gi|212672953|gb|EEB33436.1| hypothetical protein DESPIG_01668 [Desulfovibrio piger ATCC 29098] Length = 104 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 21/68 (30%), Positives = 36/68 (52%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFTL 76 L DR L + +I + EW++ +R GN+A H+ + S +E+ E + F +F F L Sbjct: 33 LYDRIDNLYKKGVITASLKEWASIIRRAGNEAAHDMEGSPDEAGELVAFTRIFLQFTFEL 92 Query: 77 PALIKEKK 84 P +I + Sbjct: 93 PDIISRTR 100 >gi|123442587|ref|YP_001006564.1| hypothetical protein YE2343 [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122089548|emb|CAL12396.1| hypothetical phage protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 197 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 26/78 (33%), Positives = 44/78 (56%), Gaps = 5/78 (6%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDECLEFVNLFCDIVFT 75 LS R L I E++ +W++ VRI+ N AVH ++ + +E++E + F +F FT Sbjct: 124 LSQRISMLYGKGKITEQMKDWAHIVRIDSNGAVHSDEAFTKDEAEEVIGFTEVFLIYSFT 183 Query: 76 LPALIKEKKSTHPNQSRD 93 LP ++ K+ N SR+ Sbjct: 184 LPEMVTAKQ----NASRE 197 >gi|320322284|gb|EFW78378.1| hypothetical protein PsgB076_22996 [Pseudomonas syringae pv. glycinea str. B076] gi|320331941|gb|EFW87877.1| hypothetical protein PsgRace4_00025 [Pseudomonas syringae pv. glycinea str. race 4] gi|330882657|gb|EGH16806.1| hypothetical protein Pgy4_27600 [Pseudomonas syringae pv. glycinea str. race 4] Length = 146 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 2/61 (3%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD--ECLEFVNLFCDIVF 74 LS + +K+ LI E +FEWS+ +R+ GN+A H SI + D + +EF N D +F Sbjct: 70 LSLSLKKMKEDGLIDERLFEWSDALRVVGNEAAHGVGISIAQPDARDTIEFTNAILDYLF 129 Query: 75 T 75 + Sbjct: 130 S 130 >gi|330965893|gb|EGH66153.1| hypothetical protein PSYAC_14870 [Pseudomonas syringae pv. actinidiae str. M302091] Length = 201 Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust. Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNL 68 K G L R H+LI E+ W++ +R++ N H + S E+ + +EF Sbjct: 123 KDGSLYSRIEAAATHHLITPEMASWAHEIRLDANDQRHSDEDASMPSEAEASKAVEFATA 182 Query: 69 FCDIVFTLPALIKEKKS 85 +F LPA + ++ Sbjct: 183 LAQFLFVLPARVARGRA 199 >gi|171779682|ref|ZP_02920638.1| hypothetical protein STRINF_01519 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171281784|gb|EDT47218.1| hypothetical protein STRINF_01519 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 244 Score = 38.9 bits (89), Expect = 0.21, Method: Compositional matrix adjust. Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 5/82 (6%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCDI 72 L+++ YL + + IE I + + VR+ GN AVH GQ ++++ E L FVNL D Sbjct: 158 LNNKIGYLVRQGMPIE-IQQMLDSVRVVGNNAVHPGQIDLKDNKELAASLLTFVNLIVDN 216 Query: 73 VFTLPALIKEKKSTHPNQSRDG 94 + P IK + P R Sbjct: 217 RISQPKKIKSVYDSLPESYRKA 238 >gi|301386145|ref|ZP_07234563.1| hypothetical protein PsyrptM_26064 [Pseudomonas syringae pv. tomato Max13] Length = 201 Score = 38.9 bits (89), Expect = 0.22, Method: Compositional matrix adjust. Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNL 68 K G L R H+LI E+ W++ +R++ N H + S E+ + +EF Sbjct: 123 KDGSLYSRIEAAATHHLITAEMASWAHEIRLDANDQRHSDEDASLPSEAEASKAVEFAMA 182 Query: 69 FCDIVFTLPALIKEKKS 85 +F LPA + ++ Sbjct: 183 LAQFLFVLPARVARGRA 199 >gi|213971540|ref|ZP_03399651.1| hypothetical protein PSPTOT1_0852 [Pseudomonas syringae pv. tomato T1] gi|302060707|ref|ZP_07252248.1| hypothetical protein PsyrptK_12010 [Pseudomonas syringae pv. tomato K40] gi|213923732|gb|EEB57316.1| hypothetical protein PSPTOT1_0852 [Pseudomonas syringae pv. tomato T1] Length = 221 Score = 38.1 bits (87), Expect = 0.36, Method: Compositional matrix adjust. Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNL 68 K G L R H+LI E+ W++ +R++ N H + S E+ + +EF Sbjct: 143 KDGSLYSRIEAAATHHLITAEMASWAHEIRLDANDQRHSDEDASLPSEAEASKAVEFAMA 202 Query: 69 FCDIVFTLPALIKEKKS 85 +F LPA + ++ Sbjct: 203 LAQFLFVLPARVARGRA 219 >gi|257452213|ref|ZP_05617512.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. 3_1_5R] gi|317058756|ref|ZP_07923241.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. 3_1_5R] gi|313684432|gb|EFS21267.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. 3_1_5R] Length = 1088 Score = 37.0 bits (84), Expect = 0.80, Method: Compositional matrix adjust. Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 +RI+ K M SDR LK++ LI E+I + +R +GNKAVH G EE+ E L Sbjct: 52 ERISDSKKNMASDRLLALKKYELIPEDIEKILTTLRKKGNKAVH-GIYGDEETAETL 107 >gi|332533299|ref|ZP_08409165.1| hypothetical protein PH505_ao00110 [Pseudoalteromonas haloplanktis ANT/505] gi|332037181|gb|EGI73637.1| hypothetical protein PH505_ao00110 [Pseudoalteromonas haloplanktis ANT/505] Length = 266 Score = 36.6 bits (83), Expect = 1.0, Method: Compositional matrix adjust. Identities = 26/95 (27%), Positives = 43/95 (45%), Gaps = 13/95 (13%) Query: 4 DQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE-- 61 + G++I+ + ++S L Q L + VR+ GN+AVH G+ +++ E Sbjct: 171 ETGKKIDKDIASLVSKGLNPLVQQAL---------DIVRVVGNEAVHPGEIDFKDNKEIA 221 Query: 62 --CLEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG 94 VNL CD + T P +KE P +G Sbjct: 222 LKLFGLVNLICDQMITHPKQVKELYGDLPKDKLEG 256 >gi|253583388|ref|ZP_04860586.1| type I restriction enzyme EcoKI subunit R [Fusobacterium varium ATCC 27725] gi|251833960|gb|EES62523.1| type I restriction enzyme EcoKI subunit R [Fusobacterium varium ATCC 27725] Length = 1088 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 20/64 (31%), Positives = 33/64 (51%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFV 66 +++N + G DR + LK ++LI E+I +R +GN AVH + E++ L V Sbjct: 52 EKLNEPESGSQLDRIKILKTYDLIPEDIENILQLIRKKGNTAVHNMSGNESEAETLLSLV 111 Query: 67 NLFC 70 C Sbjct: 112 VKLC 115 >gi|149004391|ref|ZP_01829131.1| hypothetical protein CGSSp14BS69_07031 [Streptococcus pneumoniae SP14-BS69] gi|149004569|ref|ZP_01829262.1| hypothetical protein CGSSp14BS69_06037 [Streptococcus pneumoniae SP14-BS69] gi|147757543|gb|EDK64568.1| hypothetical protein CGSSp14BS69_06037 [Streptococcus pneumoniae SP14-BS69] gi|147757640|gb|EDK64658.1| hypothetical protein CGSSp14BS69_07031 [Streptococcus pneumoniae SP14-BS69] Length = 236 Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust. Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 4/66 (6%) Query: 33 EIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCDIVFTLPALIKEKKSTHP 88 EI + + VR+ GN AVH GQ I+++ E L F+NL D T P I + + P Sbjct: 165 EIQQMLDSVRVIGNNAVHPGQIDIKDNKELALSLLSFINLIVDNRITQPKKILDIYNLLP 224 Query: 89 NQSRDG 94 + R+ Sbjct: 225 DSYRNS 230 >gi|257463918|ref|ZP_05628304.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. D12] gi|317061445|ref|ZP_07925930.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. D12] gi|313687121|gb|EFS23956.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. D12] Length = 1087 Score = 35.8 bits (81), Expect = 1.9, Method: Compositional matrix adjust. Identities = 24/57 (42%), Positives = 33/57 (57%), Gaps = 1/57 (1%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 + I+ K M SDR LK++ LI E+I + +R +GNKAVH G EE+ E L Sbjct: 52 EHISDSKKNMASDRLLVLKKYELIPEDIEKILTTLRKKGNKAVH-GSYGDEETAETL 107 >gi|78067675|ref|YP_370444.1| hypothetical protein Bcep18194_A6206 [Burkholderia sp. 383] gi|77968420|gb|ABB09800.1| hypothetical protein Bcep18194_A6206 [Burkholderia sp. 383] Length = 210 Score = 35.4 bits (80), Expect = 2.3, Method: Compositional matrix adjust. Identities = 21/75 (28%), Positives = 40/75 (53%), Gaps = 5/75 (6%) Query: 7 QRINFEKCG---MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG--QSSIEESDE 61 +R+ EK G ML+ LK +I + + EW++ +R+E N H +++ E++ + Sbjct: 113 ERLIKEKTGKPQMLAKGLADLKSRGVIDQRLHEWADALRVERNIGAHASDVETTKEDAQD 172 Query: 62 CLEFVNLFCDIVFTL 76 ++F D V+TL Sbjct: 173 IIDFTVAIFDYVYTL 187 >gi|223934041|ref|ZP_03625994.1| conserved hypothetical protein [Streptococcus suis 89/1591] gi|223897298|gb|EEF63706.1| conserved hypothetical protein [Streptococcus suis 89/1591] Length = 239 Score = 35.4 bits (80), Expect = 2.8, Method: Compositional matrix adjust. Identities = 23/60 (38%), Positives = 30/60 (50%), Gaps = 4/60 (6%) Query: 33 EIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCDIVFTLPALIKEKKSTHP 88 EI + + VR+ GN AVH GQ I+++ E L FVNL D + P I E P Sbjct: 168 EIQQMLDSVRVIGNNAVHPGQIEIQDNKELALSLLNFVNLITDSQISQPKKIAEIYGLLP 227 >gi|78485663|ref|YP_391588.1| hypothetical protein Tcr_1319 [Thiomicrospira crunogena XCL-2] gi|78363949|gb|ABB41914.1| hypothetical protein Tcr_1319 [Thiomicrospira crunogena XCL-2] Length = 232 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 17/68 (25%), Positives = 37/68 (54%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFTL 76 L R +L ++ E + + S ++ +GN HEG S E++++ L+F ++ + ++T Sbjct: 155 LGLRLPWLFDKGVLPEALRDLSGCIKDDGNDGAHEGTLSEEDAEDLLDFTSVLLERIYTE 214 Query: 77 PALIKEKK 84 P ++ K Sbjct: 215 PERLRLAK 222 >gi|257465992|ref|ZP_05630303.1| type I restriction enzyme EcoKI subunit R [Fusobacterium gonidiaformans ATCC 25563] gi|315917148|ref|ZP_07913388.1| type I restriction enzyme EcoKI subunit R [Fusobacterium gonidiaformans ATCC 25563] gi|313691023|gb|EFS27858.1| type I restriction enzyme EcoKI subunit R [Fusobacterium gonidiaformans ATCC 25563] Length = 1088 Score = 33.5 bits (75), Expect = 9.1, Method: Compositional matrix adjust. Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 1/48 (2%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 M SDR LK++ LI E+I + +R +GNKAVH G EE+ E L Sbjct: 61 MASDRLLALKKYELIPEDIEKILTTLRKKGNKAVH-GIYGDEETAETL 107 Searching..................................................done Results from round 2 >gi|315122553|ref|YP_004063042.1| hypothetical protein CKC_04025 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495955|gb|ADR52554.1| hypothetical protein CKC_04025 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 246 Score = 144 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 45/93 (48%), Positives = 70/93 (75%) Query: 1 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 + DDQG+ I+F L ++ + L++ NLI E++EWSN +R+ GNKAVHEG++ IE+++ Sbjct: 154 ITDDQGKAIDFSNNINLKNKIKSLREQNLITRELYEWSNHIRLAGNKAVHEGEAHIEDAN 213 Query: 61 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRD 93 EC EFV+LFC I+FTLPALI++ K + ++S + Sbjct: 214 ECFEFVHLFCHILFTLPALIEQNKLINSDKSTE 246 >gi|254781182|ref|YP_003065595.1| hypothetical protein CLIBASIA_05445 [Candidatus Liberibacter asiaticus str. psy62] gi|254040859|gb|ACT57655.1| hypothetical protein CLIBASIA_05445 [Candidatus Liberibacter asiaticus str. psy62] Length = 94 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD Sbjct: 1 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 Query: 61 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG 94 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG Sbjct: 61 ECLEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG 94 >gi|254503691|ref|ZP_05115842.1| hypothetical protein SADFL11_3730 [Labrenzia alexandrii DFL-11] gi|222439762|gb|EEE46441.1| hypothetical protein SADFL11_3730 [Labrenzia alexandrii DFL-11] Length = 242 Score = 103 bits (258), Expect = 6e-21, Method: Composition-based stats. Identities = 25/72 (34%), Positives = 39/72 (54%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDI 72 + L R L+ +I + + +W++ +RI+GN A HEG E E + F+ LF DI Sbjct: 169 ENRKLISRINDLRDQGVITQGLADWAHHIRIDGNLAAHEGVGDQEAVREYIGFLRLFLDI 228 Query: 73 VFTLPALIKEKK 84 VF LP I ++ Sbjct: 229 VFALPERIAARR 240 >gi|212703623|ref|ZP_03311751.1| hypothetical protein DESPIG_01668 [Desulfovibrio piger ATCC 29098] gi|212672953|gb|EEB33436.1| hypothetical protein DESPIG_01668 [Desulfovibrio piger ATCC 29098] Length = 104 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 21/68 (30%), Positives = 36/68 (52%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFTL 76 L DR L + +I + EW++ +R GN+A H+ + S +E+ E + F +F F L Sbjct: 33 LYDRIDNLYKKGVITASLKEWASIIRRAGNEAAHDMEGSPDEAGELVAFTRIFLQFTFEL 92 Query: 77 PALIKEKK 84 P +I + Sbjct: 93 PDIISRTR 100 >gi|188533144|ref|YP_001906941.1| Hypothetical protein, probable Cecropin family protein [Erwinia tasmaniensis Et1/99] gi|188028186|emb|CAO96044.1| Hypothetical protein, probable Cecropin family protein [Erwinia tasmaniensis Et1/99] Length = 232 Score = 98.2 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 23/69 (33%), Positives = 40/69 (57%), Gaps = 1/69 (1%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH-EGQSSIEESDECLEFVNLFCDIVFT 75 L R LK+ +I +E++EW++ VR++GN+ H E + + + L F F FT Sbjct: 156 LQKRIEKLKEKGVITKEMYEWADIVRLDGNEQTHSEDEFDPQSAKAVLAFTETFLLYAFT 215 Query: 76 LPALIKEKK 84 LP +++EK+ Sbjct: 216 LPEMVREKR 224 >gi|218688916|ref|YP_002397128.1| hypothetical protein ECED1_1107 [Escherichia coli ED1a] gi|218426480|emb|CAR07308.1| hypothetical protein ECED1_1107 [Escherichia coli ED1a] Length = 224 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 25/69 (36%), Positives = 43/69 (62%), Gaps = 1/69 (1%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ-SSIEESDECLEFVNLFCDIVFT 75 LS R + + + LI E++ EW++ VRI+ NKAVH + + E+ + L F +F FT Sbjct: 146 LSQRIQMIYKKGLITEQMKEWAHIVRIDANKAVHTDEVFTPIEASQILSFTEMFLVYAFT 205 Query: 76 LPALIKEKK 84 LPA+++ ++ Sbjct: 206 LPAMVEARR 214 >gi|123442587|ref|YP_001006564.1| hypothetical protein YE2343 [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122089548|emb|CAL12396.1| hypothetical phage protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 197 Score = 83.2 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 5/83 (6%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ-SSIEESDECLEFVNLFC 70 LS R L I E++ +W++ VRI+ N AVH + + +E++E + F +F Sbjct: 119 SNKEQLSQRISMLYGKGKITEQMKDWAHIVRIDSNGAVHSDEAFTKDEAEEVIGFTEVFL 178 Query: 71 DIVFTLPALIKEKKSTHPNQSRD 93 FTLP ++ K+ N SR+ Sbjct: 179 IYSFTLPEMVTAKQ----NASRE 197 >gi|295688019|ref|YP_003591712.1| hypothetical protein Cseg_0582 [Caulobacter segnis ATCC 21756] gi|295429922|gb|ADG09094.1| hypothetical protein Cseg_0582 [Caulobacter segnis ATCC 21756] Length = 237 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 43/92 (46%), Gaps = 2/92 (2%) Query: 4 DQGQRINFEK-CGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDE 61 D G +I F G L + L + I + EW++ VR+ GN H+ + +++ Sbjct: 137 DLGLKIRFPDLKGDLYKKVDKLASDHEIPLSLAEWAHEVRVIGNDGAHDLDGCNTDDAQA 196 Query: 62 CLEFVNLFCDIVFTLPALIKEKKSTHPNQSRD 93 +FV+ +F+LP +I ++ ++ + Sbjct: 197 AHDFVDAVLRYLFSLPGMIAARRRIETSEETE 228 >gi|213971540|ref|ZP_03399651.1| hypothetical protein PSPTOT1_0852 [Pseudomonas syringae pv. tomato T1] gi|302060707|ref|ZP_07252248.1| hypothetical protein PsyrptK_12010 [Pseudomonas syringae pv. tomato K40] gi|213923732|gb|EEB57316.1| hypothetical protein PSPTOT1_0852 [Pseudomonas syringae pv. tomato T1] Length = 221 Score = 61.2 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS----SIEESDECLEFVNL 68 K G L R H+LI E+ W++ +R++ N H + S E+ + +EF Sbjct: 143 KDGSLYSRIEAAATHHLITAEMASWAHEIRLDANDQRHSDEDASLPSEAEASKAVEFAMA 202 Query: 69 FCDIVFTLPALIKEKKS 85 +F LPA + ++ Sbjct: 203 LAQFLFVLPARVARGRA 219 >gi|301386145|ref|ZP_07234563.1| hypothetical protein PsyrptM_26064 [Pseudomonas syringae pv. tomato Max13] Length = 201 Score = 60.8 bits (146), Expect = 5e-08, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS----SIEESDECLEFVNL 68 K G L R H+LI E+ W++ +R++ N H + S E+ + +EF Sbjct: 123 KDGSLYSRIEAAATHHLITAEMASWAHEIRLDANDQRHSDEDASLPSEAEASKAVEFAMA 182 Query: 69 FCDIVFTLPALIKEKKS 85 +F LPA + ++ Sbjct: 183 LAQFLFVLPARVARGRA 199 >gi|330965893|gb|EGH66153.1| hypothetical protein PSYAC_14870 [Pseudomonas syringae pv. actinidiae str. M302091] Length = 201 Score = 60.4 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS----SIEESDECLEFVNL 68 K G L R H+LI E+ W++ +R++ N H + S E+ + +EF Sbjct: 123 KDGSLYSRIEAAATHHLITPEMASWAHEIRLDANDQRHSDEDASMPSEAEASKAVEFATA 182 Query: 69 FCDIVFTLPALIKEKKS 85 +F LPA + ++ Sbjct: 183 LAQFLFVLPARVARGRA 199 >gi|218663727|ref|ZP_03519657.1| hypothetical protein RetlI_32940 [Rhizobium etli IE4771] Length = 233 Score = 58.1 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 34/70 (48%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVF 74 G L+ + + L ++ + +W++ +RI GN H+ + E+ F + F + Sbjct: 160 GTLAGKIKALVSQGVLPASLGDWADEIRIVGNDGAHDDGVNREDLKAARMFCDSFLRYLI 219 Query: 75 TLPALIKEKK 84 TLP I+ ++ Sbjct: 220 TLPKEIELRR 229 >gi|261493748|ref|ZP_05990265.1| hypothetical protein COK_2151 [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261310593|gb|EEY11779.1| hypothetical protein COK_2151 [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 234 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 8/78 (10%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLEFVNLFCDIVFT 75 L+ R L + +++ ++ +R +A H S++E +E F LF FT Sbjct: 157 LNKRIEKLFDDGKLTKDLKNFALHIRSLSAEASHTYNDFSVDELEELRLFTQLFLRYTFT 216 Query: 76 LPALIKEKKSTHPNQSRD 93 LPA+I P++SR+ Sbjct: 217 LPAMI-------PDESRE 227 >gi|320322284|gb|EFW78378.1| hypothetical protein PsgB076_22996 [Pseudomonas syringae pv. glycinea str. B076] gi|320331941|gb|EFW87877.1| hypothetical protein PsgRace4_00025 [Pseudomonas syringae pv. glycinea str. race 4] gi|330882657|gb|EGH16806.1| hypothetical protein Pgy4_27600 [Pseudomonas syringae pv. glycinea str. race 4] Length = 146 Score = 53.9 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 2/78 (2%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIE--ESDECLEFVNLFCDIV 73 LS + +K+ LI E +FEWS+ +R+ GN+A H SI ++ + +EF N D + Sbjct: 69 NLSLSLKKMKEDGLIDERLFEWSDALRVVGNEAAHGVGISIAQPDARDTIEFTNAILDYL 128 Query: 74 FTLPALIKEKKSTHPNQS 91 F+ ++ K +S Sbjct: 129 FSYRDRFEQFKKRRSGES 146 >gi|167578796|ref|ZP_02371670.1| hypothetical protein BthaT_11678 [Burkholderia thailandensis TXDOH] Length = 96 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH-EGQSSIEESDECLEFVNLFCDIVFT 75 L R L I +++ W++ VR++GN A+H E + + E + E +EF L ++T Sbjct: 22 LEKRIDKLADAGKITQDLKTWAHRVRLDGNDALHEEEEFTRESATELMEFTRLLLTYLYT 81 Query: 76 LPALIKEK 83 LP I+ + Sbjct: 82 LPEKIRLR 89 >gi|134288672|ref|YP_001111134.1| gp55, hypothetical protein [Burkholderia phage phi644-2] gi|134132057|gb|ABO60854.1| gp55, hypothetical protein [Burkholderia phage phi644-2] Length = 104 Score = 52.4 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH-EGQSSIEESDECLEFVNLFCDIVFT 75 L R L I +++ W++ VR++GN A+H E + + E + E +EF L ++T Sbjct: 22 LEKRIDRLADAGKITQDLKIWAHRVRLDGNDALHEEEEFTRESATELMEFTRLLLTYLYT 81 Query: 76 LPALIKEK 83 LP I+ + Sbjct: 82 LPEKIRLR 89 >gi|78067675|ref|YP_370444.1| hypothetical protein Bcep18194_A6206 [Burkholderia sp. 383] gi|77968420|gb|ABB09800.1| hypothetical protein Bcep18194_A6206 [Burkholderia sp. 383] Length = 210 Score = 52.0 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 17/71 (23%), Positives = 38/71 (53%), Gaps = 2/71 (2%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH--EGQSSIEESDECLEFVNLFCDIV 73 ML+ LK +I + + EW++ +R+E N H + +++ E++ + ++F D V Sbjct: 125 MLAKGLADLKSRGVIDQRLHEWADALRVERNIGAHASDVETTKEDAQDIIDFTVAIFDYV 184 Query: 74 FTLPALIKEKK 84 +TL ++ + Sbjct: 185 YTLAEKYEKYR 195 >gi|262393969|ref|YP_003285823.1| hypothetical protein VEA_003198 [Vibrio sp. Ex25] gi|262337563|gb|ACY51358.1| hypothetical protein VEA_003198 [Vibrio sp. Ex25] Length = 215 Score = 51.2 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 2/69 (2%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH--EGQSSIEESDECLEFVNLFCDIV 73 DR L + I + W++ +R GN A H S E++ +CL+ + Sbjct: 139 NFYDRLESLHKAGHIDARLLSWAHGIRALGNDAAHAITANVSKEDAKDCLDLTEALLIYI 198 Query: 74 FTLPALIKE 82 ++L +E Sbjct: 199 YSLGHRFEE 207 >gi|254228158|ref|ZP_04921587.1| conserved hypothetical protein [Vibrio sp. Ex25] gi|151939231|gb|EDN58060.1| conserved hypothetical protein [Vibrio sp. Ex25] Length = 201 Score = 51.2 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 2/69 (2%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH--EGQSSIEESDECLEFVNLFCDIV 73 DR L + I + W++ +R GN A H S E++ +CL+ + Sbjct: 125 NFYDRLESLHKAGHIDARLLSWAHGIRALGNDAAHAITANVSKEDAKDCLDLTEALLIYI 184 Query: 74 FTLPALIKE 82 ++L +E Sbjct: 185 YSLGHRFEE 193 >gi|284030151|ref|YP_003380082.1| hypothetical protein Kfla_2207 [Kribbella flavida DSM 17836] gi|283809444|gb|ADB31283.1| hypothetical protein Kfla_2207 [Kribbella flavida DSM 17836] Length = 187 Score = 50.4 bits (119), Expect = 7e-05, Method: Composition-based stats. Identities = 17/74 (22%), Positives = 35/74 (47%), Gaps = 4/74 (5%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG-QSSI---EESDECLEFVNLFC 70 G L + L+Q ++I E + E ++ +R GN+ H + E+++E L ++ Sbjct: 96 GTLMSKIDGLRQADVISEAMKEAAHEIRFAGNEVAHGDLVTDPLTREDAEEVLGLMDAII 155 Query: 71 DIVFTLPALIKEKK 84 V+ PA + + Sbjct: 156 LRVYQEPAQVARVR 169 >gi|222085947|ref|YP_002544479.1| hypothetical protein Arad_2333 [Agrobacterium radiobacter K84] gi|221723395|gb|ACM26551.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 142 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 34/71 (47%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVF 74 G L+ + + L + + + +W++ +R+ GN H+ + +E F + F + Sbjct: 69 GTLAAKIKTLVGGGELPKSLGDWADEIRLIGNDGAHDDGVTRDELKAARMFCDSFLRYLI 128 Query: 75 TLPALIKEKKS 85 TLP + ++S Sbjct: 129 TLPTEVALRRS 139 >gi|78485663|ref|YP_391588.1| hypothetical protein Tcr_1319 [Thiomicrospira crunogena XCL-2] gi|78363949|gb|ABB41914.1| hypothetical protein Tcr_1319 [Thiomicrospira crunogena XCL-2] Length = 232 Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 37/69 (53%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFT 75 L R +L ++ E + + S ++ +GN HEG S E++++ L+F ++ + ++T Sbjct: 154 NLGLRLPWLFDKGVLPEALRDLSGCIKDDGNDGAHEGTLSEEDAEDLLDFTSVLLERIYT 213 Query: 76 LPALIKEKK 84 P ++ K Sbjct: 214 EPERLRLAK 222 >gi|312831426|emb|CBY17606.1| unnamed protein product [Escherichia coli LF82] Length = 232 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 2/82 (2%) Query: 4 DQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 +QG + L R +L ++L+ E + E + V+ +GN HEG +++ Sbjct: 145 EQGPAQKIRRSLGL--RMEWLFDNHLLPEALRELAECVKDDGNDGAHEGILDKAAAEDLE 202 Query: 64 EFVNLFCDIVFTLPALIKEKKS 85 +F LF + ++T P + E K+ Sbjct: 203 DFTYLFLERLYTEPQRLIEAKT 224 >gi|226313133|ref|YP_002773027.1| hypothetical protein BBR47_35460 [Brevibacillus brevis NBRC 100599] gi|226096081|dbj|BAH44523.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 243 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 15/67 (22%), Positives = 32/67 (47%), Gaps = 2/67 (2%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH--EGQSSIEESDECLEFVNLFCDIVF 74 L + + L ++ + E ++ +R GN+A H E + S + ++F ++ D V+ Sbjct: 142 LYHKLKDLSDKGVLPPIVNEMASVLRELGNEAAHGDEREFSDDLISSMIKFTHVILDYVY 201 Query: 75 TLPALIK 81 LP + Sbjct: 202 NLPDKLS 208 >gi|260579753|ref|ZP_05847610.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|258602105|gb|EEW15425.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 221 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 8/77 (10%) Query: 2 KDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFE-WSNFVRIEGNKAVHEGQSSIEE-S 59 K+D+ +FE+C L ++ + I + W++ +R+ GN A HE +S + + Sbjct: 141 KNDRSWAPSFEECVNF------LVNEGILTQRIKDSWADSIRLWGNAATHELKSVRQSTA 194 Query: 60 DECLEFVNLFCDIVFTL 76 + +EF + + F Sbjct: 195 LKAIEFTQMILRMAFEF 211 >gi|218782532|ref|YP_002433850.1| hypothetical protein Dalk_4704 [Desulfatibacillum alkenivorans AK-01] gi|218763916|gb|ACL06382.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 233 Score = 45.4 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 34/65 (52%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFTL 76 L R +L + + E + + S +R +GN H G + E++++ LEF + + ++T Sbjct: 155 LGYRLPWLFDNGYLPEGLRDLSTCLREDGNDGAHAGNLTHEDAEDLLEFTTILLERIYTE 214 Query: 77 PALIK 81 P ++ Sbjct: 215 PEKLR 219 >gi|53718204|ref|YP_107190.1| hypothetical protein BPSL0564 [Burkholderia pseudomallei K96243] gi|167737075|ref|ZP_02409849.1| hypothetical protein Bpse14_03366 [Burkholderia pseudomallei 14] gi|167814189|ref|ZP_02445869.1| hypothetical protein Bpse9_03543 [Burkholderia pseudomallei 91] gi|52208618|emb|CAH34554.1| hypothetical protein BPSL0564 [Burkholderia pseudomallei K96243] Length = 202 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 35/68 (51%), Gaps = 2/68 (2%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH--EGQSSIEESDECLEFVNLFCDIVF 74 L+ L+ +I E + W++ +R+E N H +++ +++++ ++F D V+ Sbjct: 126 LARGLAELRAQGVIDERLHAWADALRVERNIGAHASNTETTKDDAEDIIDFTVAIFDYVY 185 Query: 75 TLPALIKE 82 TL ++ Sbjct: 186 TLAERYQK 193 >gi|154508189|ref|ZP_02043831.1| hypothetical protein ACTODO_00683 [Actinomyces odontolyticus ATCC 17982] gi|153797823|gb|EDN80243.1| hypothetical protein ACTODO_00683 [Actinomyces odontolyticus ATCC 17982] Length = 184 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 38/82 (46%), Gaps = 3/82 (3%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH---EGQSSIEESDECLEFVNLFCD 71 G L + L++ LI + S+ VR+ GN+ H + E+ D+ LEF++ + Sbjct: 98 GTLKQKIDSLEESRLINPTLAGLSHQVRLMGNEMAHGDLDAAIDAEDCDDLLEFMSALLE 157 Query: 72 IVFTLPALIKEKKSTHPNQSRD 93 ++ P + ++ + + + Sbjct: 158 EIYQRPISLARRQELNRQRKAE 179 >gi|282901658|ref|ZP_06309574.1| hypothetical protein CRC_03078 [Cylindrospermopsis raciborskii CS-505] gi|281193421|gb|EFA68402.1| hypothetical protein CRC_03078 [Cylindrospermopsis raciborskii CS-505] Length = 322 Score = 44.6 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 32/62 (51%), Gaps = 1/62 (1%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG-QSSIEESDECLEFVNLFCDIV 73 G L ++ + +K+ +I + +++W++ +R+ N VH+ ++ + F D + Sbjct: 243 GSLENKLKKMKEQEIIDQNLYDWADRLRVTENDFVHKNITFGATDAQYIINFTYTVIDYI 302 Query: 74 FT 75 FT Sbjct: 303 FT 304 >gi|302388494|ref|YP_003824316.1| SH3 type 3 domain protein [Clostridium saccharolyticum WM1] gi|302199122|gb|ADL06693.1| SH3 type 3 domain protein [Clostridium saccharolyticum WM1] Length = 377 Score = 43.1 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 29/52 (55%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFV 66 G L+D L + I + + + +R+ GNKAVHEG S +++E ++ + Sbjct: 55 GDLADSIDQLFEGRFISQSAKDHYHRIRVLGNKAVHEGDDSPYDANEAVQLL 106 >gi|229195090|ref|ZP_04321865.1| Type III restriction protein res subunit [Bacillus cereus m1293] gi|228588319|gb|EEK46362.1| Type III restriction protein res subunit [Bacillus cereus m1293] Length = 1068 Score = 43.1 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 42/82 (51%), Gaps = 9/82 (10%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE--------FVNLF 69 DR + L++ L+ +E+++ + +R +GN A HE G +++E+ L F+ ++ Sbjct: 65 DRIQTLRREGLLEKELYDMFDALRKKGNNAAHEAGYGTVKEAQALLLMAFRLGIWFMEVY 124 Query: 70 CDIVFTLPALIKEKKSTHPNQS 91 D F P ++ +K + S Sbjct: 125 GDWDFEAPEYVEPEKEEKVDAS 146 >gi|282896260|ref|ZP_06304282.1| hypothetical protein CRD_01142 [Raphidiopsis brookii D9] gi|281198756|gb|EFA73635.1| hypothetical protein CRD_01142 [Raphidiopsis brookii D9] Length = 325 Score = 43.1 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 32/68 (47%), Gaps = 1/68 (1%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDECLEFVNLFCDIV 73 L + +K+ +I + +++W++ +R+ N VH+ + ++ + F D + Sbjct: 243 VSLEKKLNKMKEQEIIDQNLYDWADRLRVTENDFVHKSITCGATDAQHIINFTYTVTDYI 302 Query: 74 FTLPALIK 81 FT I+ Sbjct: 303 FTYRKKIE 310 >gi|42779916|ref|NP_977163.1| type I restriction enzyme EcoKI subunit R [Bacillus cereus ATCC 10987] gi|42735834|gb|AAS39771.1| type I restriction-modification system, R subunit [Bacillus cereus ATCC 10987] Length = 1068 Score = 43.1 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 42/82 (51%), Gaps = 9/82 (10%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE--------FVNLF 69 DR + L++ L+ +E+++ + +R +GN A HE G +++E+ L F+ ++ Sbjct: 65 DRIQTLRREGLLEKELYDMFDALRKKGNNAAHEAGYGTVKEAQALLLMSFRLGIWFMEVY 124 Query: 70 CDIVFTLPALIKEKKSTHPNQS 91 D F P ++ +K + S Sbjct: 125 GDWDFEAPEYVEPEKEEKVDAS 146 >gi|239629139|ref|ZP_04672170.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA] gi|239519285|gb|EEQ59151.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA] Length = 330 Score = 42.7 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 37/80 (46%), Gaps = 3/80 (3%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVF 74 G LSD L + I + + + +RI GNKAVHEG + ++++ L V+ Sbjct: 53 GDLSDTIDQLYEGQWINKATKDNYHTIRILGNKAVHEGDDAAYDANQAY---QLLTQEVY 109 Query: 75 TLPALIKEKKSTHPNQSRDG 94 +S+ P+Q+ G Sbjct: 110 VFANEFSGGRSSRPSQASRG 129 >gi|82617326|emb|CAI64238.1| hypothetical protein [uncultured archaeon] gi|268323035|emb|CBH36623.1| conserved hypothetical protein [uncultured archaeon] Length = 215 Score = 42.7 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 13/63 (20%), Positives = 29/63 (46%), Gaps = 1/63 (1%) Query: 21 TRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDECLEFVNLFCDIVFTLPAL 79 YL++ + + W + +R GNKA H +S + ++ L F +++ + + Sbjct: 148 LSYLEEQGFVTPPMKGWVDLIRQHGNKATHSLESPDKKRAESTLMFTAELLRLIYEMEHM 207 Query: 80 IKE 82 K+ Sbjct: 208 SKQ 210 >gi|82617194|emb|CAI64101.1| hypothetical protein [uncultured archaeon] Length = 215 Score = 42.7 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 13/63 (20%), Positives = 29/63 (46%), Gaps = 1/63 (1%) Query: 21 TRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDECLEFVNLFCDIVFTLPAL 79 YL++ + + W + +R GNKA H +S + ++ L F +++ + + Sbjct: 148 LSYLEEQGFVTPPMKGWVDLIRQHGNKATHSLESPDKKRAESTLMFTAELLRLIYEMEHM 207 Query: 80 IKE 82 K+ Sbjct: 208 SKQ 210 >gi|266621661|ref|ZP_06114596.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479] gi|288866665|gb|EFC98963.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479] Length = 326 Score = 42.3 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 24/92 (26%), Positives = 41/92 (44%), Gaps = 12/92 (13%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN----LFC 70 G L+D L + + I + + + +R+ GNKAVH+G S +++E + ++ F Sbjct: 55 GDLADSIDQLFEGHWISQATKDHYHRIRVLGNKAVHDGNDSPYDANEAFQLLSQEATAFA 114 Query: 71 DI--------VFTLPALIKEKKSTHPNQSRDG 94 DI T P +S+ P Q G Sbjct: 115 DIYSGRRRSTTPTRPQQRPASRSSQPAQRSTG 146 >gi|90592606|ref|YP_529866.1| hypothetical protein [Lactobacillus phage KC5a] gi|116629261|ref|YP_814433.1| hypothetical protein LGAS_0601 [Lactobacillus gasseri ATCC 33323] gi|116629321|ref|YP_814493.1| hypothetical protein LGAS_0663 [Lactobacillus gasseri ATCC 33323] gi|311111441|ref|ZP_07712838.1| conserved hypothetical protein [Lactobacillus gasseri MV-22] gi|89891935|gb|ABD78808.1| hypothetical protein [Lactobacillus phage KC5a] gi|116094843|gb|ABJ59995.1| hypothetical protein LGAS_0601 [Lactobacillus gasseri ATCC 33323] gi|116094903|gb|ABJ60055.1| hypothetical protein LGAS_0663 [Lactobacillus gasseri ATCC 33323] gi|311066595|gb|EFQ46935.1| conserved hypothetical protein [Lactobacillus gasseri MV-22] Length = 205 Score = 42.3 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 35/70 (50%), Gaps = 1/70 (1%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSI-EESDECLEFVNLFC 70 + + YL +++ + + +W + +R GN+A HE Q + E++ + ++F + Sbjct: 127 KDNLKFVEYVNYLNENHFVSVKSHDWVDQIRKYGNEATHEIQVNTKEDAQKIIKFCEMIL 186 Query: 71 DIVFTLPALI 80 + + P+ I Sbjct: 187 KMNYEYPSEI 196 >gi|291167071|gb|EFE29117.1| hypothetical protein HMPREF0389_01039 [Filifactor alocis ATCC 35896] Length = 217 Score = 42.3 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 36/79 (45%), Gaps = 2/79 (2%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG--QSSIEESDECLEFVNLFCDIVF 74 L + L ++ E + +R GN A H + + + EC+EF+++ + ++ Sbjct: 137 LELKIADLVNKKILPEMMNSACWILRQLGNDAAHADDVEFTELDVKECIEFISIIINYLY 196 Query: 75 TLPALIKEKKSTHPNQSRD 93 ++P I + KS ++ Sbjct: 197 SMPIRIDQLKSKIEDRKTK 215 >gi|266619616|ref|ZP_06112551.1| type I restriction-modification system, R subunit [Clostridium hathewayi DSM 13479] gi|288868818|gb|EFD01117.1| type I restriction-modification system, R subunit [Clostridium hathewayi DSM 13479] Length = 1088 Score = 42.0 bits (97), Expect = 0.027, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 29/65 (44%) Query: 8 RINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN 67 R+ F K R L + L+ ++ + + +R + NKAVHE S ++ L+ + Sbjct: 53 RLPFPKDNTSISRIDTLYREGLLTNDLTDILHLMRKKRNKAVHENYESESDAKILLQMAH 112 Query: 68 LFCDI 72 C Sbjct: 113 SLCQW 117 >gi|329667173|gb|AEB93121.1| hypothetical protein LJP_0795 [Lactobacillus johnsonii DPC 6026] Length = 202 Score = 42.0 bits (97), Expect = 0.030, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 33/70 (47%), Gaps = 1/70 (1%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ-SSIEESDECLEFVNLFC 70 + + YL + + EW + +R GN+A HE Q ++ +++ + ++F + Sbjct: 127 KDNCKFVEYVNYLADSHFVSVRSHEWVDQIRKYGNEATHEIQVNTQQDAQKIIKFCEMIL 186 Query: 71 DIVFTLPALI 80 + + P+ I Sbjct: 187 KMNYEYPSEI 196 >gi|91783377|ref|YP_558583.1| hypothetical protein Bxe_A2440 [Burkholderia xenovorans LB400] gi|91687331|gb|ABE30531.1| hypothetical protein Bxe_A2440 [Burkholderia xenovorans LB400] Length = 235 Score = 41.6 bits (96), Expect = 0.037, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 45/91 (49%), Gaps = 3/91 (3%) Query: 4 DQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 D+ +N + L R ++ ++N++ + + + V+ +GN H+G S ++++ Sbjct: 140 DETPGLNAQGRRSLGLRMNWMFENNILPAALQDLAQCVKDDGNDGAHDGTLSAVDAEDLQ 199 Query: 64 EFVNLFCDIVFTLP---ALIKEKKSTHPNQS 91 EF + ++T P + KE+++ +S Sbjct: 200 EFTFELLERLYTEPKRLEIAKERRAARHAKS 230 >gi|218901961|ref|YP_002449795.1| Type III restriction enzyme, res subunit [Bacillus cereus AH820] gi|218535465|gb|ACK87863.1| Type III restriction enzyme, res subunit [Bacillus cereus AH820] Length = 1068 Score = 41.6 bits (96), Expect = 0.038, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 42/82 (51%), Gaps = 9/82 (10%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE--------FVNLF 69 DR + L++ L+ +E+++ + +R +GN A HE G +++E+ L F+ ++ Sbjct: 65 DRMQTLRREGLLEKELYDMFDALRKKGNNAAHEAGYGTVKEAQALLLMAFRLGIWFMEVY 124 Query: 70 CDIVFTLPALIKEKKSTHPNQS 91 D F P I+ +K + S Sbjct: 125 GDWDFEAPEYIEPEKEEKVDVS 146 >gi|300361369|ref|ZP_07057546.1| conserved hypothetical protein [Lactobacillus gasseri JV-V03] gi|300353988|gb|EFJ69859.1| conserved hypothetical protein [Lactobacillus gasseri JV-V03] Length = 205 Score = 40.8 bits (94), Expect = 0.055, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 35/70 (50%), Gaps = 1/70 (1%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSI-EESDECLEFVNLFC 70 + + YL +++ + + +W + +R GN+A HE Q + E++ + ++F + Sbjct: 127 KDNLKFVEYVNYLNENHFVSVKSHDWVDQIRKYGNEANHEIQVNTKEDAQKIIKFCEMIL 186 Query: 71 DIVFTLPALI 80 + + P+ I Sbjct: 187 KMNYEYPSEI 196 >gi|184155147|ref|YP_001843487.1| hypothetical protein LAF_0671 [Lactobacillus fermentum IFO 3956] gi|183226491|dbj|BAG27007.1| conserved hypothetical protein [Lactobacillus fermentum IFO 3956] Length = 229 Score = 40.8 bits (94), Expect = 0.056, Method: Composition-based stats. Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 21 TRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDECLEFVNLFCDIVF 74 YL ++ + E+ EW + +R EGN A H + + ++ L+FV + I F Sbjct: 126 VDYLVENGYVPEKSREWIDKIRTEGNSATHNQTAKNKSDAQRILDFVQMLLLINF 180 >gi|323697975|ref|ZP_08109887.1| Type I site-specific deoxyribonuclease [Desulfovibrio sp. ND132] gi|323457907|gb|EGB13772.1| Type I site-specific deoxyribonuclease [Desulfovibrio desulfuricans ND132] Length = 1122 Score = 40.8 bits (94), Expect = 0.070, Method: Composition-based stats. Identities = 12/49 (24%), Positives = 22/49 (44%) Query: 21 TRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLF 69 R L L+ +I + + +R GN A HE + + + + L F + Sbjct: 74 IRRLDDEGLLTPKINQVLHILRKAGNVAAHELEGEPQHALDALRFARIL 122 >gi|149179563|ref|ZP_01858098.1| hypothetical protein PM8797T_03705 [Planctomyces maris DSM 8797] gi|148841598|gb|EDL56026.1| hypothetical protein PM8797T_03705 [Planctomyces maris DSM 8797] Length = 238 Score = 40.4 bits (93), Expect = 0.075, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 38/94 (40%), Gaps = 9/94 (9%) Query: 4 DQGQRINFEKCG------MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSI 56 DQG+ + G L + L LI + + + +R GN + H Q + Sbjct: 144 DQGKLVRERATGKIVLRSNLEGKINGLWMKGLISKNQSKVLHQLRRLGNDSAHALDQPPL 203 Query: 57 EESDECLEFVNLFCDIVFTLPALIKEKKSTHPNQ 90 + +EC+E + V+ P L+ K+ + + Sbjct: 204 KLIEECIEALEHLLIQVYDQPELL--KRLMNRKK 235 >gi|90022721|ref|YP_528548.1| methyl-accepting chemotaxis sensory transducer [Saccharophagus degradans 2-40] gi|89952321|gb|ABD82336.1| methyl-accepting chemotaxis sensory transducer [Saccharophagus degradans 2-40] Length = 232 Score = 40.4 bits (93), Expect = 0.078, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIE-----ESDECLEFVNLFCDIV 73 D + + + +I +W++ +RI GN H + +++ +S+E +FV+ F Sbjct: 147 DLIKQINSLASLPGDIKDWAHQIRIFGNWGAHPDRDNLKNIEATDSEEVHDFVSKFFMYT 206 Query: 74 FTLPALIKEKKSTHPNQ 90 F +P +K + + Sbjct: 207 FIMPEKVKLSRIRRDEK 223 >gi|167771151|ref|ZP_02443204.1| hypothetical protein ANACOL_02506 [Anaerotruncus colihominis DSM 17241] gi|167666821|gb|EDS10951.1| hypothetical protein ANACOL_02506 [Anaerotruncus colihominis DSM 17241] Length = 80 Score = 40.4 bits (93), Expect = 0.080, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 31/60 (51%), Gaps = 1/60 (1%) Query: 21 TRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLEFVNLFCDIVFTLPAL 79 YL+ + I + W + +R GNK VH+ +++ E++ + + F+ ++ +P L Sbjct: 19 IDYLEANGYIGVQNKAWVDKIRTIGNKYVHQLDEATEEDARKVILFLKQLLGNLYEMPQL 78 >gi|160935960|ref|ZP_02083334.1| hypothetical protein CLOBOL_00855 [Clostridium bolteae ATCC BAA-613] gi|158441202|gb|EDP18919.1| hypothetical protein CLOBOL_00855 [Clostridium bolteae ATCC BAA-613] Length = 331 Score = 40.4 bits (93), Expect = 0.080, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 28/52 (53%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFV 66 G LSD L + I + + + +RI GNKAVHEG + ++++ + + Sbjct: 56 GDLSDTIDQLYEGQWINKATKDNYHTIRILGNKAVHEGDDTAYDANQAFQLL 107 >gi|260845531|ref|YP_003223309.1| hypothetical protein ECO103_3438 [Escherichia coli O103:H2 str. 12009] gi|257760678|dbj|BAI32175.1| hypothetical protein ECO103_3438 [Escherichia coli O103:H2 str. 12009] Length = 209 Score = 40.4 bits (93), Expect = 0.084, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCD 71 ++D + L + L I + ++ RI GN+AVH G+ S+E+ + + +N+ D Sbjct: 119 NINDDIKSLVEKGLPPR-IQQAADICRIVGNQAVHPGEISLEDDPQLTHGLFKLLNIIVD 177 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 178 DRITRPKEIEAMFQSMPEGPRQG 200 >gi|206577454|ref|YP_002236657.1| hypothetical protein KPK_0785 [Klebsiella pneumoniae 342] gi|206566512|gb|ACI08288.1| conserved hypothetical protein [Klebsiella pneumoniae 342] Length = 210 Score = 40.0 bits (92), Expect = 0.096, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 36/83 (43%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNLFCD 71 ++ + L + L + I + ++ RI GN+AVH G+ + + + +N+ D Sbjct: 119 NINADIKSLVEKGLPVR-IQQAADICRIVGNQAVHPGEISLDDDPQLAHGLFKLLNIIVD 177 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 178 DRITRPKEIEAMFQSMPEGPRQG 200 >gi|309793931|ref|ZP_07688356.1| conserved hypothetical protein [Escherichia coli MS 145-7] gi|308122338|gb|EFO59600.1| conserved hypothetical protein [Escherichia coli MS 145-7] Length = 219 Score = 40.0 bits (92), Expect = 0.11, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 36/83 (43%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNLFCD 71 ++ R L Q L + I + ++ RI GN+AVH G+ + + + +N+ D Sbjct: 121 NINKDIRSLVQKGLPVR-IQQAADICRIVGNQAVHPGEISLDDDPQLAHGLFKLLNIIVD 179 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 180 DRITRPKEIEAMFQSMPEGPRQG 202 >gi|283788456|ref|YP_003368321.1| hypothetical protein ROD_49471 [Citrobacter rodentium ICC168] gi|282951910|emb|CBG91628.1| hypothetical protein ROD_49471 [Citrobacter rodentium ICC168] Length = 209 Score = 39.6 bits (91), Expect = 0.12, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 39/83 (46%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCD 71 ++D + L + L I + ++ RI GN+AVH G+ S+++ + + +N+ D Sbjct: 119 NINDDIKSLVEKGLPPR-IQQAADICRIVGNQAVHPGEISLDDDPQLTHGLFKLLNIIVD 177 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 178 DRITRPKEIEAMFQSMPEGPRQG 200 >gi|323963885|gb|EGB59379.1| hypothetical protein ERJG_04731 [Escherichia coli M863] gi|327252357|gb|EGE64029.1| hypothetical protein ECSTEC7V_3209 [Escherichia coli STEC_7v] Length = 73 Score = 39.6 bits (91), Expect = 0.12, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 34/65 (52%) Query: 21 TRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFTLPALI 80 +L ++L+ E + E + V+ +GN HEG +++ +F LF + ++T P + Sbjct: 1 MEWLFDNHLLPEALRELAECVKDDGNDGAHEGILDKAAAEDLEDFTYLFLERLYTEPQRL 60 Query: 81 KEKKS 85 E K+ Sbjct: 61 IEAKT 65 >gi|332533299|ref|ZP_08409165.1| hypothetical protein PH505_ao00110 [Pseudoalteromonas haloplanktis ANT/505] gi|332037181|gb|EGI73637.1| hypothetical protein PH505_ao00110 [Pseudoalteromonas haloplanktis ANT/505] Length = 266 Score = 39.6 bits (91), Expect = 0.14, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 43/95 (45%), Gaps = 13/95 (13%) Query: 4 DQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDEC- 62 + G++I+ + ++S L Q L + VR+ GN+AVH G+ +++ E Sbjct: 171 ETGKKIDKDIASLVSKGLNPLVQQAL---------DIVRVVGNEAVHPGEIDFKDNKEIA 221 Query: 63 ---LEFVNLFCDIVFTLPALIKEKKSTHPNQSRDG 94 VNL CD + T P +KE P +G Sbjct: 222 LKLFGLVNLICDQMITHPKQVKELYGDLPKDKLEG 256 >gi|323486841|ref|ZP_08092159.1| N-acetylmuramoyl-L-alanine amidase [Clostridium symbiosum WAL-14163] gi|323690859|ref|ZP_08105153.1| hypothetical protein HMPREF9475_00014 [Clostridium symbiosum WAL-14673] gi|323399854|gb|EGA92234.1| N-acetylmuramoyl-L-alanine amidase [Clostridium symbiosum WAL-14163] gi|323505078|gb|EGB20846.1| hypothetical protein HMPREF9475_00014 [Clostridium symbiosum WAL-14673] Length = 293 Score = 39.6 bits (91), Expect = 0.15, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 31/70 (44%), Gaps = 3/70 (4%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVF 74 G L D L +I + E + +R GNKA+HE +S +++ +L V+ Sbjct: 58 GNLIDMIDGLYDDGIISKTTCEHYHKIRTIGNKAIHEEDNSAYNANQA---HHLLSQEVY 114 Query: 75 TLPALIKEKK 84 T E++ Sbjct: 115 TFANDYNERR 124 >gi|260856986|ref|YP_003230877.1| hypothetical protein ECO26_3952 [Escherichia coli O26:H11 str. 11368] gi|257755635|dbj|BAI27137.1| hypothetical protein ECO26_3952 [Escherichia coli O26:H11 str. 11368] Length = 174 Score = 39.3 bits (90), Expect = 0.16, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 36/83 (43%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNLFCD 71 ++ R L Q L + I + ++ RI GN+AVH G+ + + + +N+ D Sbjct: 76 NINKDIRSLVQKGLPVR-IQQAADICRIVGNQAVHPGEISLDDDPQLAHGLFKLLNIIVD 134 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 135 DRITRPKEIEAMFQSMPEGPRQG 157 >gi|297582111|ref|ZP_06944029.1| predicted protein [Vibrio cholerae RC385] gi|297533631|gb|EFH72474.1| predicted protein [Vibrio cholerae RC385] Length = 248 Score = 39.3 bits (90), Expect = 0.17, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 31/71 (43%), Gaps = 1/71 (1%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLEFVNLFCDIVF 74 L + L + L+ E + R GNKAVHE S+EE ++ V + ++ Sbjct: 170 NLEGQISGLHEKGLLTEAHSSILHEHRFMGNKAVHELDMPSLEELKIAIDIVEHTLENIY 229 Query: 75 TLPALIKEKKS 85 LP E ++ Sbjct: 230 ELPEKASELRA 240 >gi|114777354|ref|ZP_01452351.1| hypothetical protein SPV1_13679 [Mariprofundus ferrooxydans PV-1] gi|114552136|gb|EAU54638.1| hypothetical protein SPV1_13679 [Mariprofundus ferrooxydans PV-1] Length = 212 Score = 39.3 bits (90), Expect = 0.19, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 5/86 (5%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSS----IEESDECLEFVN 67 E +++ L + L + + + VR+ GN+AVH G E + VN Sbjct: 121 ESGKNINNDIAALVKKGL-NPTLQKSLDVVRVIGNEAVHPGTIDLNDEPETAIALFNLVN 179 Query: 68 LFCDIVFTLPALIKEKKSTHPNQSRD 93 + + T P +I+ + P++ R+ Sbjct: 180 IITQAMITQPKMIESLYESLPDEKRN 205 >gi|225420319|ref|ZP_03762622.1| hypothetical protein CLOSTASPAR_06662 [Clostridium asparagiforme DSM 15981] gi|225041005|gb|EEG51251.1| hypothetical protein CLOSTASPAR_06662 [Clostridium asparagiforme DSM 15981] Length = 145 Score = 38.9 bits (89), Expect = 0.21, Method: Composition-based stats. Identities = 16/47 (34%), Positives = 25/47 (53%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE 61 G LSD L + I + + +RI GNKAVHEG + ++++ Sbjct: 56 GDLSDTIDQLYEGRWISRNTKDHYHNIRILGNKAVHEGDDTAYDANQ 102 >gi|320450644|ref|YP_004202740.1| hypothetical protein TSC_c15760 [Thermus scotoductus SA-01] gi|320150813|gb|ADW22191.1| conserved hypothetical protein [Thermus scotoductus SA-01] Length = 148 Score = 38.9 bits (89), Expect = 0.23, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 37/88 (42%), Gaps = 9/88 (10%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVH---------EGQSSIEESDECLEF 65 G L+DR + + + + ++ + +R+ GN A H +GQ E EF Sbjct: 58 GSLADRLKRAHEEGKLTTQTYKLAGVLRLAGNAAAHYELWKIDPSQGQEDREMILALFEF 117 Query: 66 VNLFCDIVFTLPALIKEKKSTHPNQSRD 93 +N + + P ++E + + R+ Sbjct: 118 LNEVTEELIAKPKRLEEMEQKLSRKLRE 145 >gi|229542891|ref|ZP_04431951.1| type III restriction protein res subunit [Bacillus coagulans 36D1] gi|229327311|gb|EEN92986.1| type III restriction protein res subunit [Bacillus coagulans 36D1] Length = 1071 Score = 38.5 bits (88), Expect = 0.31, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 9/75 (12%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG-QSSIEESDECLE--------FVNLF 69 DR L++ LI E+ + +R +GN+A+HE + + EE+ L F+ ++ Sbjct: 67 DRIHMLRREGLIEPELMDIFETLRRKGNQAMHEALKFTTEEAKALLRLAFRLSIWFMEVY 126 Query: 70 CDIVFTLPALIKEKK 84 + F P I+ K+ Sbjct: 127 GEWDFQAPEYIEPKE 141 >gi|311111507|ref|ZP_07712904.1| conserved hypothetical protein [Lactobacillus gasseri MV-22] gi|311066661|gb|EFQ47001.1| conserved hypothetical protein [Lactobacillus gasseri MV-22] Length = 187 Score = 38.5 bits (88), Expect = 0.34, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 30/60 (50%), Gaps = 1/60 (1%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSI-EESDECLEFVNLFC 70 + + YL +++ + + +W + +R GN+A HE Q + E++ + ++F + Sbjct: 127 KDNLKFVEYVNYLNENHFVSVKSHDWVDQIRKYGNEATHEIQVNTKEDAQKIIKFCEMIL 186 >gi|301643791|ref|ZP_07243828.1| conserved hypothetical protein [Escherichia coli MS 146-1] gi|301077824|gb|EFK92630.1| conserved hypothetical protein [Escherichia coli MS 146-1] Length = 150 Score = 38.1 bits (87), Expect = 0.35, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 39/83 (46%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCD 71 ++ + L + L + I + ++ RI GN+AVH G+ S+++ + + +N+ D Sbjct: 59 NINADIKSLVEKGLPVR-IQQAADICRIVGNQAVHPGEISLDDDPQLTHGLFKLLNIIVD 117 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 118 DRITRPKEIEAMFQSMPEGPRQG 140 >gi|295401702|ref|ZP_06811669.1| type III restriction protein res subunit [Geobacillus thermoglucosidasius C56-YS93] gi|294976322|gb|EFG51933.1| type III restriction protein res subunit [Geobacillus thermoglucosidasius C56-YS93] Length = 1123 Score = 38.1 bits (87), Expect = 0.37, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 38/84 (45%), Gaps = 9/84 (10%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE- 64 + I E+ +R R L +I +EI++ +R++GN+AVH + E+ L Sbjct: 94 EEIREERGTDQQERLRVLFYEQIIPKEIYDLLTVIRLKGNEAVHNPSYGEVNEAKALLHM 153 Query: 65 -------FVNLFCDIVFTLPALIK 81 F+ ++ D F P I+ Sbjct: 154 AFRIAVWFMEVYGDWSFQAPEYIE 177 >gi|213022360|ref|ZP_03336807.1| hypothetical protein Salmonelentericaenterica_06794 [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 217 Score = 38.1 bits (87), Expect = 0.40, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 37/83 (44%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNLFCD 71 ++ R L + L + I + ++ RI GN+AVH G+ + + + +N+ D Sbjct: 119 NINTDIRSLVKKGLPVR-IQQAADICRIVGNQAVHPGEINLDDDPKLAHGLFKLLNIIVD 177 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P +++ + P R G Sbjct: 178 DQITRPKELEDMFLSMPEGPRQG 200 >gi|240146119|ref|ZP_04744720.1| type I restriction-modification system, R subunit [Roseburia intestinalis L1-82] gi|257201772|gb|EEV00057.1| type I restriction-modification system, R subunit [Roseburia intestinalis L1-82] Length = 1091 Score = 37.7 bits (86), Expect = 0.47, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 27/65 (41%) Query: 8 RINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN 67 RI R L++ L+ ++ + + +R NKAVHE S+ E LE Sbjct: 53 RIQLPYDNTAVTRIDNLQREGLLTRDLTDILHALRKARNKAVHENYESVSECKILLEMAY 112 Query: 68 LFCDI 72 C+ Sbjct: 113 SLCEW 117 >gi|323141888|ref|ZP_08076749.1| helicase C-terminal domain protein [Phascolarctobacterium sp. YIT 12067] gi|322413635|gb|EFY04493.1| helicase C-terminal domain protein [Phascolarctobacterium sp. YIT 12067] Length = 1091 Score = 37.7 bits (86), Expect = 0.49, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 34/69 (49%), Gaps = 7/69 (10%) Query: 2 KDDQGQRINFEKCGM-------LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS 54 +D + +F+ M L ++ R L+ I E++ + +R+ GN+A HEG Sbjct: 42 EDIVNKMFDFDNIVMPYQMKDTLVNKIRLLENEEYITEDLARIMHRLRLGGNEARHEGTD 101 Query: 55 SIEESDECL 63 S++++ L Sbjct: 102 SLQKAKLLL 110 >gi|187734081|ref|YP_001881654.1| hypothetical protein SbBS512_E3301 [Shigella boydii CDC 3083-94] gi|187431073|gb|ACD10347.1| conserved hypothetical protein [Shigella boydii CDC 3083-94] Length = 212 Score = 37.7 bits (86), Expect = 0.49, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 35/83 (42%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNLFCD 71 ++ R L Q L + I + ++ RI GN+AVH G+ + + + +N+ Sbjct: 114 NINADIRSLVQKGLPVR-IQQAADICRIVGNQAVHPGEISLDDDPQLAHGLFKLLNIIVT 172 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 173 EQITRPKEIEAMFQSMPEGPRQG 195 >gi|312972898|ref|ZP_07787071.1| hypothetical protein EC182770_3443 [Escherichia coli 1827-70] gi|310332840|gb|EFQ00054.1| hypothetical protein EC182770_3443 [Escherichia coli 1827-70] Length = 132 Score = 37.7 bits (86), Expect = 0.54, Method: Composition-based stats. Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 8/74 (10%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFTL 76 ++D L Q L EEI + + R+ GN AVH G+ +I++S E +F L Sbjct: 45 INDDIATLVQKGL-PEEIQQALDICRVVGNNAVHPGEINIDDSPEIAA-------SLFGL 96 Query: 77 PALIKEKKSTHPNQ 90 LI E++ T P + Sbjct: 97 INLIVEERITRPQK 110 >gi|138894433|ref|YP_001124886.1| type I restriction enzyme EcoKI subunit R [Geobacillus thermodenitrificans NG80-2] gi|134265946|gb|ABO66141.1| Type I restriction enzyme EcoKI R protein [Geobacillus thermodenitrificans NG80-2] Length = 1081 Score = 37.7 bits (86), Expect = 0.54, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 38/84 (45%), Gaps = 9/84 (10%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE- 64 + I E+ +R R L +I +EI++ +R++GN+AVH + E+ L Sbjct: 52 EEIREERGTDQQERLRILFYEQIIPKEIYDLFTVIRLKGNEAVHNPSYGEVNEAKALLHM 111 Query: 65 -------FVNLFCDIVFTLPALIK 81 F+ ++ D F P I+ Sbjct: 112 AFRIAVWFMEVYGDWSFQTPEYIE 135 >gi|256024627|ref|ZP_05438492.1| hypothetical protein E4_14714 [Escherichia sp. 4_1_40B] Length = 213 Score = 37.7 bits (86), Expect = 0.56, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 38/83 (45%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCD 71 ++ + L + L I + ++ RI GN+AVH G+ S+++ + + +N+ D Sbjct: 123 NINADIKSLVEKGLPPR-IQQAADVCRIVGNQAVHPGEISLDDDPQLTHGLFKLLNIIVD 181 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 182 DRITRPKEIEAMFQSMPEGPRQG 204 >gi|52144400|ref|YP_082428.1| hypothetical protein BCZK0824 [Bacillus cereus E33L] gi|51977869|gb|AAU19419.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 754 Score = 37.3 bits (85), Expect = 0.70, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 3/75 (4%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDECLEFVNLFCDIVFT 75 L D+ YL + I E+ + VR GNKA H+G I + + + ++ ++ Sbjct: 65 LYDKISYLAKEGYITAEVQRDFDTVRFTGNKAAHDGSFNDISAAFKLHKVMHNIAVWLYE 124 Query: 76 L--PALIKEKKSTHP 88 + P +K HP Sbjct: 125 VYSPEQLKIPAYEHP 139 >gi|300947579|ref|ZP_07161753.1| conserved hypothetical protein [Escherichia coli MS 116-1] gi|300452830|gb|EFK16450.1| conserved hypothetical protein [Escherichia coli MS 116-1] Length = 210 Score = 36.9 bits (84), Expect = 0.84, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 37/82 (45%), Gaps = 5/82 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCD 71 ++ + L + L I + ++ RI GN+AVH G+ S+++ + + +N+ D Sbjct: 119 NINKDIKSLVEKGLPPR-IQQAADICRIVGNQAVHPGEISLDDDPQLTHGLFKLLNIIVD 177 Query: 72 IVFTLPALIKEKKSTHPNQSRD 93 T P I+ + P R Sbjct: 178 DRITRPKEIEAMFQSMPEGPRQ 199 >gi|228926089|ref|ZP_04089167.1| hypothetical protein bthur0010_8110 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|228833582|gb|EEM79141.1| hypothetical protein bthur0010_8110 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 754 Score = 36.9 bits (84), Expect = 0.84, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 3/75 (4%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQS-SIEESDECLEFVNLFCDIVFT 75 L D+ YL + I E+ + VR GNKA H+G I + + + ++ ++ Sbjct: 65 LYDKISYLAKEGYITAEVQRDFDTVRFTGNKAAHDGSFNDISAAFKLHKVMHNIAVWLYE 124 Query: 76 L--PALIKEKKSTHP 88 + P +K HP Sbjct: 125 VYSPEQLKIPAYEHP 139 >gi|120600671|ref|YP_965245.1| Sel1 domain-containing protein [Shewanella sp. W3-18-1] gi|146291428|ref|YP_001181852.1| Sel1 domain-containing protein [Shewanella putrefaciens CN-32] gi|120560764|gb|ABM26691.1| Sel1 domain protein repeat-containing protein [Shewanella sp. W3-18-1] gi|145563118|gb|ABP74053.1| Sel1 domain protein repeat-containing protein [Shewanella putrefaciens CN-32] Length = 484 Score = 36.9 bits (84), Expect = 0.92, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 30/74 (40%), Gaps = 2/74 (2%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLE 64 Q +RI F L DR L Q LI + + +R +GN+ H + + + ++ L Sbjct: 55 QDKRIEFSSP-NLYDRIEQLNQKRLIDVKTTRALHRLRADGNRGAHPEKYHLTQ-EQLLA 112 Query: 65 FVNLFCDIVFTLPA 78 V TL Sbjct: 113 LAQKTIKDVLTLVE 126 >gi|319428341|gb|ADV56415.1| Sel1 domain protein repeat-containing protein [Shewanella putrefaciens 200] Length = 484 Score = 36.9 bits (84), Expect = 0.93, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 30/74 (40%), Gaps = 2/74 (2%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLE 64 Q +RI F L DR L Q LI + + +R +GN+ H + + + ++ L Sbjct: 55 QDKRIEFSSP-NLYDRIEQLNQKRLIDVKTTRALHRLRADGNRGAHPEKYHLTQ-EQLLA 112 Query: 65 FVNLFCDIVFTLPA 78 V TL Sbjct: 113 LAQKTIKDVLTLVE 126 >gi|323154740|gb|EFZ40938.1| hypothetical protein ECEPECA14_3348 [Escherichia coli EPECa14] Length = 108 Score = 36.9 bits (84), Expect = 0.94, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 36/83 (43%), Gaps = 5/83 (6%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNLFCD 71 ++ R L Q L + I + ++ RI GN+AVH G+ + + + +N+ D Sbjct: 10 NINKDIRSLVQKGLPVR-IQQAADICRIVGNQAVHPGEISLDDDPQLAHGLFKLLNIIVD 68 Query: 72 IVFTLPALIKEKKSTHPNQSRDG 94 T P I+ + P R G Sbjct: 69 DRITRPKEIEAMFQSMPEGPRQG 91 >gi|260663335|ref|ZP_05864226.1| conserved hypothetical protein [Lactobacillus fermentum 28-3-CHN] gi|260552187|gb|EEX25239.1| conserved hypothetical protein [Lactobacillus fermentum 28-3-CHN] Length = 204 Score = 36.9 bits (84), Expect = 0.94, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ-SSIEESDECLEFVNLFCDIVFT 75 YL +H+ EW + +R GN+A HE + ++ EE+ ++F + + + Sbjct: 133 FIQYVGYLNEHHYAGARSDEWVDQIRQFGNQANHEIRINTKEEAKRIIKFCEMILKLNYE 192 Query: 76 LPALIKEKKSTH 87 P++ + S + Sbjct: 193 YPSIAYDNDSNN 204 >gi|282900741|ref|ZP_06308683.1| hypothetical protein CRC_02103 [Cylindrospermopsis raciborskii CS-505] gi|281194541|gb|EFA69496.1| hypothetical protein CRC_02103 [Cylindrospermopsis raciborskii CS-505] Length = 190 Score = 36.9 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 8/58 (13%), Positives = 29/58 (50%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVF 74 L ++ ++Q +I ++ W++ +R++ + + ++ + +E +L + +F Sbjct: 116 LKEKLEIMRQQEIINHHLYSWASNLRLQDLSHEVDINFNQNDAQQIVELTDLLIEYIF 173 >gi|269976585|ref|ZP_06183570.1| type I restriction enzyme EcoKI R protein [Mobiluncus mulieris 28-1] gi|269935386|gb|EEZ91935.1| type I restriction enzyme EcoKI R protein [Mobiluncus mulieris 28-1] Length = 1085 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 24/48 (50%) Query: 20 RTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN 67 R L++ L+ +++ + +R NKAVHEG + + L ++ Sbjct: 65 RIDTLQREGLLPQDVVNVLHILRKARNKAVHEGWGDTDTATRFLPVIH 112 >gi|289583327|ref|YP_003481737.1| hypothetical protein Nmag_3626 [Natrialba magadii ATCC 43099] gi|289532825|gb|ADD07175.1| conserved hypothetical protein [Natrialba magadii ATCC 43099] Length = 213 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 33/82 (40%), Gaps = 4/82 (4%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ----SSIEESDECLEFVNLFCDI 72 L L I E + + + VR+ GN VH G+ E + E VN+ ++ Sbjct: 126 LYQNIGDLVDDGQIDERVQQALDSVRVTGNDYVHAGEIYNPDDREVALRLFELVNIIVEL 185 Query: 73 VFTLPALIKEKKSTHPNQSRDG 94 T LI+E S P + G Sbjct: 186 TITREKLIEEAYSDIPENKKKG 207 >gi|223934041|ref|ZP_03625994.1| conserved hypothetical protein [Streptococcus suis 89/1591] gi|223897298|gb|EEF63706.1| conserved hypothetical protein [Streptococcus suis 89/1591] Length = 239 Score = 36.6 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 35/70 (50%), Gaps = 5/70 (7%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCDI 72 L+ + L + I EI + + VR+ GN AVH GQ I+++ E L FVNL D Sbjct: 153 LNKKIGNLVAKGMPI-EIQQMLDSVRVIGNNAVHPGQIEIQDNKELALSLLNFVNLITDS 211 Query: 73 VFTLPALIKE 82 + P I E Sbjct: 212 QISQPKKIAE 221 >gi|323136161|ref|ZP_08071243.1| Type I site-specific deoxyribonuclease [Methylocystis sp. ATCC 49242] gi|322398235|gb|EFY00755.1| Type I site-specific deoxyribonuclease [Methylocystis sp. ATCC 49242] Length = 1106 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 13/51 (25%), Positives = 25/51 (49%) Query: 9 INFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEES 59 + ++ + R L+Q L+ + + +F R GN AVHE + E++ Sbjct: 37 LGIDRRANFDEMLRVLRQEGLLPRQAADVFHFFRKVGNLAVHENSGTPEQA 87 >gi|261419105|ref|YP_003252787.1| type I restriction enzyme EcoKI subunit R [Geobacillus sp. Y412MC61] gi|319765922|ref|YP_004131423.1| type I site-specific deoxyribonuclease [Geobacillus sp. Y412MC52] gi|261375562|gb|ACX78305.1| type III restriction protein res subunit [Geobacillus sp. Y412MC61] gi|317110788|gb|ADU93280.1| Type I site-specific deoxyribonuclease [Geobacillus sp. Y412MC52] Length = 1080 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 11/85 (12%) Query: 18 SDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE--------FVNL 68 DR + L +I +EI++ +R++GN+AVH+ + E+ L F+ + Sbjct: 63 QDRLKVLLYEQIIPKEIYDIFTMIRLKGNQAVHDPNYGDVHEAKTLLHMAFRLAVWFMEV 122 Query: 69 FCDIVFTLPALIKEKKSTHPNQSRD 93 + D F P + ++S D Sbjct: 123 YGDWSFEAPQY--REPLPSSSESTD 145 >gi|297530926|ref|YP_003672201.1| type I site-specific deoxyribonuclease [Geobacillus sp. C56-T3] gi|297254178|gb|ADI27624.1| Type I site-specific deoxyribonuclease [Geobacillus sp. C56-T3] Length = 1080 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 11/85 (12%) Query: 18 SDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE--------FVNL 68 DR + L +I +EI++ +R++GN+AVH+ + E+ L F+ + Sbjct: 63 QDRLKVLLYEQIIPKEIYDIFTMIRLKGNQAVHDPNYGDVHEAKTLLHMAFRLAVWFMEV 122 Query: 69 FCDIVFTLPALIKEKKSTHPNQSRD 93 + D F P + ++S D Sbjct: 123 YGDWSFEAPQY--REPLPSSSESTD 145 >gi|91787108|ref|YP_548060.1| hypothetical protein Bpro_1211 [Polaromonas sp. JS666] gi|91696333|gb|ABE43162.1| conserved hypothetical protein [Polaromonas sp. JS666] Length = 199 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 2/68 (2%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG--QSSIEESDECLEFVNLFCDIVF 74 L+ + LK +I E IF W +R N H + + E++ + L+F C+ V+ Sbjct: 124 LAAGLKKLKDDGVIDERIFNWGEALRENRNLGAHATAIKVTKEDARDLLDFGLAICEYVY 183 Query: 75 TLPALIKE 82 L Sbjct: 184 VLNEKFNR 191 >gi|251798710|ref|YP_003013441.1| type I restriction enzyme EcoKI subunit R [Paenibacillus sp. JDR-2] gi|247546336|gb|ACT03355.1| type III restriction protein res subunit [Paenibacillus sp. JDR-2] Length = 1083 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 9/84 (10%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDE---- 61 + ++ K DR LK+ +LI ++I ++ + +R GNKAVHE G S E+ Sbjct: 52 EEMDETKELTQVDRLSQLKRDDLISDDIVDYLHTLRKIGNKAVHESGYGSTREAQASTLL 111 Query: 62 ----CLEFVNLFCDIVFTLPALIK 81 + F+ ++ D F P I+ Sbjct: 112 AFRLSVWFMQVYGDWNFQAPDYIE 135 >gi|157373391|ref|YP_001471991.1| Sel1 domain-containing protein [Shewanella sediminis HAW-EB3] gi|157315765|gb|ABV34863.1| Sel1 domain protein repeat-containing protein [Shewanella sediminis HAW-EB3] Length = 472 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 2/82 (2%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFV 66 +RI F+ L DR L + +I + +R +GN+ H + + S++ LE Sbjct: 52 KRITFDSP-NLYDRIEMLNRKRVINVRTTRALHKLRGDGNRGAHPEKYHL-TSEQLLELS 109 Query: 67 NLFCDIVFTLPALIKEKKSTHP 88 + + +L A + + +T+P Sbjct: 110 EKSIEKLLSLVASLFTQVTTNP 131 >gi|300118619|ref|ZP_07056357.1| type I restriction enzyme EcoKI subunit R [Bacillus cereus SJ1] gi|298724008|gb|EFI64712.1| type I restriction enzyme EcoKI subunit R [Bacillus cereus SJ1] Length = 1437 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 15/85 (17%) Query: 21 TRYLKQHNLIIEEI---FEWSNFVRIEGNKAVHEGQ-SSIEES--------DECLEFVNL 68 L + ++ EE+ FEW +R GNKAVHE S+E++ D + F+ L Sbjct: 68 IHKLYREGILNEEMHFRFEW---IRKMGNKAVHEANFGSVEDALKAHKLTYDLAVWFMEL 124 Query: 69 FCDIVFTLPALIKEKKSTHPNQSRD 93 + D+ F PA K T + D Sbjct: 125 YGDVNFKAPAYASPKAVTEQKVNTD 149 >gi|288559579|ref|YP_003423065.1| dnd system-associated protein 3 [Methanobrevibacter ruminantium M1] gi|288542289|gb|ADC46173.1| dnd system-associated protein 3 [Methanobrevibacter ruminantium M1] Length = 749 Score = 35.8 bits (81), Expect = 1.8, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 1/58 (1%) Query: 18 SDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL-EFVNLFCDIVF 74 R L +I +I++ N +R NKAVH S IE++ L ++ L C + Sbjct: 65 KKRLEMLGYKGIISYDIYKRLNHIRKIRNKAVHGHLSDIEDNANILHAYLYLICAYFY 122 >gi|229096946|ref|ZP_04227915.1| hypothetical protein bcere0020_21930 [Bacillus cereus Rock3-29] gi|228686556|gb|EEL40465.1| hypothetical protein bcere0020_21930 [Bacillus cereus Rock3-29] Length = 244 Score = 35.8 bits (81), Expect = 1.9, Method: Composition-based stats. Identities = 19/70 (27%), Positives = 37/70 (52%), Gaps = 2/70 (2%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDI 72 K + + LK+ + I + +F+ + VR+ GN AVH G+ I+++ + + LF + Sbjct: 165 KGNNIYQNIKLLKERDNINDVVFDALDAVRLVGNNAVHPGKIKIDDNPKIA--ITLFWLL 222 Query: 73 VFTLPALIKE 82 F + LI + Sbjct: 223 NFIVEELISK 232 >gi|188589957|ref|YP_001921313.1| hypothetical protein CLH_1932 [Clostridium botulinum E3 str. Alaska E43] gi|188500238|gb|ACD53374.1| conserved hypothetical protein [Clostridium botulinum E3 str. Alaska E43] Length = 800 Score = 35.4 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 1/48 (2%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG-QSSIEES 59 +R R L++ ++ E I + + VR+ GNKA H + +E + Sbjct: 59 NTITQVERLRKLEEEGILTENIDKLFHVVRLLGNKAAHSNLEGELEAA 106 >gi|259503492|ref|ZP_05746394.1| conserved hypothetical protein [Lactobacillus antri DSM 16041] gi|259168570|gb|EEW53065.1| conserved hypothetical protein [Lactobacillus antri DSM 16041] Length = 204 Score = 35.4 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 1/70 (1%) Query: 13 KCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ-SSIEESDECLEFVNLFCD 71 + YL +H+ +W + +R GN+A HE + ++ EE+ ++F + Sbjct: 130 DNLRFIEYVDYLNEHHYAGVRSEQWVDQIRQFGNQANHEIRINTKEEAQRIIKFCEMILK 189 Query: 72 IVFTLPALIK 81 + + P++ Sbjct: 190 LNYEYPSIAS 199 >gi|149004391|ref|ZP_01829131.1| hypothetical protein CGSSp14BS69_07031 [Streptococcus pneumoniae SP14-BS69] gi|149004569|ref|ZP_01829262.1| hypothetical protein CGSSp14BS69_06037 [Streptococcus pneumoniae SP14-BS69] gi|147757543|gb|EDK64568.1| hypothetical protein CGSSp14BS69_06037 [Streptococcus pneumoniae SP14-BS69] gi|147757640|gb|EDK64658.1| hypothetical protein CGSSp14BS69_07031 [Streptococcus pneumoniae SP14-BS69] Length = 236 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 5/81 (6%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCDI 72 L+ + L + I EI + + VR+ GN AVH GQ I+++ E L F+NL D Sbjct: 150 LNTQIGSLVSKGMPI-EIQQMLDSVRVIGNNAVHPGQIDIKDNKELALSLLSFINLIVDN 208 Query: 73 VFTLPALIKEKKSTHPNQSRD 93 T P I + + P+ R+ Sbjct: 209 RITQPKKILDIYNLLPDSYRN 229 >gi|297584947|ref|YP_003700727.1| hypothetical protein Bsel_2662 [Bacillus selenitireducens MLS10] gi|297143404|gb|ADI00162.1| hypothetical protein Bsel_2662 [Bacillus selenitireducens MLS10] Length = 397 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 15/38 (39%), Positives = 19/38 (50%) Query: 14 CGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE 51 C L DR R LK H E++ + +R GN A HE Sbjct: 62 CENLLDRIRLLKSHGKCSEDVLNAIHEIRKAGNSAAHE 99 >gi|291485261|dbj|BAI86336.1| hypothetical protein BSNT_04129 [Bacillus subtilis subsp. natto BEST195] Length = 1068 Score = 35.4 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 38/84 (45%), Gaps = 11/84 (13%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLE--------FVNLF 69 DR L++ L+ E+ + +R +GN A+HE +EE+ L+ F+ ++ Sbjct: 64 DRINTLRRERLLEPELIDILEALRRKGNVAMHEADYGKVEEAKALLQLTFRLSIWFMEVY 123 Query: 70 CDIVFTLPALIKEKKSTHPNQSRD 93 D F P + ++ QS + Sbjct: 124 GDWDFQAPEYTELQEQV--EQSTE 145 >gi|253583388|ref|ZP_04860586.1| type I restriction enzyme EcoKI subunit R [Fusobacterium varium ATCC 27725] gi|251833960|gb|EES62523.1| type I restriction enzyme EcoKI subunit R [Fusobacterium varium ATCC 27725] Length = 1088 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 20/64 (31%), Positives = 33/64 (51%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFV 66 +++N + G DR + LK ++LI E+I +R +GN AVH + E++ L V Sbjct: 52 EKLNEPESGSQLDRIKILKTYDLIPEDIENILQLIRKKGNTAVHNMSGNESEAETLLSLV 111 Query: 67 NLFC 70 C Sbjct: 112 VKLC 115 >gi|217971401|ref|YP_002356152.1| Sel1 domain-containing protein repeat-containing protein [Shewanella baltica OS223] gi|217496536|gb|ACK44729.1| Sel1 domain protein repeat-containing protein [Shewanella baltica OS223] Length = 491 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 1/56 (1%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 Q ++I F L DR L Q +I + + +R +GN+ H + + ++ Sbjct: 55 QDKQIAFSSP-NLYDRIEQLNQQRVIDVKTTRALHRLRADGNRGAHPEKYHLTQAQ 109 >gi|288576378|ref|ZP_05978685.2| conserved hypothetical protein [Neisseria mucosa ATCC 25996] gi|288565657|gb|EFC87217.1| conserved hypothetical protein [Neisseria mucosa ATCC 25996] Length = 233 Score = 35.0 bits (79), Expect = 3.0, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 36/73 (49%), Gaps = 2/73 (2%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCD 71 E+ ++ R L + ++ ++ + ++ +RI GN AVH GQ E+ D+ +F Sbjct: 124 EEGKNINTDIRSLVKKEVLSGQVVKVADTLRITGNNAVHPGQIVDEDFDKVAA--KMFDL 181 Query: 72 IVFTLPALIKEKK 84 I F + I E K Sbjct: 182 INFIVKKAITEPK 194 >gi|171779682|ref|ZP_02920638.1| hypothetical protein STRINF_01519 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171281784|gb|EDT47218.1| hypothetical protein STRINF_01519 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 244 Score = 35.0 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 5/70 (7%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE----CLEFVNLFCDI 72 L+++ YL + + I EI + + VR+ GN AVH GQ ++++ E L FVNL D Sbjct: 158 LNNKIGYLVRQGMPI-EIQQMLDSVRVVGNNAVHPGQIDLKDNKELAASLLTFVNLIVDN 216 Query: 73 VFTLPALIKE 82 + P IK Sbjct: 217 RISQPKKIKS 226 >gi|74316944|ref|YP_314684.1| hypothetical protein Tbd_0926 [Thiobacillus denitrificans ATCC 25259] gi|74056439|gb|AAZ96879.1| hypothetical protein Tbd_0926 [Thiobacillus denitrificans ATCC 25259] Length = 216 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 1/80 (1%) Query: 15 GMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG-QSSIEESDECLEFVNLFCDIV 73 G S + L++H I + E + GN A H G Q + ++ + ++ V V Sbjct: 135 GSFSAKLDALEKHGAIGAKSKEILHAALDAGNAASHRGYQPTTDDINAVMDIVENLLQAV 194 Query: 74 FTLPALIKEKKSTHPNQSRD 93 + L L + K P +S+ Sbjct: 195 YHLKTLAESLKKATPTRSQK 214 >gi|283797584|ref|ZP_06346737.1| N-acetylmuramoyl-L-alanine amidase [Clostridium sp. M62/1] gi|291074693|gb|EFE12057.1| N-acetylmuramoyl-L-alanine amidase [Clostridium sp. M62/1] Length = 306 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 8/61 (13%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE--------GQSSIEESDECLEFVNL 68 L D L ++ +I + E + +R+ GNKAVHE GQ+ S E F N Sbjct: 59 LVDMIDALYENGIINKTTCEHYHKIRMIGNKAVHEEDNLAYNAGQAYHLLSQEIYTFAND 118 Query: 69 F 69 F Sbjct: 119 F 119 >gi|150388682|ref|YP_001318731.1| type I restriction enzyme EcoKI subunit R [Alkaliphilus metalliredigens QYMF] gi|149948544|gb|ABR47072.1| type III restriction protein, res subunit [Alkaliphilus metalliredigens QYMF] Length = 1086 Score = 35.0 bits (79), Expect = 3.7, Method: Composition-based stats. Identities = 18/49 (36%), Positives = 26/49 (53%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN 67 +R + LK+ LI + I + +RI NKAVH G S E+ LE + Sbjct: 65 NRIKLLKKEGLISQNIDDIIYSLRIARNKAVHSGYDSFEDCVILLEMGH 113 >gi|282897432|ref|ZP_06305434.1| hypothetical protein CRD_02356 [Raphidiopsis brookii D9] gi|281198084|gb|EFA72978.1| hypothetical protein CRD_02356 [Raphidiopsis brookii D9] Length = 190 Score = 35.0 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 7/58 (12%), Positives = 29/58 (50%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVF 74 L ++ ++Q +I ++ W++ ++++ + + ++ + +E +L + +F Sbjct: 116 LKEKLEIMRQQEIINHHLYNWASNLKLQDLSCDVDINFNQNDAQQIVELTDLVIEYIF 173 >gi|315650928|ref|ZP_07903969.1| N-acetylmuramoyl-L-alanine amidase [Eubacterium saburreum DSM 3986] gi|315486842|gb|EFU77183.1| N-acetylmuramoyl-L-alanine amidase [Eubacterium saburreum DSM 3986] Length = 293 Score = 35.0 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 13/43 (30%), Positives = 22/43 (51%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEES 59 L+D L N+I ++ E + +R+ GNKAVH+ + Sbjct: 58 LADIIDSLYTSNIISQKSCEHYHKIRVIGNKAVHDNSDDSYSA 100 >gi|302874004|ref|YP_003842637.1| Type I site-specific deoxyribonuclease [Clostridium cellulovorans 743B] gi|307689747|ref|ZP_07632193.1| type I restriction enzyme EcoKI subunit R [Clostridium cellulovorans 743B] gi|302576861|gb|ADL50873.1| Type I site-specific deoxyribonuclease [Clostridium cellulovorans 743B] Length = 1085 Score = 34.6 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 16/51 (31%), Positives = 29/51 (56%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFV 66 ++R + LK+ +LI E+I + +RI+ N A H+G +E++ LE Sbjct: 63 NHNNRIKLLKKEDLIPEDIDNILHMLRIKRNSAAHQGYEDVEKAKVQLELT 113 >gi|227820719|ref|YP_002824689.1| type I restriction enzyme EcoKI subunit R [Sinorhizobium fredii NGR234] gi|227339718|gb|ACP23936.1| putative restriction endonuclease type I with R subunit / type III with Res subunit [Sinorhizobium fredii NGR234] Length = 1140 Score = 34.6 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 22/49 (44%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN 67 D R L ++ E+ + + VR GN+A H S E+ L+F Sbjct: 69 DLLRRLATQQILPREVADIFHAVRKSGNEATHNLAGSPTEALAALKFCR 117 >gi|228964014|ref|ZP_04125144.1| hypothetical protein bthur0004_8740 [Bacillus thuringiensis serovar sotto str. T04001] gi|228795666|gb|EEM43143.1| hypothetical protein bthur0004_8740 [Bacillus thuringiensis serovar sotto str. T04001] Length = 1430 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 15/85 (17%) Query: 21 TRYLKQHNLIIEEI---FEWSNFVRIEGNKAVHEGQ-SSIEES--------DECLEFVNL 68 L + ++ EE+ FEW +R GNKAVHE SIE++ D + F+ L Sbjct: 68 IHKLYREGILNEEMHFRFEW---IRKMGNKAVHEANFGSIEDALKAHKLTYDLAVWFMEL 124 Query: 69 FCDIVFTLPALIKEKKSTHPNQSRD 93 + D+ F PA K + + D Sbjct: 125 YGDVNFKAPAYASPKANAEQKVNTD 149 >gi|160877378|ref|YP_001556694.1| Sel1 domain-containing protein [Shewanella baltica OS195] gi|160862900|gb|ABX51434.1| Sel1 domain protein repeat-containing protein [Shewanella baltica OS195] gi|315269581|gb|ADT96434.1| Sel1 domain protein repeat-containing protein [Shewanella baltica OS678] Length = 491 Score = 34.6 bits (78), Expect = 4.0, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 1/56 (1%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 Q ++I F L DR L Q +I + + +R +GN+ H + + ++ Sbjct: 55 QDKQIAFSSP-NLYDRIEQLNQQRVIDVKTTRALHRLRADGNRGAHPEKYHLTQAQ 109 >gi|209526219|ref|ZP_03274749.1| type III restriction protein res subunit [Arthrospira maxima CS-328] gi|209493316|gb|EDZ93641.1| type III restriction protein res subunit [Arthrospira maxima CS-328] Length = 1128 Score = 34.6 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 22/49 (44%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN 67 D + LK +I E+ + + +RI GN+A HE + L+ Sbjct: 67 DLLKRLKFQRVISPEVADLFHQIRIAGNRATHEYIGDHRSALTMLKMAR 115 >gi|194466767|ref|ZP_03072754.1| conserved hypothetical protein [Lactobacillus reuteri 100-23] gi|194453803|gb|EDX42700.1| conserved hypothetical protein [Lactobacillus reuteri 100-23] Length = 202 Score = 34.6 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 1/66 (1%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHE-GQSSIEESDECLEFVNLFCDIVFT 75 YL +++ +W N +R GN+A HE +S EE+ ++F + + + Sbjct: 132 FIQYVNYLNENHYAGARSEQWINQIRQFGNQANHEIIINSKEEAQRIIKFCEMILKLNYE 191 Query: 76 LPALIK 81 P++ Sbjct: 192 YPSIAS 197 >gi|295115044|emb|CBL35891.1| Bacterial SH3 domain. [butyrate-producing bacterium SM4/1] Length = 306 Score = 34.6 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 20/37 (54%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ 53 L D L ++ +I + E + +R+ GNKAVHE Sbjct: 59 LVDMIDALYENGIINKTTCEHYHKIRMIGNKAVHEED 95 >gi|257452213|ref|ZP_05617512.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. 3_1_5R] gi|317058756|ref|ZP_07923241.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. 3_1_5R] gi|313684432|gb|EFS21267.1| type I restriction enzyme EcoKI subunit R [Fusobacterium sp. 3_1_5R] Length = 1088 Score = 34.6 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 9/86 (10%) Query: 7 QRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLE-- 64 +RI+ K M SDR LK++ LI E+I + +R +GNKAVH E ++ L Sbjct: 52 ERISDSKKNMASDRLLALKKYELIPEDIEKILTTLRKKGNKAVHGIYGDEETAETLLSMA 111 Query: 65 ------FVNLF-CDIVFTLPALIKEK 83 F ++ D+ FT +I +K Sbjct: 112 VKVAAWFQEVYGSDLSFTSEEIIYQK 137 >gi|295089932|emb|CBK76039.1| Bacterial SH3 domain. [Clostridium cf. saccharolyticum K10] Length = 306 Score = 34.6 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 20/37 (54%) Query: 17 LSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQ 53 L D L ++ +I + E + +R+ GNKAVHE Sbjct: 59 LVDMIDALYENGIINKTTCEHYHKIRMIGNKAVHEED 95 >gi|254455122|ref|ZP_05068557.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198263532|gb|EDY87804.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 276 Score = 34.6 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 37/83 (44%), Gaps = 11/83 (13%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLE 64 +G+ I+ + L L ++I + +R+ GN++VH G+ + + + Sbjct: 186 EGKSID--------KDIKALVAKGL-SQKIQRALDVLRVVGNESVHPGELDMRDDSQTA- 235 Query: 65 FVNLFCDIVFTLPALIKEKKSTH 87 + LF + A+I E+K + Sbjct: 236 -LKLFNVFNLIVNAMISEEKLIN 257 >gi|331270258|ref|YP_004396750.1| hypothetical protein CbC4_2086 [Clostridium botulinum BKT015925] gi|329126808|gb|AEB76753.1| conserved hypothetical protein [Clostridium botulinum BKT015925] Length = 790 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 9/66 (13%) Query: 19 DRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEG-QSSIEESDECLE--------FVNLF 69 +R R L+ +I EI + + VR+ GNKA HE + +E + + FV + Sbjct: 65 ERLRKLENEGVIEGEIDKLFHTVRLLGNKAAHEDVEGELEVALNIHKNIYKITCWFVESY 124 Query: 70 CDIVFT 75 D F Sbjct: 125 IDYNFE 130 >gi|89901611|ref|YP_524082.1| hypothetical protein Rfer_2839 [Rhodoferax ferrireducens T118] gi|89346348|gb|ABD70551.1| hypothetical protein Rfer_2839 [Rhodoferax ferrireducens T118] Length = 271 Score = 34.6 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 12/72 (16%), Positives = 24/72 (33%), Gaps = 1/72 (1%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSS-IEESDECLEFVNLFC 70 K L + L ++ + +R GN + HE + + V+ Sbjct: 152 AKGNNLLKQIDDLVSLGILTPGRASVLHQIRTLGNLSAHEAAPHTPAQLGLAMAVVDHLL 211 Query: 71 DIVFTLPALIKE 82 + V+ LP + Sbjct: 212 EEVYILPEKTQR 223 >gi|223934465|ref|ZP_03626386.1| conserved hypothetical protein [bacterium Ellin514] gi|223896928|gb|EEF63368.1| conserved hypothetical protein [bacterium Ellin514] Length = 212 Score = 34.2 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 1/46 (2%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDE 61 ++D R L Q L E I + + VR+ GN+AVH G+ + + E Sbjct: 123 NINDDIRSLVQKGL-DERIQQSLDIVRVIGNEAVHPGEMDLRDKTE 167 >gi|114776879|ref|ZP_01451922.1| hypothetical protein SPV1_11706 [Mariprofundus ferrooxydans PV-1] gi|114552965|gb|EAU55396.1| hypothetical protein SPV1_11706 [Mariprofundus ferrooxydans PV-1] Length = 247 Score = 34.2 bits (77), Expect = 5.4, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 33/69 (47%), Gaps = 3/69 (4%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFT 75 +++ + L L +I + + +R+ GN AVH GQ +E+ E + LF + F Sbjct: 158 NINNDIKALVAEGL-SPKIQQALDLLRVVGNNAVHPGQIDLEDGREIA--LRLFHVLNFI 214 Query: 76 LPALIKEKK 84 +I + K Sbjct: 215 ADEMISKPK 223 >gi|168232881|ref|ZP_02657939.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194468971|ref|ZP_03074955.1| type I restriction enzyme EcoKI R protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194455335|gb|EDX44174.1| type I restriction enzyme EcoKI R protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205333006|gb|EDZ19770.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 1169 Score = 34.2 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPACENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|161617831|ref|YP_001591796.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161367195|gb|ABX70963.1| hypothetical protein SPAB_05695 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 1169 Score = 34.2 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPACENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|16767770|ref|NP_463385.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|16423093|gb|AAL23344.1| endonuclease R [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|312915623|dbj|BAJ39597.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 1169 Score = 34.2 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPACENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|167991324|ref|ZP_02572423.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|168243976|ref|ZP_02668908.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194449776|ref|YP_002048549.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197262099|ref|ZP_03162173.1| type I restriction enzyme EcoKI R protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|194408080|gb|ACF68299.1| type I restriction enzyme EcoKI R protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197240354|gb|EDY22974.1| type I restriction enzyme EcoKI R protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205330415|gb|EDZ17179.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|205336991|gb|EDZ23755.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|261249611|emb|CBG27481.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267996884|gb|ACY91769.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301161009|emb|CBW20546.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|323132868|gb|ADX20298.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332991335|gb|AEF10318.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 1169 Score = 34.2 bits (77), Expect = 6.0, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPACENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|126172445|ref|YP_001048594.1| Sel1 domain-containing protein [Shewanella baltica OS155] gi|125995650|gb|ABN59725.1| Sel1 domain protein repeat-containing protein [Shewanella baltica OS155] Length = 491 Score = 34.2 bits (77), Expect = 6.1, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 2/63 (3%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLE 64 Q + I F L DR L Q +I + + +R +GN+ H + + ++ + L Sbjct: 55 QDKHIVFSSP-NLYDRIEQLNQQRVIDVKTTRALHRLRADGNRGAHPEKYHLTQA-QLLA 112 Query: 65 FVN 67 V Sbjct: 113 LVQ 115 >gi|157375342|ref|YP_001473942.1| hypothetical protein Ssed_2205 [Shewanella sediminis HAW-EB3] gi|157317716|gb|ABV36814.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 219 Score = 34.2 bits (77), Expect = 6.5, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCD 71 EK +++ R L +N + + + ++ VRI GN AVH G+ S ++ D Sbjct: 125 EKGDNINEDIRALASNNTLPPLVVKVADTVRITGNNAVHPGEMSDDDFDHIAS------- 177 Query: 72 IVFTLPALIKEKKSTHPNQ 90 +F L I +K T PN+ Sbjct: 178 KMFELLNFIVKKGITEPNE 196 >gi|304412306|ref|ZP_07393914.1| Sel1 domain protein repeat-containing protein [Shewanella baltica OS183] gi|307306090|ref|ZP_07585835.1| Sel1 domain protein repeat-containing protein [Shewanella baltica BA175] gi|304349341|gb|EFM13751.1| Sel1 domain protein repeat-containing protein [Shewanella baltica OS183] gi|306910963|gb|EFN41390.1| Sel1 domain protein repeat-containing protein [Shewanella baltica BA175] Length = 491 Score = 33.9 bits (76), Expect = 6.8, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 1/56 (1%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 Q ++I F L DR L Q +I + + +R EGN+ H + + ++ Sbjct: 55 QDKQIAFSSP-NLYDRIEQLNQQRVIDVKTTRALHRLRAEGNRGAHPEKYHLTQAQ 109 >gi|213648687|ref|ZP_03378740.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. J185] Length = 1032 Score = 33.9 bits (76), Expect = 7.3, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|153002642|ref|YP_001368323.1| Sel1 domain-containing protein [Shewanella baltica OS185] gi|151367260|gb|ABS10260.1| Sel1 domain protein repeat-containing protein [Shewanella baltica OS185] Length = 491 Score = 33.9 bits (76), Expect = 7.3, Method: Composition-based stats. Identities = 13/56 (23%), Positives = 24/56 (42%), Gaps = 1/56 (1%) Query: 5 QGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESD 60 Q + I F L DR L Q +I + + +R +GN+ H + + ++ Sbjct: 55 QDKHIVFSSP-NLYDRIEQLNQQRVIDVKTTRALHRLRADGNRGAHPEKYHLTQAQ 109 >gi|295135275|ref|YP_003585951.1| hypothetical protein ZPR_3439 [Zunongwangia profunda SM-A87] gi|294983290|gb|ADF53755.1| conserved hypothetical protein [Zunongwangia profunda SM-A87] Length = 298 Score = 33.9 bits (76), Expect = 7.6, Method: Composition-based stats. Identities = 11/43 (25%), Positives = 21/43 (48%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEE 58 L++R L ++ + + + +R GN A+HE + EE Sbjct: 140 NLAERIDGLSKNGHLTTSESKRLHSIRFLGNDALHEMEVPKEE 182 >gi|315181344|gb|ADT88257.1| conserved hypothetical protein [Vibrio furnissii NCTC 11218] gi|315182792|gb|ADT89705.1| hypothetical protein vfu_B01540 [Vibrio furnissii NCTC 11218] Length = 107 Score = 33.9 bits (76), Expect = 8.0, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 8/78 (10%) Query: 16 MLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVNLFCDIVFT 75 ++D L ++ L ++ I + + VR+ GN AVH G+ S+E+ + + +F Sbjct: 18 NINDDIAVLVKNGLPVK-IQQALDVVRVVGNNAVHPGEISLEDHPQVVT-------ALFN 69 Query: 76 LPALIKEKKSTHPNQSRD 93 L +I + + T P Q + Sbjct: 70 LINMIVDNQITQPKQVAE 87 >gi|170718591|ref|YP_001783794.1| exonuclease V subunit alpha [Haemophilus somnus 2336] gi|168826720|gb|ACA32091.1| ATP-dependent exoDNAse (exonuclease V) alpha subunit-like protein - helicase superfamily I member [Haemophilus somnus 2336] Length = 937 Score = 33.9 bits (76), Expect = 8.1, Method: Composition-based stats. Identities = 17/49 (34%), Positives = 24/49 (48%) Query: 9 INFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIE 57 I LSDR R + +EI + + +RI GNKAVH S ++ Sbjct: 54 IELVDNESLSDRMRKADFKTYVPKEIQQKLHLLRIAGNKAVHTSFSHLD 102 >gi|319788902|ref|YP_004090217.1| Type I site-specific deoxyribonuclease [Ruminococcus albus 7] gi|315450769|gb|ADU24331.1| Type I site-specific deoxyribonuclease [Ruminococcus albus 7] Length = 1086 Score = 33.9 bits (76), Expect = 8.2, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 10/81 (12%) Query: 19 DRTRYLKQHNLIIEE--IFEWSNFVRIEGNKAVHEGQSSIEESDECLE--------FVNL 68 +R + LK+ LI I + +R++ N AVH+ + S++ + L F+ + Sbjct: 65 NRIKILKREGLIDRGGRIDDILYSLRMKRNDAVHKYEDSVDTAKSLLRMAFRLAVWFMEV 124 Query: 69 FCDIVFTLPALIKEKKSTHPN 89 + D F P + + P+ Sbjct: 125 YGDYSFKAPDFVMPQNEPIPD 145 >gi|213621071|ref|ZP_03373854.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] Length = 110 Score = 33.9 bits (76), Expect = 8.4, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|168821021|ref|ZP_02833021.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205342336|gb|EDZ29100.1| type III restriction enzyme [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320088961|emb|CBY98717.1| type I restriction-modification system, R subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 1169 Score = 33.9 bits (76), Expect = 8.5, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|56416310|ref|YP_153385.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197365233|ref|YP_002144870.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|56130567|gb|AAV80073.1| subunit R of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197096710|emb|CAR62333.1| subunit R of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] Length = 1169 Score = 33.5 bits (75), Expect = 8.7, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|16763332|ref|NP_458949.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|29144810|ref|NP_808152.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|213428390|ref|ZP_03361140.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] gi|213864870|ref|ZP_03386989.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. M223] gi|25289195|pir||AD1069 type 1 site-specific deoxyribonuclease (EC 3.1.21.3) - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16505641|emb|CAD03371.1| subunit R of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Typhi] gi|29140449|gb|AAO72012.1| subunit R of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] Length = 1169 Score = 33.5 bits (75), Expect = 8.9, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|289810393|ref|ZP_06541022.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. AG3] Length = 195 Score = 33.5 bits (75), Expect = 9.2, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|213583087|ref|ZP_03364913.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] Length = 409 Score = 33.5 bits (75), Expect = 9.2, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|213419771|ref|ZP_03352837.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] Length = 192 Score = 33.5 bits (75), Expect = 9.2, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|213022348|ref|ZP_03336795.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 548 Score = 33.5 bits (75), Expect = 9.3, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|289207690|ref|YP_003459756.1| efflux transporter RND family, MFP subunit [Thioalkalivibrio sp. K90mix] gi|288943321|gb|ADC71020.1| efflux transporter, RND family, MFP subunit [Thioalkalivibrio sp. K90mix] Length = 349 Score = 33.5 bits (75), Expect = 9.4, Method: Composition-based stats. Identities = 10/65 (15%), Positives = 23/65 (35%) Query: 8 RINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECLEFVN 67 R F + +R + + ++ FE + + + +EE+ E L + Sbjct: 102 RARFNEARSEFNRIQDVFDRGVVSRSEFERAEAELQSARARLRSAEGRVEEAREQLGYTE 161 Query: 68 LFCDI 72 +F Sbjct: 162 VFAPY 166 >gi|298369610|ref|ZP_06980927.1| conserved hypothetical protein [Neisseria sp. oral taxon 014 str. F0314] gi|298282167|gb|EFI23655.1| conserved hypothetical protein [Neisseria sp. oral taxon 014 str. F0314] Length = 220 Score = 33.5 bits (75), Expect = 9.6, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 36/87 (41%), Gaps = 4/87 (4%) Query: 12 EKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSS----IEESDECLEFVN 67 E+ ++ R L + + + ++ +RI GN AVH GQ S + + + + +N Sbjct: 121 EEGKNINTDIRSLVNKGVFSGRVVQVADTLRITGNNAVHPGQISDADFDKAAAKMFDLIN 180 Query: 68 LFCDIVFTLPALIKEKKSTHPNQSRDG 94 T P + E P +R+ Sbjct: 181 FIVKKAITEPKELDELYQLMPENARNA 207 >gi|289824087|ref|ZP_06543684.1| type I restriction enzyme EcoKI subunit R [Salmonella enterica subsp. enterica serovar Typhi str. E98-3139] Length = 391 Score = 33.5 bits (75), Expect = 9.7, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 28/58 (48%) Query: 6 GQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEGNKAVHEGQSSIEESDECL 63 GQ ++ C D R L + I + I + +R GN AVHE + ++++ CL Sbjct: 50 GQLLDIPVCENQHDLLRELGKIAFIDDSILSVFHKLRRIGNLAVHEFHNDLDDAQMCL 107 >gi|46446740|ref|YP_008105.1| isoamylase [Candidatus Protochlamydia amoebophila UWE25] gi|46400381|emb|CAF23830.1| probable isoamylase [Candidatus Protochlamydia amoebophila UWE25] Length = 670 Score = 33.5 bits (75), Expect = 9.8, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 1/45 (2%) Query: 1 MKDDQGQRINFEKCGMLSDRTRYLKQHNLIIEEIFEWSNFVRIEG 45 M D+QG +NF CG + + + II+ + W +R++G Sbjct: 279 MIDEQGNYLNFSGCGNTFNANHPIVKE-FIIQSLRYWVTEMRVDG 322 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.314 0.136 0.363 Lambda K H 0.267 0.0412 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,635,596,587 Number of Sequences: 14124377 Number of extensions: 50878499 Number of successful extensions: 125251 Number of sequences better than 10.0: 174 Number of HSP's better than 10.0 without gapping: 110 Number of HSP's successfully gapped in prelim test: 93 Number of HSP's that attempted gapping in prelim test: 125107 Number of HSP's gapped (non-prelim): 209 length of query: 94 length of database: 4,842,793,630 effective HSP length: 64 effective length of query: 30 effective length of database: 3,938,833,502 effective search space: 118165005060 effective search space used: 118165005060 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 75 (33.5 bits)